Skip to content

Dataset for URL benchmarking purposes

License

MIT, Apache-2.0 licenses found

Licenses found

MIT
LICENSE-MIT
Apache-2.0
LICENSE_APACHE
Notifications You must be signed in to change notification settings

ada-url/url-dataset

Repository files navigation

URL Dataset

Disclaimer: This repository is developed and released for educational purposes. Use at your own risk.

This repository crawls the top visited 100 websites and extracts unique URLs to be used for generating a dataset of unique real-world URL examples. The following script creates a out.txt file with each line containing a different URL.

This project uses Node.js. We recommend running the following with code with at least Node 18.

  • For installing dependencies, run npm install
  • To execute the script run npm start and the output will be written out.txt file.

About

Dataset for URL benchmarking purposes

Topics

Resources

License

MIT, Apache-2.0 licenses found

Licenses found

MIT
LICENSE-MIT
Apache-2.0
LICENSE_APACHE

Stars

Watchers

Forks