Skip to content

Latest commit

 

History

History
167 lines (118 loc) · 5.7 KB

sources.md

File metadata and controls

167 lines (118 loc) · 5.7 KB

Data Sources

This project uses data from various sources that are openly licensed or in the public domain. Below are the sources and their respective information:

CC Legal Tools

Description: A .txt file provided by Timid Robot containing all legal tool paths.

API documentation link:

API information:

  • No API key required
  • No query limits

Flickr

Description: With over 5 billion photos (many with valuable metadata such as tags, geolocation, and Exif data), the Flickr community creates wonderfully rich data. The Flickr API is how you can access that data. In fact, almost all the functionality that runs flickr.com is available through the API. (Flickr: The Flickr Developer Guide)

API documentation link:

API information:

  • API key required
  • Query limit: 3600 requests per hour
  • Data available through CSV format

GitHub

Description: A development platform for hosting and managing code.

API documentation link:

API information:

  • API key not required but recommended by GitHub
  • Query limit: 60 requests per hour if unauthenticated, 5000 requests per hour if authenticated
  • Data available through JSON format

GCS (Google Custom Search) JSON API

Description: The Custom Search JSON API allows user-defined detailed query and access towards related query data using a programmable search engine.

API documentation links:

API information:

  • API key required
  • Query limit: 100 queries per day
  • Data available through JSON format

Notes:

  • The data from Google Custom Search will only cover 50+ general, most significant categories of CC License for data collection quota constraint. As an additional note, the order of precedence of license the collected data's first column is sorted due to intermediate data analysis progress.

Internet Archive Python Interface

Description: A python interface to archive.org to achieve API requests towards internet archive.

API documentation link:

API information:

  • No API key required
  • No query limits

MediaWiki Action API

Description: The MediaWiki Action API is a web service that allows access to some wiki features like authentication, page operations, and search. It can provide meta information about the wiki and the logged-in user. (API:Main page - MediaWiki)

API documentation link:

API information:

  • No API key required
  • Query limit: depends on user status and request type
  • Data available through XML or JSON format

The Metropolitan Museum of Art Collection API

Description: The Met’s Open Access datasets are available through our API. The API (RESTful web service in JSON format) gives access to all of The Met’s Open Access data and to corresponding high resolution images (JPEG format) that are in the public domain. (The Metropolitan Museum of Art Collection API)

API documentation link:

API information:

  • No API key required
  • 80 queries per second

Vimeo API

Description: The Vimeo API allows users to perform filtered, advanced search on Vimeo videos.

API documentation link:

API information:

  • API key required
  • Query limit: 5000 authenticated requests per day
  • Data available through JSON format

YouTube Data API

Description: An API from YouTube for platform users to upload videos, adjust video parameters, and obtain search results.

API documentation link:

API information:

  • API key required
  • Query limit: depends on the type and number of requests
  • Data available through JSON format