Google BigQuery Public Datasets Includes GDELT HathiTrust and Internet Archive Book Data

Google BigQuery Public Datasets A public dataset is any dataset that is stored in BigQuery and made available to the general public. This page lists a special group of public datasets that Google BigQuery hosts for you to access and integrate into your applications. Google pays for the storage of these data sets and provides public access to the data via BigQuery. You pay only for the queries that you perform on the data (the first 1 TB per month is free, subject to query pricing details). It includes the GDELT HathiTrust and Internet Archive Book Data. This dataset contains 3.5 million digitized books stretching back two centuries, encompassing the complete English-language public domain collections of the Internet Archive (1.3M volumes) and HathiTrust (2.2 million volumes).
From Google BigQuery Public Datasets — Google Cloud Platform

Add new comment

Full HTML

  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd> <blockquote> <img> <b> <marquee> <strike> <del> <p> <iframe>
  • Lines and paragraphs break automatically.

Plain text

  • Allowed HTML tags: <a> <em> <strong> <cite> <blockquote> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • No HTML tags allowed.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Lines and paragraphs break automatically.
Subscribe to Comments for "Google BigQuery Public Datasets Includes GDELT HathiTrust and Internet Archive Book Data"