Submitted by Blake on March 29, 2016 - 6:26pm
Google BigQuery Public Datasets
A public dataset is any dataset that is stored in BigQuery and made available to the general public. This page lists a special group of public datasets that Google BigQuery hosts for you to access and integrate into your applications. Google pays for the storage of these data sets and provides public access to the data via BigQuery. You pay only for the queries that you perform on the data (the first 1 TB per month is free, subject to query pricing details). It includes the GDELT HathiTrust and Internet Archive Book Data. This dataset contains 3.5 million digitized books stretching back two centuries, encompassing the complete English-language public domain collections of the Internet Archive (1.3M volumes) and HathiTrust (2.2 million volumes).
From Google BigQuery Public Datasets — Google Cloud Platform