Exploring Google Dataset Search

Dataset Search is a search engine for data sets. Google Dataset Search indexes thousands of data sets in repositories across the web. It is a web-based tool that will allow users to discover and access datasets hosted on the internet. Users can search for datasets based on their subject, dataset creator, or other characteristics. Dataset Search will return a list of data sets that match the user's search keywords, with direct links to download or explore open datasets online. The number of new open datasets found through Dataset Search will grow as more data sets are added to the search engine.

What are some potential uses of this?

  1. Students can use this to help them create their data sets and become more well rounded

  2. Researchers can use this to better understand what datasets are being used in research, in order to improve the quality of their work

  3. Journalists can use this to help them better understand what data scientists are using and creating, in order to better inform the public

  4. Scientists can use this to better understand and create data sets for their research,

  5. Companies can use this to better understand what data is out there for their business, in order to make better business decisions

Google's Datasets Mission

In addition to making datasets universally accessible and useful, Google's Dataset Search mission is to:

  • Foster a data sharing ecosystem that will encourage data publishers to follow best practices for data storage and publication

  • Give scientists a way to show the impact of their work through citation of datasets that they have produced

  • As more dataset repositories use and similar standards to describe their datasets, the variety and coverage of datasets that users find in Dataset Search will continue to grow.



