Data Search Engines
- Google Dataset SearchA Google Search engine for datasets. You will find information about usage rights, citation, creators, and more.
- Dimensions Data SearchThere are many forms of research-related data available, often stored in unconnected repositories and systems. Dimensions provides results for data-sets, grants, conference proceedings, preprints, books and chapters, journal articles, patents, and clinicial trials.
Sites to Find Data
- DataLumosDataLumos is an ICPSR archive for valuable government data resources. ICPSR has a long commitment to safekeeping and disseminating US government and other social science data. DataLumos accepts deposits of public data resources from the community and recommendations of public data resources that ICPSR itself might add to DataLumos.
- DOE Data ExplorerThe DOE Data Explorer (DDE) is the Office of Scientific and Technical Information's (OSTI) search tool for finding DOE-funded, publicly available, scientific data submitted by data centers, repositories, and other organizations funded by the Department.
- DATA.govThe home of the U.S. Government’s open data. Here you will find data, tools, and resources to conduct research, develop web and mobile applications, design data visualizations, and more.
- Explore Census Data This link opens in a new windowPlatform to access data and digital content from the U.S. Census Bureau.U.S. Department of Commerce
- Texas Open Data PortalThe official State repository of publicly accessible data. It features tools to search, filter, map, and visualize official State agency data sets. Users can access these data sets to meet their unique information needs for research, collaboration, innovation, and more.
- Texas Higher Education Data (THED)Texas’ primary source for statistics on higher education.
- Texas Data RepositoryThe Texas Data Repository is the research data management system for the Texas Digital Library (TDL) member institutions. It is a stateiwde collaboration of Texas higher education institutions.
- World Bank Data & StatisticsTime series statistics on national accounts, international transactions, manufacturing, andmonetary indicators are provided for World Bank member countries
- DryadInternational repository of scientific and medical datasets.
- European Data PortalThe official portal for European data. This project is funded by the European Union. Datasets and statistics from public data portals from EU countries.
- Simmons Directory for Open DataA large list of data repositories divided by subject.
- Association of Religion Data Archives (ARDA)The ARDA Data Archive is a collection of surveys, polls, and other data submitted by researchers and made available online by the ARDA.
- HealthData.govDesigned for consumers, this website provides information and data regarding a variety of health topics. Topics include: environmental health, medical devices, Medicare & Medicaid, social services, community health, mental health, and substance abuse.
- DataONEDataONE is a community driven program providing access to data across multiple member repositories, supporting enhanced search and discovery of Earth and environmental data. DataONE promotes best practices in data management through responsive educational resources and materials.
- Open ContextOpen Context publishes archaeological data you can analyse, media you can reuse, and field notes you can explore. All content published by Open Context carries Creative Commons copyright licenses, which means you have explicit legal permission to reuse and adapt this content in your own works, as long as you provide appropriate citations to Open Context contributors.
- UCI Machine Learning RepositoryThe UCI Machine Learning Repository is a collection of databases, domain theories, and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms.
- ICPSRThe Inter-university Consortium for Political and Social Research (ICPSR) maintains a data archive for research in the social and behavioral sciences.
Considerations for Publishing Data
Data repositories are great ways to promote access and collaboration across the scholarly communities, but it can be hard to know where to publish your dataset.
To help, consider the FAIR guidelines when choosing which data repository is right for your data:
- (F)indable: (meta)data should be easy to find for both humans and computers
- (A)ccessible: The user needs to know how the data can be accessed, possibly including authentication and authorization
- (I)nteroperable: The data usually needs to be integrated by other data and the data needs to interoperate with applications or workflows for analysis, storage and processing
- (R)esuable: (Meta)data should be well-described so they can be replicated and/or combined in different setting
Source: FAIR Principles
- Where and how can I publish my dataset?Page from Princeton Research Data Service with great information and links to data resources.
- FAIR-Aware AssessmentFAIR-Aware helps you assess your knowledge of the FAIR Principles, and better understand how making your data(set) FAIR can increase the potential value and impact of your data. The tool is discipline-agnostic, making it relevant to any scientific field. You can use this tool at any point during your research before depositing your data(set) in a data repository.