F&M College Library

Data Science

Data sources by discipline

A repository where uses can make all of their research outputs available in a citable, shareable and discoverable manner.

A free data repository open to all within and outside the Harvard community. Users can share, archive, cite, access, and explore research data. Each individual Dataverse collection is a customizable collection of datasets for organizing, managing, and showcasing datasets.

Discover open data sets on a variety of topics. Within the online community users can share analysis methods.

Re3data is a global registry of research data repositories that covers research data repositories from different academic disciplines. It includes repositories that enable permanent storage of an access to data sets to researchers, funding bodies, publishers, and scholarly institutions, re3data promotes a culture of sharing, increased access and better visibility of research data.

The quality-assured, global Directory of Open Access Repositories. Search and browse thousands of registered repositories based on features such as location, software or type of material held.

A scientific online publication that focuses on large global problems - poverty, disease, hunger, climate change, war, existential risks, and inequality.

A yearly survey from the U.S. Census Bureau that provides data on occupation, educational attainment, home ownership status, and more.

The United States government's open data website. Provides access to datasets published by agencies across the federal government. It is intended to provide access to government open data to the public.

Produced by the National Center for Educational Statistics, a federal agency. Includes data from enrollment to outcomes, for gradeschool through post-secondary education.

The Federal Reserve Archival System for Economic Research is a project by the Research Division of the Federal Reserve Bank of St. Louis to expand on its mission to provide economic information and data to researchers interested in the U.S. economy. On this web site you will find links to scanned images of historical economic statistical publications, releases, and documents.

The data is collected and supplied from agencies from the U.S. Department of Health and Human Services as well as state partners. Includes the Centers for Medicare and Medicaid Services, Centers for Disease Control and Prevention, Food and Drug Administration, and the Agency for Health Care Research and Quality, and more.

Includes data and metadata for OECD countries and selected non-member economies.

Designed by Socrata, this initiative is to foster data-centered collaboration between governments and the private sector.

The case-level microdata for much of their research is available to the public for secondary analysis after a period of time. You can find more information here on how to use the datasets.

The Uniform Crime Reporting Program generates reliable statistics for use in law enforcement. It also provides information for students of criminal justice, researchers, the media, and the public. Crime statistics have been provided since 1930.

From the U.S. Department of Labor, the Bureau of Labor Statistics is the principal Federal agency responsible for measuring labor market activity, working conditions, and price changes in the economy.

The U.S. Census Bureau has a new content platform where one can access data collected from their decennial collection instrument.

The Program Data site provides selected statistical information on activity in all major Food and Nutrition Service programs.

This website provides access to several EPA databases to provide you with information covering environmental activities that may affect air, water, and land anywhere in the United States.

The IEA collects, assesses and disseminates energy statistics on supply and demand, compiled into energy balances in addition to a number of other key energy-related indicators, including energy prices, public RD&D and measures of energy efficiency, with other measures in development.

Intuitive tools to analyze NCSES data on R&D and the education and employment of the STEM workforce.

The U.S. Energy Information Administration (EIA) collects, analyzes, and disseminates independent and impartial energy information to promote sound policymaking, efficient markets, and public understanding of energy and its interaction with the economy and the environment.

World Environment Situation Room (WESR) is the UNEP online data, information and knowledge platform. It enables users to visualize, interrogate, access, link and download data, information and knowledge products regarding the World environment.

The data is collected from agencies from the U.S. Department of Health and Human Services as well as state partners. This includes the Centers for Medicare and Medicaid Services, Centers for Disease Control and Prevention, Food and Drug Administration, and the Agency for Health Care Research and Quality, among others. 

Explore data from various collection instruments including the National Survey of Family Growth, National Health and Nutrition Examination Survey, National Hospital Ambulatory Medical Care Survey, and more.

SAMHDA is a one-stop shop for SAMHSA public use data with online analysis tools. One can learn about different types of data files, what data are available as free downloadable PUFs, and what online analysis systems are available on the site.

Featured Datasets

General Social Survey (GSS) is a long running survey of social, cultural and political indicators. In addition to the GSS, topic modules include computer and Internet, racial and ethnic prejudice, and child mental-health stigma. The GSS has participated since 1985 in the International Social Survey Program.

Frequently used statistics

Awesome Public Datasets

GitHub logo

"This list of a topic-centric public data sources in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed are free, however, some are not."