0% found this document useful (0 votes)
70 views5 pages

50 Data Websites

Uploaded by

abdoalsenaweabdo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
70 views5 pages

50 Data Websites

Uploaded by

abdoalsenaweabdo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

50 Data Websites

 Famous Data Websites:


1- Kaggle (https://www.kaggle.com/):
- A wide range of datasets for machine learning, from government data to business and health.

2- Google Dataset Search (https://datasetsearch.research.google.com/):


- A search engine to find datasets across various domains like education, government, and
science.

3- AWS Open Data (https://registry.opendata.aws/):


- Open datasets hosted on AWS, ranging from environmental data to genomics and economics.

4- UCI Machine Learning Repository (https://archive.ics.uci.edu/):


- A collection of datasets for machine learning and data mining, used extensively in research.

5- Datahub (https://datahubproject.io/):
- A platform for finding, sharing, and publishing data in various domains.

6- OpenML (https://www.openml.org/):
- A platform for sharing datasets, machine learning tasks, and results.

7- World Data (https://data.world/):


- A collaborative platform for discovering, storing, and sharing datasets across various fields.

8- FiveThirtyEight (https://projects.fivethirtyeight.com/polls/):
- A data journalism website providing datasets on political polling, economics, and sports.

9- KDnuggets (https://www.kdnuggets.com/datasets/index.html):
- A collection of datasets and resources for data science, machine learning, and AI research.

10- Data Is Plural (https://www.data-is-plural.com/):


- A newsletter that curates interesting and diverse datasets across various domains.

11- Data Science Central LLC (https://www.datasciencecentral.com/):


- A platform for data science and machine learning practitioners to share and explore datasets.

12- Awesome Public Datasets on GitHub (https://github.com/awesomedata/awesome-public-


datasets?tab=readme-ov-file):
- A curated list of open datasets from various fields, available on GitHub for public use and
analysis.
13- Google Public Data Directory (https://www.google.com/publicdata/directory):
- A collection of public datasets from various sectors, including economics, environment, and
health.

 Computer Vision Websites :


14- Roboflow Datasets (https://public.roboflow.com/):
- Public datasets for computer vision tasks, such as object detection and classification.

15- Open Images Dataset V7 (https://storage.googleapis.com/openimages/web/index.html):


- A large-scale image dataset for machine learning, specifically designed for object detection
and image classification tasks.

16- Kaggle Computer Vision (https://www.kaggle.com/datasets?sort=votes&tags=13207-


Computer+Vision):
- A collection of popular computer vision datasets for tasks like image classification, object
detection, and segmentation.

17- Papers with Code - Computer Vision (https://paperswithcode.com/area/computer-vision):


- A platform linking state-of-the-art research papers with their corresponding code and
datasets in computer vision.

18- 10 Best Open Source Datasets for Computer Vision :


- A curated list of top open-source datasets for computer vision tasks in 2024.

19- DagsHub Computer Vision (https://dagshub.com/datasets/computer-vision/):


- A collection of open-source datasets for computer vision tasks, with tools to manage and
explore them collaboratively.

20- VisualData.io (https://visualdata.io/discovery):


- A discovery platform for datasets related to visual computing, primarily for computer vision
tasks.

21- YouTube-8M (https://research.google.com/youtube8m/):


- A large-scale labeled video dataset for research in video classification and machine learning.

 Environment and Climate Websites:

22- NASA Earth Data (https://www.earthdata.nasa.gov/):


- Earth observation data related to weather, climate, and environmental monitoring.
 Business and Industry Websites :
23- Dataverse (https://www.microsoft.com/ar/power-platform/dataverse):
- A cloud-based platform to manage and share business data for app development.

24- Google Trends (https://trends.google.com/trends/):


- Provides insights into the popularity of search queries over time across different regions and
languages.

25- Inside-BigData (https://insideainews.com/):


- A website focused on big data and AI news, including access to datasets and resources.

26- Yelp Dataset (https://www.yelp.com/dataset):


- A dataset of business reviews, check-ins, and user data for research and analysis.

27- NYC TLC Trip Record Data (https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page):


- Taxi and ride-hailing trip records from the New York City Taxi & Limousine Commission.

28- BFI Industry Data & Insights (https://www.bfi.org.uk/industry-data-insights):


- UK film industry statistics and insights, provided by the British Film Institute.

29- Inside Airbnb (https://insideairbnb.com/get-the-data/):


- Airbnb listings data for cities worldwide, providing insights on the short-term rental market.

30- IMDb Datasets (https://datasets.imdbws.com/):


- IMDb’s data for movies, TV shows, and actors, available for research and analysis.

 Finance and Economics Websites:


31- Quandl (https://data.nasdaq.com/publishers/QDL):
- Provides financial, economic, and alternative data for analysis and research.

32- World Bank Open Data (https://data.worldbank.org/):


- Global development data including economic indicators, poverty levels, and more.

33- KAPSARC Data Portal (https://datasource.kapsarc.org/pages/home/):


- Datasets focusing on energy and economics in Saudi Arabia and globally.

34- IMF Data (https://www.imf.org/en/Data):


- Global economic data from the International Monetary Fund, including statistics on trade,
GDP, and inflation.
 Government and Public Data Websites:
35- CKAN (https://ckan.org/):
- An open-source platform for managing public datasets, often used by government agencies.

36- IPUMS (https://www.ipums.org/):


- U.S. and international demographic data, including census and survey data.

37- Data.gov (https://data.gov/):


- U.S. government’s open data portal, providing datasets on a wide range of public sectors like
health, education, and environment.

38- EU Open Data Portal (https://data.europa.eu/en):


- The European Union’s portal for accessing open datasets on a variety of topics like economics,
environment, and social issues.

39- FBI Crime Data Explorer (https://cde.ucr.cjis.gov/LATEST/webapp/#/pages/home):


- Crime statistics and reports from the FBI, including data on violent crime, property crime, and
arrests.

40- CAPMAS (https://www.capmas.gov.eg/):


- Egypt's official statistical agency providing demographic, economic, and social data.

41- U.S. Census Bureau Data (https://www.census.gov/data.html):


- U.S. census data on population, housing, economic, and geographic topics.

42- Data.gov.uk (https://www.data.gov.uk/):


- Open data from the UK government on various topics like education, health, and the
environment.

43- Data.gov.sa (https://data.gov.sa/ar):


- Saudi Arabia’s official portal for open government data on topics like economy, health, and
education.

44- Humanitarian Data Exchange (https://data.humdata.org/dataset):


- Datasets related to global humanitarian crises, managed by the UN's OCHA.

45- Data.gov.in (https://www.data.gov.in/):


- India’s open data portal, providing datasets across sectors like agriculture, health, and
finance.
 Health and Medicine Websites:
46- CDC WONDER (https://wonder.cdc.gov/):
- Health-related data, including statistics on public health issues, managed by the CDC.

47- UNICEF Data (https://data.unicef.org/):


- Global datasets focusing on children’s health, education, and overall well-being.

48- Global Health Observatory Data Repository (https://www.who.int/data/gho):


- The World Health Organization’s global health data repository, covering statistics on various
health indicators.

49- HealthData.gov (https://healthdata.gov/browse):


- U.S. health-related datasets covering various public health issues.

 Education and Research Websites:


50- ICPSR (https://www.icpsr.umich.edu/web/pages/):
- Social science data for research and teaching, covering a range of societal topics.

51- Harvard Dataverse (https://dataverse.harvard.edu/):


- A repository for academic and research data across disciplines.

52- Academic Torrents (https://academictorrents.com/):


- A platform for sharing academic datasets, covering topics like machine learning, biology, and
physics.

 Science and Technology Websites:


53- SDSS Data Archive (https://cas.sdss.org/dr18/):
- Astronomical data from the Sloan Digital Sky Survey, including images and spectra.

54- CERN Open Data Portal (https://opendata.cern.ch/):


- High-energy physics data from CERN experiments like the Large Hadron Collider, available for
educational and research purposes.

55- IEEE DataPort (https://ieee-dataport.org/datasets):


- A wide range of datasets related to technology and engineering, shared by IEEE members.

56- KEEL Dataset Repository (https://sci2s.ugr.es/keel/datasets.php):


- A collection of datasets for machine learning and data mining, specifically for algorithms and
benchmarking.

You might also like