Find the perfect dataset for your Data Science project 🎯
Let’s say that you need to find a dataset for a Data Science project. Perhaps this is a project for school, or a practice project to build up your portfolio and showcase your skills.
Where should you look? Here are 5 sources I recommend checking out:
📈 Kaggle Datasets
It’s fun to browse, and the upvoting system makes it easy to discover high-quality datasets. Also, its Data Explorer lets you see a preview of the raw data.
💌 Data Is Plural
This is a fascinating weekly newsletter (since 2015!) that highlights “useful/curious datasets.” Search its archive via a Google Sheet or web app.
📚 Awesome Public Datasets
A gigantic list of high-quality datasets grouped into 30+ topics.
🏛️ Data.gov
Open data from the US government. It’s huge, well-organized, and more interesting than you would think!
🔎 Google Dataset Search
This is a great way to search for a dataset, especially if you already have a specific topic in mind. Also, the autocomplete feature is quite nice!
Want even more options?
Researcher Sebastian Raschka compiled a list of dataset repositories for Machine Learning and Deep Learning.
What did I miss?
Let me know in the comments section below if you have another great source for datasets!