Skip to content

Open Data Sets

October 11, 2016

Introduction

If you’re like me, whenever you’re working with a new piece of software or product that works with data, you like to actually get your hands on it and work with it. Sadly this sometimes requires you have data. You can create your own data, but this takes needless time out of your discovery work. Not only that, but if you want to practice data analytics or Machine Learning then you certainly do need some pre-existing dataset. This is what lead me to search for some free data sets out there. Here is a list of some of the best places I’ve found to get open data sets.

Datasets

US Government Data

Description

The US Government provides some data about business, agriculture, education, energy and other types of data. Some are specific to a city and some are more general.

URL

https://data.gov

Kaggle Datasets

Description

Kaggle is a Data Challenge competition website where people can compete to create the best predictive model on a variety of datasets. Along with this they have provided a repository of their datasets.

URL

https://www.kaggle.com/datasets

Sean Lahman – Baseball Database

Description

A free relational database of individual and team statistics that covers the game back to 1871 up to the early 2000’s.

URL

http://www.seanlahman.com/baseball-archive/statistics/

Advertisements

From → Data Science

Leave a Comment

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: