19 Free Public Data Sets For Your First Data Science Project

This post was originally published at: https://www.springboard.com/blog/free-public-data-sets-data-science-project/

Completing your first project is a major milestone on the road to becoming a data scientist. It’s also an intimidating process. The first step is to find an appropriate, interesting data set. You should decide how large and how messy a dataset you want to work with; while cleaning data is an integral part of data science, you may want to start with clean dataset for your first project so that you can focus on the analysis rather than on cleaning the data.

Based on the learnings from our Foundations of Data Science Workshop and the Data Science Career Track, we’ve selected datasets of varying types and complexity that we think work well read more

Tagged with: , , , , ,

Six weeks, 77 stories, 84 cities and 350 people

How the Bureau Local team of four grew to a UK-wide collaborative network

65 people came together in five cities across the UK to dig into the database and find stories in their communities

Three months ago we launched the Bureau Local,

Tagged with: , , , , ,

Open it to fix it

How Nigerians are shedding light on public deals with dataProcurement monitor Nonye Onumonu during a mission to inspect 37 government school contracting projects across Nigeria (Credit: PPDC)

On a hot, dry day in February 2017, Nonye Onumonu boards a small wooden boat

Tagged with: , , ,

Creative Commons and Hope — How Open Access Journals Helped Me in a Dark Time

I’m bad with linear time. Sometime about two months ago, my daughter was four months old. She was healthy and happy and life was good for our family. When we brought her in for her four-month vaccination, I asked our

Tagged with: , , , , ,

US EPA Orders Turn-Off of Open Data Service on 28-Apr-2017

The US Government’s largest civilian linked open data web service is scheduled to go dark at 12noon US ET Friday 28-April 2017.

23-April 2017 — Last week, after numerous conversations with the U.S. Environmental Protection Agency’s Office of Environmental Information (OEI), and various technical

Tagged with: , , , , ,

Using mySociety’s APIs for #ge2017

If you’re living in the UK, it probably hasn’t escaped your attention that the Prime Minister Theresa May has put the wheels in motion for a UK General Election on June 8th. This means that an estimated 47 million UK

Tagged with: , , , , ,

Trump Presidency Sees Spike in “Open Data Day” Events Across US Cities

After a two-year decline in the number of “Open Data Day” events held in US cities, the Trump presidency has seen renewed interest in organizing the event, and given it new meaning.

By Aaron Wytze

It’s hard to say what first alarmed data scientists

Tagged with: , ,

EPCs in 3D — Help needed!

Buildings in Manchester City Centre (not) coloured according to their energy rating

I was going through the recently-released energy performance certificate data, and wanted to join the ranks of people who’ve made awesome things with the data.

One of the things I really wanted

Tagged with: , , , , , ,

A Story About Open Data (and Snow)

Open access to publicly-funded research accelerates discovery and progress

By Jen Caltrider, Global Campaigner, Mozilla

It’s Tax Day. Let me tell you a story about my buddy Erik, snow, your taxes, and open data.

Here in Colorado where I live, people love snow. It fuels

Tagged with: , , , ,

‘Bring Open Data to Your School’: connecting teens with open data in Argentina

With access to more data about their communities, teenagers can use it to shape decisions and plan their futures. Supported by the ODI’s mini-grants programme, the Argentine Government set out to empower young people with tools to find and compare information that could make a difference


Photo: Esther Vargas

By Marisol Parnofiello

How can teenagers be empowered to generate and use data about their communities? This is something we at the Open Data and Innovation team in the Argentine government have been focusing on recently, with our ‘Bring open data to your school’ programme and application.

The problem

Opening data is only worthwhile if it is used. That is why we do everything we can to connect open data with citizens, helping them to realise that open data is a powerful resource for understanding their environment and for making better decisions every day.

We think that open data could be particularly useful for teenagers. In such a crucial time of their lives – when they have to make so many decisions about education and employment – having easy access to data could help enormously.

To enable this, we need to increase young people’s knowledge of open data and their data literacy skills to use it. We also need to bring them the right open data to inform them of their communities. We decided that developing an application would be the best way to achieve this, since many young people are already familiar with that technology.

Building the app

To help build the application, we engaged industry experts including digital agency Aerolab, to help design user experience for teens, and educational foundation Eidos, to design open data learning activities around the app.

Our aim was to create a simple and responsive tool that could be used for young people to search, find and compare information. The application drew upon data from the 2010 Argentine Census of Population and Housing Census, to allow teenagers using the app to compare:

  • their local reality versus the reality of different places around the country
  • their perception versus the reality

The app is structured around questions to help students apply this data to relatable issues, such as:

What percentage of teenagers between 15 and 18 years old do you think go to school in your neighbourhood?

Users are asked to answer the question with a number or percentage based on their knowledge or assumptions. The app then compares the user’s guess and the true answer, showing the data in an understandable way through a data visualisation. Pretty simple!

During the project, the government team also engaged in regular, remote mentoring sessions with the ODI team in London, who offered feedback and advised on project implementation.

Testing the app

After four months in development, the Eidos team launched the app with a sample group of 20 adolescents in December 2016. The session was held at La Casa Nacional del Futuro, a new innovative centre designed to gather young people together to improve their technical knowledge and skills.

Three mentors from Eidos began the session by describing what open data is and how important it is to make decisions based on evidence. Later, teens tried the app and created visualisations with data that were then shared on social media.

During the session, teens enjoyed using the app to compare data about their neighbourhoods with other places in the country, especially those ones far away from the capital, Buenos Aires. Tomas, a student at the session, described his reactions to the data he saw:

I was really surprised [to learn] about how many children go to school in my neighbourhood. I thought it was almost the half the correct number. That was shocking.

Another student, Ludmila, noted that “there is so much more information than I thought”, and that “there are many possibilities [with open data]”.

What we learned

Young people are really interested in open data, and want to use it to help make decisions. We’d recommend that other organisations working with young people consider how they can introduce data in a way that is relevant to young people, focusing on topics they understand, like sports or food.

We want to reach more young people to help them experience the world of open data. As a result, Eidos and the Open Data and Innovation team will be creating a guide, so mentors and teachers elsewhere in the country can deploy the same activities in their own classrooms. Watch this space!

Bring Open Data to Your School has been supported by the ODI’s mini-grant programme, which included remote mentoring and a grant of £6,500 to support the development and implementation of the project. The project was supported by the Open Data for Development (OD4D) programme, a partnership funded by Canada’s International Development Research Centre (IDRC), the World Bank, United Kingdom’s Department for International Development (DFID), and Global Affairs Canada (GAC).

Marisol Parnofiello is UX Content Manager of the Argentine Government Open Data Team. Follow @datosgobar on Twitter

If you have ideas or experience in open data that you’d like to share, pitch us a blog or tweet us at @ODIHQ

Tagged with: , , ,