Workshop Description:
In an ideal world, any data you collect or obtain would be clean and formatted perfectly for analysis and visualization. But the reality is that data can be really messy! Cleaning and reformatting your data can be a time-… Read more.
OpenRefine
To build your skills in OpenRefine, we suggest following our tutorials in this order:
i. (Optional) OpenRefine Installation Instructions
ii. OpenRefine Tutorial 1. Survey of Household Spending Activity
iii. OpenRefine Tutorial 2. Citizen Science Activity
iv. OpenRefine Tutorial 3. Regular Expressions (Regex) Activity
v. OpenRefine Tutorial 4. 311 Calls Activity
vi. OpenRefine Augmenting Activity 1: Preparing the data
vii. OpenRefine Augmenting Activity 2: Using Reconciliation Services
viii. OpenRefine Augmenting Activity 3: Using Add Column by Fetching URLs
ix. OpenRefine Augmenting Activity 4: Using Python
Please note that we also have converted some of this tutorial into a self-paced course with videos. U of T students, staff, and faculty can enroll in our OpenRefine Quercus course.
Course Description:
This workshop will provide an introduction to OpenRefine, a powerful open source tool for exploring, cleaning and manipulating “messy” data. Through hands-on activities, using a variety of datasets, participants will learn how to… Read more.
Getting started
Please visit our tutorial on installing OpenRefine
Please note that all computers in the Map and Data Library (on the fifth floor of Robarts Library), and the computer labs on the fourth and fifth floor of Robarts Library have… Read more.
This tutorial has been developed for OpenRefine version 3.7.5
Sometimes when you construct an API call and use the Add Column by Fetching URLs feature, it won’t work. In these cases, you can use python to help. So far we’ve been writing GREL… Read more.
This tutorial has been developed for OpenRefine version 3.7.5
You may have noticed from Activity 2 that sometimes you searched for a property to add from Wikidata, but got an error. Other times you might want to augment your dataset with data that… Read more.
This tutorial has been developed for OpenRefine version 3.7.5
To learn more about reconciliation services and how you can use them to augment your data, check out the official OpenRefine guide to reconciling Feel free to read just the introduction… Read more.
We are going to work with a different dataset for the next few activities. In order to start augmenting the dataset, we need to do a bit of preparation work first. This activity will showcase some new concepts and features for OpenRefine, as well as… Read more.
In this activity, you are going to:
Create a new project by making an API call to pull in data and parse the resulting JSON
Manipulate the data by using GREL date expressions and facet the data to make discoveries
Create a new project by… Read more.
In this activity, you are going to:
Open regex101.com and load some sample data
Practice some regex basics
Use regex in OpenRefine
Open regex101.com and load some sample data
1. Browse to the website regex101.com. The REGULAR EXPRESSION box at the… Read more.
In this activity, you are going to:
Create a new project from the citizen science dataset and use the clustering feature
Split and concatenate various columns in the dataset
Restructure the dataset by removing columns and rows, and then work with… Read more.