Repository logo

Data cleaning using OpenRefine


The process of generating data can be messy, especially when data are hand collected by multiple people. This month's Data and Donuts will discuss how to wrangle messy tabular data using OpenRefine (, a free, open source tool for working with messy data. We will discuss the concepts of faceting, clustering, and splitting data. We will also show you how to export scripts to help you automate the cleaning process.


Date on video should be date 3/28/17.
Multiple sessions held on: 11/9/16, 3/28/17, 2/13/18.
Accessibility features: Transcript.

Rights Access


data management
Data editing
data education
Data curation


Associated Publications