Your reading for today is Katie Rawson and Trevor Muñoz, “Against Cleaning.”
As you work with the class data and with your own data, Prepare Your Data for Tableau, chapters 6 – 10 provide further details on the different “cleaning” steps that we will discuss here.
So, what is data cleaning and why do you need to do it?
Think of data cleaning like editing and revising. The data file you are working with, whether you created it or downloaded it from someone else, was the first draft. Now you need to focus in on your thesis, rearrange paragraphs to make the argument easier to follow, clean up your grammar, and polish your prose.
Similarly, with data, you clean to focus in on the columns (variables) that help us answer our research question, reorganize to make it tidy, make the data consistent, and add some finishing touches, like data types, so that the data works well for visualization.
We are going to practice with a couple forms of data cleaning in class and your homework will be to practice on one of your datasets for your final project.
Continue reading “Lab 8 – Cleaning Data”