Data Pre-Processing and Visualization

Open In Colab

Data Pre-Processing and Visualization#

Welcome to Data Pre-Processing and Visulaization. This text was developed from lecture notes from a course taught in the Spring of 2022 at East Central University in Ada Oklahoma.

The text will cover three main ideas:

  1. Gathering Data

  2. Cleaning Data

  3. Visualizing Data

Each section takes about a month to cover and was concluded with an exam (solutions to the exam are found in the back of the text) Each lecture contains some problems to attempt (with some selected solutions). These were utilized as daily quizzes although some took much more time than others. The course concluded with two projects, recreating the masters and examining recent visualizations.

Data sets utilized in the course can be found in the github repository for this text. Most pages include a link to google colab so you should be able to run the code there. You will need to create a BigQuery account to utilize the SQL section.

I do not suggest running two projects at once. In future iterations of the course, I may switch the order, first running Visualize Data, doing the project on examining recent visualizations and then proceed with Gathering and cleaning and conlude with recreating the masters.