Module 2

Data science lifecycle & Exploratory data analysis using visualization

Are you ready for some data visualisations? This week is all about exploring data with ggplot2 R package. We will also learn about the data science lifecycle.

Learning Objectives

  1. Learners can identify four components of a Quarto file (YAML, code chunk, R code, markdown).
  2. Learners can list the six elements of the data science lifecycle.
  3. Learners can describe the four main aesthetic mappings that can be used to visualise data using the ggplot2 R Package.
  4. Learners can control the colour scaling applied to a plot using colour as an aesthetic mapping.
  5. Learners can compare three different geoms (bar/col, histogram, point) and their use case.

Slides

View slides in full screen | Download slides as PDF

Readings

  1. Read R for Data Science - Whole game
  2. Read R for Data Science - Section 3 - Workflow basics
  3. Read R for Data Science - Section 2 - Data visualization

Assignments

Please complete the following assignment by Wednesday, 6th March 2024.

Thank you for working through this assignment