COGS 137: Practical Data Science in R

Course Info

Practical Data Science in R focuses on teaching students how to think rigorously throughout the data science process. To this end, through interaction with unique data sets and interesting questions, this course helps students 1) gain fluency in the R programming language, 2) effectively explore & visualize data, 3) use statistical thinking to analyze data and rigorously evaluate their conclusions, and 4) effectively communicate their results. Course objectives are accomplished through hands-on practice, using real-world data to learn via case studies, and project-based learning.

Days & Times

Lecture: Tu/Th 2-3:20 (CSB 002)
Lab: Mon 5-5:50 (DIB 121) or Fri 2-2:50 (RWAC 0103)

Instructional Staff & Office Hours
Instructor Shannon Ellis sellis@ucsd.edu Tu 11A-12P CSB 243
Wed 10-11A Zoom by appt
TA Quirine (Q) van-Engen Th 8:30-9:30A Zoom (see Canvas for link)
IA Eric Song

Course Objectives

  • Program at the introductory level in the R statistical programming language

  • Employ the tidyverse suite of packages to interact with, wrangle, visualize, and model data

  • Explain & apply statistical and machine learning concepts for data analysis

  • Communicate data science projects through effective visualization, oral presentation, and written reports

Texts

Texts are freely available online:

Introduction to Modern Statistics Çetinkaya-Rundel and Hardin OpenIntro, 2e, 2024
R for Data Science Wickham, Çetinkaya-Rundel, and Grolemund O’Reilly, 2e, 2023
Tidy Modeling With R Max Kuhn and Julia Silge O’Reilly, 1e, 2022

Materials

You should have access to a laptop and bring it to every class, fully charged (as possible).

Note: If you do not have consistent access to the technology needed, please use this form to request a loaner laptop. (For any issues that you may have, please email vcsa@ucsd.edu, and they will work to assist you.)

Acknowledgements

I want to first recognize Dr. Mine Çetinkaya-Rundeland for her unparalleled efforts in support of education and educators in data science, statistics, and R programming. This course website was adapted from her course websites. Some of our course materials are also from her datascienceinabox project. I am so very indebted to Mine! I also want to thank the Open Case Studies team for their tireless work in putting together interesting and topical case studies, a handful of which have been used throughout the course over the years. And, finally, thanks to Allison Horst, whose artwork is inspiring, educational, and fun…and is used throughout this course. Further, thanks to the greater R (education) community who make planning and teaching R courses simply the best.