UC San Diego
COGS 137 - Fall 2024
2024-10-24
Q: I am confused on how to draw meaning from the combined data visualizations because I don’t understand what difference between variables like thc and thcooh are (and the other compounds).
A: This is something I want groups to discuss/look into! And, we’ll discuss more of this today. But, also, check out this piazza post.
Q: Would sensitivity and specificity be related to type I and type II errors?
A: Yes! Type I Error (measured as 1-Specificity) is known as the false positive rate. Type II Error (1- Sensitivity) is also known as the false negative rate. These concepts are directly related.
About the CSS MS Program:
UC San Diego’s one-year M.S. in Computational Social Science combines coursework approaches and formal models across social science disciplines with modern computational data analysis techniques. With a hands-on curriculum involving a summer bootcamp, core foundational training, in-depth electives across fields, and a portfolio-building capstone project, this program provides substantive and wide-ranging practice applying skill sets to real-world problems preparing graduates for careers in industry, public policy, education, non-profits, or for further academic study in a Ph.D. program.
Due Dates:
Notes:
Note: We’ll discuss oral communication closer to the end of the quarter, when you’ll have to present out loud.
❓ What does it mean to “consider your audience?”
Simply: You do the work so they don’t have to.
…also the aesthetic-usability effect exists.
General Audience
✔ background
🚫 limit technical details
🎉 emphasize take-home
Technical Audience
⬇ limit background
💻 all-the-details
🎉 emphasize take-home
On presentations: Balance b/w short and informative (goal: concise)
Avoid: “Analyzing NHANES”
Better: “Data from the NHANES study shows that diet is related to overall health”
On visualizations: emphasize the take-home! (what’s learned or what action to take)
Avoid: “Boxplot of gender”
Better: “Twice as many females as males included for analysis”
Avoid: “Tickets vs. Time”
Better: “Staff unable to respond to incoming tickets; need to hire 2 FTEs”
Your audience has time to process…but the explanation has to be there!
Visually: more on a single visualization
Yes, often there are different visualizations for reports/papers than for presentations/lectures.
❓ What makes this an effective visualization for a written communication?”
Source: Storytelling wtih data by cole nussbaumer knaflic
---
title: "Document Title"
output:
html_document:
toc: true
toc_float: true
---
---
title: "Document Title"
output:
html_document:
theme: united
highlight: tango
---
---
title: "Document Title"
output:
html_document:
fig_width: 7
fig_height: 6
fig_caption: true
---
---
title: "Document Title"
output:
html_document:
code_folding: hide
---
eval
: whether to execute the code chunkecho
: whether to include the code in the outputwarning
, message
, and error
: whether to show warnings, messages, or errors in the knit documentfig.width
and fig.height
: control the width/height of plotsknitr::opts_chunk$set(fig.width = 8, collapse = TRUE)
When are citations needed?
“We will be doing our analysis using two different data sets created by two different groups: Donohue and Mustard + Lott, or simply Lott”
“What turned from the idea of carrying firearms to protect oneself from enemies such as the British monarchy and the unknown frontier of North America has now become a nationwide issue.”
“Right to Carry Laws refer to laws that specify how citizens are allowed to carry concealed handguns when they’re away from home without a permit”
“In this case study, we are examining the relationship between unemployment rate, poverty rate, police staffing, and violent crime rate.”
“In the United States, the second amendment permits the right to bear arms, and this law has not been changed since its creation in 1791.”
“The Right to Carry Laws (RTC) is defined as”a law that specifies if and how citizens are allowed to have a firearm on their person or nearby in public.””
Reminder: You do NOT get docked points for citing others’ work. You can be at risk of AI Violation if you don’t. When in doubt, give credit.
How to specify a footnote in text:
Here is some body text.[^1]
How to include the footnote’s reference:
[^1]: This footnote will appear at the bottom of the page.
ggplot(penguins, aes(y = fct_rev(fct_infreq(species)), fill = species)) +
geom_bar() +
geom_text(stat='count', aes(label=after_stat(count)), hjust = 1.5, color = "white", size = 7) +
scale_x_continuous(expand = c(0, 0)) +
scale_fill_manual(values = c("#454545", rep("#adadad", 2))) +
labs(title = "Adelie Penguins are the most common in Antarctica",
subtitle = "Frequency of each penguin species studied near Palmer Station, Antarctica") +
theme_minimal(base_size = 14) +
theme(axis.text.x = element_blank(),
plot.title.position = "plot",
panel.grid.major = element_blank(), panel.grid.minor = element_blank(),
axis.title = element_blank(),
legend.position = "none")
Note
A reminder that when you’re doing EDA, you’re going to generate a lot of plots. Don’t spend your time making all of them beautiful. Save that for the “editing” portion of your case study. Once you know which are needed to tell your story, add them in.