Warning: package 'ggplot2' was built under R version 4.3.3
Warning: package 'tidyr' was built under R version 4.3.3
Warning: package 'readr' was built under R version 4.3.3
Warning: package 'dplyr' was built under R version 4.3.3
Warning: package 'stringr' was built under R version 4.3.3
Warning: package 'lubridate' was built under R version 4.3.3
β Quiz is due Tuesday (see Moodle)
β Homework later this week
β Statistics experience released
β Expect take-home grades by this Wednesday
β Final Exam is Dec 8th at 3:30 (expect an email this afternoon)
Warm up
A company HR department wants to know if the type of coffee an employee prefers (Black or With Milk/Cream) is independent of their primary method of commuting (Car or Public Transit).
What is \(H_o\), \(H_a\)?
Assuming the assumptions are met, whatβs my statistic and distribution?
Warm up
A veterinary nutritionist wants to know if a new vitamin supplement increases the average distance a dog can run on a treadmill before tiring. They decide to test two different breeds: Border Collies and Beagles.
The goal is to test if the mean running distance of Border Collies is significantly different from that of Beagles.
What is \(H_o\), \(H_a\)?
Assuming the assumptions are met, whatβs my statistic and distribution?
Questions
Learning objectives
What can we do when working with two quantitative variables?
β Summary statistics?
β What is simple linear regression?
β How a line of best fit is made
β How to talk about the line of best fit
We are reviewing exploratory data analysis
We will move to statistical inference in the following lesson
Airquality
We are going to look at daily air quality measurements in New York from May through September in 1973. Specifically, we are going to look at the relationship between wind (mph) and temperature (F).
β Can I analyze these data using difference in means?
β Difference in proportions?
β What kind of plot can we make?
Plot the data
How can we summarize these data?
Summary statistics
β correlation (r)
β slope + intercept (fit a line)
Correlation
β Is bounded between [-1, 1]
β Measures the strength + direction of a linear relationship