Simulated data for analyzing the relationship between watching TV and grades
Source:R/data-gradestv.R
gradestv.Rd
This is a simulated dataset to be used to estimate the relationship between number of hours per week students watch TV and the grade they got in a statistics class.
Format
A data frame with 25 observations on the following 2 variables.
- tv
Number of hours per week students watch TV.
- grades
Grades students got in a statistics class (out of 100).
Details
There are a few potential outliers in this dataset. When analyzing the data one should consider how (if at all) these outliers may affect the estimates of correlation coefficient and regression parameters.
Examples
library(ggplot2)
ggplot(gradestv, aes(x = tv, y = grades)) +
geom_point() +
geom_smooth(method = "lm")
#> `geom_smooth()` using formula = 'y ~ x'