Details from the EPA.
Format
A data frame with 1129 observations on the following 28 variables.
- model_yr
a numeric vector
- mfr_name
Manufacturer name.
- division
Vehicle division.
- carline
Vehicle line.
- mfr_code
Manufacturer code.
- model_type_index
Model type index.
- engine_displacement
Engine displacement.
- no_cylinders
Number of cylinders.
- transmission_speed
Transmission speed.
- city_mpg
City mileage.
- hwy_mpg
Highway mileage.
- comb_mpg
Combined mileage.
- guzzler
Whether the car is considered a "guzzler" or not, a factor with levels
N
andY.
- air_aspir_method
Air aspiration method.
- air_aspir_method_desc
Air aspiration method description.
- transmission
Transmission type.
- transmission_desc
Transmission type description.
- no_gears
Number of gears.
- trans_lockup
Whether transmission locks up, a factor with levels
N
andY
.- trans_creeper_gear
A factor with level
N
only.- drive_sys
Drive system, a factor with levels.
- drive_desc
Drive system description.
- fuel_usage
Fuel usage, a factor with levels.
- fuel_usage_desc
Fuel usage description.
- class
Class of car.
- car_truck
Car or truck, a factor with levels
car
,1
,2
.- release_date
Date of vehicle release.
- fuel_cell
Whether the car has a fuel cell or not, a factor with levels
N
,Y
.
Examples
library(ggplot2)
library(dplyr)
# Variable descriptions
distinct(epa2012, air_aspir_method_desc, air_aspir_method)
#> # A tibble: 4 × 2
#> air_aspir_method_desc air_aspir_method
#> <fct> <fct>
#> 1 Naturally Aspirated NA
#> 2 Turbocharged TC
#> 3 Supercharged SC
#> 4 NA NA
distinct(epa2012, transmission_desc, transmission)
#> # A tibble: 7 × 2
#> transmission_desc transmission
#> <fct> <fct>
#> 1 Manual M
#> 2 Automated Manual AM
#> 3 Semi-Automatic SA
#> 4 Automatic A
#> 5 Selectable Continuously Variable (e.g. CVT with paddles) SCV
#> 6 Continuously Variable CVT
#> 7 Other OT
distinct(epa2012, drive_desc, drive_sys)
#> # A tibble: 5 × 2
#> drive_desc drive_sys
#> <fct> <fct>
#> 1 2-Wheel Drive, Rear R
#> 2 All Wheel Drive A
#> 3 2-Wheel Drive, Front F
#> 4 4-Wheel Drive 4
#> 5 Part-time 4-Wheel Drive P
distinct(epa2012, fuel_usage_desc, fuel_usage)
#> # A tibble: 7 × 2
#> fuel_usage_desc fuel_usage
#> <fct> <fct>
#> 1 Gasoline (Premium Unleaded Recommended) GP
#> 2 Gasoline (Premium Unleaded Required) GPR
#> 3 Gasoline (Regular Unleaded Recommended) G
#> 4 Electricity EL
#> 5 Diesel DU
#> 6 Gasoline (Mid Grade Unleaded Recommended) GM
#> 7 Hydrogen H
# Guzzlers and their mileages
ggplot(epa2012, aes(x = city_mpg, y = hwy_mpg, color = guzzler)) +
geom_point() +
facet_wrap(~guzzler, ncol = 1)