Skip to contents

Details for all 19,961 runners in the 2017 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC. Most runners participate in a 10-mile run while a smaller fraction take part in a 5k run or walk.

Usage

run17

Format

A data frame with 19,961 observations on the following 9 variables.

bib

Number on the runner's bib.

name

Name of the runner, with only the initial of their last name.

sex

Gender of the runner.

age

Age of the runner.

city

Home city of the runner.

net_sec

Time to complete the race, after accounting for the staggered starting time, in seconds.

clock_sec

Time to complete the race, ignoring the staggered starting time, in seconds.

pace_sec

Average time per mile, in seconds.

event

The event the racer participated in, either the "10 Mile" race or the "5K".

Details

There was a time limit where all 10 Mile racers had to finish by. Can you figure out what that time is?

Examples


library(ggplot2)

# Finishing times
ggplot(run17, aes(x = net_sec)) +
  geom_histogram(binwidth = 300) +
  facet_wrap(~event, nrow = 2) +
  labs(
    title = "Finishing times for 2017 Cherry Blossom Run, by event",
    subititle = "After accounting for the staggered starting time",
    x = "Time to complete the race, in seconds",
    y = "Frequency"
    )
#> Warning: Removed 3 rows containing non-finite values (stat_bin).


# Pacing
ggplot(run17, aes(x = pace_sec)) +
  geom_histogram(binwidth = 100) +
  facet_wrap(~event, nrow = 2, scales = "free_y") +
  labs(
    title = "Pacing for 2017 Cherry Blossom Run, by event",
    x = "Average time per mile, in seconds",
    y = "Frequency"
    )
#> Warning: Removed 3 rows containing non-finite values (stat_bin).