Skip to contents

In 2004, the state of North Carolina released to the public a large dataset containing information on births recorded in this state. This dataset has been of interest to medical researchers who are studying the relation between habits and practices of expectant mothers and the birth of their children. This is a random sample of 1,000 cases from this dataset.

Usage

ncbirths

Format

A data frame with 1000 observations on the following 13 variables.

fage

Father's age in years.

mage

Mother's age in years.

mature

Maturity status of mother.

weeks

Length of pregnancy in weeks.

premie

Whether the birth was classified as premature (premie) or full-term.

visits

Number of hospital visits during pregnancy.

gained

Weight gained by mother during pregnancy in pounds.

weight

Weight of the baby at birth in pounds.

lowbirthweight

Whether baby was classified as low birthweight (low) or not (not low).

gender

Gender of the baby, female or male.

habit

Status of the mother as a nonsmoker or a smoker.

marital

Whether mother is married or not married at birth.

whitemom

Whether mom is white or not white.

See also

We do not have ideal provenance for these data. For a better documented and more recent dataset on a similar topic with similar variables, see births14.

Examples


library(ggplot2)

ggplot(ncbirths, aes(x = habit, y = weight)) +
  geom_boxplot() +
  labs(x = "Smoking status of mother", y = "Birth weight of baby (in lbs)")


ggplot(ncbirths, aes(x = whitemom, y = visits)) +
  geom_boxplot() +
  labs(x = "Mother's race", y = "Number of doctor visits during pregnancy")
#> Warning: Removed 9 rows containing non-finite outside the scale range
#> (`stat_boxplot()`).


ggplot(ncbirths, aes(x = mature, y = gained)) +
  geom_boxplot() +
  labs(x = "Mother's age category", y = "Weight gained during pregnancy")
#> Warning: Removed 27 rows containing non-finite outside the scale range
#> (`stat_boxplot()`).