This is a simulated dataset for photo classifications based on a machine learning algorithm versus what the true classification is for those photos. While the data are not real, they resemble performance that would be reasonable to expect in a well-built classifier.
Format
A data frame with 1822 observations on the following 2 variables.
- mach_learn
The prediction by the machine learning system as to whether the photo is about fashion or not.
- truth
The actual classification of the photo by a team of humans.