1.1 Download the data

It is important that you follow along with the guide by replicating the code and analysis presented throughout. You will need two datasets to do this. Use the links below to download both. Then save the files to a project folder for this course.

  • DCPS testing.RData (download DCPS data here)
    This data from the DC Public School System records the results of the PARCC (Partnership for Assessment of Readiness for College and Careers) Assessment from 2017-2018. This version of the data includes the school name, level, and number of students tested, as well as the percentage of students performing at or above grade level in language and math. You can find more information about the test at the DC PARCC results page.

  • biopics.xls (download biopics data here)
    This is a shortened version of the data behind the story “‘Straight Outta Compton’ Is The Rare Biopic Not About White Dudes.” published on fivethirtyeight.com. It contains IMDB data on 177 biopics from 1915 to 2014. Variables include the sex and race of the lead actor at the center of the biopic and the year in which the film was released.