The following datafiles were used in creating the https://scari.sites.er.kcl.ac.uk/cpre/ analysis: ks4.parquet (6MB compressed, 200MB uncompressed) https://scari.sites.er.kcl.ac.uk/data/ks4.parquet ks5.parquet (7MB compressed, 200MB uncompressed) https://scari.sites.er.kcl.ac.uk/data/ks5.parquet These are apache arrow .parquet files, in this instance highly compressed .csv files. They cover all exam entries in English education institutions at key stage 4 and key stage 5 for the years 2012-2023. To open in Windows take a look at the following: https://github.com/mukunku/ParquetViewer Data used to create these files is from the English Department for Education's open datasets: https://get-information-schools.service.gov.uk/Downloads https://www.compare-school-performance.service.gov.uk All data is under the OGL 3.0 License: https://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/ To reference these files, please use the following: Contains English Department for Education public sector data modified by Peter Kemp for the Nuffield funded SCARI computing project, 2024, under the Open Government Licence v3.0. Bibtex entries @misc{kemp2014scaridata, title = {English school exam entries dataset 2012-2023}, author = {Peter Edward Joseph Kemp}, year = {2024}, note = {Contains English Department for Education public sector data modified by Peter Kemp under the Open Government Licence v3.0.}, howpublished = {https://scari.sites.er.kcl.ac.uk/} } @article{kemp2024future, title={The future of computing education: Considerations for policy, curriculum and practice}, author={Peter Edward Joseph Kemp and Billy Wong and Jessica Hamer and Megan Copsey-Blake}, publisher={{King's College London: United Kingdom}}, year={2024}, url={https://www.kcl.ac.uk/ecs/assets/kcl-scari-computing.pdf} }