Scientific journal paper Q3
A study on the quality of novel coronavirus (COVID-19) official datasets
Afshin Ashofteh (Ashofteh, A. ); Jorge Miguel Bravo (Bravo, J.);
Journal Title
Statistical Journal of the IAOS
Year (definitive publication)
2020
Language
English
Country
Netherlands
More Information
Web of Science®

Times Cited: 25

(Last checked: 2025-12-22 19:36)

View record in Web of Science®

Scopus

Times Cited: 31

(Last checked: 2025-12-18 22:06)

View record in Scopus


: 11.2
Google Scholar

Times Cited: 48

(Last checked: 2025-12-18 03:08)

View record in Google Scholar

This publication is not indexed in Overton

Abstract
Policy makers depend on complex epidemiological models that are compelled to be robust, realistic, defendable and consistent with all relevant available data disclosed by official authorities which is deemed to have the highest quality standards. This paper analyses and compares the quality of official datasets available for COVID-19. We used comparative statistical analysis to evaluate the accuracy of data collection by a national (Chinese Center for Disease Control and Prevention) and two international (World Health Organization; European Centre for Disease Prevention and Control) organisations based on the value of systematic measurement errors. We combined excel files, text mining techniques and manual data entries to extract the COVID-19 data from official reports and to generate an accurate profile for comparisons. The findings show noticeable and increasing measurement errors in the three datasets as the pandemic outbreak expanded and more countries contributed data for the official repositories, raising data comparability concerns and pointing to the need for better coordination and harmonized statistical methods. The study offers a COVID-19 combined dataset and dashboard with minimum systematic measurement errors, and valuable insights into the potential problems in using databanks without carefully examining the metadata and additional documentation that describe the overall context of data.
Acknowledgements
--
Keywords
SARS-CoV-2,Coronavirus disease (COVID-19),Measurement error,Data quality,Official statistics,Epidemiology,Health emergency,Public health
  • Mathematics - Natural Sciences
  • Economics and Business - Social Sciences
  • Other Social Sciences - Social Sciences