Resources


Standards

Practices

Data Quality Review


“A data quality review is a process whereby data and associated files are assessed and required actions are taken to ensure files are independently understandable for informed reuse. This is an active process, involving a review of the files, the documentation, the data, and the code. We strongly believe that data quality cannot be realized without a data quality review.”
Peer, Green, & Stephenson, 2014

REVIEW FILES

 

  • Assign persistent IDs
  • Create a citation to the study and a study-level metadata record
  • Record file details (size, format, checksums)
  • Check that all files are present
  • Verify that content of files matches expected format
  • Create non-proprietary versions of the files
  • Implement migration strategy for file formats
REVIEW DATA

 

  • Check for undocumented variable and value information or out-of-range codes
  • Review data for confidentiality issues
REVIEW DOCUMENTATION

 

  • Confirm comprehensive descriptive information for informed reuse including methodology and sampling information
  • Link to other research products
REVIEW CODE

 

  • Check and verify code for data analysis and replication