Checks the supplied data for common pitfalls in the harmonization process
Source:R/ql_check.R
ql_check.Rd
ql_check()
performs plausbility checks on the supplied data. It prints messages
if inconsistencies in the data are found which may influence the results of the harmonization.
Details
Three checks are performed:
(1) Multiple versions of the same question:
Each question should be associated with one set of response options. If a one set of response options was found for one year (e.g. 1,2,3,4 in the year 2000) and another set was found in another year (e.g. 1,2,3 in the year 2004) there may be a problem with the data, such as missing cases or falsly exculded cases. Make sure to include only one version of the question, as the harmonization may lead to wrong results.
(2) Negative responses:
Negative values are used to represent missing values of some sort in most survey data files. Check whether negative values are valid responses in your data (and ignore the message if they are valid).
(3) Response options not used in scale:
Missing response options (e.g. in a 4 point scale 1,2,4, response option 3 would be missing), can lead to
false harmonization results. If response options are missing, check if responses were falsely excluded from the
dataset. This check utilizes the information supplied by the scale_min_max
argument in ql_prepare()
.
If scale_min_max
is left empty, the minimum and maximum found in the data will be used.