Gittner, Kevin B., Lauren M. Matheny, Gregory D. Balkcom, et al. 2026. “Data Quality in Online Crowdsourced Surveys: Methodological Challenges and Analytic Insights in Survey Research.” Survey Practice 20 (May). https://doi.org/10.29115/SP-2026-0017.
Download all (3)
  • Figure 1. Summary of all responses received by data collection platform.
  • Figure 2. Percentage of good-quality data by data collection platform (n = 2,315).
  • Figure 3. Distribution of missed data quality checks by platform.

Abstract

Pay-for data collection platforms are increasingly popular in health sciences, due to the ease of use, accessibility, and speed of data collection. However, data quality among platforms may differ. The purpose of this study was to compare data quality from three commonly used, pay-for data collection crowdsourcing platforms, using predefined quality control measures. Data were collected using three major platforms, Qualtrics (N = 705), SurveyMonkey (N = 576), and Amazon Mechanical Turk (N = 1,034), for the methodological purpose of assessing data quality for a scale used by public health practitioners to assess ankle activity level. Data were collected using attempted quota sampling for gender, age, ethnicity and region, for representativeness of the general US population. Twenty quality checks in five categories were utilized to identify poor-quality data: attention verification (e.g. ‘please select response B’); demographic verification (e.g. state and zip code match); illogical responses (e.g. respondents separately reporting higher difficulty walking up one flight of stairs than walking up four flights); honesty/reliability verification (e.g. male reported to be pregnant); and assessment of open-ended questions (rejected based on similarity of content or nonsensical responses). Of the 2,315 completed responses received from the three different platforms, 947 (40.91%) passed our quality checks and were considered good-quality responses. SurveyMonkey yielded the highest proportion of good-quality data (70.5%), followed by Qualtrics (61.8%) and Amazon Mechanical Turk (10.2%); however, open-ended responses could not be included in SurveyMonkey due to their 80-item limit, which may have inflated its proportion of good-quality responses. This study demonstrates that data quality issues may differ across platforms and a variety of quality control measures should be implemented for researchers utilizing pay-for data collection services from crowdsourcing platforms.

Accepted: March 23, 2026 EDT