Determinants of Item Nonresponse to Web and Mail Respondents in Three Address-Based Mixed-Mode Surveys of the General Public

Benjamin Messer; Michelle Edwards; Don Dillman

doi:10.29115/SP-2012-0012

Three recent experiments demonstrate the efficacy of using mail contacts to convince address-based samples of general public households to respond to a Web survey (Smyth et al. 2010; Messer and Dillman 2011). Results show that mailing the Web request first to respondents, followed by a paper questionnaire at a later date, produced Web responses from over 1/3 of sampled respondents; the paper follow-up resulted in an additional 14-18%. This “Web+mail” design also obtained demographically dissimilar respondents via each mode: Web respondents were significantly younger with higher levels of income and education than mail respondents. Thus, it seems beneficial to offer mail as an alternative to Web to increase response rates and enhance the representativeness of respondents. However, as research suggests, it could also be the case that the mail obtains higher item nonresponse compared to Web, which raises concerns about the additional value mail might add in this type of Web+mail mixed mode design.

In this paper, we examine some of the determinants of item nonresponse for the Web and mail groups used in the three experiments reported in Smyth et al. (2010) and Messer and Dillman (2011). The experiments employ similar Web and mail combination strategies, questionnaire designs, and respondent demographic measures, making it possible to examine trends within and across the studies. Specifically, we report how survey mode, respondent demographics (e.g., gender, age, education, and income), and question format (e.g., close- and open-end) and type (e.g., attitudinal and factual) contribute to item nonresponse rates^[1].

Methods

In the three experiments, sampled households were obtained via the USPS Delivery Sequence File and were sent four mail contacts over the course of about three months. The experiments employed multiple Web+mail and mail-only^[2] groups to test the effects of mode sequence, cash incentives, Web instructions, and Priority Mail on response rates. In Web+mail treatment groups, we offered a Web request first to respondents, followed later by a mail alternative for those who did not respond initially via Web. In mail-only treatment groups, we offered only mail surveys throughout the data collection period. In this study, we combined data from all three experiments into one larger dataset. For example, the Web+mail mode reflects respondents from all Web+mail treatment groups across all three experiments. In addition, all experiments utilized a unified-mode design that provided quite similar visual layouts for Web and mail questionnaires and allowed respondents to skip questions without providing a response in both the Web and mail questionnaires (Dillman, Smyth, and Christian 2009).

Experiment 1, the 2007 Lewiston and Clarkston Quality of Life Survey (LCS), was conducted in a rural region in the Pacific Northwest (Smyth et al. 2010). Experiments 2 & 3, the 2008 Washington Community Survey (WCS) & the 2009 Washington Economic Survey (WES), were state-wide surveys of Washington households (Messer and Dillman 2011). The samples for Experiments 2 & 3 were stratified by urban and rural county, and post-stratification weights have been applied in analyses, as described in more detail in Messer and Dillman (2011).

Sample sizes and response rates for each experiment and survey mode are in Table 1. In all experiments, the mail-only treatment obtained the highest rates (vs. Web+mail) while the Web+mail design was successful at obtaining about two-thirds of responses via the Web. Furthermore, respondent demographics and questionnaire characteristics are mostly consistent across each of the three experiments (see Messer, Edwards, and Dillman 2012): Web respondents are younger and have higher levels of education and income compared to mail-only and mail follow-up respondents. Each of the questionnaires employed the same question formats and types, although the number of each format and type varies across the experiments.

Table 1 Sample Sizes, Unit Response Rates¹, and Item Nonresponse Rates by Design and Mode, by Experiment.

				1^st Mode			2^nd Mode
Design	N (N’)²	Total Unit Response Rate %(n)	Total Number of Items³	Mode Used	Unit Response Rate % (n)	Item Nonresponse Rate	Mode Used	Unit Response Rate % (n)	Item Nonresponse Rate	Total Item Nonresponse Rate
LCS
Mail Only	800 (738)	66.3 (489)	92	Mail	64.4 (475)	5.0	Web	1.9 (14)	DNC³	5.0
Web+Mail	600 (566)	55.1 (312)		Web	40.8 (231)	2.7	Mail	14.3 (81)	6.2	3.6
WCS⁴
Mail Only	2200 (2069)	50.4 (1043)	110	Mail	49.2 (1017)	4.2	Web	1.3 (26)	DNC⁵	4.2
Web+Mail	3200(2993)	40.1 (1200)		Web	25.0 (747)	2.7	Mail	15.1 (453)	6.9	4.2
WES⁴
Mail Only	1800 (1673)	62.2 (1040)	96	Mail	62.2 (1040)	8.1	Web	–	–	8.1
Web+Mail	2100 (1932)	50.2 (969)		Web	32.6 (630)	6.1	Mail	17.5 (339)	11.6	8.0

Notes: ¹Response rate= number of completed (I+P) / N’ [(I + P)+(R+NC+O)+(UH + UO)-undeliverables] (AAPOR, 2009); ²N’ = N -undeliverables, which are the number of addresses in the sample that were no longer in service and were determined by whether the mailings were returned to the sender; ³Number varies per respondent depending on branching questions. ⁴Weighted data; ⁵DNC=did not calculate due to small sample size.

Item nonresponse rates are calculated the same for mail and Web versions in each experiment. For each respondent, we divided the number of missing responses by the total number of possible complete responses and multiplied by 100. The total number of possible complete responses varied based on how respondents answered the branching questions, with the total number of items ranging between 92 and 110 (see Table 1). We calculated overall rates by averaging individual rates for a particular mode. Missing responses were indicated based on whether or not the respondent provided any answer on a particular item, with only unanswered items counting toward item nonresponse. Non-substantive (i.e., “don’t know” or “not sure”) or incorrect responses were considered to be responses in these analyses. Respondents who completed less than 50 percent of the items were dropped as “partial completes.”

Results

As shown in Table 1, item nonresponse rates were lower for the Web mode in each experiment (2.7-6.1 percent) compared to mail-only (4.2-8.1 percent), and the mail follow-up (6.2-11.6 percent). Item nonresponse rates are reported separately for mail used as a follow-up mode since respondents to this mode exhibit higher item nonresponse rates. Web item nonresponse rates are significantly lower than mail follow-up response rates across all three experiments at p 0.01 with the Bonferonni-Holm correction; for the WCS and the WES, the mail-only rates are significantly lower than those obtained via mail follow-up (for more detail see Messer, Edwards, and Dillman 2012). These differences are shown graphically in Figure 1 and the higher mail item nonresponse rates are likely a result of that mode obtaining respondents who are older with lower levels of education and income, which we test with multivariate analyses below. Also shown in Figure 1, when the Web and mail follow-up are combined, the total item nonresponse rates tend to be statistically similar to those obtained via mail-only (see Table 1).

Figure 1 Item Nonresponse Rates by Mode, by Experiment.

Demographic Analyses

Table 2 displays the results of bivariate and multivariate Ordinary Least Squares (OLS) regression models predicting item nonresponse rates (i.e., dependent variable) by survey mode and individual demographic characteristics for each experiment. In Models 1, 3, and 5, we only included survey mode as a predictor of item nonresponse rates. These variables were all statistically significant at the 0.05 level or lower. In Models 2, 4, and 6, we included survey mode and controlled for demographic characteristics. Using global F-tests, we found these models to all be significant improvements over the models with only survey mode. Survey mode continues to be statistically significant (with one exception), even when controlling for demographic characteristics. On average, Web respondents tend to have significantly lower item nonresponse rates and mail follow-up respondents tend to have significantly higher item nonresponse rates than mail-only respondents, holding demographics constant. Demographic comparisons vary across the three experiments, but education and age tend to be consistent significant predictors of item nonresponse. With each additional year of age, the item nonresponse rate increases by about 0.10 units, holding other variables constant. Compared to respondents with a high school degree or less, respondents with at least some college tend to have lower item nonresponse rates, holding other variables constant.

Table 2 Bivariate and Multivariate OLS Regression Models¹ Predicting Item Nonresponse Rates by Survey Mode and Respondent Demographic Characteristics, by Experiment.

	LCS		WCS²		WES²
Mode	Model: 1	Model: 2	Model: 3	Model: 4	Model: 5	Model: 6
Mail-only	Reference	Reference	Reference	Reference	Reference	Reference
Web(of Web+Mail)	-1.81***(.476)	-1.19** (.444)	-1.27***(.207)	-0.55**(.179)	-2.17***(.328)	-1.49***(.313)
Mail (of Web+Mail)	1.52*(.741)	0.18(.695)	2.12*** (.434)	1.20**(.402)	2.97***(.631)	1.74** (.603)
Demographics³
Female	–	-0.46(.374)	–	0.63**(.204)	–	-0.17 (.350)
Age	–	0.13***(.011)	–	0.10*** (.009)	–	0.12***(.012)
HS or less	–	Reference	–	Reference	–	Reference
Some college, no deg.	–	-1.20*(.473)	–	-2.55***(.463)	–	-0.44(.519)
2-,4-Yr., Grad/prof deg.	–	-1.83*** (.473)	–	-2.76***(.406)	–	-1.57***(.366)
Less than $50K	–	Reference	–	Reference	–	Reference
$50K to less than $100K	–	-0.54(.436)	–	-0.87***(.238)	–	-0.69(.412)
$100K or more	–	-0.21(.695)	–	-0.79** (.250)	–	-1.3**(.411)
Prefer not to say	–	0.82(.609)	–	-0.58(.397)	–	-0.28(.618)
R²	0.02***	0.17***	0.05***	0.19***	0.05***	0.15***
N	991	991	2143	2143	1901	1901

Notes: *p = .05; **p = .01; ***p= .001; ¹Unstandardized coefficients reported (standard errors in parentheses); ²Weighted data; ³Female is coded dichotomously, where l=female respondents; Age is coded as a continuous variable; Education is coded ordinally with three categories (high school or less; some college but no degree; and 2-year, 4-year, or graduate/professional degree; Income is coded nominally with four categories (less than $50,000; $50,000 to less than $100,000; $100, 000 or more; and “ prefer not to say”).

Question Analyses

Table 3 displays results of OLS regression models predicting item nonresponse rates by survey mode and question characteristics for each experiment. We first conducted analyses with only survey mode as the predictor (Models 1, 3, & 5), and then ran models with question characteristics as controls. As before global F-tests indicate that our second models controlling for questionnaire characteristics (2, 4, & 6) are significant improvements over the models with only survey mode variables. In Models 2, 4, and 6, survey mode was statistically significant (with one exception), even when controlling for questionnaire characteristics. On average, Web respondents tend to have significantly lower item nonresponse rates and mail follow-up respondents tend to have significantly higher item nonresponse rates than mail-only respondents, holding questionnaire characteristics constant. In terms of questionnaire characteristics, the trends vary somewhat across the three experiments but some trends are consistent: screened, multi-item, and other factual questions tend to be significant predictors of item nonresponse across all three experiments in similar directions. Screened questions have higher item nonresponse rates than non-screened questions, even holding survey mode and other question characteristics constant. Similarly, multi-item questions have higher item nonresponse rates than single-item questions, controlling for other variables. Finally, other non-demographic factual questions have lower item nonresponse rates than demographic questions, holding other variables constant.

Table 3 Multivariate OLS Regression Models¹ Predicting Item Nonresponse Rates by Question Type and Format, by Experiment.

	LCS		WCS		WES
Mode	Model: 1	Model: 2	Model: 3	Model: 4	Model: 5	Model: 6
Mail-only	Reference	Reference	Reference	Reference	Reference	Reference
Web (of Web+Mail)	-3.78**(1.356)	-3.78**(1.116)	-2.67*(1.132)	-2.67**(.983)	-3.52(2.274)	-3.52*(1.583)
Mail (of Web+Mail)	1.11(1.356)	1.11(1.116)	3.03**(1.132)	3.03**(.983)	3.76(2.274)	3.76*(1.583)
Question Format³
Screened	–	10.17***(1.409)	–	6.92***(1.193)	–	6.87***(1.852)
Multi-Item	–	2.78*(1.134)	–	2.55**(.987)	–	15.26***(1.655)
Ordinal	–	-11.98***(1.709)	–	-13.14***(1.757)	–	7.52*(3.010)
Y/N	–	1.66(1.926)	–	-2.20(1.738)	–	0.33(2.994)
Other Nominal	–	-8.60***(2.041)	–	-11.31***(1.647)	–	-4.37(3.221)
Open-Ended	–	Reference	–	Reference	–	Reference
Question Type⁴
Demographic	–	Reference	–	Reference	–	Reference
Other Factual	–	-11.72***(2.292)	–	-6.17***(1.859)	–	-11.43***(2.078)
Attitudinal	–	2.31(1.878)	–	2.01(1.743)	–	-19.35***(2.477)
Behavioral	–	3.34(1.907)	–	1.03(1.633)	–	-29.44***(2.263)
R2	0.05***	0.38***	0.07***	0.32***	0.03**	0.55***
N	276	276	330	330	288	288

Notes: *p≤.05; **p≤.01; ***p≤.001; ^lUnstandardized coefficients reported (standard errors in parentheses); ³Screened items are coded dichotomously, where 1=items that followed a branching question; Multi-Item items are coded dichotomously, where 1=items that were part of a multi-item question (in which respondents were asked to provide answers to multiple items in the same question); Closed or Open-Ended items are coded nominally with four categories (ordinal, y/n, other nominal, and open-ended): Ordinal items contain answer categories that have a natural order (e.g., “Very Good” to “Very Poor”); Y/N items contain answer categories that are nominal and have the choices “yes” or “no”; Other Nominal items contain nominal answer categories that are not yes/no (e.g., marital status); Open-Ended items are those in which respondents are asked to write or enter their responses in a blank answer space; ⁴Question type is coded nominally with four categories (demographic, other factual, attitudinal, and behavioral): Demographic items ask about a factual, demographic characteristic of the respondent; Other Factual items ask about a factual, non-demographic characteristic of the respondent; Attitudinal items ask about the respondent’s attitude, opinion, or preference on a topic; and Behavioral items ask about the respondent’s behavior.

Conclusion

First, there appears to be a trade-off when using the Web+mail design. The mail follow up increases response rates and attracts different types of respondents but obtains lower data quality in terms of item nonresponse. In each of the three experiments, combining Web and mail follow-up respondents resulted in item nonresponse rates statistically similar to those obtained by using mail alone. Second, there may be demographic sources of item nonresponse, net of those resulting from differential participation in Web and mail modes. For example, even when controlling Web and mail mode of response, respondents who are older and have less education and income display higher rates of item nonresponse. Third, the results consistently show that question formats that require more effort than single-item, close-ended scale questions – branching, multi-item, and open-end questions – obtain higher rates of item nonresponse, net of mode of response.

It is likely that interactions between survey mode, demographics, and questionnaire characteristics contribute to variations in item nonresponse, but we are unable to conclusively determine if this is the case here. In addition, our studies are limited to regional and statewide samples from the Pacific Northwest and our measure of item nonresponse is made somewhat conservative by including all answers, whether invalid or not applicable, in the calculations and by setting the partial complete threshold at less than 50% of items answered.

Our overall conclusion from this analysis is that item nonresponse in a Web+mail survey design is not a major barrier to the use of this design for data collection.

Acknowledgments

Support for this research was provided by USDA-NASS and NSF-National Center for Science and Engineering Statistics, under a Cooperative Agreement to the Social and Economic Sciences Research Center (SESRC) at Washington State University. Additional support was provided by the SESRC.

A Technical Report by Messer, Edwards, and Dillman (2012) provides additional details on study procedures and analyses.
A “mail+Web” design was also employed but obtained so few Web respondents that we dropped them from analyses and refer to the design as “mail-only”.