Introduction
The digital age allows researchers to conduct new forms of “participant observation” studies, providing detailed insights into the thoughts, behaviors, and attitudes of those they study (Lai et al. 2010). Such an approach is tested here using a mobile-based, repeated-measures design, which combines several digital measurements, including repeated mobile phone surveys and visual data capture to provide real-time information about respondents.
Given that mobile-based survey designs are still in the early stages of development, the analysis focuses on various aspects of respondent compliance, including participation rates over time and across various segments of the day as well as potential longitudinal changes in data quality measures meant to capture the level of respondent engagement in the study.
Methods
In June and July 2010, a panel of nearly 400 South Africans was recruited across the four largest cities in South Africa: Johannesburg, Pretoria, Durban, and Cape Town. Recruitment was a two-phase effort, with telephone outreach to a quota sample of potential respondents that fit defined age and employment targets, followed by face-to-face orientation sessions to familiarize respondents with the smartphone device and study guidelines.
Each respondent was equipped with a BlackBerry Curve running a Techneos SODA® survey application (Confirmit, Oslo, Norway). With the exception of the phoning, camera and survey capability, all other features were hidden and locked from respondent use to minimize potential changes in respondent behavior. Phones were programmed in one of four languages: Zulu, Sotho, Afrikaans, and English. An initial set of cell phone minutes was provided as an upfront incentive for participation. Panelists were also told they could keep the Blackberry as a gift if they completed all 5 weeks of the study ($85 value in U.S. dollars).
The self-administered mobile survey was delivered five times a day for the 33-day fielding period. Five times each day (6:00 a.m.–10:00 a.m.; 10:00 a.m.–1:00 p.m.; 1:00 p.m.–4:00 p.m.; 4:00 p.m.–7:00 p.m.; 7:00 p.m.–11:00 p.m.) a reminder alarm would sound prompting respondents to complete the survey. Although the time intervals were known, the actual timing of the alarm and reminders was conducted at random intervals within each time period to deter panelists from anticipating the survey and potentially changing their behaviors accordingly.
The survey collected “in the moment” information pertaining to respondent’s activities, where he or she was, what he or she was doing, and with whom, as well as questions about his or her current mood. The goal was to administer all questions within a 3–5 minute survey. The last question of each survey asked the respondent to take a picture of what he or she was currently focused on and to provide a caption to describe the photo.
The analysis focuses on issues related to respondent compliance, in particular assessments of increasing survey nonresponse, declining item completeness, changing mood, and decreasing compliance with specific tasks (providing a photo and caption).
Results
Survey completion rate
Data were collected from 384 respondents between June 10 and July 12, 2010. Over the 33-day period, respondents were asked to complete 165 surveys (five per day across different parts of the day), resulting in 63,360 potential surveys across all respondents. The average, respondent-level completion rate (number of fully completed interviews / number of surveys offered) was 78.7 percent.
The average completion rate varied significantly over time (Table 1). Over the 5-week period, we see a pattern of initial learning (week 1), followed by more intensive usage (weeks 2 and 3), followed by a slight drop-off in participation during the latter weeks.
There were no significant differences in participation levels across any of the weeks based on sex, work status, or language. The only significant difference noted for age was in week 1, where participation was significantly higher among those 16–24 years old (87 percent) versus those aged 25 and older (where participation was 79 percent or lower). This likely reflects greater familiarity with new technologies, such as smartphones, among younger versus older people. This notion is buttressed by the cross-time pattern of participation which demonstrates what appears to be a “learning curve” during the first week for the older group, but less so with the younger groups.
Finally, looking at completion rates across the different times of the day, we find remarkable stability (Table 2). There were no significant differences across demographic groups or within subgroups over time. Nonresponse therefore appears primarily driven by whether the respondent chose to continue participating in the study, and not necessarily participating only at selected times of the day.
Recording activities at the time of the survey
To gain a better understanding of whether or not respondents were “learning” to navigate the shortest path through the survey – a sign of respondent fatigue and potential measurement error – we examined responses provided to the question: “What were you doing before starting the survey?” Respondents could choose all that apply from a list of 14 potential activities. Most of these questions had specific follow-up questions to obtain more detail about the activity. A respondent wanting to minimize the number of questions he or she was asked could therefore limit the number of activities chosen and hence reduced the survey length.
Looking at the average number of activities reported by respondents over time, the weekly average across respondents was 1.63 activities over the 5-week period. This number was very stable across all weeks of the field period, with no significant variation noted over time overall (Table 3). Moreover, there was little significant variation across time within the various demographic groups examined. This would seem to indicate that respondents did not alter their reporting behavior when it came to concurrent activities from the start to the conclusion of the field period.
Respondent mood
Another potential indicator of respondent engagement (or disengagement) in the survey process is his or her self-reported mood at the time of the survey. Halfway through each survey, respondents were asked to indicate on a 5-point scale the number that most accurately described their current mood. The scale had two descriptive anchors: 1=bored, 5=excited. If respondents were growing weary of completing surveys over the 5-week period, one would expect a higher percentage of “bored” responses or a shifting in the average rating to the lower end over time.
Over the 5-week period the weekly average rating for respondent mood was 3.45 (Table 4). There was a slight change over time across all respondents, with week 1 showing a marginally higher average rating when compared to other weeks. The trending overall was fairly flat, indicating that respondents did not appear to change in mood appreciably after the second week of the study.
Photos and captions
Finally, we examined the percentage of completed interviews accompanied by a photo and a brief caption recorded by the respondent. To measure compliance, we looked at the percentage of completed interviews that had both a photo and an accompanying caption.
On average 92.0 percent of all completed interviews were accompanied by a photograph, while 66.4 percent of all completes had both a photo and a caption (Table 5). Women were more likely than men across each of the five weeks to both take a picture and insert a caption. Like some of the other measures already discussed, the percentage of photos with captions remained remarkably stable over time, both overall and within the subgroups examined. Again, this is an indication that respondents were largely compliant with this specific task.
Implications
The 2010 World Cup study provides some critical insights into how researchers can utilize current technologies to expand the types of studies that can be conducted. Given the length of the panel and that surveys were required at multiple points during the day, the study achieved a marked rate of completion. Most of the other respondent engagement measures showed stability over time – both overall and within the demographic groups examined. Where there were differences in compliance, they tended to be across demographic groups (as opposed to cross-time changes within specific subgroups) and they tended to be minimal. Of interest for future research is the apparent “learning curve” among older respondents as compared to younger ones. Participation levels among older respondents rose over the course of the first week, whereas for younger respondents started high from the outset. This “learning curve” is a phenomena that researchers need to understand better and potentially account for in their study designs when using mobile forms of data collection.
The study has several potential limitations which should be noted. First, it was conducted in one country (South Africa) and during a time of heightened national awareness and unity with the country hosting the World Cup matches. Second, the sample was drawn using a quota (non-probability) methodology. While this approach ensured the proper proportion of respondents in terms of sex, age, work status and language, the fact that these respondents were not chosen via a random selection process could affect the generabilizability of the findings.
The promise of the digital age for researchers lies in how we can best harness these new technologies to unlock insights into various avenues of inquiry. An approach utilizing periodic surveys via smartphones combined with other survey and non-survey data offers one potential option. The approach allows researchers to capture rich, detailed data in a manner heretofore unattainable through traditional focus group or survey approaches. As those technologies continue to evolve, researchers will need to keep pace in will keep pace, leveraging new hardware and applications to meet changing data needs.