journal of biomedical informaticspotent technologies that shape pa behavior over time [24]. adaptive...

16
Contents lists available at ScienceDirect Journal of Biomedical Informatics journal homepage: www.elsevier.com/locate/yjbin Modeling individual dierences: A case study of the application of system identication for personalizing a physical activity intervention Sayali S. Phatak a , Mohammad T. Freigoun b , César A. Martín b,c,1 , Daniel E. Rivera b , Elizabeth V. Korinek a , Marc A. Adams a , Matthew P. Buman a , Predrag Klasnja d , Eric B. Hekler a, a School of Nutrition and Health Promotion, Arizona State University, Phoenix, AZ 85004, USA b Control Systems Engineering Laboratory, School for Engineering of Matter, Transport, and Energy, Arizona State University, Tempe 85281, USA c Escuela Superior Politécnica del Litoral, ESPOL, Facultad de Ingeniería en Electricidad y Computación, Campus Gustavo Galindo Km 30.5 Vía Perimetral, P.O. Box 09- 01-5863, Guayaquil, Ecuador d Kaiser Permanente Washington Health Research Institute, Seattle, WA 98101, USA ARTICLE INFO Keywords: System identication Idiographic modeling Dynamical systems modeling Adaptive interventions Physical activity Behavior change Wearables ABSTRACT Background: Control systems engineering methods, particularly, system identication (system ID), oer an idiographic (i.e., person-specic) approach to develop dynamic models of physical activity (PA) that can be used to personalize interventions in a systematic, scalable way. The purpose of this work is to: (1) apply system ID to develop individual dynamical models of PA (steps/day measured using Fitbit Zip) in the context of a goal setting and positive reinforcement intervention informed by Social Cognitive Theory; and (2) compare insights on potential tailoring variables (i.e., predictors expected to inuence steps and thus moderate the suggested step goal and points for goal achievement) selected using the idiographic models to those selected via a nomothetic (i.e., aggregated across individuals) approach. Method: A personalized goal setting and positive reinforcement intervention was deployed for 14 weeks. Baseline PA measured in weeks 12 was used to inform personalized daily step goals delivered in weeks 314. Goals and expected reward points (granted upon goal achievement) were pseudo-randomly assigned using techniques from system ID, with goals ranging from their baseline median steps/day up to 2.5× baseline median steps/day, and points ranging from 100 to 500 (i.e., $0.20$1.00). Participants completed a series of daily self- report measures. Auto Regressive with eXogenous Input (ARX) modeling and multilevel modeling (MLM) were used as the idiographic and nomothetic approaches, respectively. Results: Participants (N = 20, mean age = 47.25 ± 6.16 years, 90% female) were insuciently active, over- weight (mean BMI = 33.79 ± 6.82 kg/m 2 ) adults. Results from ARX modeling suggest that individuals dier in the factors (e.g., perceived stress, weekday/weekend) that inuence their observed steps/day. In contrast, the nomothetic model from MLM suggested that goals and weekday/weekend were the key variables that were predictive of steps. Assuming the ARX models are more personalized, the obtained nomothetic model would have led to the identication of the same predictors for 5 of the 20 participants, suggesting a mismatch of plausible tailoring variables to use for 75% of the sample. Conclusion: The idiographic approach revealed person-specic predictors beyond traditional MLM analyses and unpacked the inherent complexity of PA; namely that people are dierent and context matters. System ID provides a feasible approach to develop personalized dynamical models of PA and inform person-specic tai- loring variable selection for use in adaptive behavioral interventions. 1. Introduction Since the National Research Council report [1], there has been an explosion of interest in precisionmedicine and related eorts, such as personalized digital health interventions to foster behavior change. While the exact denition of precision medicine is still being specied, a general theme of the various eorts is that individual dierences, contextual dierences, and changes to both, the person and context https://doi.org/10.1016/j.jbi.2018.01.010 Received 15 April 2017; Received in revised form 23 January 2018; Accepted 30 January 2018 Corresponding author at: School of Nutrition and Health Promotion, Arizona State University, 550 North 3rd Street, Phoenix, AZ 85004, USA. 1 Present address. E-mail addresses: [email protected] (S.S. Phatak), [email protected] (M.T. Freigoun), [email protected] (C.A. Martín), [email protected] (D.E. Rivera), [email protected] (E.V. Korinek), [email protected] (M.A. Adams), [email protected] (M.P. Buman), [email protected] (P. Klasnja), [email protected] (E.B. Hekler). Journal of Biomedical Informatics 79 (2018) 82–97 Available online 01 February 2018 1532-0464/ © 2018 Elsevier Inc. All rights reserved. T

Upload: others

Post on 12-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

Contents lists available at ScienceDirect

Journal of Biomedical Informatics

journal homepage: www.elsevier.com/locate/yjbin

Modeling individual differences: A case study of the application of systemidentification for personalizing a physical activity intervention

Sayali S. Phataka, Mohammad T. Freigounb, César A. Martínb,c,1, Daniel E. Riverab,Elizabeth V. Korineka, Marc A. Adamsa, Matthew P. Bumana, Predrag Klasnjad, Eric B. Heklera,⁎

a School of Nutrition and Health Promotion, Arizona State University, Phoenix, AZ 85004, USAb Control Systems Engineering Laboratory, School for Engineering of Matter, Transport, and Energy, Arizona State University, Tempe 85281, USAc Escuela Superior Politécnica del Litoral, ESPOL, Facultad de Ingeniería en Electricidad y Computación, Campus Gustavo Galindo Km 30.5 Vía Perimetral, P.O. Box 09-01-5863, Guayaquil, Ecuadord Kaiser Permanente Washington Health Research Institute, Seattle, WA 98101, USA

A R T I C L E I N F O

Keywords:System identificationIdiographic modelingDynamical systems modelingAdaptive interventionsPhysical activityBehavior changeWearables

A B S T R A C T

Background: Control systems engineering methods, particularly, system identification (system ID), offer anidiographic (i.e., person-specific) approach to develop dynamic models of physical activity (PA) that can be usedto personalize interventions in a systematic, scalable way. The purpose of this work is to: (1) apply system ID todevelop individual dynamical models of PA (steps/day measured using Fitbit Zip) in the context of a goal settingand positive reinforcement intervention informed by Social Cognitive Theory; and (2) compare insights onpotential tailoring variables (i.e., predictors expected to influence steps and thus moderate the suggested stepgoal and points for goal achievement) selected using the idiographic models to those selected via a nomothetic(i.e., aggregated across individuals) approach.Method: A personalized goal setting and positive reinforcement intervention was deployed for 14 weeks.Baseline PA measured in weeks 1–2 was used to inform personalized daily step goals delivered in weeks 3–14.Goals and expected reward points (granted upon goal achievement) were pseudo-randomly assigned usingtechniques from system ID, with goals ranging from their baseline median steps/day up to 2.5× baseline mediansteps/day, and points ranging from 100 to 500 (i.e., $0.20–$1.00). Participants completed a series of daily self-report measures. Auto Regressive with eXogenous Input (ARX) modeling and multilevel modeling (MLM) wereused as the idiographic and nomothetic approaches, respectively.Results: Participants (N=20, mean age= 47.25 ± 6.16 years, 90% female) were insufficiently active, over-weight (mean BMI=33.79 ± 6.82 kg/m2) adults. Results from ARX modeling suggest that individuals differ inthe factors (e.g., perceived stress, weekday/weekend) that influence their observed steps/day. In contrast, thenomothetic model from MLM suggested that goals and weekday/weekend were the key variables that werepredictive of steps. Assuming the ARX models are more personalized, the obtained nomothetic model wouldhave led to the identification of the same predictors for 5 of the 20 participants, suggesting a mismatch ofplausible tailoring variables to use for 75% of the sample.Conclusion: The idiographic approach revealed person-specific predictors beyond traditional MLM analyses andunpacked the inherent complexity of PA; namely that people are different and context matters. System IDprovides a feasible approach to develop personalized dynamical models of PA and inform person-specific tai-loring variable selection for use in adaptive behavioral interventions.

1. Introduction

Since the National Research Council report [1], there has been anexplosion of interest in “precision” medicine and related efforts, such as

personalized digital health interventions to foster behavior change.While the exact definition of precision medicine is still being specified,a general theme of the various efforts is that individual differences,contextual differences, and changes to both, the person and context

https://doi.org/10.1016/j.jbi.2018.01.010Received 15 April 2017; Received in revised form 23 January 2018; Accepted 30 January 2018

⁎ Corresponding author at: School of Nutrition and Health Promotion, Arizona State University, 550 North 3rd Street, Phoenix, AZ 85004, USA.

1 Present address.

E-mail addresses: [email protected] (S.S. Phatak), [email protected] (M.T. Freigoun), [email protected] (C.A. Martín), [email protected] (D.E. Rivera),[email protected] (E.V. Korinek), [email protected] (M.A. Adams), [email protected] (M.P. Buman), [email protected] (P. Klasnja), [email protected] (E.B. Hekler).

Journal of Biomedical Informatics 79 (2018) 82–97

Available online 01 February 20181532-0464/ © 2018 Elsevier Inc. All rights reserved.

T

Page 2: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

over time can all be taken into account to provide the optimal inter-vention for a particular person when they need it. Within the realm ofdigital health/mobile health and persuasive technologies (henceforthlabeled ‘mHealth’) for behavior change, taking individual differencesinto account requires the empirical development of decision policiesthat can use information about people in the form of tailoring variablesto specify the selection of one intervention option over another [2].Tailoring variables can be defined as predictors that are expected tomoderate the effect of the intervention for a given individual [2].Tailoring can be static, using time-invariant predictors (e.g., sex), ordynamic, using time-varying predictors, such as different states of theindividual (e.g., stress, busyness) or different contexts (e.g., at home vs.at work; day of the week) for more personalized adjustments of theintervention dosages over time [3–5].

Much within-person or intra-individual variability in behavior andpsychological processes exists. Specifically, since many of these pro-cesses have time-varying characteristics, they are thought to be non-ergodic processes, i.e. they are heterogeneous across people and acrosstime [6–8]. Yet, most researchers, even those studying time-varyingvariables, pool individuals together and make ‘aggregated' between andwithin-person predictions (referred to as nomothetic approaches) tomodel responses to interventions [6,9,10]. Although a significantoverall effect might indicate a strong influence of a variable on anoutcome or its role as a moderator of an intervention’s effectiveness fora typical subgroup, there are often many individuals who respond dif-ferently than the average person or whose data do not fit the populationmodel [11,12]. For example, a nomothetic model might identify stressas a moderator of the relationship between a physical activity (PA)intervention and PA. The model could indicate that, on average, in-dividuals respond well to the intervention on days that they are lessstressed, but not on days that they are highly stressed. A follow-upquestion might be: Is it possible that this relationship does not hold foreveryone? Could there be some individuals whose activity is not asaffected by their stress levels but instead by some other aspect of theirlife (e.g., day of the week)? Could there be some individuals that re-spond more favorably to the intervention when stressed? Moreover,could this relationship change over time? This information is largelyaveraged out when nomothetic approaches are utilized, unless veryclear moderation hypotheses are specified that would account for dif-ferential responses. Some of the above questions about individual dif-ferences can be answered by using analyses that allow specification ofnot only fixed (aggregated) but also random effects (individual devia-tions from the group mean). But even these analyses place greateremphasis on the nomothetic insights, with the random estimates in-corporated more as a mechanism of control as opposed to a uniqueresearch target, and the idiographic responses are examined in terms ofhow much individuals differ from group-level information [8,13].Secondary analyses comparing idiographic and nomothetic analysesconducted on the same data also show vast differences in inferencesbetween the two approaches [7,8,14], supporting the importance offurther comparisons and highlighting that person-specific approachesmay identify unique variables for behavior change leading to betterexplanation, prediction and control of complex behaviors. Overall, ro-bust scientific practices that can support the creation of idiographicmodels are strongly needed.

With idiographic approaches, the aim is to make individualizedpredictions by examining within-person variation over time [6–8]. Anexperimental design and analytic suite of methods, called systemidentification (system ID) [15] is feasibly well matched to this problemas it provides strategies for developing idiographic dynamical models.The purpose of this paper is to present the methodology of system ID forcreating individual dynamical models of behavior to support, amongother things, tailoring variable selection. First, we describe our use caseof an adaptive goal setting and positive reinforcement intervention forincreasing Fitbit-measured steps/day. Next, we describe the logic ofsystem ID, followed by methods that were used to generate

individualized dynamical models. Finally, we contrast findings from theidiographic analyses with those using a nomothetic, multilevel mod-eling (MLM) approach with respect to the match between plausibletailoring variables obtained from the nomothetic model, and each in-dividual’s dynamical model.

1.1. Adaptive approaches for physical inactivity

Physical inactivity is one of the leading behavioral risk factors as-sociated with mortality worldwide, with most adults being in-sufficiently active [16–18]. PA is a complex behavior because bothstatic and time-varying factors can impact an individual's ability toengage in PA over time (such as each day, week, month, etc.)[10,19–21]. Therefore, it is a logical use case for exploring potentialidiographic response patterns to treatments/interventions. In addition,within the realm of mHealth technologies targeting PA, it is increas-ingly being realized that one-size-fits-all approaches might not be en-ough to achieve sustained improvements in PA or to maintain users’interest and usage of the technology [9,10,22,23]. Beyond this, workfrom persuasive technology research also suggests that in addition tofactors such as perceived credibility of the system and interactive fea-tures such as providing feedback on performance, task-related supportfeatures such as tailoring and personalization of the intervention con-tent (e.g., step goal itself) to be more responsive to a user’s changingneeds and context are important in order to develop engaging andpotent technologies that shape PA behavior over time [24].

Adaptive interventions are a class of interventions that recognizethe need for dynamic decision-making and personalization of inter-vention components in addition to the traditional, static tailoring ap-proaches used in most behavior change interventions [2–5]. Adaptiveinterventions use time-varying tailoring variables to inform the inter-vention’s decision framework and re-adapt continually over time as theintervention progresses.

Behavior change techniques of goal setting and positive reinforce-ment lend themselves well to adaptive interventions and are founda-tional to many treatment approaches and mHealth technologies[25–27]. Step goals can be adjusted and communicated daily with re-ward points for goal attainment using smartphones and step-basedwearables. Previous studies have shown that interventions assigningdaily step goals that adapt to a person's own PA levels over time andreward goal achievement via points and financial incentives were moresuccessful in increasing steps/day as compared to interventions with astatic daily goal of 10,000 steps/day [28,29] or as compared to a no-intervention control group [30]. In addition, results from Adams et al.[29] showed that the group receiving adaptive goals exhibited a slowerdecline in steps over the duration of the intervention, suggesting thatadaptive goals were more successful in maintaining the increases in PAover the 4- month intervention period. This work highlights the po-tential for adaptive interventions to foster sustained behavior change,though further work is needed. For example, one area of future workcould be to take into consideration factors that impact a person’s dailysteps beyond the intervention, such as weather, location, day of theweek, social settings, or even psychological factors such as self-efficacy,commitment to being active or perceived stress. Insights on those fac-tors, particularly if created at an idiographic level should, theoretically,support even more targeted and appropriate suggested step goals andpoints. The focus of the current study is to develop individualizedmodels that can inform this target of selecting person-specific time-varying tailoring variables that can be used in adaptive goal-settinginterventions.

1.2. Control systems engineering in behavioral research

Control Systems Engineering is a field that studies how a dynamicalsystem (for example, a model explaining daily PA behavior) can beregulated (by manipulation through an intervention) to achieve the

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

83

Page 3: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

desired outcome (e.g., increase in PA to recommended levels) overtime. The idiographic, multivariate, time-varying and non-linear natureof human behavior lends itself well to be studied using control systemsprinciples [31]. Further, the development of dynamic models requiresintensive longitudinal measurements. Smartphones and wearabletechnologies are well equipped to support such measurements bytracking a wide range of factors passively (e.g., steps, location, weather)as well as actively through user self-report (e.g., perceived stress) in aless burdensome manner. Recently, these methods have been applied inbehavioral contexts, predominantly through secondary analyses of datafrom clinical and behavioral interventions [12,32,33]. One study fo-cused on developing a dynamical model for understanding gestationalweight gain [32]. Another used control engineering methods to developa dynamical model of the relationship between fibromyalgia treatmentand symptoms to inform an adaptive intervention capable of adjustingand assigning optimal treatment dosages over time [33]. These studieshave established the proof-of-concept to apply these methods as theprimary methodology. Two methods from control systems engineering,particularly, system ID and model predictive control together offer themethodology and decision framework for development of adaptive in-terventions.

1.2.1. System identificationSystem ID is a data-driven suite of methods that are used to develop

dynamical systems models from input-output measurements coupledwith prior physical and conceptual knowledge [15]. The primary focusis on modeling variations and responses within longitudinal data on acase-by-case (N-of-1; single-case; idiographic) basis. It explores the re-lationships between the 'inputs', which includes the manipulated vari-ables or intervention components (e.g., daily step goal and points) andtime-varying disturbance variables (e.g., perceived stress, weather),and the predicted 'output' or outcome of interest (e.g., actual steps takenper day) for a given individual over time. This idiographic approachprovides insights on predictors of behavior most relevant to each in-dividual that can be used as person-specific tailoring variables to per-sonalize decision rules and intervention dosages even for those in-dividuals that might be mis-matched to the population model.

System ID extends from experimental design up to parameter esti-mation and validation. First, an appropriate ‘informative’ experimentneeds to be designed in a way that allows the development of a modelthat contains preliminary information about the dynamics of thesystem. Such an experiment is called an open-loop experiment and is thefocus of this work [34]. This experiment is termed open loop becausethe intervention option (step goal) is pre-specified, and does not takeinto account the individual’s measured behavior to determine futuregoals. An open-loop informative experiment randomizes the delivery ofintervention components within-person over time in a way to producesufficient excitation within the system to test how the interventioncomponents affect behavior in different states (e.g., weekdays vs.weekends, when busy vs. not busy). This allows for causal inferences tobe drawn at the dynamic, within-person level, thus providing the em-pirical data for a dynamical behavioral model.

An open-loop informative experiment can be considered the controlsystems version of a pilot study or the Preparation Phase as used in theMultiphase Optimization Strategy or MOST [35,36]. The measuredvariables used in these modeling efforts can be based on a conceptualdynamical model that includes specification of model structure andmathematically-defined, theorized inter-relationships between vari-ables. For example, this work is based on a dynamical conceptual modelof the Social Cognitive Theory (Fig. 4), briefly described in Section 2.6[37–39]. System ID is an iterative procedure whereby execution of theexperiment is followed by data preprocessing, model structure de-termination, parameter estimation, and finally, model validation until asuitable model is obtained [33,40]. Detailed information about thesesteps as carried out in the current study is provided in Section 2.

1.2.2. Model predictive controlModels derived from the open-loop experiment can then be in-

corporated into a 'closed-loop' control system, which consists of the in-tervention's decision framework and algorithms to achieve the desiredbehavioral outcome. The controller algorithm aims to determine anappropriate intervention dose (e.g., daily step goal and points) using acombination of the predictive model, repeated real-time assessments ofthe relevant variables, and additional optimization constraints (e.g.,never assign a goal higher than 5000 steps more than the previous day).The aims of a control system are to: (1) reach the desired goal (e.g., asustained increase in daily PA up to 10,000 steps per day); (2) handlethe effects of measured factors known to influence behavior; and (3)handle the effects of unmeasured factors (i.e. the controller will adjustthe daily step goals if there is a change in PA that the model cannotexplain). An in-depth discussion on control engineering and its appli-cation in behavioral interventions is provided in [31]. It should benoted that the focus of the current study was the first step in this pro-cess; that of building the dynamical models, NOT incorporating them ina model predictive controller. This information is included to equip thereader with conceptual knowledge of how the dynamical models willultimately be incorporated into an adaptive intervention frameworkusing control systems engineering methods.

The eventual goal of this research is to develop an adaptive inter-vention using model predictive control that can predict an ‘ambitious,but doable’ (to avoid negative effects of overly high goals but stillproduce increases in PA) daily step goal and expected reward points foran individual based on what the system knows about them. The currentpilot study is one of the first steps in this process.

1.3. Current study: the Just Walk pilot study

Just Walk is a pilot, open-loop system ID informative experimentthat was designed to develop dynamical models that attempt to un-derstand observed steps/day as the outcome of interest on a case-by-case basis [40]. Previous analyses from this study have reported themean change in PA, which indicated that, on average, participantssignificantly increased their daily steps by 2650 steps/day over14 weeks (t= 8.25, p < 0.01) from the baseline phase to interventioncompletion, based on a linear mixed effects model [41]. This work in-dicates that on average, our intervention can produce meaningful in-creases in steps. In addition, previous work [42] has delineated themathematical equations and other logical proofs expected for appro-priate validation for system ID. The focus of this paper is to expandupon this previous work by describing the methodology of the systemID informative experiment for both, an informatics and behavioralscience audience and demonstrate what is uniquely learned from use ofidiographic dynamical systems modeling as compared to a traditionalnomothetic MLM analysis with regard to plausible tailoring variableselection.

2. Method

2.1. Overview

The study was designed using system ID experimental design prin-ciples that involve pseudo-randomization of the intervention compo-nents (justified below). The total duration of the study was 14weeks.Participants were provided a Fitbit Zip activity tracker along with anAndroid based app called Just Walk. The Fitbit Zip has shown strongvalidity for measurement of steps in free living conditions in numerousstudies [43,44]. Weeks 1–2 were the baseline phase; this was used toassess participants' initial PA level. During this phase, they were askedto use the Fitbit Zip at all times and to maintain their usual routine.Weeks 3–14 were the intervention phase, in which participants receiveda daily step goal and were notified of the available reward points theywould earn if they met their goal. Each participant's daily goals were

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

84

Page 4: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

individualized based on their own baseline PA level. During all14 weeks, participants answered daily morning and evening surveyscovering a range of psychometric measures delivered via the Just Walkapp. The university’s Institutional Review Board approved the inter-vention and data collection procedures.

2.2. Participants and screening process

Participants were generally healthy, insufficiently active, over-weight adults, aged 40–65 years using an Android (v 2.3 or higher)phone, interested in participating in a walking intervention and livinganywhere in the United States. Walking interventions are an importanttarget for this at-risk group as most can safely fit walking into theirlives, but only one-third meet recommended physical activity guide-lines [45–48].

Recruitment was carried out using Facebook, Twitter and universitylistservs. All participants completed a screening survey to assess elig-ibility for the study. The level of PA was assessed using theInternational Physical Activity Questionnaire (I-PAQ) short form [49]and BMI was calculated using self-reported height and weight. In-dividuals with BMI < 25 kg/m2 and activity levels of more than 1000MET-minutes/week were excluded from the study. Participants whohad any medical condition(s) that compromised participation in a PAintervention as assessed by the Physical Activity Readiness Ques-tionnaire (PAR-Q), were pregnant, did not have regular access to theInternet on their phone, and/or were currently participating in anotherexercise or nutrition program were also excluded from the study. Par-ticipants who were deemed eligible were given more detailed in-formation about the study and invited to participate. Those who con-sented and completed a baseline questionnaire were then sent a FitbitZip, the Just Walk app (as an attachment through email) along with thestudy user guide. The user guide provided detailed instructions for theinitial setup. Participants were also asked to download and install theFitbit app (but instructed to avoid opening it throughout the duration ofthe study) so that data from the tracker could be synced frequently withthe Just Walk app. Fig. 1 shows the process of recruitment that wascarried out from June-October 2015.

2.3. Study design

The unique experimental design strategies that are part of system IDallow the researcher to examine changes in the behavior of interest inresponse to varying levels of intervention components (e.g., differentvalues of daily goals, ranging from low to high) and to changes incontext and psychological states.

The study used an orthogonal multisine experimental design thatinvolves pseudo-randomization of the intervention components. Amultisine pseudo-random signal is one that appears to be random butis in fact deterministic and periodic. Such a signal can help producesufficient excitation in the behavior, particularly important for gen-erating dynamic models. These signals can be repeated; thus enabling,through statistical properties, not only model estimation but also morerigorous validation of a dynamical model [50]. The pseudo-rando-mization strategies used in these experiments are carried out at thewithin-person level, allowing inferences to be drawn at the individuallevel and strengthening the confidence in these models. For detailedinformation about the logic of multisine signals, pseudo-randomiza-tion and a mathematical description of the experimental design, referto [34,51].

As system ID methods focus on modeling dynamics within in-dividual systems, power for such experiments is not achieved by thenumber of participants but in other ways such as an appropriate studylength and validation and cross-validation data [40]. The power cal-culation was based on 'cycles', that are set time-periods which definethe multisine signal [34]. This type of a design maximizes the signal-to-noise ratio and helps to assess progressive model fits via the use of both

estimation and validation cycles. Simulations to determine theminimum cycle length required to ensure sufficient excitation and toenable the two intervention components to be delivered orthogonallyindicated that at least 3 independently excited sinusoids per signal percycle were required. To have at least 3 sinusoids per signal, the sam-pling time should be at least 12 days or more (this process and under-lying equations are explained in detail in Martín et al. [51]). Con-sidering that the decision period should be as short as possible, a cyclelength of 14 days each was deemed appropriate. To avoid aliasingspecific goals and points with day of the week, this duration was in-creased to 16 days. Based on existing literature on physical activityinterventions and available resources, a total intervention length of 5cycles (∼12 weeks) was selected as the duration for this pilot work.

2.4. Intervention components

Both intervention components were pseudo-randomly assigned on adaily basis for 12 weeks of the intervention as described below.

2.4.1. Step goalsThe median value of steps/day from the baseline period was con-

sidered as the representation of every individual’s usual level of ac-tivity. Days with fewer than 500 steps were deemed as non-wear daysand excluded from this calculation. Daily step goals were then assignedas a factor of this baseline PA from 'doable' (i.e., baseline median steps/day) to 'ambitious' (i.e., up to 2.5× the baseline median) individuallyfor each person. This approach was used in order to induce sufficientvariability and be able to model individuals’ behavioral responses todifferent goals and expected reward points in different states and con-texts. For most participants (n=11), goals assigned were up to 2x theirbaseline median steps/day. So if a person had a baseline median of4000 steps/day, their daily goal would range between 4000–8000steps/day. However, the factor was varied if the median value was toolow or too high. If a participant’s baseline median value was below3000 steps/day, the maximum goal was increased to 2.5x to avoid as-signing very low goals (n= 6), while if the baseline median was above7500 steps/day, the maximum factor was reduced to 1.75×, to avoidassigning overly high goals (n=3).

2.4.2. Positive reinforcementDaily expected points ranged from 100 ($0.20) to 500 ($1.00) and

were granted if the participant met their assigned goal. Participantsreceived a $5 Amazon Gift Card electronically every time they accu-mulated 2500 points. Participants were notified of the expected pointsfor the day at the same time the goal was assigned every morning.

Factors for both intervention components were chosen in a waythat both signals were orthogonal in frequency. A different experi-mental design was used for every individual to allow for control ofordering effects across participants [34]. Each participant thus re-ceived a unique sequence of daily goals and expected points that re-peated every 16 days for a total of five cycles. Fig. 2 illustrates anexample of the experimental design for one participant. This in-dividual was assigned goals up to 2x their baseline PA of 4317 steps/day. It should be noted that each participant’s goals for the entirestudy duration were based on their initial baseline PA and were notadaptively increased or decreased over time. The aim was to repeatthe intervention sequence multiple times, allowing the five cycles tobe comparable and utilized as estimation and validation sets for dy-namical modeling. This research design and process allowed us toidentify important individual-level time-varying variables that af-fected participants’ PA in addition to goals and points.

2.5. The Just walk app

Just Walk is the Android-based mobile application developed to runthe experiment. The system relies on the Fitbit API to collect the user’s

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

85

Page 5: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

Fig. 1. Just Walk Recruitment Process (adapted from Korinek et al., 2017) [41].

Fig. 2. Example of the experimental design for one participant. Figure displays participant’s actual PA during baseline phase, followed by daily goal and expected point sequences thatrepeated every 16 days for 5 cycles.

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

86

Page 6: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

step data. The password-protected encrypted system includes a back-end database and a web service for monitoring participants’ activity.The app has a simple user interface that displays the user's progresstowards their daily goal, expected points for that day, their step andgoal achievement history, and total accumulated points (Fig. 3).

2.6. Measures

Our past work [38,39] simulating a dynamical model of the SocialCognitive Theory (SCT; Fig. 4) [37] helped to inform the measures andanticipated relationships that drive behavior change. The completedescription of the SCT model is beyond the scope of this paper. But in

Fig. 3. Screenshots of the Just Walk app. Thenumber inside the gold medallion represents theexpected points, the red box displays the stepgoal, and green box displays steps since the latestsync with Fitbit. Fig. 3(a) displays the app widgetthat participants were advised to keep on theirhomescreen. Fig. 3(b) displays the opened appdisplaying progress towards current goal and stepand goal achievement history. Fig. 3(c) displaysan example of a daily morning survey question.(For interpretation of the references to colour inthis figure legend, the reader is referred to theweb version of this article.)

Fig. 4. Fluid analogy of a dynamical model of social cognitive theory.

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

87

Page 7: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

short, it is a mathematical fluid analogy model of the SCT developedusing previous literature, simulations and secondary analyses. Thismodel depicts how various factors interact to impact behavior overtime. This model is based on a daily time-scale. The primary constructsare considered to be ‘inventories’ (self-efficacy, outcome expectancies,behavioral outcomes, self-management skills, and the actual behaviorof interest). The other variables are inputs that are theorized to increaseor decrease behavior, and can be external or internal to the individual.Examples of these are perceived social support, perceived barriers tobeing active, intrapersonal states, environmental context and internaland external cues to action [38].

A number of variables were measured as part of the study to enablethe eventual goal of testing and building a dynamical model of the SCT,but only a subset was used in these preliminary analyses. Incorporatingall of the measured variables in an exhaustive analysis such as the oneutilized for this study would be computationally demanding and isaccompanied by the risk of over-modeling and over-parameterization(over-fitting). We chose to run this analysis with a subset of variablesthat have high potential for being meaningful time-varying tailoringvariables, particularly, interpersonal states and environmental context,namely predicted stress, predicted typicality of the day, predicted bu-syness, and whether the day was a weekday/weekend. This subset waschosen also because these variables were measured daily and prior togoal assignment, and could plausibly be used to adapt the day’s goal ina live system based on what the individual reports. Table 1 lists allvariables used in the modeling efforts: the three intervention signals(i.e., goals, expected, and granted points, henceforth referred to as ‘baseinputs’) and the four plausible tailoring variables. The main outcome(output) for the analyses was actual PA in the form of steps/day asmeasured by the Fitbit Zip. The item for measuring predicted typicalitywas developed by the research team and has not been previously testedfor validity or reliability. The items measuring predicted busyness andstress are single item measures that have been used in previous studies[52,53]. We chose to use one item for each construct in order to balancethe need to reduce participant burden with the desire to collect variousdata to serve the end-goal of testing and building a dynamical model ofthe SCT. Increasing the daily time spent in filling out surveys wouldhave likely reduced adherence, particularly over time.

The system ID analyses described in this paper (Section 3.2) are thepreliminary step in building dynamical models and do not incorporatethe SCT model structure. Additional variables that were measured inthe study [including disturbance variables such as weather, location]will be incorporated in the more advanced modeling efforts that willuse the SCT model structure.

3. Analysis

We conducted both nomothetic and idiographic analyses. As thenomothetic approach, we used multilevel modeling (MLM) as one keybest practice that would commonly be used for analyzing such data. Weemployed system ID methods, specifically, black-box identification

using ARX (Auto Regressive with eXogenous inputs) modeling as ouridiographic approach. The following section describes the MLM ana-lyses.

3.1. Multilevel modeling analyses for ‘on average’ effects

The aim of this analysis was to use MLM to build a parsimonious andpredictive model to describe the aggregate effects of our predictors ofinterest over the course of the intervention phase (excluding the base-line phase) and to compare these findings to those obtained in theidiographic analyses. MLM is necessary when the data is nested (e.g.repeated measures within individuals) and observations cannot be as-sumed to be completely independent. In this study, days representedlevel-one units (∼86 days per individual) and individuals were thelevel-two units (N= 20). All analyses were performed using SAS 9.4PROC MIXED software [54]. The same raw data (and approach forhandling missing data) that were utilized for the system ID analyses(Section 3.2.1) were used for this analysis, but variables were trans-formed differently in order to be more interpretable for the purposes ofMLM. Data transformations are outlined in Table 2.

The unconditional means model (UMM; absent of predictors at anylevel) yielded a level-2 variance of τ02 = 8,325,037 (p=0.0011), andan Intraclass Correlation Coefficient (ICC) of 0.5308, implying that53.08% of the variance in the outcome (actual steps/day) was betweenindividuals, indicating the need for an MLM approach [55]. All modelswere estimated using unstandardized maximum likelihood estimation,and a first order autoregressive variance-covariance structure, AR(1)was used to account for correlated residuals [56]. Initially, a growthmodel with the base inputs was built hierarchically followed by addi-tion of the remaining predictors. Improvement in model fit was calcu-lated using a likelihood ratio chi-square test [57]. Predictors that didnot improve model fit were removed from the subsequent models unlessthere was a reasonable justification, such as the base inputs that werepart of the intervention.

3.2. System ID analyses

The following section describes data preprocessing, model estima-tion and validation, and further analyses tailored to the data that wereused to estimate models for each participant. The black-box identifi-cation procedure presented in this paper is the preliminary step insystem ID analyses of these data. This is an important first step to takewhen little is known about significant input-output relationships. Theprimary aim is to build individualized computational models withoutmathematically incorporating the theoretical or conceptual model. Allanalyses were conducted using MATLAB (MATLAB 9.0, The MathWorks

Table 1Variables used in the analyses

Variable Question Scale/Unit

Step goal – Steps/dayExpected points – 100–500Granted points – 0–500Weekday/weekend 0/1Predicted stress How stressful do you expect

your day to be?1–5 scale (not at all tovery)

Predicted busyness How busy is today going to befor you?

1–4 scale (not at all toextremely)

Predicted typicality How typical do you expect todayto be for a [day of the week]?

1–4 scale (not at all tocompletely)

Table 2Predictors and data transformation used in the MLM analyses

Predictor Data structure/Centering

Level-1 predictors(1) Day Day 0 is the first day in intervention(2) Step goal Cluster centering around baseline PA (median

steps/day)(3) Expected points 0= 100 points(4) Granted points Raw score, shifted by one day(5) Weekday/weekend 0=weekday, 1=weekend(6) Predicted stress Cluster mean centering(7) Predicted busyness(8) Predicted typicality

Level-2 predictors(9) Baseline PA Grand mean centering around average baseline

PA(10) Mean predicted stress Mean value for each individual(11) Mean predicted busyness(12) Mean predicted typicality

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

88

Page 8: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

Inc., Natick, MA, 2016).

3.2.1. Data preprocessingData preprocessing included shifting the actual steps and granted

points forward by one day to reflect temporal precedence in the mod-eling. Missing data were handled using linear interpolation with sa-turation limits corresponding to the scales outlined in Table 1 [42]. Asin MLM, the variables listed in Table 1 (3 base inputs+ 4 time-varyingvariables) were used as predictors for model estimation.

3.2.2. Cross-correlation analysisCross-correlation analyses are a measure of the cross-covariance

between two time-varying variables [58]. Prior to model estimation,cross-correlation analyses were carried out for all input-input andinput-output pairs separately for all participants. For input-input pairs,these analyses help avoid potential multi-collinearity. Multi-collinearityrefers to a phenomenon where the predictor variables are highly cor-related. Including such inputs in the same model can lead to inflatedvariances of the ARX parameters (ARX is a linear least squares esti-mator) [59]. Input-output cross-correlations can also provide insightsinto potential significant dynamics that might exist between a predictorand outcome, such as feedback loops.

Cross-correlation coefficients for two variables were calculated fromtheir sample covariance. Next, 68.3%, 95.4%, and 99.7% standardhypothesis error bounds were calculated based on the null hypothesisthat the two signals are uncorrelated and one of which is Gaussianwhite noise. Hence, coefficient values exceeding their correspondinghypothesis bounds are indicative of a cross-correlation with the like-lihood indicated by the bounds [58].

3.2.3. Model estimationAn ARX model is a linear, lagged input-output dependence. In the

least squares sense, the ARX parameter estimation can be formulated asa linear regression problem [15]. The choice of using ARX models is dueto its appealing properties; first, the solution of the regression problemis globally optimal. Second, it is computationally inexpensive. Third,asymptotically, with large data and sufficiently high ARX model order,the ARX is a consistent estimator [15,42]. Preprocessed data were fittedto the following ARX model, … …( [ ])n n n n n n nARX a b b b k k knu nu1 2 1 2using the difference equation:

⎜ ⎟ ⎜ ⎟

⎜ ⎟ ⎜ ⎟

+ − +…+ − = ⎛⎝

− ⎞⎠

+…+ ⎛⎝

− − + ⎞⎠

+ ⎛⎝

− ⎞⎠

+…+ ⎛⎝

− − + ⎞⎠

⋮+ − +…+ − − +

+

y k a y k a y k n b u k n b u k n n

b u k n b u k n n

b u k n b u k n n

e k

( ) ( 1) ( ) 1

1

( ) ( 1)

( )

na a k nb k b

k nb k b

nu nu k nunbnu nu knu bnu

1 11 1 1 1 1 1 1 1

21 2 2 2 2 2 2 2

1

(1)

na represents the number of inputs …[ ]u u un1 2 u , …n n n n, , , ,a b b bnu1 2 are themodel orders, and …n n n, , ,k k knu1 2 are the input delays. e k( ) is the pre-diction error, while k is a discrete time index (in this case, day).

3.2.4. Model validationThe dataset for most participants was segmented into 5 cycles

(16 days each). All cycle combinations were used as estimation andvalidation sets to control for the effect of time across the different cy-cles. At least two cycles were preserved as the validation set in everycase. This was done in place of the conventional approach, which is tosplit datasets into a 50% validation and 50% estimation set. System IDoffers a metric to quantify the model in terms of the percentage ofvariance in the output that explained by the model, through the use ofthe normalized root mean square error (NMRSE) fit index:

⎜ ⎟= × ⎛⎝

− ∥ − ∥∥ − ∥

⎞⎠

y k y ky k y

model fit(%) 100 1 ( ) ( )( )

2

2 (2)

y k( ) is the measured output, y k( ) is the simulated output, y is the meanof all measured y k( ) values, and ∥ ∥· 2 indicates the l2-norm∥ ∥ =x x x( )T

2def

[42].As seen in the difference Eq. (1) above, with ARX estimation, the

analyst needs to specify a model order. A model order refers to thenumber of lagged inputs and outputs incorporated in the model [15].Since numerous ARX models can be generated for the same set of in-puts, the research team developed a series of systematic procedures tochoose the most appropriate model for each participant. The followinghierarchical model building procedure was carried out for each parti-cipant:

1. A search limit of 3 for all ARX model orders was deemed sufficient.Freigoun et al. [42] argued that fourth order model estimations,besides being computationally expensive, performed worse thanlower order models.

2. For every cycle combination, ARX orders were determined from anexhaustive search. The ARX orders with the maximum average va-lidation fit were chosen.

3. This procedure was carried out for all input combinations. The in-itial model that consisted of the base inputs (goals, expected points,and granted points) was considered the 'base' model, and the fourother inputs were added and estimated systematically (predictedbusyness, predicted stress, predicted typicality of day, andweekday/weekend).

4. The above process led to an ARX model being estimated for allpossible cycle combinations for all possible input combinations, foreach participant. Up to 20 models were estimated for all possibleinput combinations for each participant.

5. The next step was to choose one final model (one particular inputcombination) for each individual.

6. Further analyses showed that the most predictive ARX models thatare also resilient are models whose orders produce the highestaverage validation fit, and whose parameters are estimated suchthat they produce the highest overall fits [42].

7. Finally, 16 different models, one for each input combination weregenerated for all participants. The graphs in Fig. 7 display thehighest overall fits for every input combination. For a detailedmathematical description of these analyses, please refer to [42].

3.2.5. Final model selection procedureModel validation in system ID experiments ultimately corresponds

to the researcher's confidence in the suitability of the estimated modelsfor application [33]. To choose the most appropriate input combinationfor each individual, a protocol was developed by the research teambased on plausibility, and conventional wisdom in behavioral scienceand systems engineering. A penalty approach was applied to penalizeunfavorable model characteristics. Different weights (w) were assignedto four characteristics that affect model consistency and reliability: (a)overall highest fit (w= 2), with a higher penalty for lower fits; (b)cross-correlations between inputs (w=2), with higher penalty for in-puts with high cross-correlation coefficients; (c) distance of the overallhighest fit from the mean fit (mean % fit for all cycle combinations, foreach input combination) (w=1), with a higher penalty for larger dif-ferences; (d) standard deviation of the sample (% fit for all cyclecombinations, for each input combination) (w=1), with higher pen-alty for higher variances.

3.2.6. Personalization optimization criteriaAs in MOST, there is a need to define success quantitatively [35].

We did this via the use of the NMRSE fit index, Eq. (2). Using % modelfit as a benchmark, this step enables a process of data-driven optimi-zation for individualization that conceptually maps to reliability andvalidity. In this case, reliability and validity are estimated for our dy-namical models for predicting human behavior (as indicated by model

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

89

Page 9: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

fit) and, by extension, the selection of tailoring variables. In line withthe “leave-one-out” approach commonly used when cross-validatingmodels such as physical activity estimation via accelerometers [60], weobtained estimates using every cycle from our 5 cycle system ID studyas both estimation and validation data. Next, we engaged in a process ofselecting the “best” model, described above. For selection of the modeland in turn, the tailoring variables to use for each person, we chose touse multiple criteria to assess reliability, including penalizing modelsthat have higher variances, favoring models that have less deviationbetween the highest fitted model compared to the average of all modelfits, and favoring models with minimal cross-correlation between pre-dictors. With general reliability of the overall procedure and an ap-proach for selecting models that further accounts for reliability risksaccomplished, we then turn to good enough validity, which, in thiscontext involves the predictive capacity of the models. As these ana-lyses are a variation of multiple linear regression and the model fitestimate is analogous to r2, we chose Cohen’s conventions of explaining3% of variance as a small effect, 13% as a medium effect, and 26% as alarge effect [61]. While there is no clear definition of good enough forindividualization purposes, as, to the best of our knowledge, we are thefirst to do this, we chose to use the 13% medium effect as our a priorigood enough marker for the final model selected for each participant.Note, however, that further validity testing related to individualizationis possible and a core target of the more definitive optimization trial,the closed loop experiment. Further, we also fully acknowledge that ourapproach is only one of many (see discussion).

3.3. Nomothetic and idiographic comparison

The final step of our work was a comparison of the plausible tai-loring variables that would be selected based on the nomothetic vs. theidiographic approach. For this comparison, we assume that, based onthe analytic frame, the idiographic approaches will by definition, pro-vide more personalized and thus appropriate individualized selection oftailoring variables. We calculated the percentage of time, across parti-cipants, that the input combination identified by MLM is identical tothat identified in the idiographic analyses. By extension, this refers toan appropriate match in tailoring variable selection. We also calculateda percentage of the participants that had what might feasibly be aplausible good enough model in terms of tailoring variable selection. Inthis context, we use good enough when identification of the most im-portant tailoring variable(s) is the same between the MLM and dyna-mical models.

4. Results

4.1. Adherence

Out of the 22 participants who were consented and began the study,1 participant lost their Fitbit, and 1 had an incomplete dataset, leadingto a final sample size of 20 individuals. For the 20 participants thatcompleted the study, there was high adherence to the interventionoverall, with minimal missing data. Out of all participants, only 10 haddays with missing step data; a total of 40 (M=4 ± 4.39) non-wear ordays with fewer than 500 steps. In addition, there was high adherencefor daily morning surveys (∼90%). Baseline descriptive statistics forparticipants are provided in Table 3.

4.2. Multilevel modeling results

4.2.1. Model building stepsFollowing the UMM, a linear growth model with random intercepts

was tested (Model 1) as individuals were expected to vary in thenumber of steps walked per day at the start of the intervention. Modelfit improved significantly over a single intercept model (p < 0.001),revealing that intercepts vary significantly across individuals. A random

slope for time was then added to the model (Model 2) as it was expectedthat the rate of change in steps over time would vary between in-dividuals. Model fit improved significantly (χ2 (2) = 41.00,p < 0.001), indicating that change rates in steps per day vary acrossindividuals. Both quadratic and cubic terms for time were added to themodel as fixed effects to test for non-linearity, and model fit sig-nificantly improved when a cubic term for time was added to the model(Model 4; χ2 (1) = 6, p=0.014), with a significant effect of cubic time,(t(5 0 6) = −2.40, p=0.016), suggesting that an individual’s walkingbehavior follows a curvilinear pattern over the duration of the inter-vention. This was expected due to the design of the goals (they were notadaptively increased over time but repeated over cycles). A level-2predictor, baseline PA was then added to the model (Model 5) to controlfor the high variability in baseline steps that was observed in the study.Model fit improved significantly (χ2 (1) = 26.9, p < 0.001) and theeffect was significant, (t(20) = 7.63, p < 0.001).

The base inputs (all level-1 predictors) of the intervention were thenprogressively added to the model (Models 6–8). Model fit improvedsignificantly after the addition of a fixed effect for goals (χ2 (1) =137.10, p < 0.001), and the effect for goals was significant, (t(1641)= 11.97, p < 0.001). The addition of a fixed effect for expected pointsdid not improve model fit (χ2 (1) = 3.70, p=0.053) and the effect wasnot significant, (t(1601) = 1.94, p=0.0529). Addition of a fixed effectfor granted points also did not improve model fit (χ2 (1) = 3.20,p=0.07) but indicated a significant effect, (t(1722) = 2.00,p=0.0456). All base inputs were maintained in the subsequent modelssince they were manipulated as part of the intervention package. Therest of the predictors were then added to the model as fixed effects(Models 9–12). Addition of weekday/weekend (Model 9) improvedmodel fit, (χ2 (1) = 38.20, p < 0.001), and indicated a significanteffect, (t(1528) = −6.22, p < 0.001). Models failed to converge afterincluding random slopes for almost all level-1 predictors of interest(except linear time), likely due to insufficient power or variabilityneeded to detect those effects and due to the low sample size (N=20)and were not tested in these analyses. No other variables improvedmodel fit, and Model 9 was retained as the final model to be inter-preted. Model 9 is represented using the following equation:

= + + + +

+ + +

+ + + +

steps γ γ (baseline ) γ (time ) γ (time ) γ (time )

γ (goal ) γ (expectedpoints ) γ (grantedpoints )

γ (isWeekend ) U U R

ij 00 01 pa 10 ij 20 ij2

30 ij3

40 ij 50 ij 60 ij

70 ij 0j 1j ij

j

(3)

γ00 is the average intercept, γ01 is the average slope for baseline PA, andγ γ,10 20, γ30, γ40, γ40, γ50, γ γ,60 70 are the average slopes for the respectivepredictors, U0j is the deviation of person j’s intercept from the averageintercept, U1j is the deviation of person j’s slope for time from theaverage slope for time, and Rij is the Level 1 residual.

4.2.2. Final modelResults from the final model indicated that the average individual in

the intervention was expected to walk 6865.17 steps at day 0, which isthe first day of the intervention. On average, holding all other pre-dictors constant, for every 100 steps increase in daily goals, actual steps

Table 3Descriptive statistics (N=20)

Variable n (%) Mean Std. deviation Median Interquartilerange

Age, years – 47.25 6.16 46.00 9.50Females 18 (90) – – – –Ethnicity, White 19 (95) – – – –BMI, kg/m2 – 33.79 6.82 31.65 11.26Baseline PA, median

steps/day– 4863.30 2097.64 4434.00 3280.50

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

90

Page 10: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

increase by 37.8 steps (γ40= 0.37, 95% CI= [0.31, 0.44], p < 0.001).On average, participants were predicted to walk 842.21 steps less onweekends as compared to weekdays (γ70=−842.21, 95% CI[−1107.8, −576.62], p < 0.001). As mentioned earlier, previousanalyses looking at the mean change in PA [41] indicated that on

average, participants significantly increased their daily PA by 2650steps/day (p < 0.01) from baseline to intervention completion. Thecurrent analysis does not include the baseline phase and is looking atpredicted steps only during the intervention. Overall, these results in-dicate that on average, daily step goals were driving changes in walkingbehavior during this intervention, and that weekday/weekend is a keytailoring variable that should be considered in adaptive walking inter-ventions (see Tables 4 and 5).

4.3. System ID results

4.3.1. Cross-correlation analysisAnalyses were conducted for all input-input and input-output

pairs, but only a subset are presented in the paper for simplicity (Figs. 5and 6). In these figures, the x-axis corresponds to time (lag). The y-axiscorresponds to normalized cross-correlation coefficients calculatedfrom the sample covariances of the input pair for each lag. The red andgreen lines represent 68.3% and 99.7% standard hypothesis boundsrespectively. To interpret these graphs, we look at the correlationcoefficients across time. Those cross-correlations that touch or exceedthe 99.7% bounds are likely to be highly correlated. For example, forthe participant in Fig. 6, it is clear that predicted stress-weekday/weekend, predicted busyness-predicted stress, predicted stress-pre-dicted typicality are all highly correlated, and the penalty approachwould make them less likely to be incorporated into the same model.

Box and MacGregor [62] have shown that a significantly cross-correlation at lag 0 indicates the presence of a feedback loop betweenthe signals under scope. Fig. 6 displays input-output cross-correlationsfor a subset of the pairs for one participant, and illustrates an exampleof a potential feedback loop between granted points and actual steps forthis participant. These analyses make it possible to visualize these re-lationships and can be useful in future work that will incorporate theSCT model structure.

Table 4Parameter estimates for Model 9 (fixed effects)

Fixed effects

Effect Estimate Standarderror

Degrees offreedom

t Value Sig.

Intercept 6865.17 439.84 42.4 15.61 <0.0001Day −67.3538 25.3709 436 −2.65 0.0082Day2 1.7434 0.6491 434 2.69 0.0075Day3 −0.01312 0.0049 436 −2.70 0.0071Baseline PA 1.0804 0.1473 20 7.34 <0.0001Goal 0.3748 0.0314 1645 11.95 <0.0001Expected Points 0.7303 0.4233 1624 1.73 0.0847Granted Points 0.6682 0.3587 1722 1.86 0.0627Weekday/

Weekend−842.21 135.40 1528 −6.22 <0.0001

Table 5Parameter estimates for Model 9 (random effects)

Random effects

Parameters Estimate Standard Error Z Value Sig.

Intercept variance (τ02) 2,247,278 835,194 2.69 0.0036Intercept-Slope Covariance −20,892 12,783 −1.63 0.1022Slope variance (τ12) 761.23 298.82 2.55 0.0054AR(1) 0.1207 0.02786 4.33 <0.0001Residual variance (σ2) 6,199,813 218,449 28.38 <0.0001

Fig. 5. Examples of input-input cross-correlation analyses for one participant.

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

91

Page 11: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

4.3.2. Model estimationThe selected models for all participants are shown in Table 6. The

overall average model fit (estimation and validation data) for all par-ticipants combined was 19.2 ± 9.25%. The range was 6.3–46%. Asmentioned earlier, 16 different models, one for each input combinationwere estimated for every participant. Fig. 7 displays the highest overallfits for all 16 input combinations for four of the participants, with thefinal selected input combination marked in red. It also displays modelcharacteristics that were used in the penalty approach (excluding cross-

correlations). The error bars represent the mean and standard devia-tions (± 2 SD) of the % fits for each input combination. The green linerepresents the penalty line. Input combinations with lower penaltyscores are more favorable.

In Fig. 7(a), the selected model for participant A is a four-inputmodel with weekday/weekend as the fourth predictor added to the basemodel. The overall highest fit for this participant was 17.9%, suggestingthat this model explained 17.9% of the variance in actual steps per day.The appropriate interpretation from this individual model is thatwhether or not the present day is a weekday or a weekend significantlyinfluences the amount of PA the individual will engage in. The figurealso illustrates how adding more predictors in the model did not resultin a meaningful improvement in model fit. From Fig. 7(b), the selectedmodel for that participant was a 5-input model, which suggested thatboth predicted typicality of day and whether it is a weekday/weekendwere the factors that influence their steps in addition to the base inputs.

Out of all the chosen models, a 4-input model was more appropriatefor 45% of the participants, and a 5-input model for 40% of all parti-cipants. The 4-input model with weekday/weekend, and a 5-inputmodel with predicted typicality and weekday/weekend were the twomost common input combinations that emerged from the idiographicanalyses (25% and 20% of participants, respectively). Such insights canhelp us understand which tailoring variables should be further exploredfor development of personalized decision rules.

It should be noted that models for 4 of the 20 participants had two-standard deviation bounds that span the 0% fit threshold, suggestingthat, per Eq. (2), models could potentially be no better than a straightline at matching the measured data in these cases. Fig. 7(d) illustratesone such example. These participants will be further explored in nextstep, i.e., semi-physical modeling, where additional variables will beincorporated and other model structures will be tested.

Using Cohen’s convention [61], all 20 participants met the criteriafor explaining at least 3% of the variance, 12 out of 20 met criteria for

Fig. 6. Examples of input-output cross-correlation analyses for one participant.

Table 6Highest overall fit (%) for all participants (N=20)

PID Model (input combination) Highest overall fit (%)

1 Weekendb 17.92 Typical-weekendb 12.83 Busy-typical-weekendb 20.74 Busyb 24.65 Typical-weekendb 22.86 Stress-typicalb 12.97 Weekendb 6.38 Typical-weekendb 16.09 Weekendb 9.310 Weekendb 7.911 Stress-typical-weekendb 16.612 Base modela 8.313 Weekendb 24.214 Typical-weekendb 21.315 Stressb 23.116 Busy-stressb 25.617 Busy-weekendb 31.818 Busyb 46.019 Stress-weekendb 16.020 Typicalb 20.3

a Base model that includes goals, expected points and granted points.b Base model+ specified inputs.

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

92

Page 12: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

at least 13% of explained variance and 2 out of 20 met criteria for alarge effect size of 26% or more. While at this stage it is unclear whatminimal effect sizes would be needed to drive robust individualization,the model estimations provide an indication of the differences betweenpeople in terms of tailoring variables. If a medium effect size is con-sidered the minimum threshold, only four participants had highestoverall model fits below 13%, suggesting good enough model fit for80% of the sample.

Fig. 8 illustrates a time-series plot of a fitted model for one of theparticipants. A 4-input model was fitted, resulting in an overall highest

model fit of 24.2%. It displays the participant's actual steps along withthe model's predicted steps. It also visualizes the daily variation in themanipulated inputs (goals, expected points, and granted points) as wellas other selected predictors (weekday/weekend), and the cycles thatwere used as estimation and validation sets for this model.

4.4. Nomothetic vs. idiographic tailoring variable selection

Results from both, MLM and system ID analyses suggest that overall,in addition to the base inputs, whether it was a weekday or a weekend

Fig. 7. Models with highest overall % fit for all input combinations along with the final selected model (in red) derived using the penalty approach. [wknd=weekend, stress= predictedstress, busy= predicted busyness, typical= predicted typicality]. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of thisarticle.)

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

93

Page 13: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

day seemed to be an important predictor of participants’ PA. But if thisaggregate MLM model that included only those time-varying predictors(besides base inputs) that had a significant main effect overall wereused for informing the tailoring variable selection, we would likely beusing appropriate tailoring for only 25% of the participants (the same 4-input combination was chosen for 5 out of the 20 participants based onsystem ID), and less conservatively, for 65% (13 out of 20 models in-clude weekday/weekend, but in addition to another input). The idio-graphic approach helped identify potential tailoring variables even forthose participants who likely do not fit the population model (for ex-ample, those whose PA was influenced by how busy they predictedtheir day to be, such as the participant in Fig. 7(c). In addition, themodels failed to converge after including random effects for most of ourLevel-1 predictors of interest. Random effects offer a way to get closerto insights about individual differences in responses to interventions ascompared to fixed effects and standard OLS regression. However, it isoften the case that models fail to converge after including more than2–3 random effects [57] emphasizing an important advantage of thesystem ID approach that does not require a high sample size to achievesufficient power, particualrly when the goal is obtaining idiographicinsights, which, by definition, does not require multiple subjects toachieve said power.

5. Discussion

The current informative experiment aimed to pilot test a novel ap-proach to understanding PA behavior through the use of an idiographicsystem ID approach. Results of this work indicate the feasibility of usingsystem ID in a behavioral context, including the application of experi-mental designs from system ID for human behavior, and the feasibilityof developing and validating individualized dynamical models that candescribe, on a case-by-case basis, factors that dynamically relate withPA within the framework of a goal setting and positive reinforcement

intervention. Further, results of this work indicate that, in our sample,there was a match between the tailoring variables selected from a no-mothetic vs. idiographic approach for only one quarter of the partici-pants, which suggests the need for more idiographic approaches tosupport selection of tailoring variables.

Results from this study provide important clues to consequentialinformation that is likely missed when a nomothetic or aggregate ap-proach is used for analysis, particularly based on the wide range ofdifferent best-fit models that were generated across individuals in ourstudy. From the perspective of adaptive interventions, it is essential toremain mindful of the plausible decisions that will be made from thesedata. This information can be used to test decision policies aroundtailoring variables. In our context, these known dynamics can be usedto inform the daily suggested step goals to strive for and reward pointsprovided for achieving said goals on any given day. For example, it maybe appropriate to reduce the step goals on weekends for some in-dividuals to ensure they remain “ambitious but doable” for the targetedday. As our nomothetic vs. idiographic comparison suggests, exactlyhow we adjust the daily step goals depends on other factors that in-fluence the target outcome. As more information is accounted for in thedecision-making process (i.e., which variable, and plausibly, the di-rection and magnitude), the mismatch between decisions made via thenomothetic vs. the idiographic models will become more apparent. Asthe field of mHealth and precision medicine moves towards more per-sonalized and adaptive interventions, work is needed to clearly definestrategies for generating idiographic models.

A plausible critique of this line of thinking is that it is not possible todevelop individualized models at scale. A careful look into the currentdigital economy undermines this statement. For example, Google has apersonalized search option that takes into account a wide range of in-dividualized attributes about you – including your individualized re-sponse patterns and history [63]. These personalized models are alsoprevalent in Facebook, Amazon, and other major digital technology

Fig. 8. Time-series plot of a fitted 4-input model that was selected one of the participants.

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

94

Page 14: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

companies. While our approach is different from them in terms ofprocess, these scaled personalized models establish the feasibility of thelogic of our approach writ large. A second critique to scaling would bethe argument that individuals want personalized responses quickly andthus it is not possible to do learning run-in periods, such as the 3-monthlearning trial we did. This is a fair critique that should be thought aboutcarefully. In our context, we found that many of our participants ac-tually enjoyed receiving the variable step goals from one day to the next[41]. From their perspective, they were not taking part in a “learning”trial but instead were receiving an intervention that was individualizedto their usual activity level. This insight suggests that it might be pos-sible to learn while still providing a personalized intervention, as oc-curred in our case. Again, examples from the digital economy providefurther concurrent validation to the logic of this claim, though morework is needed to clearly understand which types of personalizedlearning are feasible for any given problem domain.

We see at least three broad domains of personalized learning thatwarrant further exploration. The first is a system ID approach we havealready presented. The second approach is from computer science andis the class of methods that stem from “reinforcement learning” [64]. Atits most basic level, reinforcement learning involves the specification ofvery clear measurable success criterion for a given intervention. Forexample, if one sends a text message suggesting a person goes for awalk, success would be that they go for a walk. Within reinforcementlearning, the delivery or not of that intervention is stochastic (e.g. 50%probability that a person will receive the intervention at any givenmoment). If the success criterion is met after an intervention is sent, theprobability of sending the message goes up (e.g., from 50 to 55%) nexttime. If, however, the message is sent and success is not observed, thenthe probability goes down. There are more advanced derivations of thisapproach but, at its core, this is an approach that can be responsive totruly idiographic issues.

A third approach could involve teaching individuals the process ofself-experimentation [65–67]. In this approach, individuals are taughtbasics in formulating hypotheses, testing those hypotheses with N-of-1designs, and then reviewing and iterating on their work based on theresults. This type of approach enables individuals to truly learn whatwork works for them, rather than what works on average and, thus, isanother plausible approach for advancing idiographic understanding ofa problem.

We highlight these three approaches to provide a broader perspec-tive on how science might progress by more formally acknowledgingand taking idiographic effects into account. These approaches fit withearlier calls for “data-driven case studies” as a plausible new approachfor learning about the subtleties on how behavior occurs in a real-worldcontext [68]. As such, we see this work as essential not only for sup-porting adaptive interventions but also as a strategy for advancingbehavioral science to one that can better manage and understand theinherent complexity of behavior [5,69].

While we see great value in idiographic approaches, it is importantto note that, like all methods, there are boundary conditions on whenthey are appropriate and, more importantly not appropriate. In thosecases where personalized learning is not possible, it is advisable to havea warm start by using the insights provided from nomothetic models.Further, it is quite plausible that insights can be transferred betweennomothetic and idiographic models to enable more robust under-standing overall of a given phenomenon [69,70].

5.1. Methodological considerations

To the best of our knowledge, this is the first study to systematicallyapply system ID, particularly via the use of system ID experimentationdesigns and corresponding analyses to the problem domain of behaviorchange. System ID offers a unique experimental design that is wellsuited for use in a goal setting and positive reinforcement intervention.Goal setting is one of the most popular behavior change techniques

available in mHealth apps, especially those linked to activity trackers[25–27], making this relevant for application in a real-world context.Design components such as multisine signal design and the ability topseudo-randomize continuous variables on a daily level make this astrong experimental design to obtain idiographic and dynamic com-putational models. In addition, the current experiment included 20adults living anywhere in the US and was carried out using a com-mercial activity tracker. The research team maintained minimal contactwith participants except for troubleshooting.

With that said, there are several limitations to this study. One of thelimitations of the study was the small sample size with mostly femaleand non-Hispanic, white participants, which limits the generalizabilityof the results. It is possible that the types of insights obtained might notbe the same among males. Difficulty recruiting males is a commonproblem for walking interventions [71]. While no noticeable differencesin engagement/dropout were documented between the males and fe-males in this study, future studies should focus on understanding if theintervention effects and insights from idiographic modeling differ basedon gender and the implications it may have on tailoring variable se-lection and personalization. In addition, more targeted recruitmentstrategies may be needed to increase male adoption and participation.

The intensive measurement over 12 weeks (5 cycles) of the inter-vention provided sufficient power for N-of-1 modeling and modelingthe within-person effects from MLM analyses, but not sufficient to in-terpret between-person effects or possible random effects. Anotherlimitation is that, during this trial, we used only their baseline steps as areference. As such, this intervention was not yet a fully adaptive in-tervention. This was by design as our focus was on examining thefeasibility of using system ID to develop individualized dynamicalmodels, not the creation of an adaptive intervention. In subsequentwork, we will make the intervention fully adaptive longitudinally byusing both a model-predictive controller [72] and also the use of amoving referent strategy whereby the prior 2 cycles will be used toguide definition of the next set of targeted step goals.

As mentioned earlier, a hierarchical model building approach wasused for the system ID analyses, which led to numerous models beingestimated for every participant. Since this procedure is exhaustive andcomputationally demanding, we did not parse out the separate effectsof the base inputs for the purpose of these preliminary analyses. Inaddition, only a subset of the measured variables was used. Futureanalyses will build on these insights to study the likely prediction im-provement by incorporating other variables such as weather, daily taskself-efficacy, daily commitment to being active, etc. to the modelspresented in this paper. In addition, the dichotomous variable‘weekday/weekend’ can also be further refined and tested by breakingit down into separate days of the week. The research team developed apenalty based approach that assigned weights based on model char-acteristics that affected the validity and stability of the models. In ad-dition, we assessed model fit estimates as ‘good enough’ for in-dividualization based on existing conventions such as Cohen’s criteria.The team fully acknowledges that these approaches are one of many,and there are other plausible methods that could be applied to make thefinal model selections (e.g., modifying weightage) and determine ro-bustness of model fits. However, the aim of these analyses was to not tomake generalized claims about determinants of PA, but to explore thisunique approach to understanding PA, and highlight differences thatmight exist when using idiographic vs. nomothetic approaches. As such,from a generalization standpoint, our work helps to progress system IDas an approach for managing idiographic issues, even if the insights donot generalize per say.

5.2. Future directions

There remain many methodological questions that need to be betteraddressed before these techniques can be used in mHealth interven-tions. Black-box modeling is an appropriate first step as it is best suited

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

95

Page 15: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

in situations where little is known about the input-output relationships.Insights from these analyses will be incorporated and used in a proce-dure known as semi-physical or grey-box modeling [15,34]. Theseanalyses will build on the SCT model structure [39] and other hy-pothesized feedback loops that can potentially be mathematically spe-cified and tested (e.g., the hypothesized reciprocal relationship betweenself-efficacy and behavior).

A targeted next step of our work is to run a closed loop experiment.Closed loop experiments take into account information about theperson and, thus, enable the systematic testing and optimization of thedecision policies a targeted controller will make. This closed loopsystem ID experiment is an optimization design that fits well, con-ceptually, in the increasingly popular intervention optimization fra-mework, MOST. As such, one final key future area of research is toclearly specify the particular uses of the various emerging optimizationtrials including closed loop system ID experiments, micro-randomiza-tion trials, sequential multiple assignment randomized trials (SMART),and a between-subject factorial trial, as “screening” experiments withinthe MOST framework.

6. Conclusion

This is one of the first studies that used control systems methodsfrom experimental design to analysis for understanding behaviorchange and, in this context, tailoring variable selection. Along withdescribing preliminary results, this paper aimed to describe the processof applying system ID methodology to a behavioral setting, to en-courage other researchers to explore the application of such methods intheir work. Results highlighted the kind of insights that might be missedwhen a nomothetic approach is used for analysis as compared to idio-graphic for selecting tailoring variables. This approach also highlightedthe individual differences and variations that might exist betweenparticipants in relation to the factors that affect their PA. Since controlsystems are designed to work with systems that have time-varyingcharacteristics and with a single-case approach, it provides a feasibleapproach to accommodate personalization in intervention design and,thus, warrants further exploration within the mHealth research com-munity.

Conflict of interest

The authors report no conflicts of interest regarding this manuscript.

Acknowledgements

Support for this work has been provided by the National ScienceFoundation (NSF) through grant IIS-1449751. The opinions expressedin this article are the authors’ own and do not necessarily reflect theviews of NSF.

Appendix A. Supplementary material

Supplementary data associated with this article can be found, in theonline version, at http://dx.doi.org/10.1016/j.jbi.2018.01.010.

References

[1] National Research Council, Toward Precision Medicine: Building a KnowledgeNetwork for Biomedical Research and a New Taxonomy of Disease, NationalAcademies Press, 2011.

[2] L.M. Collins, S.A. Murphy, K.L. Bierman, A conceptual framework for adaptivepreventive interventions, Prev. Sci. 5 (2004) 185–196, http://dx.doi.org/10.1023/B:PREV.0000037641.26017.00.

[3] I. Nahum-Shani, E.B. Hekler, D. Spruijt-metz, Building health behavior models toguide the developmnet of just-in-time adaptive interventions: a pragmatic frame-work, Heal. Psychol. 34 (2015) 1209–1219, http://dx.doi.org/10.1037/hea0000306.

[4] D. Spruijt-Metz, W. Nilsen, Dynamic models of behavior for just-in-time adaptive

interventions, IEEE Pervasive Comput. 3 (2014) 13–17.[5] D. Spruijt-Metz, E. Hekler, N. Saranummi, S. Intille, I. Korhonen, W. Nilsen,

D.E. Rivera, B. Spring, S. Michie, D.A. Asch, A. Sanna, V.T. Salcedo, R. Kukakfa,M. Pavel, Building new computational models to support health behavior changeand maintenance: new opportunities in behavioral research, Transl. Behav. Med. 5(2015) 335–346, http://dx.doi.org/10.1007/s13142-015-0324-1.

[6] P.C.M. Molenaar, A manifesto on psychology as idiographic science: bringing theperson back into scientific psychology, this time forever, Meas. Interdiscip. Res.Perspect. 2 (2004) 201–218, http://dx.doi.org/10.1207/s15366359mea0204_1.

[7] P.C.M. Molenaar, C.G. Campbell, The new person-specific paradigm in psychology,Curr. Dir. Psychol. Sci. 18 (2009) 112–117.

[8] A.M. Beltz, A.G.C. Wright, B.N. Sprague, P.C.M. Molenaar, Bridging the Nomotheticand Idiographic Approaches to the Analysis of Clinical Data, Assessment, 2016,1073191116648209. 10.1177/1073191116648209.

[9] W.T. Riley, D.E. Rivera, A.A. Atienza, W. Nilsen, S.M. Allison, R. Mermelstein,Health behavior models in the age of mobile interventions: are our theories up tothe task? Transl. Behav. Med. 1 (2011) 53–71, http://dx.doi.org/10.1007/s13142-011-0021-7.

[10] G.F. Dunton, A.A. Atienza, The need for time-intensive information in healthfuleating and physical activity research: a timely topic, J. Am. Diet. Assoc. 109 (2009)30–35, http://dx.doi.org/10.1016/j.jada.2008.10.019.

[11] H. Lei, I. Nahum-shani, K.G. Lynch, D. Oslin, S.A. Murphy, A “SMART” design forbuilding individualized treatment sequences, Annu. Rev. Clin. Psychol. 8 (2012)21–48, http://dx.doi.org/10.1146/annurev-clinpsy-032511-143152.

[12] E.B. Hekler, M.P. Buman, N. Poothakandiyil, D.E. Rivera, J.M. Dzierzewski,A. Aiken Morgan, C.S. McCrae, B.L. Roberts, M. Marsiske, P.R. Giacobbi, Exploringbehavioral markers of long-term physical activity maintenance: a case study ofsystem identification modeling within a behavioral intervention, Heal. Educ. Behav.40 (2013) 51S–62S, http://dx.doi.org/10.1177/1090198113496787.

[13] N. Ram, K. Grimm, Using simple and complex growth models to articulate devel-opmental change: matching theory to method, Int. J. Behav. Dev. 31 (2007)303–316, http://dx.doi.org/10.1177/0165025407077751.

[14] T.W. Buford, M.D. Roberts, T.S. Church, Toward exercise as personalized medicine,Sport. Med. 43 (2013) 157–165, http://dx.doi.org/10.1007/s40279-013-0018-0.

[15] L. Ljung, System Identification: Theory for the User, PTR Prentice Hall Informationand System Sciences Series, 1999.

[16] M. Pratt, O.L. Sarmiento, F. Montes, D. Oglivie, B.H. Marcus, L.G. Perez,R.C. Brownsonfor the Lancet Physical Activity Series Working Group, The im-plications of megatrends in information and communication technology andtransportation for changes in global physical activity, Lancet 380 (2012) 282–293,http://dx.doi.org/10.1016/S0140-6736(12)60736-3.

[17] A.H. Mokdad, J.S. Marks, D.F. Stroup, J.L. Gerberding, Actual causes of death in theUnited States, Jama. 291 (2004) (2000) 1238–1245.

[18] A.E. Bauman, R.S. Reis, J.F. Sallis, J.C. Wells, R.J.F. Loos, B.W. Martin, L.P.A.S.W.Group, Correlates of physical activity: why are some people physically active andothers not? Lancet 380 (2012) 258–271, http://dx.doi.org/10.1016/S0140-6736(12)60735-1.

[19] D.E. Conroy, S. Elavsky, A.L. Hyde, S.E. Doerksen, The dynamic nature of physicalactivity intentions: a within-person perspective on intention-behavior coupling, J.Sport Exerc. Psychol. 33 (2011) 807.

[20] N.E. Sherwood, R.W. Jeffery, The behavioral determinants of exercise: implicationsfor physical activity interventions, Annu. Rev. Nutr. 20 (2000) 21–44.

[21] M.F. Hovell, D.R. Wahlgren, M.A. Adams, The logical and emperical basis for theBehavioral Ecological Model, Emerg. Theor. Heal. Promot. Pract. Res., second ed.,2009, pp. 415–450.

[22] R. Gulotta, J. Forlizzi, R. Yang, M.W. Newman, Fostering Engagement with PersonalInformatics Systems, in: Proc. 2016 ACM Conf. Des. Interact. Syst. – DIS’16, 2016,pp. 286–300. 10.1145/2901790.2901803.

[23] S. Halko, J.A. Kientz, Personality and persuasive technology: An exploratory studyon health-promoting mobile applications, Lect. Notes Comput. Sci. (IncludingSubser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics). 6137 LNCS, 2010, pp.150–161, 10.1007/978-3-642-13226-1_16.

[24] H. Oinas-kukkonen, M. Harjumaa, Persuasive systems design: key issues, processmodel, and system features persuasive systems design: key issues, process model,and system features, Commun. Assoc. Inf. Syst. 24 (2009).

[25] S. Michie, C. Abraham, C. Whittington, J. McAteer, S. Gupta, Effective techniques inhealthy eating and physical activity interventions: a meta-regression, HealthPsychol. 28 (2009) 690–701, http://dx.doi.org/10.1037/a0016136.

[26] E.J. Lyons, Z.H. Lewis, B.G. Mayrsohn, J.L. Rowland, Behavior change techniquesimplemented in electronic lifestyle activity monitors: A systematic content analysis,J. Med. Internet Res. 16 (2014) e192, http://dx.doi.org/10.2196/jmir.3469.

[27] D.E. Conroy, C.H. Yang, J.P. Maher, Behavior change techniques in top-rankedmobile apps for physical activity, Am. J. Prev. Med. 46 (2014) 649–652, http://dx.doi.org/10.1016/j.amepre.2014.01.010.

[28] M.A. Adams, J.F. Sallis, G.J. Norman, M.F. Hovell, E.B. Hekler, E. Perata, Anadaptive physical activity intervention for overweight adults: a randomized con-trolled trial, PLoS One. 8 (2013) e82901, http://dx.doi.org/10.1371/journal.pone.0082901.

[29] M.A. Adams, J.C. Hurley, M. Todd, N. Bhuiyan, C.L. Jarrett, W.J. Tucker,K.E. Hollingshead, S.S. Angadi, Adaptive goal setting and financial incentives: a 2× 2 factorial randomized controlled trial to increase adults’ physical activity, BMCPublic Health 17 (2017) 286, http://dx.doi.org/10.1186/s12889-017-4197-8.

[30] J. Poirier, W.L. Bennett, G.J. Jerome, N.G. Shah, M. Lazo, H.-C. Yeh, J.M. Clark,N.K. Cobb, Effectiveness of an activity tracker-and internet-based adaptive walkingprogram for adults: a randomized controlled trial, J. Med. Internet Res. 18 (2016).

[31] D.E. Rivera, M.D. Pew, L.M. Collins, Using engineering control principles to inform

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

96

Page 16: Journal of Biomedical Informaticspotent technologies that shape PA behavior over time [24]. Adaptive interventions are a class of interventions that recognize the need for dynamic

the design of adaptive interventions: a conceptual introduction, Drug AlcoholDepend. 88 (2007) S31–S40, http://dx.doi.org/10.1016/j.drugalcdep.2006.10.020.

[32] J.S. Savage, D.S. Downs, Y. Dong, D.E. Rivera, Control systems engineering foroptimizing a prenatal weight gain intervention to regulate infant birth weight, Am.J. Public Health. 104 (2014) 1247–1254, http://dx.doi.org/10.2105/AJPH.2014.301959.

[33] S. Deshpande, D.E. Rivera, J.W. Younger, N.N. Nandola, A control systems en-gineering approach for adaptive behavioral interventions: illustration with a fi-bromyalgia intervention, Transl. Behav. Med. 4 (2014) 275–289, http://dx.doi.org/10.1007/s13142-014-0282-z.

[34] C.A. Martín, D.E. Rivera, E.B. Hekler, Design of informative identification experi-ments for behavioral interventions, IFAC-PapersOnLine 48 (2015) 1325–1330,http://dx.doi.org/10.1016/j.ifacol.2015.12.315.

[35] L.M. Collins, K.C. Kugler, M.V. Gwadz, Optimization of multicomponent behavioraland biobehavioral interventions for the prevention and treatment of HIV/AIDS,AIDS Behav. 20 (2016) 197–214, http://dx.doi.org/10.1007/s10461-015-1145-4.

[36] L.M. Collins, S.A. Murphy, V.N. Nair, V.J. Strecher, A strategy for optimizing andevaluating behavioral interventions, Ann. Behav. Med. 30 (2005) 65–73.

[37] A. Bandura, Self-efficacy: toward a unifying theory of behavioral change, Psychol.Rev. 84 (1977) 191.

[38] W.T. Riley, C.A. Martín, D.E. Rivera, E.B. Hekler, M.A. Adams, M.P. Buman, M.Pavel, A.C. King, Development of a dynamic computational model of social cog-nitive theory, Transl. Behav. Med., 2015, pp. 1–13, 10.1007/s13142-015-0356-6 (inpress).

[39] C.A. Martín, D.E. Rivera, W.T. Riley, E.B. Hekler, M.P. Buman, M.A. Adams, A.C.King, A dynamical systems model of social cognitive theory, in: IEEE Am. ControlConf., IEEE, 2014: pp. 2407–2412, 10.1109/ACC.2014.6859463.

[40] C.A. Martín, S. Deshpande, E.B. Hekler, D.E. Rivera, A System IdentificationApproach for Improving Behavioral Interventions Based on Social CognitiveTheory, in: Am. Control Conf. (ACC), 2015, 2015, pp. 5878–5883.

[41] E.V. Korinek, S.S. Phatak, C.A. Martin, M.T. Freigoun, D.E. Rivera, M.A. Adams,P. Klasnja, M.P. Buman, E.B. Hekler, Adaptive step goals and rewards: a long-itudinal growth model of daily steps for a smartphone-based walking intervention,J. Behav. Med. (2017), http://dx.doi.org/10.1007/s10865-017-9878-3.

[42] M.T. Freigoun, C.A. Martín, A.B. Magann, D.E. Rivera, S.S. Phatak, E. V Korinek, E.B. Hekler, System identification of Just Walk: A behavioral mhealth intervention forpromoting physical activity, in: 2017 Am. Control Conf., IEEE, Seattle, WA, 2017,pp. 116–121. 10.23919/ACC.2017.7962940.

[43] T. Ferguson, A.V. Rowlands, T. Olds, C. Maher, The validity of consumer-level,activity monitors in healthy adults worn in free-living conditions: a cross-sectionalstudy, Int. J. Behav. Nutr. Phys. Act. 12 (2015) 42, http://dx.doi.org/10.1186/s12966-015-0201-9.

[44] K.R. Evenson, M.M. Goto, R.D. Furberg, Systematic review of the validity and re-liability of consumer-wearable activity trackers, Int. J. Behav. Nutr. Phys. Act. 12(2015) 159, http://dx.doi.org/10.1186/s12966-015-0314-1.

[45] C. Tudor-Locke, C.L. Craig, Y. Aoyagi, R.C. Bell, K.A. Croteau, I. De Bourdeaudhuij,B. Ewald, A.W. Gardner, Y. Hatano, L.D. Lutes, S.M. Matsudo, F.A. Ramirez-Marrero, L.Q. Rogers, D.A. Rowe, M.D. Schmidt, M.A. Tully, S.N. Blair, How manysteps/day are enough? For older adults and special populations, Int. J. Behav. Nutr.Phys. Act. 8 (2011) 80, http://dx.doi.org/10.1186/1479-5868-8-80.

[46] C. Tudor-Locke, T.L. Hart, T.L. Washington, Expected values for pedometer-de-termined physical activity in older populations, Int. J. Behav. Nutr. Phys. Act. 6(2009) 59, http://dx.doi.org/10.1080/02701367.2009.10599550.

[47] E.M. Murtagh, M.H. Murphy, J. Boone-Heinonen, Walking: the first steps in car-diovascular disease prevention, Curr. Opin. Cardiol. 25 (2010) 490–496, http://dx.doi.org/10.1097/HCO.0b013e32833ce972.

[48] CDC, Adults Need More Physical Activity, 2016,<https://www.cdc.gov/physicalactivity/inactivity-among-adults-50plus/index.html > .

[49] C.L. Craig, A.L. Marshall, M. Sjöström, A.E. Bauman, M.L. Booth, B.E. Ainsworth,M. Pratt, U. Ekelund, A. Yngve, J.F. Sallis, P. Oja, International physical activityquestionnaire: 12-Country reliability and validity, Med. Sci. Sports Exerc. 35 (2003)1381–1395, http://dx.doi.org/10.1249/01.MSS.0000078924.61453.FB.

[50] M. Braun, D. Rivera, A. Stenman, W. Foslien, C. Hrenya, Multi-level Pseudo-Random Signal Design and “Model-on-Demand” Estimation Applied to Nonlinear

Identification of a RTP Wafer Reactor, Am. Control Conf. San Diego, Calif., 1999,pp. 1573–1577, 10.1109/ACC.1999.786090.

[51] C.A. Martín, D.E. Rivera, E.B. Hekler, An identification test monitoring procedurefor MIMO systems based on statistical uncertainty estimation, in: 2015 54th IEEEConf. Decis. Control, 2015, pp. 2719–2724.

[52] A.C. King, E.B. Hekler, L.A. Grieco, S.J. Winter, J.L. Sheats, M.P. Buman,B. Banerjee, T.N. Robinson, J. Cirimele, Harnessing different motivational framesvia mobile phones to promote daily physical activity and reduce sedentary behaviorin aging adults, PLoS One 8 (2013) 2–9, http://dx.doi.org/10.1371/journal.pone.0062613.

[53] E.B. Hekler, M.P. Buman, D. Ahn, G. Dunton, A.A. Atienza, A.C. King, Are dailyfluctuations in perceived environment associated with walking? Psychol. Health. 27(2012) 1009–1020.

[54] R.C. Littell, G.A. Milliken, W.W. Stroup, R.D. Wonfinger, O. Schabenberger, SASSystem for Mixed Models, SAS Institute. Inc., Cary, NC, 1996.

[55] S.W. Raudenbush, A.S. Bryk, Hierarchical Linear Models: Applications and DataAnalysis Methods, vol. 1, Sage, 2002.

[56] J.D. Singer, J.B. Willett, Applied Longitudinal Data Analysis: Modeling Change andEvent Occurrence, Oxford University Press, 2003.

[57] T.A. Snijders, R.J. Bosker, Multilevel Analysis: An Introduction to Basic andAdvanced Multilevel Modeling, SAGE, 2011.

[58] G.E.P. Box, G.M. Jenkins, G.C. Reinsel, G.M. Ljung, Time Series Analysis:Forecasting and Control, John Wiley & Sons, 2015.

[59] C.F. Dormann, J. Elith, S. Bacher, C. Buchmann, G. Carl, G. Carré, J.R.G. Marquéz,B. Gruber, B. Lafourcade, P.J. Leitão, T. Münkemüller, C. Mcclean, P.E. Osborne,B. Reineking, B. Schröder, A.K. Skidmore, D. Zurell, S. Lautenbach, Collinearity: areview of methods to deal with it and a simulation study evaluating their perfor-mance, Ecography (Cop.) 36 (2013) 027–046, http://dx.doi.org/10.1111/j.1600-0587.2012.07348.x.

[60] L. Bao, S. Intille, Activity Recognition from User-annotated Acceleration Data,Pervasive Comput. Springer, 2004, pp. 1–17.

[61] J. Cohen, A power primer, Psychol. Bull. 112 (1992) 155–159, http://dx.doi.org/10.1037/0033-2909.112.1.155.

[62] G.E.P. Box, J.F. MacGregor, The analysis of closed-loop dynamic-stochastic systems,Technometrics 16 (1974) 391–398.

[63] E. Pariser, The filter bubble: How the New Personalized web is Changing What WeRead and How We Think, Penguin, 2011.

[64] R.S. Sutton, A.G. Barto, Reinforcement Learning: An introduction, MIT Press,Cambridge, 1998.

[65] R. Karkar, J. Zia, R. Vilardaga, S.R. Mishra, J. Fogarty, S.A. Munson, J.A. Kientz, Aframework for self-experimentation in personalized health, J. Am. Med. Inform.Assoc. (2015) 1–9, http://dx.doi.org/10.1093/jamia/ocv150.

[66] J. Lee, E. Walker, Understanding Users ’ Creation of Behavior Change Plans withTheory-Based Support, 2015, pp. 2301–2306. 10.1145/2702613.2732870.

[67] R. Kravitz, N. Duan (Eds.), Design and Implementation of N-of-1 Trials: A User’sGuide, AHRQ Publication No. 13(14)-EHC122-EF. Agency for Healthcare Researchand Quality, Rockville, MD, n.d. < http://www.effectivehealthcare.ahrq.gov/N-%0A1-Trials.cfm> .

[68] E.B. Hekler, P. Klasnja, V. Traver, M. Hendriks, Realizing Behavioral Managementof Health, IEEE Pulse., 2013, pp. 29–32.

[69] K. Patrick, E.B. Hekler, D. Estrin, D.C. Mohr, H. Riper, D. Crane, J. Godino,W.T. Riley, The pace of technologic change: implications for digital health behaviorintervention research, Am. J. Prev. Med. 51 (2016) 816–824, http://dx.doi.org/10.1016/j.amepre.2016.05.001.

[70] E.B. Hekler, S. Michie, M. Pavel, D.E. Rivera, L.M. Collins, H.B. Jimison, C. Garnett,S. Parral, D. Spruijt-Metz, Advancing models and theories for digital behaviorchange interventions, Am. J. Prev. Med. 51 (2016) 825–832.

[71] C.E. Foster, G. Brennan, A. Matthews, C. McAdam, C. Fitzsimons, N. Mutrie,Recruiting participants to walking intervention studies: a systematic review, Int. J.Behav. Nutr. Phys. Act. 8 (2011) 137, http://dx.doi.org/10.1186/1479-5868-8-137.

[72] C.A. Martin, D.E. Rivera, E.B. Hekler, A decision framework for an adaptive be-havioral intervention for physical activity using hybrid model predictive control,2016 Am. Control Conf., 2016, pp. 3576–3581. 10.1109/ACC.2016.7525468.

S.S. Phatak et al. Journal of Biomedical Informatics 79 (2018) 82–97

97