skip to main content
research-article
Open access

A Human-in-the-Loop Segmented Mixed-Effects Modeling Method for Analyzing Wearables Data

Published: 25 January 2023 Publication History

Abstract

Wearables are an important source of big data, as they provide real-time high-resolution data logs of health indicators of individuals. Higher-order associations between pairs of variables is common in wearables data. Representing higher-order association curves as piecewise linear segments in a regression model makes them more interpretable. However, existing methods for identifying the change points for segmented modeling either overfit or have low external validity for wearables data containing repeated measures. Therefore, we propose a human-in-the-loop method for segmented modeling of higher-order pairwise associations between variables in wearables data. Our method uses the smooth function estimated by a generalized additive mixed model to allow the analyst to annotate change point estimates for a segmented mixed-effects model, and thereafter employs Brent's constrained optimization procedure to fine-tune the manually provided estimates. We validate our method using three real-world wearables datasets. Our method not only outperforms state-of-the-art modeling methods in terms of prediction performance but also provides more interpretable results. Our study contributes to health data science in terms of developing a new method for interpretable modeling of wearables data. Our analysis uncovers interesting insights on higher-order associations for health researchers.

1 Introduction

Wearables are smart electronic devices that bring the ability to continuously record, monitor, and analyze key health indicators of individuals as they go about their lives. The global market share for wearables is forecasted to be $33 billion by 2025, with an average growth rate of 15% each year [1]. Wearable devices have been instrumental in transforming personalized medicine and individual health monitoring practices. The number of wearables-based research has been steadily increasing each year—a trend that is expected to continue in the future [2]. Wearables are primarily wrist-worn sensors that continuously track elementary wellness indicators such as sleep, heart rate, and activity; however, newer types of wearables, including implantables, chest-worn sensors, head-mounted displays, smart jewelry, and smart clothing, could be capable of capturing a lot more health indicators such as physiological stress, respiration rate, blood alcohol, blood sugar, and other vitals in a noninvasive way.
Many studies employ wearables data for predictive modeling such as fall detection or predicting 30-day readmissions [35], yet there is growing interest in conducting natural and quasi-natural experiments to study human health and its relationship with externalities [2, 6, 7]. Health indicators from wearables can be combined with other data sources such as electronic health records and social media to better understand dynamics between human health and society [810]. Wearables are capable of continuously monitoring and recording health indicators at fine granularity for multiple participants simultaneously. However, the resulting repeated measures data poses data quality and analytical challenges such as missing values, high dimensionality, lagged effects, and clustered errors [11, 12]. Existing glass-box models such as mixed-effects regression are often insufficient for accurately representing higher-order associations between variables in wearables data [13]. For example, including a second-order term in the mixed-effects regression model can show that a second-order effect of an input is statistically significant, but it is difficult to explain the functional nature of the second-order association from the quadratic expression alone. In other words, it would be interesting to learn the extrema (i.e., maxima/minima) of the second-order effect and how the outcome varies for unit change in the input around such a point. Furthermore, establishing such a functional relationship using visualizations alone can be challenging for wearables data with repeated measures per user and heterogeneity across users. Therefore, there is a need for novel interpretable methods for explaining higher-order associative patterns in wearables data.
Segmented regression, also known as broken-stick or piecewise regression, is an explanatory modeling approach where input variable(s) of interest are partitioned or segmented into intervals followed by fitting straight lines for each interval in the regression model. It is commonly used in design science research applications with a hypothesized curvilinear relationship in simple regression models [14, 15]. The most critical challenge in fitting a segmented model is the determination of input value(s) at which segments need to be separated, a problem also known as breakpoint or change point determination. Studies that use segmented regression with mixed effects [16] determine change points using either ad hoc or black-box procedures [13, 17]. These methods either tend to have low external validity or overfit and are therefore unsuitable for making inferences. To overcome these shortcomings, we propose a new method for determining change points for input segments in a mixed-effects regression. Our method is based on the human-in-the-loop (HITL) paradigm as it uses human inputs during the model training process. Our method repurposes the smooth functions generated by a generalized additive mixed model (GAMM) to allow the analyst to set initial estimates of the change points through visual inspection. Following which, a fast robust root-finding algorithm called Brent's method is used to precisely locate each change point iteratively by maximizing the mixed-effects model's Akaike information criteria [18]. In this way, our method takes advantage of human inputs to fit segmented models that can accurately represent higher-order associations between variables in wearables data. We evaluate our method on three real-world datasets. Our method consistently outperforms existing approaches for all three datasets. The models developed using our method enable clear interpretations of four higher-order associations across the datasets. Inferences from our analysis have multiple managerial implications.

2 Background and Related Work

Sensor-based content is among the key characteristics of third-generation business intelligence and analytics applications [19]. Wearables offer the unique opportunity to observe and physiological changes in individuals through measurement of activity, heart rate, body temperature, and health indicators. Different types of wearables are available for commercial use such as implantables, head-mounted displays, smart jewelry, smartwatches, fitness trackers, and smart clothing. Out of these, the chest-worn and wrist-worn fitness trackers are most widely used in research applications [2022]. During the early adoption phase of wearable technology, research was primarily directed toward sensor development and architecture [23, 24]. With more and more commercial products being introduced into the market today, the research focus is shifting on wearable data analytics (WDA) and associated design science research applications [2527].

2.1 Wearable Data Analytics

WDA is the discovery, interpretation, and communication of meaningful patterns from large volumes of data generated by wearable devices [12]. WDA applications can be broadly classified into interpretable modeling and predictive modeling applications. Predictive modeling is the process of learning from existing data to effectively predict future unknown outcomes [28]. In the clinical setting, health vital sign parameters such as electrocardiogram, oxygen saturation, heart rate, respiratory rate, and blood pressure are used to provide preemptive care for patients with cardiovascular diseases, renal diseases, neurological disorders, and cerebrovascular disorders [20]. In the nonclinical setting, wearables-enabled predictive modeling has been employed in problem domains including ambient assisted living [5], human activity recognition [29], reality mining [30], and sports medicine [31]. Data mining, machine learning, and deep learning are the most common approaches used in predictive modeling applications of WDA [21, 25].
Interpretable or explanatory modeling is the process of developing a mathematical representation of patterns in the data for explaining a hypothesized phenomenon [32]. Typically, controlled experiment-based and observational studies employing wearables for data collection use interpretable models to explain their phenomena of interest. Wearables-enabled interpretable modeling has been employed in problem domains including lifestyle modeling [27, 33, 34], environment-wellbeing modeling [13, 35, 36], and psycho-physiological stress modeling [37, 38]. Mixed-effects regression is the most common glass-box model used for interpretable modeling in WDA [27, 35, 36]. In mixed-effects regression, fixed effects or the global coefficients represent the overall effects of inputs on outcomes, and the random effects or varying effects represent how these effects differ across individuals [39].
Few studies employing wearables data report linear pairwise associations [34, 37]; however, many studies observe higher-order associations between variables. For example, Föhr et al. [37] identify a linear association between subjective stress and physical stress measured using heart rate monitors among overweight office workers, and Li et al. [34] find a linear association between the number of steps measured using an activity watch and resting heart rate measured using a heart rate monitor for healthy adults. In addition, Pimentel et al. [40] find associations between two different measures of physical stress (pNN50 and SDNN) to be significant only in a limited range for surgeons in a hospital. Cropley et al. [41] address the challenge of modeling the nonlinear association between work-related rumination and heart rate variability (HRV) by dividing the rumination score into low and high categories, whereas Kraus et al. [13] use domain knowledge to set a change point for segmented modeling. [42] measured heat strain using a heat exposure monitor and reported an inverted U-shaped association curve between heat strain and outdoor temperature for workers. Wearables-based studies such as the ones above either hypothesize or empirically observe higher-order associations. Compared to naïve approaches such as discretization of variables [41] which may lead to loss of information or just reporting first-order and second-order effects in regression models [42], characterizing the higher-order association using segmented modeling leads to better interpretable results.

2.2 Segmented Modeling

Higher-order relationships between inputs and outcomes are common in information systems (IS) [14, 15]. Polynomial regression models account for higher-order relationships, but they are not directly interpretable [43]. In other words, first-order effects and second-order effects cannot be used to quantify the unit change in outcome due to a unit increase in input as in the case of a regression model with only first-order effects. Segmented or piecewise regression is a preferred approach for modeling higher-order relationships, as it is easier to interpret. The primary challenge in using a segmented regression approach is the determination of change points linking the input segments. Change point determination has been studied in different problem contexts including detecting structural change in continuous values of parameters [44], interruption of time series [45], and characterizing higher-order functional relationships [4648]. Common procedures to determine change points include visual inspection of pairwise plots [48], incorporating domain inputs, greedy search [46], and likelihood-based estimation [47]. Fewer procedures exist for change point determination in mixed-effects models as the likelihood function of multilevel models are not directly differentiable, thus making greedy search and likelihood-based estimation more difficult [17, 46]. A maximum-likelihood (ML) estimation of a continuous functional approximation of the piecewise linear function has been proposed as a more robust alternative to subjective/ad hoc assignment of change points based on visualization of pairwise association plots [17]. However, this method estimates multiple change points automatically with no scope for user inputs into the estimation process. For example, manual intervention such as dropping change points at the extremities of an input distribution could avoid overfitting as well as improve interpretability. Moreover, an automated method may fail to execute if there are too many local extrema or due to high sensitivity toward outliers. To summarize, existing procedures for determining change points in studies employing segmented models are either ad hoc or analytically complex, leading to problems such as low external validity, overfitting, or failure in program execution. There is a need for a segmented mixed-effects modeling method that is robust, efficient, and transparent. Such a method can be helpful to better explain higher-order pairwise associations in wearables data.

2.3 HITL Analytics Methods

HITL analytics methods are geared toward enhancing algorithm performance by incorporating human knowledge and inputs into the modeling and program execution process. HITL is an extensive area of research that covers the intersection of computer science, cognitive science, and psychology [49, 50]. HITL can be performed at different stages of an analytics system, from data preprocessing and modeling to system implementation. Human-machine hybrid models have demonstrated superior performance in natural language and computer vision applications [50] and are being actively considered in general analytics applications [49]. For example, one study [51] shows how initial knowledge inputs from domain experts improve downstream performance of automated machine learning systems. Another effort [52] presents an iterative experimentation framework in which users repeatedly make changes to the ML workflow to improve performance. Similarly, HITL also finds applications in model design, training, testing, and model optimization stages and is applicable to health analytics research involving subjective expertise and a higher need for transparency [53, 54]. For instance, one study [55] incorporates physician inputs toward model parameterization for patient-specific IV fluid recommendation in sepsis treatment. An HITL approach is suitable for analyzing wearable data as human expert inputs and observation can spot technical and logical errors in the analysis at an early stage and avoid rework or erroneous conclusions. Although our study does not attempt to specifically contribute to HITL methodology literature, our method is one of the first few attempts to use an HITL approach for accurately determining change points in mixed-effects modeling in the context of wearables data analytics.

3 Our Method

We propose a new method for capturing higher-order associative patterns in wearables data using segmented mixed-effects modeling. Our method uses an HITL approach to determine the change points in the segmented model by combining an algorithmic search process with human inputs for fine-tuning. In other words, our method uses the smooth function estimated by a GAMM to allow the analyst to annotate the change point estimates followed by fine-tuning of the estimates using a constrained optimization procedure. Our method is novel since few studies have used an HITL approach to tune change point parameters for segmented modeling. It also provides a robust mechanism to capture and explain higher-order relationships hypothesized in wearables data. In the rest of this section, we explain our method in detail.
Consider a mixed-effects model commonly used for explaining repeated measures in data from wearables as follows:
\begin{equation} {y}_{ij} = {\beta }_0 + {\gamma }_{0j} + \mathop \sum \limits_{k = 1}^K {\beta }_k{x}_{kij} + \mathop \sum \limits_{m = 1}^M {\gamma }_{mj}{z}_{mij} + {\epsilon }_{ij} \end{equation}
(1)
Equation (1) is a representation of generalized linear mixed models with a linear link function, but any link function is applicable to our method. \({y}_{ij}\) is value of a given health outcome for the \({i}\hbox{th}\) observation and \({j}\hbox{th}\) individual, \({\beta }_0\) is the fixed intercept, \({\rm{{\rm B}}} = \{ {{\beta }_1, \ldots ,{\rm{\ }}{\beta }_K} \}\) are coefficients for K fixed effects \(\{ {{x}_1,{\rm{\ }}{x}_2, \ldots ,{\rm{\ }}{x}_K} \}\), \({{\rm{\Gamma }}}_0 = \{ {{\gamma }_{01},{\gamma }_{02}, \ldots ,{\gamma }_{0j}, \ldots ,{\gamma }_{0J}} \}\) are J random intercepts for each individual, \({\rm{\Gamma \ }} = \{ {{\gamma }_{11}, \ldots ,{\gamma }_{1j}, \ldots ,{\gamma }_{MJ}} \}{\rm{\ }}\) are coefficients for \(M{\rm{\ }}x{\rm{\ }}J\) random effects \(\{ {{z}_1,{\rm{\ }}{z}_2, \ldots ,{\rm{\ }}{z}_M} \}\), and \({\epsilon }_{ij}\) is the residual error.
Suppose there exists an input \({x}_r\) such that its second-order (or higher-order) effects are significant, then Equation (1) can be represented as follows:
\begin{equation} {y}_{ij} = {\beta }_0 + {\gamma }_{0j} + \mathop \sum \limits_{\substack{k = 1\\ k \ne r}}^K {\beta }_k{x}_{kij} + {\rm{\ }}{\beta }_{{r}_1}{x}_{rij}{\rm{\ }} + \cdots + {\rm{\ }}{\beta }_{{r}_t}x_{rij}^t + \mathop \sum \limits_{m = 1}^M {\gamma }_{mj}{z}_{mij} + {\epsilon }_{ij} \end{equation}
(2)
In Equation (2), \({x}_{rij}\) denotes the value of input variable \({x}_r\) for the \({i}\hbox{th}\) observation and \({j}\hbox{th}\) individual.
As described earlier, segmented representations of higher-order effects are more interpretable than polynomials. The input variable \({x}_r\) can be represented as the sum of segments as follows:
\begin{equation} {x}_r = {x}_r\cdot I({x}_r < {\eta }_1) + {\rm{\ }}{x}_r\cdot I({\eta }_1 \le {\rm{\ }}{x}_r < {\eta }_2) + \cdots + {\rm{\ }}{x}_r\cdot I({\eta }_P \le {\rm{\ }}{x}_r) \end{equation}
(3)
In Equation (3), \({\rm{{\rm H}}} = \{ {{\eta }_1,{\eta }_2, \ldots ,{\rm{\ }}{\eta }_P} \}{\rm{\ }}\) is a set of P change points defined for the input variable\({\rm{\ }}{x}_r\), which is broken into P segments \(\{ x_r^{( 1 )} = \ {x}_r\cdot I( {{x}_r < {\eta }_1} ), x_r^{( 2 )} = {x}_r\cdot I( {\eta }_1 \le {\rm{\ }}{x}_r < {\eta }_2 ), \ldots ,{\rm{\ }}x_r^{( P )} = {x}_r\cdot I( {{\eta }_P \le {\rm{\ }}{x}_r} ) \}\). \(I( \varphi )\) is an indicator function equal to 1 if condition \(\varphi\) is true; otherwise, it is 0. Therefore, the scalar product \({x}_r\cdot I( \varphi )\) has value equal to \({x}_r\) when \(\varphi\) is true and is 0 otherwise. The next logical step is to estimate number of change points and their positions.
We propose an HITL method to estimate the change points \({\eta }_{p{\rm{\ }}} \in H,\ P = | H |\) as follows. As the first step, we fit a GAMM [56] with a given input \({x}_r{\rm{\ }}\) as a nonparametric spline as shown next:
\begin{equation} {y}_{ij} = {\beta }_0 + {\gamma }_{0j} + \mathop \sum \limits_{\substack{k = 1\\ k \ne r}}^K {\beta }_k{x}_{kij} + {\rm{\ }}{f}_r{\rm{(}}{x}_{rij}{\rm{)\ }} + \mathop \sum \limits_{m = 1}^M {\gamma }_{mj}{z}_{mij} + {\epsilon }_{ij} \end{equation}
(4)
Although a nonparametric spline can be included for the corresponding random effect of input \({x}_r\), it is computationally more expensive for fitting the corresponding semiparametric model. We empirically tested on multiple datasets and observed the shape of the component smooth function \({f}_r( {{x}_{rij}} )\) to not be sensitive to random effects as smooth functions. Therefore, we consider the smooth function only in the fixed effects. In the next step, we visualize the plot of the smooth function in Equation (4) approximated as a B-spline [57]. Here, a human input is required to identify the order of the curve by inspecting the number of extrema (i.e., minima and maxima) to set the value of P. The setting of P can be based upon visual inspection as well as prior domain knowledge. For example, in Figure 1, the P values for the different scenarios in (a) through (d) are chosen as 1, 2, 1, and \(5,\) respectively. This step also determines whether to opt for a segmented model over a linear model, by inspecting the curvilinear nature of the component smooth function. For instance, although we set P = 1 for scenario illustrated in Figure 1(c), an analyst may also approximate the monotonically increasing curve as a linear function in this case, thus favoring simplicity over slightly better model fit.
Fig. 1.
Fig. 1. Smooth function plots generated using GAMM for Sound-SDNN (a), Activity-SDNN (b), ST2-RR (c), and z-TAC (d).
The value of the maxima and minima are used as starting points in a linear search algorithm in the third step. The third step involves iteratively performing search for change points using Brent's method [58], a linear optimization with box constraints. Brent's method is a hybrid root-finding algorithm combining the bisection method, the secant method, and the inverse quadratic interpolation that make it robust and highly efficient while incorporating box constraints (i.e., range of permitted values) [59]. For each iteration of the optimizer, a mixed model shown with segmented inputs for \({x}_r\) is fit for a particular change point parameter. The algorithm returns the change point parameter corresponding to the model with minimum Akaike information criteria. Since Brent's method is a single parameter estimator, we identify the change points sequentially by repeating the search algorithm for each subsequent change point after fixing values of previously determined change points. Finally, the segmented mixed-effects model is fit as shown next:
\begin{equation} {y}_{ij} = {\beta }_0 + {\gamma }_{0j} + \mathop \sum \limits_{{\rm{\ }}s{\rm{\ }} \in S} \beta _r^{\left( s \right)}{x}_{rij}\cdot I({x}_{rij} \in s{\rm{\ }}) + \mathop \sum \limits_{k = 1,{\rm{\ }}k{\rm{\ }} \ne r{\rm{\ }}}^K {\beta }_k{x}_{kij} + {\rm{\ }}\mathop \sum \limits_{m = 1}^M {\gamma }_{mj}{z}_{mij} + {\epsilon }_{ij} \end{equation}
(5)
In Equation (5), \(S{\rm{\ }}\) is a set of segments constructed using change points \({\rm{{\rm H}}}\) identified for input \({x}_r\). Significance of the effect of input variable, \({x}_r\), at each segment, s, can be determined by inspecting the corresponding fixed-effects coefficient,\({\rm{\ }}\beta _r^{( s )}\), under regular conditions. The algorithm for our method is shown in Table 1.
Table 1.
Input: Mixed-effects model with significant higher-order coefficients for input variable, \({x}_r\)
1:Fit a GAMM with input \({x}_r\) represented as a nonparametric spline (Equation (1)).
2:Inspect the component smooth function plot \({f}_r( . )\) to identify number of change points \(P = | H |\), starting points \(\eta _p^0\) and box constraints \(\{ {\eta _p^{min},\eta _p^{max}} \}\) for corresponding change points \({\eta }_p \in H\).
3:For \(p\ = 1\), compute \({\eta }_1\) as follows:
\(\mathop {{\rm{argmin}}}\nolimits_{\eta _1^{\min } < {\eta }_1 < \eta _1^{\max }} AIC( {M( {{\eta }_1} )} )\)
\(M( {{\eta }_1} ):{\rm{\ }}{y}_{ij} = {\beta }_0 + {\gamma }_{0j} + {\rm{\beta }}_{\rm{r}}^{( 1 )}{x}_r\cdot I( {{x}_r < {\eta }_1} ) + {\rm{\beta }}{{\rm{'}}}_{\rm{r}}x{{\rm{'}}}_r\cdot I( {{\eta }_1 \ge {\rm{\ }}{x}_r} ) + \mathop \sum \nolimits_{k = 1,{\rm{\ }}k{\rm{\ }} \ne r{\rm{\ }}}^K {\beta }_k{x}_{kij}\)
\(+ {\rm{\ }}\mathop \sum \nolimits_{m = 1}^M {\gamma }_{mj}{z}_{mij} + {\epsilon }_{ij}\)
4:For \(1 < p \le P\), fix values of \(\{ {{\eta }_1,\ \ldots ,{\eta }_{p - 1}} \}\) in model to compute \({\eta }_p\) as follows:
\(\mathop {{\rm{argmin}}}\nolimits_{\eta _p^{\min } < {\eta }_p < \eta _p^{\max }} AIC( {M( {{\eta }_p} )} )\)
\(M( {{\eta }_p} ):\ {y}_{ij} = {\beta }_0 + {\gamma }_{0j} + {\rm{\beta }}_{\rm{r}}^{( 1 )}{x}_r\cdot I( {{x}_r < {\eta }_1} ) + \cdots + {\rm{\ \beta }}_{\rm{r}}^{( p )}{x}_r\cdot I( {{\eta }_{p - 1} \le {\rm{\ }}{x}_r < {\eta }_p} ) + {\rm{\beta }}{{\rm{'}}}_{\rm{r}}x{'}_r\cdot I( {{\eta }_p \ge {\rm{\ }}{x}_r} ) + \mathop \sum \nolimits_{k = 1,{\rm{\ }}k{\rm{\ }} \ne r{\rm{\ }}}^K {\beta }_k{x}_{kij} + {\rm{\ }}\mathop \sum \nolimits_{m = 1}^M {\gamma }_{mj}{z}_{mij} + {\epsilon }_{ij}\)
Output: Segmented model with change points H as shown in Equation (5).
Table 1. HITL Method for Segmented Mixed-Effects Modeling

4 Experimental Setup

To demonstrate the utility and effectiveness of our method, we apply it to model high-order pairwise relationships in four different applications across three real-world wearable datasets: WellbuiltforWellbeing, HospitalMonitoring, and BeerCrawl. We describe each of the datasets in the next section followed by analysis and findings.

4.1 Data

4.1.1 WellbuiltforWellbeing.

The Wellbuilt for Wellbeing (WB2) project [60] was a 16-month multiphase field study funded by the U.S. General Services Administration to better understand the influence of the office environment on human health, comfort, and performance. In the study, self-described healthy adult workers involved in a variety of office-based roles for the U.S. government were recruited across four federal office buildings across the country. Participants wore two sensors for 3 days while carrying out their day-to-day activities, a heart and physical activity monitor, and a personal environment quality sensor-based device. The study also included experience sampling mobile surveys to collect individuals’ perceived psychological responses at periodic intervals of 1 to 2 hours. Post processing, the dataset contained around 3,000 hours of wearables data with a wide range of variables from 231 participants. More details about the field study setup and variables can be found in the work of Lindberg et al. [6].
For our study, we analyze two pairwise associations from this dataset: that of ambient sound level and HRV, and that of instantaneous activity and HRV. HRV is the variability between heart beats and is considered as a proxy measure for the physiological wellbeing of a person—that is, the higher the variability, the higher the physical wellbeing [61]. Among different HRV measures, the mean of standard deviation for all successive R-R intervals (SDNN) measured in milliseconds reflects the overall activity in the autonomous nervous system and is widely used as an indicator of better health and wellbeing [62]. Physical activity levels were assessed in g (i.e., 1 unit of gravitational force) from a triaxial accelerometer sensor and sound levels were measured in dBA (i.e., decibel weighted according to human ear hearing) using separate neck-worn sensors. The raw data from multiple wearables were aggregated at 5-minute intervals to be integrated with the heart rate monitor sensor used for computing SDNN [63]. We consider appropriate covariates in our models including person-level fixed effects (i.e., age, gender, BMI, worktype), time of the day, and day of the week after closely examining all variables collected in the Wellbuilt for Wellbeing project.

4.1.2 HospitalMonitoring.

The HospitalMonitoring dataset contains vital signs data recorded from patients undergoing anesthesia at the Royal Adelaide Hospital [64]. It is publicly accessible from the University of Queensland website.1 Data was collected for 32 cases using multiple wearables and stationary sensors including the electrocardiograph, pulse oximeter, capnograph, noninvasive arterial blood pressure monitor, airway flow, pressure monitor, Y-piece spirometer, electroencephalogram (EMG) monitor, and arterial blood pressure monitor [64]. The processed data is aggregated at 1-second intervals and has 51 variables out of which we select a meaningful subset for our analysis.
For our study, we analyze the pairwise association between the ST segment index and airway respiratory rate. In electrocardiography, the ST segment connects the QRS complex and the T wave and its depression or elevation is related to acute cardiovascular conditions including myocardial ischemia, infarction, and arrhythmia [65]. The human respiratory rate is measured by counting the number of breaths per minute, with typical values ranging from 12 to 16 for a healthy adult. Respiration rate has been related to abnormalities in oxygen saturation, aging, and cardiovascular diseases and has been widely adopted as part of early warning systems [66]. In our model, we consider heart rate, oxygen saturation, and perfusion index as covariates after examining collinearity and cross correlations among all features and potential confounding effects.

4.1.3 BeerCrawl.

The BeerCrawl dataset contains blood alcohol content and movement information recorded in a field study by Killian et al. [4]. It is publicly accessible from the University of California Irvine (UCI) dataset repository.2 Transdermal alcohol content (TAC) was measured using an ankle bracelet wearable and the movement data was captured using raw accelerometer readings from mobile phones for 20 students participating in an annual college bar crawl event. The TAC data was sampled every 30 minutes, whereas accelerator readings were available at a more granular level leading to more than 30M samples across participants. TAC has been shown to be a more reliable indicator of sustained alcohol use as compared to self-reporting [67].
For our study, we analyze the pairwise association between the raw z-axis readings from the tri-axial accelerometer and TAC values in the dataset. Prior studies have proposed several features using accelerometer readings that are related to a person's gait, activities, and wellbeing, with z-axis values contributing significantly to their variability [4, 68]. The units of TAC and accelerometer are g/dL and m/s2, respectively. In our model, we consider the x- and y-axis coordinate values as covariates. The accelerometer is mapped to the TAC monitor readings at the minute level.
Table 2 shows a summary of the datasets used in our study. The datasets and respective input-outcome associations were chosen to demonstrate variety of scenarios with higher-order pairwise associations across different problem domains.
Table 2.
DatasetSize (Rows; Columns)AccessibilityOutcomeInputInput Statistics (Mean, SD, Min, Max)
WellbuiltforWellbeing [60](31,557; 7)Not publicSDNN (HRV)Sound (sound level in dBA)(50.24, 8.69,
0.00, 87.80)
    Activity (activity level in g)(0.1738, 0.3187,
0.0000, 3.0000)
HospitalMonitoring [64](83,861; 4)PublicRR (respiration rate)ST2 (ST segment index)(0.0635, 0.3969,
–1.1000, 1.3000)
BeerCrawl [4](8,273; 3)PublicTAC (blood alcohol level)z (z-axis of accelerometer)(0.0503, 0.1840,
–0.6705, 0.6676)
Table 2. Summary of Real-World Wearables Datasets Used in Our Study

4.2 Data Preprocessing and Model Assumptions

Training and test samples were partitioned in a 75:25 ratio for performance evaluation. A variance component structure for the covariance matrix of the random effects coefficients is assumed in the mixed-effects regression models. The input variable of interest was included as a fixed effect as well as random effect in the model. We compared models having uncorrelated residual errors with counterparts with autoregressive error residuals and observed that the model fit did not improve significantly after controlling for temporal correlations. We also compared different error distributions and found the normal distribution to be most suitable. Hence, our model is represented as Equation (1) without any special link function or autocorrelation terms.

4.3 Results

We fit mixed-effects models using the three datasets for explaining the following four input-outcome associations: Sound-SDNN, Activity-SDNN, ST2-RR, and z-TAC. The corresponding component smooth functions for the partial effects from GAMM are shown in Figure 1. The smooth functions for Sound-SDNN and Activity-SDNN approximate second-order and third-order polynomial curves, whereas the smooth functions of ST2-RR and z-TAC indicate a higher-order curve.
As part of the HITL step, we made the following model interventions. Figure 1(a) shows a maxima around the 50 to 60 dBA range, whereas Figure 1(b) shows a maximal range between 0.2 and 1 followed by a decrease in the smooth function until it reaches a minima at around 2.5, after which the function again turns upward. The extrema in Figure 1(c) are more subtle with the confidence interval being most conservative around ST2 = 0.0. Figure 1(d) clearly depicts three maxima and two minima within the main range, whereas a minima exists around z = 0.6, which may be ignored due its extreme right position in the input distribution. Based on these observations of the smooth functions, we chose \(P\ = \ 1\), \(P\ = \ 2\), \(P\ = \ 1\), and \(P\ = \ 5\) as number of change points for associations Sound-SDNN, Activity-SDNN, ST2-RR, and z-TAC, respectively. Through visual inspection, we set initial change point estimates as well as box constraints for each of the pairwise associations.
We compared the prediction performance of mixed-effects models fitted using our method with the following benchmarks: (i) model with inputs as first-order effects (i.e., linear), (ii) model with inputs as first-order and second-order effects (i.e., curvilinear), (iii) segmented inputs using change points identified by the ML method [17], and (iv) segmented inputs using visually identified change points (i.e., visual). The fixed-effects model was used as a baseline, representing the case when only fixed effects are considered in the mixed-effects regression. The model with linear inputs (i.e., first-order effects only) is a benchmark that emphasizes simplicity over better model fit through capture of higher-order associations. The curvilinear model is more commonly used in prior literature as it improves model fit when compared to the linear effects only model, but at the cost of lower interpretability of its higher-order coefficients. The segmented models with change points determined visually or using the fully automated ML approach can be considered as the state-of-the-art approaches for segmented mixed-effects modeling for capturing higher-order associations. Other methods in machine learning or statistical modeling such as multivariate adaptive regression splines or nonparametric analysis are not considered as their objective of capturing higher-order patterns is not primarily toward explanatory modeling but toward making predictions.
We performed repeated fourfold cross validation with 10 iterations (i.e., 40 runs in total) for evaluating the performance of our method against the benchmarks across the three datasets. Table 3 shows the mean and standard deviation (in parentheses) of performance metrics (R-squared, root mean squared error (RMSE), and mean absolute prediction error (MAPE)) of our method and benchmarks across the three datasets and four higher-order association scenarios. A higher value of R-squared and lower values of RMSE and MAPE are preferred. Statistical significance of better performance of our method over each benchmark was tested using the Tukey-Kramer pairwise comparison test on ANOVA fit over the performance metrics of our method and benchmarks. The R-squared values and error estimates for the best-performing models are highlighted. Our method has a statistically significant improvement in the prediction performance over existing higher-order mixed-effects modeling benchmarks for the WellbuiltforWellbeing and HospitalMonitoring datasets. For the BeerCrawl dataset, our method is better than other benchmarks except the visual method, but the performance difference between the visual method and our method is not statistically significant. Although the ML approach is popular for GLMMs with the R package segmented [69], it is not very compatible for wearables data in terms of setting the number of change points and convergence. And although the visual method performs well for the BeerCrawl dataset, the complete dependence on human inputs and rationalization renders it less robust as is evident from its significant lower performance in the other two datasets.
Table 3.
ModelWellbuiltforWellbeing: HRV (Sound)WellbuiltforWellbeing: HRV (Activity)HospitalMonitoring: RRBeerCrawl: TAC
R-sq.RMSE (ms)MAPE (%)R-sq.RMSEMAPER-sq.RMSE (br/min)MAPE (%)R-sq.RMSEMAPE
(ms)(%)(g/dL)(%)
Fixed effects0.5168 (0.01)***17.78 (0.03)***25.81 (0.05)***0.5557 (0.01)***17.54 (0.02)***26.12 (0.07)***0.7335 (0.00)***2.58 (0.00)***5.47 (0.00)***0.6007 (0.01)***0.04 (0.00)***70.38 (0.72)***
Linear0.5237 (0.01)***17.65 (0.03)***25.47 (0.05)***0.6128 (0.01)***16.65 (0.08)***24.44 (0.10)***0.8781 (0.00)***2.36 (0.00)***4.92 (0.00)***0.6003 (0.01)***0.0377 (0.00)***66.54 (0.73)***
Curvilinear0.5876 (0.00)**16.83 (0.04)**23.78 (0.06)***0.6630 (0.01)***15.64 (0.05)***23.12 (0.06)***0.8875 (0.00)***2.36 (0.00)***4.92 (0.00)***0.5995 (0.01)***0.0377 (0.00)***66.56 (0.72)***
Visual0.5891 (0.00)16.81 (0.04)23.74 (0.06)0.6719 (0.01)***15.70 (0.06)***23.01 (0.07)***0.8856 (0.00)***2.35 (0.00)***4.85 (0.00)***0.6419 (0.01)0.0369 (0.00)*63.09 (0.72)
ML0.5860 (0.00)***16.82 (0.04)**23.80 (0.06)***0.6734 (0.01)***15.69 (0.06)***23.01 (0.07)***0.8810 (0.00)***2.35 (0.00)***4.84 (0.00)***0.6011 (0.01)***0.0380 (0.00)***66.57 (0.72)***
Our method0.5900 (0.00)16.80 (0.04)23.72 (0.06)0.7628 (0.01)14.93 (0.09)21.99 (0.10)0.9201 (0.00)2.33 (0.00)4.82 (0.00)0.6412 (0.01)0.0373 (0.00)63.17 (0.70)
Table 3. Model Fit and Predictive Performance Comparison of Segmented Multilevel Models
Statistical significance of the Tukey Kramer test for comparison of performance metrics of benchmark with HITL.
*** = p < .01, ** = p < 0.05, * = p < 0.1.
In addition to better model fit and prediction performance, our method is able to generate better interpretable models. The fixed effects of segmented inputs across all three datasets are shown in Table 4. In the WellbuiltforWellbeing dataset, HRV increases by 0.19 ms/dBA for sound levels less than 51 dBA, indicating that physical wellbeing improves with every unit increase in sound in quiet environments. Higher activity levels such as brisk walking (i.e., activity ≥ 0.9) decreases SDNN by 15.21 ms/g, whereas moderate walking speed (0.9 > Activity ≥ 0.21) is related to a steep increase of 90.49 ms/g in SDNN. Assuming that brisk walking is confounded by an intention to reach meetings on time and therefore is related to higher stress, the steep gradient in the moderate walking range underscores the value of intermittent low-level activity on physiological wellbeing in office spaces. In the HospitalMonitoring dataset, we observe that a unit increase in the ST segment index beyond –0.18 is related to an increase in the respiration rate by 2.28 breaths/minute. This finding underscores the importance of looking for elevated values of the ST segment as reported in the medical literature [65]. It also hints at a possible association between acute cardiovascular events and elevation in the respiration rate in bedridden patients. In the BeerCrawl dataset, the z-axis indicates acceleration perpendicular to the phone screen by participants. Although experimental research is required to understand the precise interpretations of the coefficients identified in our study, our study does uncover significant associative patterns between z-axis values and blood alcohol levels across different z-axis range segments. Higher inebriety is associated with an increase in acceleration at the upper range of z (i.e., 0.41 > z ≥ –0.01) and also associated with a decrease in the lower range of z (i.e., –0.01 > z ≥ –0.15). Table 4 also shows that the preceding interpretable pairwise associative patterns are not evident from linear and curvilinear inputs in the mixed-effects models.
Table 4.
Dataset/OutcomesInputCoefficient (SE)
  SegmentedLinearCurvilinear
WellbuiltforWellbeing/
SDNN
Sound 0.07 (0.02)***0.09 (0.02)***
Sound2  –0.01 (0.00)***
Sound < 510.19 (0.04)***  
Sound ≥ 51–0.01 (0.03)  
Activity 18.19 (1.27)***46.09 (1.68)***
Activity2  –43.39 (1.64)***
Activity < 0.219.00 (1.41)***  
0.9 > Activity ≥ 0.2190.49 (3.87)***  
Activity ≥ 0.9–15.21 (1.46)***  
HospitalMonitoring/RRST2 1.63 (0.85)1.75 (0.85)**
ST22  –0.44 (–4.09)***
ST2 < –0.181.51 (1.92)  
ST2 ≥ –0.182.28 (0.95)**  
BeerCrawl/TACz –0.0073 (0.0104)–0.0061 (0.0098)
z2  –0.026 (0.0102)**
z < –0.44–0.0246 (0.0205)  
–0.2 > z ≥ –0.44–0.0318 (0.0167)  
–0.15 > z ≥ –0.2–0.0340 (0.0390)  
–0.01 > z ≥ –0.15–0.2014 (0.0734)***  
0.41 > z ≥ –0.010.0857 (0.0279)***  
z ≥ 0.410.0030 (0.0182)  
Table 4. Fixed Effects of Segmented, Linear, and Curvilinear Models Across All Three Datasets
*** = p < .01, ** = p < .05.
Figure 2 shows a visual representation of the segmented model coefficients compared to coefficients from linear and curvilinear models. Coincidentally, the shapes of the piecewise relationships for each of the pairwise association resemble the corresponding smooth functions shown in Figure 1, reinstating the importance of our HITL approach to train robust interpretable segmented inputs in the mixed-effects models.
Fig. 2.
Fig. 2. Trajectory of linear, curvilinear, and segmented fixed-effects coefficients for Sound-SDNN (a), Activity-SDNN (b), ST2-RR (c), and z-TAC (d).

5 Discussion and Conclusion

In this study, we presented the problem of developing an interpretable model that captures piecewise pairwise associations between different modalities captured by wearables. Since existing methods for segmented modeling for mixed-effect regression are insufficient to determine robust and verifiable change point, our method is timely with increasing research applications utilizing wearables in a natural experimental setup. Our method involves the inspection of smooth functions of pairwise associations captured using GAMM, followed by using Brent's method to sequentially position change points optimizing model fit. Our method not only uses analytical tools to determine change points but also utilizes user discretion to control the number of change points and its localization. For example, it is often desirable to avoid change points at extremities as data corresponding to these segments may be very sparse, rendering inference unreliable. We apply our method to three different wearables datasets and show that not only is it effective in terms of improving model fit and prediction performance but also significantly enhances model interpretability and ability to derive meaningful inferences.

5.1 Managerial Implications

Our method and analysis have several managerial implications. Our study provides a novel tool to analyze wearables data, thus boosting the value for storing and processing of large amounts of big data generated by wearables. Our HITL-based segmented modeling method can be used in a wide range of wearables applications such as patient monitoring systems, military fitness management programs, smart diet applications, and COVID-19 contact tracing. Our analysis over the three wearables datasets presents interesting pairwise associations. The positive relationship between sound level and physical wellbeing measure below the range of 51 dBA informs workplace design practices on the need for further examination of sound level effects on employee health for different sound level ranges. A higher gradient of an activity-wellbeing relationship in the lower range of activity provides additional empirical evidence on the value of low-intensity/intermittent activities on elevating instantaneous stress and improving wellbeing. The significant association between cardiovascular wellness measure and respiration rate after a certain threshold of the ST segment index solicits clinical researchers to further examine inter-relationships between pulmonary and cardiovascular wellness indices to improve on existing hospital monitoring and early warning systems. Finally, the association between a dimension of raw accelerometer data and extrinsic phenomena such as alcohol consumption stresses the value of looking at raw data in addition to expert-engineered features such as gait variability and the number of continuous steps.

5.2 Contribution to IS Research

Predictive modeling and statistical modeling in analytics go side-by-side as one predicts the future using existing data, focusing on informing us on the question “What will be,” whereas the other explicates hidden patterns and tells us about “What is” with respect to a phenomenon. Both of them are important and require attention to optimize the utility of the generated data. As the number of wearable technology-based applications increases in the future, the quantum of available data to analyze will exponentially increase and warrant more and more advancements in explainable modeling for meaningful interpretations of patterns. In this study, we introduce a new method to address the design challenge of representing nonlinear associative patterns in wearables data. Our contribution is timely in IS research, as the discipline is widening its scope in design science as well as explanatory modeling applications by using novel data sources such as wearables [70]. WDA is a promising area in IS [5, 19], opening a wide range of research applications owing to the following two reasons: the ubiquitous nature of wearables in today's lifestyle, and the promise of wearables to generate rich, personalized, temporal, and highly grained information content. We therefore posit that our contributions through a novel interpretable modeling method for addressing challenges in WDA lays the foundation for promising research in IS using data generated from wearables.

5.3 Limitations

There are some caveats and limitations to our study. We have focused on the design science problem of developing an interpretable modeling method but do not delve into the subject of determining the significance of input variables themselves. In addition, our method by itself does not imply causation, although it can be applied to any explanatory modeling scenario including causal or quasi-causal experimental settings. If curvilinear effects are absent, the segmented modeling approach should be avoided to prevent overfitting. The modeling approach described in this study is useful when higher-order association is predetermined between pairs of repeated measures and there is a need to better explain these associations for making inferences. For high-dimensional large datasets, GAMM can take longer time to fit, and the change point optimization can be tedious for the analyst. A few ways to avoid this problem are to apply feature selection, variable transformation, and outlier detection procedures before examining pairwise associations using our method. Next, our HITL approach involves human inputs and therefore may still be susceptible to human errors and biases, despite the fine-tuning step using the optimization procedure. One way of reducing such potential errors is to consult domain experts post determination of change point from the optimization procedure. Finally, it is worth noting that our method caters to the problem of improving interpretability of glass-box models, at the cost of increased bias and limited predictive power when compared to black-box data mining models [28].

5.4 Conclusion

With the increasing availability of wearables, we can measure and understand different health phenomena at a highly granular level. We propose an HITL method for accurate estimation of change points in segmented mixed-effects regression facilitating the interpretations of pairwise associations of variables in wearables data. Our method is robust and efficient, and the resultant segmented models provide better prediction accuracy than state-of-the-art alternatives for a given problem. Our proposed method is empirically validated, more reliable due to human verification, and provides better interpretable results. Our approach can be generalized to other areas of IS where nonlinear pairwise associations are anticipated.

Footnotes

References

[2]
I. Bardhan, H. Chen, and E. Karahanna. 2022. Connecting systems, data, and people: A multidisciplinary research roadmap for chronic disease management. MIS Quarterly 44 (March 2020), 185–200.
[3]
L. He, H. Liu, Y. Yang, and B. Wang. 2021. A multi-attention collaborative deep learning approach for blood pressure prediction. ACM Transactions on Management Information Systems 13, 2 (Oct. 2021), 1–20.
[4]
J. A. Killian, K. M. Passino, A. Nandi, D. R. Madden, and J. Clapp. 2019. Learning to detect heavy drinking episodes using smartphone accelerometer data. In CEUR Workshop Proceedings. 2429.
[5]
H. Zhu, S. Samtani, R. A. Brown, and H. Chen. 2021. A deep learning approach for recognizing activity of daily living (ADL) for senior care: Exploiting interaction dependency and temporal patterns. MIS Quarterly 45, 2 (2021), 1–69.
[6]
C. M. Lindberg, K. Srinivasan, B. Gilligan, J. Razjouyan, H. Lee, B. Najafi, K. J. Canada, et al. 2018. Effects of office workstation type on physical activity and stress. Occupational and Environmental Medicine 75, 10 (2018), 689–695.
[7]
E. Smets, E. R. Velazquez, G. Schiavone, I. Chakroun, E. D'Hondt, W. de Raedt, J. Cornelis, et al. 2018. Large-scale wearable data reveal digital phenotypes for daily-life stress detection. npj Digital Medicine 1, 1 (Dec. 2018), 1–10.
[8]
W. Zhang and S. Ram. 2020. A comprehensive analysis of triggers and risk factors for asthma based on machine learning and large heterogeneous data sources. MIS Quarterly 44, 1 (2020), 305–349.
[9]
C. C. Yang, G. Leroy, and S. Ananiadou. 2013. Smart health and wellbeing. ACM Transactions on Management Information Systems 4, 4 (2013), Article 15, 8 pages.
[10]
K. Gu, S. Vosoughi, and T. Prioleau. 2021. SymptomID: A framework for rapid symptom identification in pandemics using news reports. ACM Transactions on Management Information Systems 12, 4 (Sept. 2021), 1–17.
[11]
I. Azodo, R. Williams, A. Sheikh, and K. Cresswell. 2020. Opportunities and challenges surrounding the use of data from wearable sensor devices in health care: Qualitative interview study. Journal of Medical Internet Research 22, 10 (Oct. 2020), e19542.
[12]
M. Uddin and S. Syed-Abdul. 2020. Data analytics and applications of the wearable sensors in healthcare: An overview. Sensors (Basel) 20, 5 (March 2020), 1379.
[13]
U. Kraus, A. Schneider, S. Breitner, R. Hampel, R. Ruckerl, M. Pitz, U. Geruschkat, P. Belcredi, K. Radon, and A. Peters. 2013. Individual daytime noise exposure during routine activities and heart rate variability in adults: A repeated measures study. Environmental Health Perspectives 121, 5 (2013), 607–612.
[14]
B. Q. Liu and D. L. Goodhue. 2012. Two worlds of trust for potential e-commerce users: Humans as cognitive misers. Information Systems Research 23, 4 (2012), 1246–1262.
[15]
G. Pant and P. Srinivasan. 2010. Predicting web page status. Information Systems Research 21, 2 (June 2010), 345–364.
[16]
J. D. Singer and J. B. Willett. 2009. Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence. Oxford University Press, New York, NY.
[17]
V. M. Muggeo, D. C. Atkins, R. J. Gallop, and S. Dimidjian. 2014. Segmented mixed models with random changepoints: A maximum likelihood approach with application to treatment for depression study. Stat Modelling 14 (2014), 293–313.
[18]
H. Bozdogan. 1987. Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions. Psychometrika 52, 3 (1987), 3450370.
[19]
H. Chen, R. H. L. Chiang, and V. C. Storey. 2012. Business intelligence and analytics: From big data to big impact. MIS Quarterly 36, 4 (2012), 1165–1188.
[20]
M. Chan, D. Estève, J.-Y. Fourniols, C. Escriba, and E. Campo. 2012. Smart wearable systems: Current status and future challenges. Artificial Intelligence in Medicine 56, 3 (2012), 137–156.
[21]
H. Banaee, M. U. Ahmed, and A. Loutfi. 2013. Data mining for wearable sensors in health monitoring systems: A review of recent trends and challenges. Sensors 13 (2013), 17472–17500.
[22]
S. Majumder, T. Mondal, and M. Deen. 2017. Wearable sensors for remote health monitoring. Sensors 17, 1 (2017), 130.
[23]
I. Yamada and G. Lopez. 2012. Wearable sensing systems for healthcare monitoring. In Proceedings of the 2012 Symposium on VLSI Technology (VLSIT’12), 5–10.
[24]
K. Malhi, S. C. Mukhopadhyay, J. Schnepper, M. Haefke, and H. Ewald. 2012. A Zigbee-based wearable physiological parameters monitoring system. IEEE Sensors Journal 12, 3 (2012), 423–430.
[25]
D. Ravi, C. Wong, B. Lo, and G.-Z. Yang. 2017. A deep learning approach to on-node sensor data analytics for mobile or wearable devices. IEEE Journal of Biomedical and Health Informatics 21, 1 (2017), 56–64.
[26]
A. Sano, S. Taylor, A. W. McHill, A. J. K. Phillips,L. K. Barger, E. Klerman, and R. Picard. 2018. Identifying objective physiological markers and modifiable behaviors for self-reported stress and mental health status using wearable sensors and mobile phones: Observational study. Journal of Medical Internet Research 20, 6 (2018), 210–216.
[27]
J. B. Wang, L. A. Cadmus-Bertram, L. Natarajan, M. M. White, H. Madanat, J. F. Nichols, G. X. Ayala, and J. P. Pierce. 2015. Wearable sensor/device (Fitbit One) and SMS text-messaging prompts to increase physical activity in overweight and obese adults: A randomized controlled trial. Telemedicine Journal and e-Health 21, 10 (Oct. 2015), 782–792.
[28]
G. Shmueli. 2010. To explain or to predict? Statistical Science 25, 3 (2010), 289–310.
[29]
H. Zhu, H. Chen, and R. Brown. 2018. A sequence-to-sequence model-based deep learning approach for recognizing activity of daily living for senior care. Journal of Biomedical Informatics 84 (Aug. 2018), 148–158.
[30]
A. Pentland, D. Lazer, D. Brewer, and T. Heibeck. 2009. Using reality mining to improve public health and medicine. Studies in Health Technology and Informatics 149 (2009), 93–102.
[31]
S. Guillén, M. T. Arredondo, and E. Castellano. 2011. A survey of commercial wearable systems for sport application. In Wearable Monitoring Systems. Springer, 165–178.
[32]
L. Breiman. 2001. Statistical modeling: The two cultures. Statistical Science 16, 3 (2001), 199–231.
[33]
M. N. Jarczok, M. Jarczok, D. Mauss, J. Koenig, J. Li, R. M. Herr, and J. F. Thayer. 2013. Autonomic nervous system activity and workplace stressors—A systematic review. Neuroscience and Biobehavioral Reviews 37, 8 (2013), 1810–1823.
[34]
X. Li, J. Dunn, D. Salins, G. Zhou, W. Zhou, S. M. Schussler-Fiorenza Rose, D. Perelman, et al. 2017. Digital health: Tracking physiomes and activity using wearable biosensors reveals useful health-related information. PLoS Biology 15, 1 (2017), e2001402.
[35]
P. MacNaughton, J. Spengler, J. Vallarino, S. Santanam, U. Satish, and J. Allen. 2016. Environmental perceptions and health before and after relocation to a green building. Building and Environment 104 (2016), 138–144.
[36]
J. F. Thayer, B. Verkuil, J. F. Brosschot, K. Kampschroer, A. West, C. Sterling, I. C. Christie, et al. 2010. Effects of the physical work environment on physiological measures of stress. European Journal of Cardiovascular Prevention and Rehabilitation 17, 4 (Aug. 2010), 431–439.
[37]
T. Föhr, A. Tolvanen, R. Myllymaki, E. Jarvela-Reijonen, S. Rantala, R. Korpela, K. Peukhuri, et al. 2015. Subjective stress, objective heart rate variability-based stress, and recovery on workdays among overweight and psychologically distressed individuals: A cross-sectional study. Journal of Occupational Medicine and Toxicology 10 (2015), 39.
[38]
T. Pereira, P. R. Almeida, J. P. S. Cunha, and A. Aguiar. 2017. Heart rate variability metrics for fine-grained stress level assessment. Computer Methods and Programs in Biomedicine 148 (Sept. 2017), 71–80.
[39]
A. Gelman and J. Hill. 2006. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge University Press, New York, NY.
[40]
G. Pimentel, S. Rodrigues, P. A. Silva, A. Vilarinho, R. Vaz, and J. P. Silva Cunha. 2019. A wearable approach for intraoperative physiological stress monitoring of multiple cooperative surgeons. International Journal of Medical Informatics 129 (2019), 60–68.
[41]
M. Cropley, D. Plans, D. Morelli, S. Sutterlin, I. Inceoglu, G. Thomas, and C. Chu. 2017. The association between work-related rumination and heart rate variability: A field study. Frontiers in Human Neuroscience 11 (Jan. 2017), 27.
[42]
J. D. Runkle, C. Cui, C. Fuhrmann, S. Stevens, J. del Pinal, and M. M. Sugg. 2019. Evaluation of wearable sensors for physiologic monitoring of individually experienced temperatures in outdoor workers in southeastern U.S. Environment International 129 (2019), 229–238.
[43]
M. Durban, J. Harezlak, M. P. Wand, and R. J. Carroll. 2005. Simple fitting of subject-specific curves for longitudinal data. Statistics in Medicine 24, 8 (2005), 1153–1167.
[44]
D. W. K. Andrews. 1993. Tests for parameter instability and structural change with unknown change point. Econometrica 61, 4 (1993), 821–856.
[45]
A. Linden. 2015. Conducting interrupted time-series analysis for single- and multiple-group comparisons. Stata Journal 15, 2 (2015), 480–500.
[46]
S. E. Ryan and L. S. Porth. 2007. A Tutorial on the Piecewise Regression Approach Applied to Bedload Transport Data. General Technical Report RMRS-GTR, no. 189. USDA Forest Service.
[47]
V. M. R. Muggeo. 2003. Estimating regression models with unknown break-points. Statistics in Medicine 22, 19 (2003), 3055–3071.
[48]
X. Shuai, Z. Zhou, and R. S. Yost. 2003. Using segmented regression models to fit soil nutrient and soybean grain yield changes due to liming. Journal of Agricultural, Biological, and Environmental Statistics 8, 2 (2003), 240–252.
[49]
A. Doan. 2018. Human-in-the-loop data analysis: A personal perspective. In Proceedings of the Workshop on Human-in-the-Loop Data Analytics (HILDA’18).
[50]
X. Wu, L. Xiao, Y. Sun, J. Zhang, T. Ma, and L. He. 2022. A survey of human-in-the-loop for machine learning. Future Generation Computer Systems 135 (Oct. 2022), 364–381.
[51]
Y. Gil, J. Honaker, S. Gupta, Y. Ma, V. D'Orazio, D. Garijo, S. Gadewar, O. Yang, and N. Jahanshad. 2019. Towards human-guided machine learning. In Proceedings of the IUI International Conference on Intelligent User Interfaces (IUI’19), 614–624.
[52]
D. D. Xin, L. L. Ma, J. J. Liu, S. S. Macke, S. S. Song, and A. A. Parameswaran. 2018. Accelerating human-in-the-loop machine learning: challenges and opportunities. In Proceedings of the 2nd Workshop on Data Management for End-to-End Machine Learning (DEEM’18).
[53]
A. Holzinger. 2016. Interactive machine learning for health informatics: When do we need the human-in-the-loop? Brain Informatics 3, 2 (2016), 119–131.
[54]
K. Zheng, G. Chen, M. Herschel, K. Y. Ngiam, B. C. Ooi, and J. Gao. 2021. PACE: Learning effective task decomposition for human-in-the-loop healthcare delivery. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 2156–2168.
[55]
A. Gupta, M. T. Lash, and S. K. Nachimuthu. 2021. Optimal sepsis patient treatment using human-in-the-loop artificial intelligence. Expert Systems with Applications 169 (May 2021), 114476.
[56]
S. N. Wood. 2004. Stable and efficient multiple smoothing parameter estimation for generalized additive models. Journal of the American Statistical Association 99, 467 (2004), 673–686.
[57]
J. J. Faraway. 2016. Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models. CRC Press, Boca Raton, FL.
[58]
R. P. Brent. 2013. Algorithms for Minimization without Derivatives (2nd ed.). Dover Publications, Eaglewood Cliffs, NJ.
[59]
R. Fletcher. 2013. Practical Methods of Optimization (4th ed.), vol. 53. John Wiley & Sons, West Sussex, England.
[61]
F. Shaffer and J. P. Ginsberg. 2017. An overview of heart rate variability metrics and norms. Frontiers in Public Health 5, 258 (Sept. 2017), 1–17.
[62]
L. Soares-Miranda, J. Sattelmair, P. Chaves, Glen Duncan, D. S. Siscovick, P. K. Stein, and D. Mozaffarian. 2014. Physical activity and heart rate variability in older adults. Circulation 129, 21 (May 2014), 2100–2110.
[63]
K. Srinivasan, F. Currim, S. Ram, M. R. Mehl, C. Lindberg, E. Sternberg, P. Skeath, et al. 2017. A regularization approach for identifying cumulative lagged effects in smart health applications. In Proceedings of the 7th International Conference on Digital Health (DH’17), 99–103.
[64]
D. Liu, M. Görges, and S. A. Jenkins. 2012. University of Queensland vital signs dataset: Development of an accessible repository of anesthesia patient monitoring data for research. Anesthesia and Analgesia 114, 3 (2012), 584–589.
[65]
B. Vogel, B. E. Claessen, S. V. Arnold, D. Chan, D. J. Cohen, E. Giannitsis, C. Michael Gibson, et al. 2019. ST-segment elevation myocardial infarction. Nature Reviews: Disease Primers 5, 1 (June 2019), 1–20.
[66]
G. B. Smith, D. R. Prytherch, P. Meredith, P. E. Schmidt, and P. I. Featherstone. 2013. The ability of the National Early Warning Score (NEWS) to discriminate patients at risk of early cardiac arrest, unanticipated intensive care unit admission, and death. Resuscitation 84, 4 (April 2013), 465–470.
[67]
S. Kianersi, M. Luetke, J. Agley, R. Gassman, C. Ludema, and M. Rosenberg. 2020. Validation of transdermal alcohol concentration data collected using wearable alcohol monitors: A systematic review and meta-analysis. Drug and Alcohol Dependence 216 (Nov. 2020), 108304.
[68]
D. Pradeep Kumar, N. Toosizadeh, J. Mohler, H. Ehsani, C. Mannier, and K. Laksari. 2020. Sensor-based characterization of daily walking: A new paradigm in pre-frailty/frailty assessment. BMC Geriatrics 20, 1 (May 2020), 1–11.
[69]
V. M. R. Muggeo. 2008. Segmented: An R package to fit regression models with broken-line relationships. R News 8 (May 2008), 20–25.
[70]
A. Rai. 2017. Editor's Comments: Diversity of design science research. MIS Quarterly 41 (2017), iii–xviii.

Cited By

View all

Index Terms

  1. A Human-in-the-Loop Segmented Mixed-Effects Modeling Method for Analyzing Wearables Data

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Management Information Systems
    ACM Transactions on Management Information Systems  Volume 14, Issue 2
    June 2023
    178 pages
    ISSN:2158-656X
    EISSN:2158-6578
    DOI:10.1145/3580448
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 January 2023
    Online AM: 24 September 2022
    Accepted: 03 September 2022
    Revised: 06 July 2022
    Received: 15 January 2022
    Published in TMIS Volume 14, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Smart health
    2. wearables
    3. segmented mixed-effects regression
    4. human-in-the-loop method
    5. explainability
    6. interpretable modeling

    Qualifiers

    • Research-article
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)910
    • Downloads (Last 6 weeks)64
    Reflects downloads up to 18 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media