If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Address reprint requests to Riccardo LoMartire, RPT, MSc, Division of Physiotherapy, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Alfred Nobels allé 23, 141 83 Huddinge, Sweden.
Division of Physiotherapy, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Huddinge, SwedenSchool of Health and Welfare, Dalarna University, Falun, Sweden
Division of Physiotherapy, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Huddinge, SwedenSchool of Health and Welfare, Dalarna University, Falun, SwedenCenter for Clinical Research Dalarna-Uppsala University, Falun, Sweden
The distribution of social insurance benefits is described over a 7-year period.
•
Overall, 17 to 49% of the patients received benefits at any given time.
•
A fifth of the patients accounted for half of the total benefits.
•
Eight predictors of future sickness absence are identified.
Abstract
Chronic pain-related sickness absence is an enormous socioeconomic burden globally. Optimized interventions are reliant on a lucid understanding of the distribution of social insurance benefits and their predictors. This register-based observational study analyzed data for a 7-year period from a population-representative sample of 44,241 chronic pain patients eligible for interdisciplinary treatment (IDT) at specialist clinics. Sequence analysis was used to describe the sickness absence over the complete period and to separate the patients into subgroups based on their social insurance benefits over the final 2 years. The predictive performance of features from various domains was then explored with machine learning-based modeling in a nested cross-validation procedure. Our results showed that patients on sickness absence increased from 17% 5 years before to 48% at the time of the IDT assessment, and then decreased to 38% at the end of follow-up. Patients were divided into 3 classes characterized by low sickness absence, sick leave, and disability pension, with eight predictors of class membership being identified. Sickness absence history was the strongest predictor of future sickness absence, while other predictors included a 2008 policy, age, confidence in recovery, and geographical location. Information on these features could guide personalized intervention in the specialized healthcare.
Perspective
This study describes sickness absence in patients who visited a Swedish pain specialist interdisciplinary treatment clinic during the period 2005 to 2016. Predictors of future sickness absence are also identified that should be considered when adapting IDT programs to the patient's needs.
Global burden of Disease study Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990-2016: A systematic analysis for the Global Burden of Disease Study 2016.
Although its heterogeneous manifestations and the overlap with other conditions complicate accurate quantification of its socioeconomic consequences, it reportedly represents between 3 to 10% of gross domestic product in western economies.
OPENminds The individual and societal burden of chronic pain in Europe: the case for strategic prioritisation and action to improve knowledge and availability of appropriate care.
OPENminds The individual and societal burden of chronic pain in Europe: the case for strategic prioritisation and action to improve knowledge and availability of appropriate care.
Indeed, conditions linked to chronic pain constitute the most common causes of prolonged sick leave and disability pension, with musculoskeletal disorders, mental disorders, and injuries consistently being reported among the leading diagnoses.
Sickness absence: causes, consequences, and physicians' sickness certification practice. A systematic literature review by the Swedish Council on Technology Assessment in Health Care (SBU).
Furthermore, it is recognized that social insurance benefits are unevenly distributed so that a minority of affected individuals represent a disproportionate amount of the costs.
For instance, 2 studies of pain-related US worker compensation claims revealed that 5 to 9% of long-term claims accounted for 60 to 85% of the total costs.
It is therefore important to isolate different patterns of sickness absence, so that interventions can be tailored to the most likely trajectory of the patient.
To accomplish this goal, a clear overview of sickness absence and its predictors is necessary. A number of studies have described sickness absence relating to the causes of chronic pain, but research specific to the condition itself is nearly non-existent, which combined with methodological variations, has contributed to a fragmented overview of the topic.
Sickness absence: causes, consequences, and physicians' sickness certification practice. A systematic literature review by the Swedish Council on Technology Assessment in Health Care (SBU).
The understanding of predictors of sickness absence linked to chronic pain is further impeded by a tendency to examine sick leave and disability pension separately, and to limit the field of study to musculoskeletal diagnoses.
Sickness absence due to back pain or depressive episode and the risk of all-cause and diagnosis-specific disability pension: A Swedish cohort study of 4,823,069 individuals.
Sickness absence due to back pain or depressive episode and the risk of all-cause and diagnosis-specific disability pension: A Swedish cohort study of 4,823,069 individuals.
In addition, the analytical tradition of the medical sciences has focused on describing sample-level associations reliant on statistical assumptions behind the data generating process.
Driven by the data itself to select the most appropriate algorithm for a given circumstance, this strategy can more effectively identify complex patterns in large datasets.
In summary, sickness absence linked to chronic pain presents an enormous and unevenly distributed socioeconomic burden. Information on social insurance benefits linked to chronic pain is currently fragmented and, to the detriment of national health policy and patient-tailored intervention, the applicability of its predictors is uncertain for the chronic pain population. We therefore described sickness absence in chronic pain patients, isolated sickness absence trajectories, and identified predictors of the trajectories.
Methods
Design and Participants
This longitudinal register-based observational study was built on microdata from 5 Swedish national registers, linked via unique personal identification numbers held by all Swedish residents.
The study population was defined through the Swedish Quality Registry for Pain Rehabilitation, which contains information on chronic pain patients that are eligible for rehabilitation at any of the 44 specialist interdisciplinary treatment (IDT) clinics across Sweden.
Swedish Quality Registry for Pain Rehabilitation patients represent those with particularly complex chronic pain symptoms that have not responded to primary care interventions, corresponding to roughly 0.5‰ of the Swedish population annually. Patients eligible for IDT during the period 2005 to 2016 with musculoskeletal pain for a minimum of 90 days and aged 18 to 64 years at IDT assessment (t0) were included, while those with a registered ICD-10 neoplasm diagnosis (C00-D49) in the 5 years prior to t0 were excluded.
Information on sickness absence stored in the Micro Data for Analysis of the Social Insurance register was obtained from the Swedish Social Insurance Agency in February 2019 and sociodemographic information stored in the Longitudinal Integration Database for Health Insurance and Labour Market Studies was procured from Statistics Sweden in March 2019.
Meanwhile, data on specialized health care and on dispensed prescription pharmaceuticals stored in the National Patient Register and the Prescribed Drug Register, respectively, were acquired from the National Board of Health and Welfare in February 2020.
Data covered a 7-year period from 5 years before to 2 years after t0 (ie, January 2000–November 2018), with the exception of the National Patient Register and Prescribed Drug Register, which were initialized in 2001 and 2005, respectively. This study was approved by Uppsala's Medical Research Ethics Committee (DNR 2018/036) and all participants signed a written informed consent form.
Sickness Absence
Swedish residents are eligible for sick leave benefits from the age of 16 and are granted for either full-time (100%) or part-time (25%, 50%, or 75%) of ordinary work hours.
Social insurance benefits are reimbursed by the Swedish Social Insurance Agency once they exceed the qualifying period that is most commonly 14 but on occasion up to 21 days for employees, between 3 to 93 days for self-employed workers, and as little as 1 day for students and the unemployed.
Individuals aged 30 to 64 years can also be granted a full or partial permanent disability pension if their working capacity is deemed to be permanently impaired. In this study, sickness absence for all disability pension and sick leave spells exceeding the employee-qualifying period in effect at the time were included. Benefits were limited to ICD-10 diagnoses related to the chronic pain spectrum: musculoskeletal system diseases (M: 00–99), nervous system diseases (G: 43–44, 47, 50–64, 82, 96–97), unclassified pain symptoms (R: 07, 10, 26, 29, 51–52), injuries and complications (S: 12–13, 22, 32, 42–43, 53; T: 85, 88, 91–94), and mental disorders (F: 32–33, 41, 43, 45).
Data was right-censored: at the end of the 2-year follow-up, due to the end of registration from November 2018, and at the assumed retirement of age 65.
Features
The predictive performance on sickness absence was examined for 101 features in the domains of sociodemographics, the chronic pain experience, sickness absence history, and healthcare. Multinomial features were one-hot encoded as binary dummy variables, while ordinal features were label encoded and analyzed as numerical variables. Supplementary tables 4.1 to 4.5 and figures 4.1 to 4.6 detail the feature characteristics.
Sociodemographics included age, sex (male, female), geographical region (Stockholm/Gotland, southern Sweden, central Sweden, northern Sweden), country of birth (Sweden, other European country, non-European country), Swedish language proficiency (high, intermediate, poor), family composition (partner with children, partner without children, single with children, single without children), number of children, education level (elementary school, high school, university), employment status (employed, unemployed, student), unemployment duration, and the past 5-year mean disposable income for the patient and their family.
The chronic pain experience covered pain characteristics and their consequences in everyday life. Pain characteristics included pain duration, past week 11-point numeric rating scale pain intensity, number of anatomical pain locations (0–36), the primary pain location (head, neck/shoulder/upper extremities, lower back, abdomen, hips/lower extremities, widespread pain), and confidence in recovery (high, moderate, low). Everyday consequences were measured using four questionnaires. The Multidimensional Pain Inventory reflected the impact of pain (5 scales: pain severity, interference, life control, affective distress, support), perceived responses of significant others (3 scales: punishing, solicitous, distracting), and frequency of everyday activities (4 scales: household chores, outdoor activities, activities away from home, social activities; one activity index).
SF-36 reflected health status through 2 IRT-based summary scales (physical health, mental health) and 8 conventional scales (physical functioning, social functioning, physical role limitations, emotional role limitations, mental health, vitality, bodily pain, general health).
History of sickness absence included ongoing full or partial sick leave at t0, past-year full or partial sick leave spells, sustained past-year full or partial sick leave spells > 180 days, and cumulative gross sick leave days summed up annually for the 5 years prior to t0. Features distinguished between benefits related to the chronic pain spectrum and to other diagnoses.
Health care covered features related to comorbidities and health care utilization. Comorbidities were indicated from ICD-10 codes as severe mental illness (schizophrenia or bipolar disorder), substance use disorder, and past self-harm. Meanwhile, health care utilization was represented via the number of past-year specialist outpatient healthcare visits, the admitted inpatient healthcare days, and the past-year doctor visits (0, 1–3, >3). It also included a previous IDT evaluation, a previous IDT intervention, and whether the patient was currently enrolled in an IDT program. Based on ATC codes of non-opioid pain medication, opioids, antidepressants, benzodiazepines, hypnotics, mood stabilizers, antipsychotics, and AD/HD medication (supplementary table 4.5), the number of dispensed prescription pharmaceutical classes were summed up for the previous year.
For each of the first 5 classes, both past-year indicators and total defined daily doses were included. Finally, an indicator of whether t0 occurred from 2009 and later was included to reflect a 2008 policy that was implemented to decrease long-term sickness absence of chronic pain-related diagnoses through guaranteed access to evidence-based health care.
Implementation of evidence-based rehabilitation for non-specific back pain and common mental health problems: A process evaluation of a nationwide initiative.
Practically, a sequence is made up of a series of categorical states over a temporal range resolved into units. Sequences are compared through several available algorithms, of which optimal matching is most frequently used.
It computes the cost of transforming sequences into one another by inserting/deleting or substituting elements and returns a dissimilarity matrix of pairwise distances. For a comparison to be possible, the cost of element exchange needs to be specified in advance. With the dissimilarity matrix obtained, sequences can then be clustered into homogenous pattern subgroups based on pre-defined criteria. Unfortunately, cluster analysis is an inherently unstable technique with highly variable solutions depending on the exact algorithm used, which additionally provides no guarantee of cluster meaningfulness. It is, therefore, critical to interpret them in the light of both statistical quality and pertinence.
Predictive models are used to predict an outcome given a feature pattern. In their development, the aim is to isolate systematic trends of interest, while avoiding sample-specific random patterns, so that predictive generalizability is optimized.
In practice, this is achieved by balancing the prediction error between data used in the model development (training data) and data not yet seen by the model (evaluation data), so that the training error neither is markedly smaller (overfit) or markedly larger (underfit) than the evaluation error.
With multiclass outcomes, multinomial logistic regression is a common choice to estimate the probability of class memberships. Non-parametric machine learning algorithms are a powerful alternative that offers intrinsic support for capturing complex non-linearities and higher-order feature interactions, but it requires hyperparameters (settings) to be calibrated before model training for optimal performance.
Succinctly, a support vector machine identifies a decision boundary in multidimensional space that maximizes the margin between outcome classes, with orientation and direction guided by support vectors (data points near the boundary). Gradient tree boosting sequentially trains an ensemble of decision trees (a set of consecutive decisions) and combines them into a composite prediction model. With each iteration, higher weight is placed on misclassifications so that they are boosted (prioritized) in subsequent decision trees, thereby capturing data patterns with a gradually increasing level of detail. Finally, a multilayer perceptron is an artificial neural network that combines linear and non-linear functions to approximate data patterns.
Unlike multinomial logistic regression, non-parametric machine learning algorithms do not return coefficients that allow the association between individual features and the outcome to be observed directly. Instead, permutation can be used to determine their contribution by iteratively shuffling each feature randomly and measuring the effect on model performance.
For non-collinear features, permutation importance is a fair measure; however, the importance of collinear features is underestimated in rough proportion to the strength of the collinearity and may require separate evaluation.
This analysis could be broken down into 3 general steps: sickness absence was initially described across the 7-year period, patients were then separated into subgroups based on their post-t0 patterns, and finally predictors of sickness absence were identified in a prediction modeling procedure. An overview is provided below, and further details are available in the supplementary materials: figures 2.1 to 2.2 illustrate the sickness absence for the full sample and patient subsamples, figures 3.1-3.6 provide support for pattern subgrouping, and figures 4.7 to 4.18 detail the prediction modeling protocol.
Sequence analysis was used to examine the sickness absence related to the chronic pain spectrum (R v4.0.2; package: ‘TraMineR’ v2.2-0.1).
To describe it over the 7-year period, sequences were constructed with a daily resolution, each patient was assigned to 1 of 5 mutually exclusive states, and simultaneously occurring states were prioritized in the following order: full disability pension, full sick leave, partial disability pension, partial sick leave, and no sickness absence. Limited to the period post t0, the dissimilarity matrix was then generated from the sequences through optimal matching with a constant cost of 1 for insertion/deletion and 2 for substitution, after which it was analyzed with monothetic divisive hierarchical clustering to isolate pattern subgroups (R v4.0.2; package: ‘WeightedCluster’ v1.4-1).
Targeting state incidence, total state duration, spell timing and duration, frequency of subsequent patterns, and pattern-related measures, the algorithm searched for solutions with up to ten clusters. The statistical quality of each solution was then evaluated using the average silhouette width (range: -1, 1), the point-biserial correlation (range: -1, 1), and Hubert's C (range: 0, 1) as measures of cluster membership cohesion, the clusters’ capacity to reproduce the distance matrix, and the partition obtained compared to the best possible partition given the distance matrix and the cluster solution, respectively.
Guidelines for the average silhouette width define 0.5 as the threshold for a reasonable structure, while higher and lower estimates indicate better quality for the point-biserial correlation and Hubert's C, respectively.
Once the solution with the highest statistical quality was identified, its robustness was evaluated using 2 strategies. Firstly, the same clustering protocol was repeated in separate analyses of the data randomly partitioned into fifths. Secondly, the complete sample was re-analyzed with alternative clustering algorithms (ie, variations of agglomerative hierarchical clustering and partitioned around medoids algorithms described by Studer et al, 2013).
Finally, the selected solution was modified in accordance with theoretical considerations to ascertain clinically meaningful clusters.
With outcome classes defined, predictive features were identified in a modeling procedure that compared the performance between multinomial logistic regression, a support vector machine, gradient tree boosting, and a multilayer perceptron (Python v3.8.3 libraries: ‘scikit-learn’ v0.23.2; ‘XGBoost’ v1.2.0; ‘Tensorflow-Keras’ v2.4.0).
in: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. Association for Computing Machinery,
2016: 785-794
Model optimization was conducted in a 3-step nested cross-validation procedure to minimize information leakage, with data randomly split into 3 training sets (n × 8/30 each), 1 validation set (n × 3/30), and 1 test set (n × 3/30).
In the first step, the non-parametric machine learning-algorithms’ hyperparameters were calibrated through consecutive grid searches, whereby various model configurations were iteratively trained and evaluated on the first and second training sets, respectively.
The initial searches were broad to provide a general direction for the hyperparameters, while subsequent searches fine-tuned them. In the second step, the feature configuration with the best predictive performance was identified for each algorithm, whereby models were trained on the first and second training sets combined and evaluated on the third training and validation sets. The full 101-feature model was initially trained and the importance of individual features was determined based on their predictive accuracy through a 100-repetition permutation; highly collinear features (r > 0.7) were alternated to avoid an underestimation of their importance. Subsequently, a model with the top 15 features was trained, followed by a stepwise feature elimination until only features that contributed to performance remained. The optimized models were then compared between the algorithms and the best performing model was selected as the final model. In the last step, the generalizability of the final model was confirmed on the test set and its robustness against systematic sample differences examined through sensitivity analyses of 8 separate subsamples: aged ≤ 50 and > 50 years, Swedish and non-Swedish born, t0 < 2009 and ≥ 2009, and selected and non-selected for IDT.
Throughout the modeling procedure, 2 strategies were used to mitigate accuracy overestimation due to outcome class imbalance: the non-parametric machine learning-algorithms’ training misclassification was penalized inversely proportional to the class frequencies and all algorithms’ performance was evaluated through metrics sensitive to class imbalance.
The balanced accuracy and F1 score were chosen for overall performance, while sensitivity, specificity, the positive and negative predictive values, accuracy, and area under the ROC-curve provided a detailed assessment of individual class performance.
All metrics range between 0 to 1, with higher values indicating better performance.
Results
Sample Characteristics
Of 54,338 patients considered for IDT at a Swedish pain specialist clinic in the period from 2005 to 2016, 44,241 (81.4%) were included in the analysis. Excluded patients were either assessed outside the observation period (n = 929), were not within the selected age range (n = 1,960), had a pain duration of less than 90 days (n = 5,770), or had a registered neoplasm diagnosis in the 5 years preceding t0 (n = 1,438). Of the included patients, 70.0% were younger than 50 years of age, 83.1% had completed a secondary school education or higher, and 64.5% were employed. Pain duration had exceeded a year for 89.6%, with 35.3% having widespread pain, and the most common primary diagnosis being fibromyalgia at 13.6%. At t0, 37.1% and 12.8% were on ongoing sick leave or a disability pension related to chronic pain, respectively, while 24.6% and 10.7% had over 180 net days of pain-related sick leave or disability pension in the past year, respectively. Table 1 details the sample characteristics.
Fig 1A presents the sample's sickness absence across the 7-year observation period (ie, 5 years before to 2 years after t0). Participants collecting benefits related to chronic pain increased from 17.4% at the start to 49.3% at 2 months past t0, to then decrease to 38.0% at the end of follow-up (Fig 1AI). In contrast, benefits due to other diagnoses remained comparatively constant across the same period at roughly 10% (Fig 1AII). Sickness absence was unevenly distributed between patients so that 21.7% received no benefits, while 19.9% accounted for 56.2% of the total benefits over the 7 years. A closer inspection of the transitions between sickness absence states revealed the dynamics of category exchange (Fig 1B). In line with the sample state distribution, most transitions took place between the no benefits and sick leave states, while an increasing number of patients transited from sick leave to disability pension over time. Finally, a consistent but non-negligible amount also transitioned from disability pension to no benefits.
Figure 1Sample sickness absence. (A) Chronograms of the sample's daily transversal state distribution; the main (I) and inset (II) plots show benefits related to chronic pain and other diagnoses, respectively. (B) Alluvial diagram of the sample's state transitions across the first day of each year. (C) Chronograms per cluster for low sickness absence (I), sick leave (II), and disability pension (III). (D-F) Cluster-wise sequence index plots of daily individual patient patterns. The dotted vertical line marks the time of the IDT assessment (t0).
Restricting the period to the 2 final years, the complete sample was represented by 15,417 unique sickness absence patterns. Half of the participants remained in the same state the entire period: 32.6% with no benefits, 3.3% with partial sick leave, 4.9% with full sick leave, 4.3% with partial disability pension, and 4.9% with full disability pension; while 4.3% improved from > 180 days of gross sick leave in the first to none in the second year and 0.9% deteriorated in a reverse pattern. Based on the patterns, a 5-cluster structure driven by total state duration was identified as the best solution. The largest cluster corresponded to patients with low sickness absence (n = 25,294), while the remaining 4 clusters were dominated by partial sick leave (n = 5,416), full sick leave (n = 6,924), partial disability pension (n = 2,884), and full disability pension (n = 3,723). The solution had an acceptable statistical quality (overall: average silhouette width = 0.73, point-biserial correlation = 0.91, Hubert's C = 0.005; cluster-wise: average silhouette width = 0.59–0.78) and its robustness was supported by both the split-sample analyses and the alternative clustering algorithms. However, some theoretical inadequacies still warranted modification to improve the clusters’ clinical relevance. Firstly, full and partial sick leave clusters were combined, as they are poorly defined and overlapping in practice. Secondly, patients with permanent disability pension in the 5 years prior to t0 were excluded (n = 9,893), as their unlikeliness to change state would inflate predictive performance. Finally, full and partial disability pension were combined due to their low prevalence and patients with any disability pension following t0 were re-assigned to the disability pension cluster. This resulted in a 3-cluster solution (Fig 1C-1F; Table 2, Table 3) characterized by patients with: either no sickness absence or a fast recovery from benefits (low sickness absence class; n = 21,970), a considerable amount of sick leave who either ended on sick leave or recovered during the second year (sick leave class; n = 9,660), and a substantial amount of sickness absence who transited from sick leave to disability pension (disability pension class; n = 2,718).
Table 2Sample Sickness Absence and Health Care Utilization During the Final 2 Years
Fig 2 illustrates the predictive performance of the 4 algorithms. The non-parametric machine learning techniques performed similarly to each other and consistently outperformed the multinomial logistic regression. The full 101-feature model (Fig 2A) correctly predicted nearly 70% of patients across the outcome classes with the non-parametric algorithms on both the validation and the test set (balanced accuracy: support vector machine = 0.669/0.669; gradient tree boosting = 0.685/0.701; multilayer perceptron = 0.667/0.675). In comparison, the average performance was 60% for the multinomial logistic regression (balanced accuracy: 0.595/0.595), with only 25% of patients on disability pension correctly classified. Domain-specific models revealed that whereas sickness absence history contributed the most to overall performance, features related to sociodemographics and the chronic pain experience mainly improved disability pension predictions (Fig. 2C-F). Ultimately, 1 model per algorithm was identified (Fig 2B) that achieved similar performance to their respective full models (balanced accuracy: multinomial logistic regression = 0.574/0.567; support vector machine = 0.666/0.667; gradient tree boosting = 0.684/0.681; multilayer perceptron = 0.659/0.659). Ongoing chronic pain-related sickness absence at t0, the 2008 policy indicator, and age were identified as important predictors of future sickness absence by all algorithms. The non-parametric algorithms further identified confidence in recovery, gross sick leave in the second year prior to t0, and the southern Sweden geographical location. Meanwhile, gross sick leave in the past year, number of pain locations, and whether the patient was enrolled in a current IDT program was individually identified by the gradient tree boosting, the support vector machine, and the multilayer perceptron, respectively. The gradient tree boosting was selected as the final model since it slightly but consistently outperformed the other algorithms. Sensitivity analyses supported that it was robust to sample restriction (balanced accuracy): aged ≤ 50 and > 50 years (0.694 vs 0.599), Swedish and non-Swedish born (0.680 vs 0.710), t0 < 2009 and ≥ 2009 (0.676 vs 0.633), and selected and non-selected for IDT (0.662 vs 0.692).
Figure 2Predictive performance of the 4 algorithms. Balanced accuracy and class-wise sensitivities per algorithm for the full (A), final (B), and domain-wise models (C-F) when estimated on the training set (trn) and evaluated on trn, the validation set (val), and the test set (test), respectively. The dotted horizontal lines mark 0.33 and 0.70 of balanced accuracy. MLR, multinomial logistic regression. SVM, support vector machine. GTB, gradient tree boosting. ANN, multilayer perceptron artificial neural network.
Of the 8 features included in the final model, half were measures of sickness absence history, whereas the other half consisted of the 2008 policy indicator, age, confidence in recovery, and the southern Sweden geographical region (Fig 3B). Fig. 3C-D illustrate the average change in outcome class probability per feature. Sickness absence history had a strong association with future sickness absence. An increase in gross sick leave from zero to 365 days in either the past year or the second year prior to t0 decreased the probability of low sickness absence with more than 0.30 and 0.10, respectively, while the corresponding estimate was roughly 0.20 for both an ongoing full or partial sick leave spell at t0. Meanwhile, the remaining features were most strongly associated with future disability pension. An increased age from 18 to 64 years, a t0 before the 2008 policy, low confidence in recovery, and the southern Sweden region increased the probability of future disability pension by roughly 0.30, 0.30, 0.20, and 0.10, respectively. Intriguingly, past-year sick leave days displayed a non-linear interaction with age, whereby sick leave days largely influenced both whether patients below 30 years would have low sickness absence and whether patients older than 50 years would receive disability pension (Fig 3D).
Figure 3Gradient tree boosting final model details. (A) Cross-tables of observed versus predicted class memberships. The upper left shows performance by chance for a model estimated on the training set (n = 35,392) and evaluated on the test set (n = 4,425). The upper right, lower left, and lower right show the final model's performance when estimated on the training set and evaluated on the training, validation (n = 4,424), and test sets, respectively. Square brackets denote balanced accuracy. LSA, low sickness absence class. SL, sick leave class. DP, disability pension class. (B) Permutation feature importance for the final model, measured through the mean change in balanced accuracy across 100 repetitions. (C) Partial dependence plot of the marginal change in probability for predicted class membership with minimum and maximum values contrasted (ie, from no to yes for dichotomous features, from low to high for confidence in recovery, and from 0 to 365 days for gross sick leave). (D) Partial dependence plot of marginal change in probability for class membership with the interaction between age and gross sick leave days the past year. The figure illustrates the topography of probabilities, with the numbers on the borders showing thresholds for each color region. For example, a patient at age 50 with a gross past-year sick leave of 200 days has an increased probability of 0.32, 0.37, and 0.25 for LSA, SL, and DP, respectively (squares).
The final model had a balanced accuracy of 68% on the test set compared to 33% by chance, with 64 to 70% of true positives and 82 to 87% of true negatives identified across the outcome classes (Fig 3A; Table 3). However, a consequence of the imbalanced class distribution was that only 1 in 4 positive disability pension predictions were correct (positive predictive value = 0.256). The model was therefore primarily useful in discriminating between patients with low and high future sickness absence; most mispredictions were in the most similar class and the high sickness absence classes were typically misclassified as each other. When sick leave and disability pension classes were merged, performance was markedly increased (balanced accuracy = 0.805; F1 score = 0.794; sensitivity and specificity ≥ 0.794; positive and negative predictive values ≥ 0.701).
Discussion
This study described the sickness absence over a 7-year period in 44,241 chronic pain patients evaluated at specialist IDT clinics and identified predictors of sickness absence. Our results showed that study participants with social insurance benefits increased from 17% 5 years before to 48% at the time of the IDT evaluation to then decrease to 38% 2 years after. Eight predictors of future sickness absence were identified. Sickness absence history was the strongest predictor, while other predictors included the 2008 policy, age, confidence in recovery, and geographical region.
Consistent with previous research, our results show that sickness absence caused by musculoskeletal disorders, mental disorders, and injuries is high in chronic pain patients.
Sickness absence: causes, consequences, and physicians' sickness certification practice. A systematic literature review by the Swedish Council on Technology Assessment in Health Care (SBU).
With data centered at IDT evaluation, nearly a fifth of our sample received benefits at any given time, with the mean annual net sickness absence ranging from 54 to 145 days per patient. In comparison, official Swedish statistics have reported that the general population in the age bracket 16 to 64 years received benefits for 27 to 43 days across the period 2000 to 2016.
These figures are not directly comparable, as they were computed over relative versus absolute time periods, respectively. Nevertheless, our lowest estimate was higher than their upper extreme and the difference in estimates is likely to be conservative as our data was from a period corresponding to lower benefits received by the general population.
The sample trend of monotonously increasing benefits that peaked in the first year post-IDT evaluation could be attributed to specialist IDT referral procedure that prioritizes patients with complex clinical presentations. The subsequent decrease in benefits indicates that the IDT procedure represents a sorting point from temporary sick leave to either no benefits or a disability pension. This pattern was consistent in all sensitivity analysis subsamples (Figure 2.2 in supplementary materials). When comparing the trajectories of the selected and non-selected IDT patients, a more pronounced decrease in benefits was observed in the former, which could represent either a treatment or selection effect. A previous study restricted to a subsample of our data observed patients selected for IDT from 1 year before to 2 years after the intervention and reported a similar pattern in benefits.
However, whereas they observed that the benefits had decreased to the previous-year level already 1 year after evaluation, the benefits in our sample had still not decreased to that level at the end of follow-up. This discrepancy was due to markedly lower pre-evaluation benefits in our study and is unlikely to be caused by methodological differences, as we approximately reproduced their results when restricting the data to their sampling criteria. Instead, it is more likely to be related to the sample time frame.
As previously reported, sickness absence was unevenly distributed so that a minority accounted for most of the social insurance benefits.
To reflect these differences, patients were divided into 3 classes that were used as outcome and resulted in the identification of 8 associated features. Consistent with previous research, sickness absence history was the strongest predictor of future sickness absence and accounted for most of the final model's performance.
Sickness absence due to back pain or depressive episode and the risk of all-cause and diagnosis-specific disability pension: A Swedish cohort study of 4,823,069 individuals.
Interestingly, the performance was specific to past benefits linked to chronic pain, which aligned with previous reports of the strongest association within diagnosis category.
Sickness absence due to back pain or depressive episode and the risk of all-cause and diagnosis-specific disability pension: A Swedish cohort study of 4,823,069 individuals.
Both ongoing sick leave at the IDT evaluation and cumulative sick leave for the 2 preceding years were positively associated with future sickness absence. This is similar to earlier findings, where the sick leave extent at baseline predicted the duration of future sick leave spells and cumulative sick leave was a strong risk factor for future disability pension.
To a lesser extent, age, confidence in recovery, the 2008 policy indicator, and geographical location also predicted future sickness absence. There are several reports of a positive association between age and both sick leave duration and the risk of disability pension.
Our results suggested that the likelihood of low sick leave decreased with age and that age largely drove the risk of disability pension, which was the highest in patients over 50 years of age. Meanwhile, confidence in recovery was inversely associated with disability pension, which aligns with an earlier review that found evidence for high confidence in recovery as a predictor of return-to-work.
Since sickness absence history, age, and confidence in recovery are recurrent prognostic factors of future sickness absence in the literature, it is likely that they are directly informative in predictions for future patients. Regarding the 2008 policy, patients evaluated in the specialist healthcare after 2008 were less likely to receive disability pension, which is in accordance with previous reports; however, the extent to which the change is attributable to the policy alone is uncertain.
Likewise, geographical region was associated with disability pension, whereby patients in southern Sweden were at higher risk. Municipal differences were also previously observed and plausible causes for the discrepancy are differences in demographics, work-related factors, health care services, and policy.
Whereas both the 2008 policy and geographical region were important predictors in the analyzed sample, the former is not informative for future patients and the latter could be of limited value depending on its underlying causes. These findings nonetheless highlight the importance of spatio-temporality in sickness absence predictions. In contrast to the aforementioned results, several previously reported risk factors of sickness absence including sex, education, employment status, and multimorbidity did not improve predictions of the final model.
Lastly, separating patients with high sick leave and disability pension enabled identification of features specifically associated with disability pension, but the final model's accuracy was not sufficient for individual patient predictions in this class. Instead, an acceptable 80% accuracy was obtained by discriminating only between patients with low or high future sickness absence. To optimize personalized intervention, it could be useful to begin conceptualizing such models as decision support tools into clinical practice. Until then, sickness absence history, age, and confidence in recovery are important factors to consider when allocating rehabilitation modules that target absenteeism.
Our results rest upon a rigorous analysis of high-quality data from a large population-representative sample of chronic pain patients in a specialist treatment setting. The results should generalize to similar patients in Sweden, but predictions could be impaired if the underlying circumstances change, as prediction models are inherently non-causal. Five main limitations should be considered when interpreting the results. Firstly, that non-musculoskeletal diagnoses were pooled under the chronic pain spectrum necessitated the assumption that they were a consequence of the condition, which was reasonable when considering the sample characteristics. Secondly, the non-registered sick leave during the qualifying period introduced some bias, as recurrent short spells were misclassified as no benefits. The consequently slightly underreported sick leave was unlikely to influence the predictions, however, as spells below the employee-qualifying period were omitted for all patients. Thirdly, absence of microdata on migration, death, and retirement resulted in slightly underreported sickness absence due to misclassification as no benefits. The Swedish official statistics support that migration contributed the most bias, corresponding to 1.1 to 2.1% of the general population annually, while the annual death rate for ages 18 to 64 and the proportion retired in the ages 61 to 64 each correspond to 0.1%.
Fourthly, the prediction model is contingent on the validity of the outcome classes. Combining full and partial benefits decreased the clustering solution's statistical quality but was theoretically motivated as the degree of benefits are poorly resolved, owing to their basis on ordinary work hours rather than an absolute amount. This has been done previously and was supported by the inherent class ordering of the prediction model.
The visual presentation of the classes further provides an easily apprehensible patient overview, so that the class quality is directly observable (Fig 1D-F). In addition, targeting chronic pain benefits alone resulted in that patients in the low sickness absence class were slightly more likely to collect benefits for other diagnoses, as parallel benefits have a legislated upper limit. Adding this information to the model did not, however, increase its performance. Finally, the predictive analyses were limited to the data available in the registers. This prevented examination of psychosocial occupational stressors and lifestyle factors such as physical activity, body weight, smoking, and alcohol intake, which have previously been identified as important prognostic factors of sickness absence in non-clinical populations.
Similarly, data quality issues provide an alternative explanation for that certain features were not selected in the final model. This included noise owing to missingness between 20 to 30% in some chronic pain experience questionnaires and misclassification in employment status.
Conclusions
Patients evaluated at specialist IDT clinics displayed an overall trend of increased sickness absence from 5 years before to the time of the assessment, that then decreased until the end of follow-up. Eight features were identified that were associated with sickness absence in the 2 years following the IDT evaluation. Sickness absence history was the strongest predictor of future sickness absence, while other predictors included the 2008 policy, age, confidence in recovery, and geographical region. These results could guide personalized intervention in the specialist healthcare.
Acknowledgments
The authors thank the patients, the specialist treatment clinics, and the administrative personnel at the Swedish National Registers for their efforts in making this study possible.
Sickness absence: causes, consequences, and physicians' sickness certification practice. A systematic literature review by the Swedish Council on Technology Assessment in Health Care (SBU).
Implementation of evidence-based rehabilitation for non-specific back pain and common mental health problems: A process evaluation of a nationwide initiative.
The individual and societal burden of chronic pain in Europe: the case for strategic prioritisation and action to improve knowledge and availability of appropriate care.
in: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. Association for Computing Machinery,
2016: 785-794
Sickness absence due to back pain or depressive episode and the risk of all-cause and diagnosis-specific disability pension: A Swedish cohort study of 4,823,069 individuals.
Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990-2016: A systematic analysis for the Global Burden of Disease Study 2016.
This study was funded by the Swedish Research Council (Vetenskapsrådet: 2015-02512) and by the Swedish Research Council for Health, Working Life and Welfare (FORTE: 2016-07414 and 2017-00177).