Evaluation of joint awareness after acetabular fracture: Validation of the Forgotten Joint Score according to the COSMIN checklist protocol

2021-03-11ViolaFreigangJohannesWeberKarolinaMuellerChristianPfeiferMichaelWorlicekVolkerAltFlorianMichaelBaumann

World Journal of Orthopedics 2021年2期

Viola Freigang, Johannes Weber, Karolina Mueller, Christian Pfeifer, Michael Worlicek, Volker Alt, Florian Michael Baumann

Viola Freigang, Johannes Weber, Christian Pfeifer, Michael Worlicek, Volker Alt, Florian Michael Baumann, Department of Trauma Surgery, Regensburg University Medical Center, Regensburg 93042, Germany

Karolina Mueller, Center for Clinical Studies, Regensburg University Medical Center,Regensburg 93042, Germany

Abstract BACKGROUND A fracture of the acetabulum is an uncommon, but serious injury. Established outcome tools do not reflect the patient’s perspective after fracture of the hip joint. Originally designed for post-arthroplasty patients, the Forgotten Joint Score (FJS) is a patient-reported outcome measurement (PROM) tool evaluating the diseasespecific health-related quality of life (HR-QoL).AIM To validate the FJS in patients after acetabular fracture.METHODS In a prospective mono-centric cohort study, we evaluated 100 patients at mean 5.2 ± 3.6 years after a fracture of the acetabulum. The validation study followed the complete COSMIN checklist protocol. For calculation of convergent validity, we used the Tegner-Activity Scale, the Western Ontario and McMaster Universities Osteoarthritis Index, the EuroQol-5D, and a subjective rating of change as an anchor variable.RESULTS We confirmed good internal consistency with a Cronbach‘s alpha of 0.95. With an intraclass correlation coefficient of 0.99 (95%CI: 0.97, 0.99), test-retest reliability of the FJS was excellent. Correlation coefficients between the questionnaires were moderate to high ranging from |0.56| to |0.83| (absolute value). No relevant floor or ceiling effects occurred. Standard error of measurement was 3.2 and smallest detectable change (SDC) was 8.8. Thus, changes greater than 8.8 points between two assessments denote a real change in FJS.CONCLUSION The FJS is a valid and reliable tool for evaluation of patient-reported outcome in posttraumatic condition after acetabular fracture. The SDC indicating a real clinical improvement was 8.8 points in the FJS. We could confirm responsiveness of the FJS and found no relevant floor- or ceiling effects.

Key Words: Patient reported outcome measurement; Validation study; Hip joint; Forgotten Joint Score; Acetabulum fracture; COSMIN checklist

INTRODUCTION

A fracture of the acetabulum is an uncommon but severe injury to the hip joint. Initial management of these injuries is crucial regarding long-term outcome[1-7]. Persistent immobilizing pain and post-traumatic osteoarthritis are typical consequences in cases where the integrity of the acetabulum is not adequately reconstructed. Open reduction and internal fixation (ORIF) is the standard treatment for displaced acetabular fractures. Risk factors for a poor radiographic outcome like residual articular surface gaps and steps as well as involvement of the posterior wall and the quadrilateral surface are well known[2,4,8,9]. However, the impact of these radiographic parameters on Patient-reported outcome (PRO) is unclear. Conventional scoring systems like the Merle d’Aubigne-Postel Score are rather surgeon-based neglecting the patient’s perception of the outcome[10].

The “Forgotten Joint Score” (FJS) was developed in 2012 as a PRO measurement tool in patients after arthroplasty of the knee or hip joint[11]. The concept that the patient rates the loss of awareness of the hip joint is revolutionary in patient-reported outcome measurement (PROM). Joint awareness in everyday life is seen as an important criterion in activity of daily living (ADL)[11-13]. The forgotten joint, a condition where the patient has no distracting sensation of the joint, is seen as the ultimate goal resulting in maximum patient satisfaction[11]. Currently, the Merle d’Aubigne-Postel Score and the Western Ontario and McMaster Universities Osteoarthritis Index-Visual Analog Scale (WOMAC-VAS) Score are most frequently used in functional outcome evaluation after acetabular fractures[2,4,8,9,14]. Other scales like the EuroQol-5D (EQ-5D-3L) and the Tegner Activity Scale (TAS) are tools to rate global health-related quality of life and sports activity for characterization of a patient population.

Studies on psychometric properties of outcome tools should meet highest standards regarding methodological quality[15]. The COSMIN checklist is based on an international Delphi study in 2010 reporting on a consensus-based checklist evaluating the quality of studies on psychometric properties of HR-QoL instruments[16]. The COSMIN checklist consists of ten items giving recommen-dations for design, conduction and interpretation of medical validation studies. Relevant characteristics are study design, content validity, structural validity, internal consistency, crosscultural validity/measurement invariance, reliability, measurement error, criterion validity, hypotheses testing for construct validity, and responsiveness.

Purpose of this study was to validate the FJS for mid- and long-term condition after acetabular fractures and to investigate the relation between perioperative parameters and score values of the FJS.

MATERIALS AND METHODS

Validation study

The ethics committee at the University of Regensburg approved this study in December 2015 (Institutional Review Board Number 15-101-0241). We obtained written informed consent from all study participants. The study was carried out in accordance with the Declaration of Helsinki.

In this prospective, mono-centric validating study, we identified 100 consecutive German-speaking patients with a history of an acetabular fracture between 07/2002 and 06/2016.

Inclusion criteria were: (1) Mid- and long-term condition after acetabular fracture; (2) Minimum follow up was 12 mo after trauma; (3) Age between 18 and 70 years; and (4) Sufficient German reading and comprehension capacity.

Exclusion criteria were: (1) Relevant concomitant injuries (e.g., a fracture of the lower extremity or neurovascular injury leading to prolonged immobilization); (2) Preexisting mental disorder; and (3) Lack of consent to participate in this study.

Of initial 296 patients who were treated in a single institution for an acetabular fracture between 07/2002 and 06/2016, 32 patients had died until follow-up, 102 patients were lost to follow-up, 58 patients refused to give their informed consent. Four patients were excluded due to missing data. The patients were asked to complete the questionnaire at follow-up evaluation at mean 5.2 ± 3.6 years after the injury (time point T1). The patients received the questionnaire two weeks after the first questionnaire (time point T2) to evaluate the test-retest reliability and responsiveness. If the patients did not return the questionnaire form within four weeks, the patients were reminded at intervals of two weeks’ time. 55 of 100 patients completed the questionnaire twice on average after 32 ± 71.4 d. Figure 1 shows the patient flow chart.

FJS

The Forgotten Joint score is a self-administrated questionnaire comprising of 12 items concerning on the patient’s ability to forget the hip joint in everyday life[11]. Meanwhile, the loss of awareness of a joint is seen as the ultimate goal leading to maximum patient satisfaction. The FJS was developed in 2012. Several studies have proven high internal consistency, responsiveness, and construct validity in mid- and long term conditions after arthroplasty[11,13,17-19]. Originally, it was developed to evaluate PRO in patients after arthroplasty of the knee or hip.

The patients were asked to return the forms by mail. For evaluation of test-retest reliability, the patients were supposed to complete a second questionnaire after a minimum of two weeks. The patients received a reminding call if they did not answer within two weeks.

WOMAC-VAS

The WOMAC-VAS is a well-established standardized questionnaire to evaluate the disease-specific health-related quality of life of patients with osteoarthritis of the knee and hip comprising of 24 questions each rated on a VAS (0-100 mm)[20]. Following scores were computed: Subscale pain (0-500 mm), subscale stiffness (0-200 mm), subscale functional limitation (0-1700 mm), and a summary score (0-2400 mm). A WOMAC-VAS score of 0 indicates no pain, no stiffness, and full function. The higher the score is, the more problems the patient is facing. The WOMAC-VAS is validated in German[21].

TAS

The TAS is a well-established score reflecting the patient’s highest level of physical activity on a 10 level scale[22]. The TAS is the most commonly reported tool for physical activity in patients with lower extremity disorders. A TAS score of ten reflects the functional capacity of a top-level athlete, a score of 0 reflects the inability to walk. A German adaption is available[23].

Figure 1 Patient flow-chart of study population.

EQ-5D-3L

The EQ-5D-3L is a HR-QoL questionnaire consisting of subscales regarding mobility, self-care, usual activities, pain/discomfort, and anxiety/depression[24,25]. Responses to each dimension were transformed into EQ index ranging from -0.21 (worst health) to 1.00 (best health). Additionally, the EQ-5D reports the patient’s assessment of the current HR-QoL in a 100 mm visual analogue scale. This scale ranges from 0 (the worst health you can imagine) to 100 (the best health you can imagine).

Subjective assessment

We asked the patients to evaluate whether the condition of their hip joint had changed since primary evaluation (much better, somewhat better, unchanged, somewhat worse, much worse). For calculation of the responsiveness, the anchor variable was summarized to ‘better’, ‘unchanged’, or ‘worse’.

Clinical data

We reviewed all digital patient charts to record relevant clinical data. Our institution is a level-one trauma-center specialized on management of pelvic injuries. Standard diagnostics at time of injury was a clinical evaluation and a computed tomography (CT) scan of the pelvis. The initial degree of initial and postoperative dislocation was rated according to the Matta scoring system for evaluation of dislocation after acetabular fracture[8]. There was an indication for conservative treatment in cases with no relevant dislocation (less than 2 mm and no involvement of the dome area) measured in the initial CT scan. We recorded clinical parameter like treatment modality (operativevsnon-operative/fracture classification according to Letournel/surgical approach) and compared mean values of the FJS between the groups.

Statistical analysis

Statistical analysis was performed using the software package SPSS (Version 25, SPSS Inc, Chicago, IL, United States). The level of significance was defined atP＜ 0.05 for all tests. Analyses were exploratory, thus no adjustments ofPvalues for multiple testing were conducted. Questionnaires were computed in accordance with guidelines. If not stated in questionnaire guidelines, missing values were not imputed.

Descriptive data were given as frequencies (n) and percentage (%) for categorical variables, mean ± SD or median and interquartile range (IQR) for continuous variables.

Methodological testing according to the COSMIN checklist

We evaluated the reliability (internal consistency, test-retest reliability, and measurement error), validity (convergent validity and clinical validity), responsiveness to change, and interpretability (data completeness and response distribution) of the FJS based on the COSMIN checklist.

Reliability

Reliability is the degree to which the measurement is free from measurement error[16].

Internal consistency

Internal consistency is the degree of interrelatedness among items[16]. Sufficient internal consistency iukrs assumed for a Cronbach’s α ≥ 0.70[15].

Test-retest reliability

Test-retest reliability is the degree of which an outcome of the same patient in the same health condition remains unchanged over time. As the time interval between first and second testing were heterogeneous (0 to 385 d), intraclass correlation coefficient (ICC, two-way mixed model with absolute agreement, average measures) and its 95%CI was estimated indicating an unchanged health condition regarding their hip function compared to the primary evaluation. For an ICC ≥ 0.70 sufficient testretest reliability was assumed.

Measurement error

The measurement error is the systematic and random error of a patient’s score that is not attributed to true changes in the construct to be measured[16]. For patients indicating no change in the condition of their hip joint, standard error of measurement (SEM) and smallest detectable change (SDC) were computed to estimate how much a score needs to change to be sure that a true change has occurred (sensitivity to change). SEM was computed by multiplying the standard deviation (SD, all assessments of patients with unchanged condition) by the square root of 1 minus reliability (ICC): SD × (1-ICC)1/2. SDC was computed by SEM × 1.96 × 21/2[15].

Validity

Validity is the degree to which a questionnaire measures the construct it purports to measure[16]. There is no gold standard in the measurement of PRO. Therefore, we rated validity as convergent and clinical validity. Convergent validity is the degree to which the score of the FJS is comparable with the scores of other functional questionnaires (TAS, WOMAC-VAS, EQ Index, EQ VAS). We measured convergent validity by Spearmen’s rank correlation. With a correlation coefficients |≥ 0.60| (absolute value), convergent validity was rated positive. Based on previous results, it is expected that FJS correlates negative and high ≤ -0.60 with WOMAC-VAS subscales and WOMACVAS summary score[11], correlates positive and moderate between 0.30 and 0.50 with EQ Index, EQ VAS and TAS[13]. We measured clinical validity of FJS by known-groups comparison usingU-test to assess differences between patients with and without articular displacement. Just the first FJS measurement of patients without total hip arthroplasty was assessed.

Responsiveness

Responsiveness is the ability of a questionnaire to detect a change over time in the construct to be measured[16]. Changes in FJS were assessed by Wilcoxon tests separate for patients indicating improvement, no change or worsening of hip functioning. Moreover, Cohan’s d effect size (ES) was computed for each patient group by the mean difference between measurement at T1 and T2 by the SD of measurement A: mean difference measure A and B/SD measure A. ES values of 0.2, 0.5, and 0.8 indicate small, moderate and large changes/ responsiveness[15,16]. To assess whether patients with different hip function development (improvement, no change, worsening) differed in baseline FJS, Kruskal-Wallis H-Test was computed.

Interpretability

Interpretability is the ability to transform a qualitative effect into a quantitative score[16]. Interpretability of the FJS was assessed by presenting data completeness and response distribution (floor and ceiling effects). Extreme outcome values might not be represented adequately if more than 15% of patients score lowest (0) value (floor effect) or highest (100) (ceiling effect) levels in the FJS. In cases of floor- or ceiling effects, the questionnaire might also not be able to reflect changes[15].

RESULTS

Demographic data

Table 1 shows mean score values at initial assessment and demographic data. The cohort comprised of 79% men and 21% women. All patients had sustained an acetabular fracture 1 to 14 years before. 22% of these patients were treated conservatively; 78% required surgical treatment. Of 78 patients with surgical treatment, 55 patients required open reduction and internal plate fixation. In 37 cases, an anterior approach (25 × ilio-inguinal and 12 × intrapelvic approach) was necessary. 18 patients required posterior fixationviaa Kocher-Langenbeck approach. We carried out operative management according to the Arbeitsgemeinschaft für Osteosynthesefragen (AO) principles (Figure 2). According to these recommendations, we aimed for anatomic reduction of the articular surface and internal fixation of the fragments. Aftercare was equal for all patients. We advised the patients to partial weight-bearing of 15 kg for 6 weeks and a step-wise increase of load of 10 kg per week until the patient’s normal weight was reached. For patients with an involvement of the posterior wall, internal rotation of the hip was limited for 6 weeks postoperative. 54/100 patients had an initial articular surface step of more than 3 mm and 46 patients had an initial articular surface step of 3 mm or less. Regarding long-term complications, 4 of 100 patients required a total hip arthroplasty (THA) due to rapid progressive joint degeneration after the acetabular fracture. There were 2/54 patients in the displacement group and 2/46 patients in the other group requiring THA until follow-up evaluation. The time between accident (T0) and follow-up (T1) was 5.2 ± 3.6 years (range 0.2-13.5). The mean age at follow up (T1) was 57.9 ± 17.6 years (range 22-88). The second questionnaire (T2) was completed on average 31.2 ± 71.4 d (range 0-385) after the first.

Reliability

The internal consistency of the questionnaire is satisfying, with Cronbach’s alpha = 0.95 for measurement at T1 (n= 83) and 0.97 for measurement at T2 (n= 50). Even with deleting an item, Cronbach’s alpha was high raging between 0.95 and 0.97.

The test-retest reliability was excellent with an ICC = 0.99 (95%CI = 0.97-0.99). We included only patients indicating an unchanged functional condition regarding their hip joint (n= 24).

SEM was 3.2 and SDC was 8.8. Thus, changes ＞ 8.8 points between two assessments denote real change in FJS and changes ≤ 8.8 denote for measurement errors and subject variability.

Validity

In both assessments, convergent validity between FJS and WOMAC-VAS, TAS, EQ Index as well as EQ VAS could be confirmed. Correlation coefficients were moderate to high ranging from |0.56| to |0.83| (absolute value). The higher the WOMAC-VAS (pain, stiffness, functional limitation and summary score), the lower the forgetting of the joint. The higher the TAS, EQ index and EQ VAS, the higher the forgetting of the joint (Table 2).

Clinical validity was assessed by comparing FJS scores between patients with articular displacement (＞ 3 mm,n= 54) and patients without articular displacement (≤ 3 mm,n= 46). Patients with (median = 3 8.6, IQR = 12.5/72.9) and without (median = 3 7.5, IQR = 16.6/67.8) articular displacement did not differ in FJS at follow-up evaluation T1 (Z= 0.005,P= 0.996).

We also evaluated clinical validity by comparing the different fracture patterns according to the Letournel classification. We found highest median values for anterior column and posterior hemi-transverse fractures (median = 12.5,n= 11), and lowest levels of the FJS in T-type fractures (median = 72.9,n= 3). However, there were only three patients with a T-type fracture within the study population. There was no significant difference in the FJS between the fracture types according to the Letournel classification (P= 0.795).

There was also no significant difference between the surgical approaches (P= 0.477). Lowest FJS levels (median = 15.6,n= 24) were found in patients undergoing ORIFviaan ilio-inguinal approach whereas patients with a Kocher-Langenbeck approach reported highest FJS levels (median 33.3,n= 18).

There was a significant difference in FJS (P= 0.019) comparing patients after conservative treatment (median 68.7,n= 22) to surgically managed patients (median 33.3,n= 78).

Table 1 Patient characteristics at initial assessment (T1)

Table 2 Detailed data on correlation of functional scores indicating a positive rating for convergent validity (r = Spearmen’s rank correlation coefficient) level of significance P ＜ 0.05

Responsiveness

Patients with different hip function development did not significantly differ in FJS at initial assessment (H2,55= 0.336,P= 0.845). In patients indicating improvement of hip function, the FJS significantly increased (mean difference = 9.4,Z= 3.465,P= 0.001,n= 21). This increase was higher than the SDC of 8.8 and had a small ES of 0.28. In patients indicating worsening of hip function, the FJS significantly decreased (mean difference = 17.1, Z = -2.402,P= 0.016,n= 10). This decrease was higher than the SDC of 8.8 and had a moderate ES of 0.59. Patients indicating no change in hip function did not show differences in FJS (mean difference = 1.1, Z = -0.328,P= 0.743,n= 24, ES = 0.03). Table 3 presents responsiveness data of the FJS.

Interpretability

The proportion of missing FJS item responses was 3.9% (47/1200) at first assessment and 1.2% (8/660) at second assessment. For four patients, the FJS at first assessment could not be computed as they had more than four missing responses and thus, these patients were excluded from data analyses.

There was no relevant floor effect [minimum score of 0: first assessment T1:n= 1 (1%), second assessment T2:n= 1 (1%)] and no relevant ceiling effect [maximum score of 100: first assessment T1:n= 1 (1%), second assessment T2:n= 2 (2%)] for the FJS.

DlSCUSSlON

The main result of this study is that the FJS is a valid and reliable tool for evaluation of PRO in posttraumatic condition after acetabular fracture. The SDC indicating a real clinical improvement after a change of 8.8 points between FJS assessments. We could confirm responsiveness of the FJS and found no relevant floor- or ceiling effects. This is the first study validating FJS in fractures of the hip joint according to the COSMIN checklist.

Table 3 Responsiveness of the Forgotten Joint Score based on the subjective rating of change

Figure 2 Initial X-rays of a patient with a both column fracture of the right acetabulum and follow-up X-rays 3.6 years after injury.

Study design and patient population

An acetabular fracture is an uncommon but serious injury to the hip joint[2,6,8,9,14,26]. The surgeon-based view on clinical and functional outcomes after acetabular fracture is well documented in literature[1,4,5,26-29]. Operative management has become the standard treatment for displaced fractures[2,6,8,26,28,30]. Even though technical advancements have led to major improvements in management of these injuries, there is still a relevant number of cases leading to poor clinical outcomes[5,6]. The presented outcome studies mainly reflect short-term outcomes with a focus on risk factors leading to early failure with posttraumatic rapid-progressive joint degeneration like quality of reduction or stability of the osteosynthetic fixation[2,4,5,7]. However, there is a variety of symptoms sensed by the patient and the patient’s perspective are neglected by most studies on QoL after acetabular fracture[1,2,5,7,8,31]. We chose to include patients with a mid-term result as well as long-term condition in this validating study to represent as many conditions as possible. Demographic data of our patient population is comparable to other studies on acetabular fractures with an average age of 57.9 years at time of injury[2,32,33]. Again, our population represents the broad spectrum regarding age distribution seen in acetabular fracture patients[33]. The FJS was originally designed for patients after arthroplasty of the knee and hip[11]. Therefore, it is compiled for an older, physically less active patient population. However, recent studies have proven reliability in younger patients[12,13,17-19,34]. The FJS measures the patient’s ability to forget the joint in everyday life, which is seen as ultimate goal resulting in maximum patient satisfaction[11]. This concept seems to be a valid construct not just post arthroplasty. We found a mean FJS of 41.6 points in our patient population. Recently, Baumannet al[34]published a validation study on the FJS in long-term results of patients in a posttraumatic condition after tibial head fracture in 77 skiers. They found an increased joint awareness of 70.7 points in the FJS 9-13 years after injury. The mean score in the present study was 41 at first assessment indicating that the level of forgetting the joint after acetabular fracture is lower than in patients after tibial head fracture (70.7)[13], anterior cruciate ligament repair (70.1)[12]or even total hip arthroplasty (59.8)[11]. We included patients of all age groups and operative and non-operative treatment. The distribution of our population’s Letournel classification of acetabular fractures reflects also typical fracture patterns. Therefore, we assume generalizability for this type of injury.

Validation

There is ongoing discussion about methodical standards in validation studies[15,16,35]. The validation process in the present study was processed following the COSMIN checklist[16]. Along with the prospective multicenter design, the study meets high methodological standards with a level of evidence Ib.

The FJS has been validated in English and has been adapted in French, Dutch, Danish, Japanese, and German language[11,13,17,18,35,36]. All of these publications gave a positive rating on internal consistency with a Cronbach’s alpha of 0.95-0.97. We measured a Cronbach’s alpha of at least 0.95 in our study. According to Terweeet al[15]internal consistency can be confirmed if Cronbach’s alpha is greater than 0.70 and below 0.95. Greater values reflect higher correlations among the items and might be an indication for a redundancy of two or more items[15]. Cronbach’s alpha is also dependent on the number issues a questionnaire contains; higher values for scores with a higher number of items. The FJS consists of only 12 items. Therefore, it seems that the concept of the FJS with measurement of the joint awareness for every question could be the reason for a high level of internal consistency.

Prior studies reported a test-retest reliability of the FJS between 0.80 and 0.97[13,18,36]. We found perfect test-retest reliability with an ICC of 0.99. We intended to include also patients with a long-term result years after injury. In these cases, we assumed a stable medical condition making the ICC robust. The long period of time between time points T1 and T2 in some cases leading to a SD of 71.4 days is a potential source of bias. However, the anchor-based method to include only patients with a subjective steady medical condition should lead to reliable results from a statistical point of view.

We expected the WOMAC-VAS, TAS, and EQ-5D-3L to be appropriate for evaluation of construct validity, because these scores are widely used and a German language version of all scores was validated for sports related injuries and arthroplasty patients[23,24,37]. We decided to use the WOMAC-VAS score as tool for evaluation of validity because it was used in the original publication of the FJS as well[11]. The FJS showed good correlation to the total score of the WOMAC-VAS as well as to the WOMAC-VAS subscales.

To investigate if a risk factors for rapid-progressive joint degeneration like a residual joint surface step of more than 3 mm might have an impact on patientcentered HR-QoL, we collected data from the initial CT scan in cases of conservative treatment and the post-operative CT scan in operatively treated patients. We expected that a complex fracture pattern or a residual articular step would lead to a lower level of forgetting the joint because of inflammation of the joint. However, we did not find any impact of the fracture pattern according to Letournel or an articular surface step of more than 3 mm in the CT scan on the FJS at time of the validation study. This is probably due to the long period mean time between injury and mean follow-up of more than 5 years. It is likely that the patients with a clinically relevant joint step develop joint degeneration within the first two years after injury (n= 4/100). The other patients with an initial articular step might have sustained a consolidation in functional joint kinetics to a sub-clinical degree. From our perspective, this is an interesting finding and is worth further investigating.

However, we did find a difference between surgical and conservative management. Patients after conservative management reported higher levels in the FJS than surgically treated patients. We recommended conservative management with partial weight bearing only in cases with no dislocation of the fragments. We suspect that this reflects a potential selection bias for conservatively treated patients.

Another important issue for the use of PROMs for further prospective trials is responsiveness to change in medical condition. With a lack of gold standard, defining responsiveness is difficult. According to the COSMIN checklist, we assessed responsiveness in patients by Wilcoxon tests and ES[16]. The FJS score has shown good responsiveness in post-arthroplasty conditions longer than 12 mo postoperatively[11,13,38]. In this study, patients indicating no change in function of the hip, FJS scores did not significantly differ, and the mean differences was with 1.1 below the SDC of 8.8. In patients indicating a change in medical condition of the hip, FJS scores significantly differed between both assessments and the mean differences were above the computed SDC. Therefore, we could give a positive rating for responsiveness.

According to the guidelines of the original FJS publication in 2012, we refused to calculate the FJS if more than four items were missing. The overall proportion of missing items of the FJS was 4% at first assessment. Concordant to all prior studies on the FJS, there was no relevant floor- or ceiling effect[11,12,17,18,35,36,38].

Limitations

The findings of this study have to be seen in the light of some limitations. First of all, the limited number of cases. Acetabular fractures are uncommon and treatment is concentrated to large trauma centers. We assume that our population can be seen as reference population for most centers, since our age distribution is relatively broad and we included conservatively and operatively treated patients to reflect clinical reality. Another limitation is that the time span between first and second evaluation showed quite some variability. Although we put major efforts on guidance of the patients to assure highest methodical quality, there is a natural variation in the recorded data. Finally, calculation of ICC was based on only 24 patients. This was the number of patients who indicated no relevant clinical change in medical condition of their hip joint between T1 and T2. Further studies are needed to confirm the results based on longitudinal data.

CONCLUSION

This is the first study validating FJS in fractures of the hip joint according to the COSMIN checklist. The FJS is a valid and reliable tool for evaluation of PRO in posttraumatic condition after acetabular fracture. With a Cronbach’s alpha of 0.95, internal consistency of the questionnaire was good. Test-retest reliability was excellent with an ICC of 0.99. Based on the anchor variable, the SDC indicating a real clinical improvement was 8.8 points in the FJS. We could confirm responsiveness of the FJS and found no relevant floor- or ceiling effects. Clinicians are suggested to use the FJS for evaluation of PRO after fractures of the acetabulum.

ARTICLE HIGHLIGHTS

Research background

Patient-reported outcome (PRO) measurement is gaining more and more importance in clinical decision-making. Evaluation of psychometric properties of PRO tools is essential to assure validity.

Research motivation

A fracture of the acetabulum is an uncommon but serious injury. Outcome evaluation tools in patients after acetabular fractures are outdated. However, research based on large registries are dependent on valid outcome tools to allow the comparability.

Research objectives

Aim of the study was to validate the Forgotten Joint Score (FJS) according the COSMIN checklist. The FJS is a novel PRO tool to disease-specific quality of life in musculo-skeletal disorders.

Research methods

The COSMIN checklist is a standard protocol to assure methodical quality of validation studies. The COSMIN checklist consists of ten items giving recommendations for design, conduction and interpretation of medical validation studies. Relevant characteristics are study design, content validity, structural validity, internal consistency, cross-cultural validity/measurement invariance, reliability, measurement error, criterion validity, hypotheses testing for construct validity, and responsiveness.

Research results

We found the FJS to be a valid and reliable tool for evaluation of PRO in posttraumatic condition after an acetabular fracture. With a Cronbach’s alpha of 0.95, internal consistency of the questionnaire was good. Test-retest reliability was excellent with an ICC of 0.99. Based on the anchor variable, the smallest detectable change indicating a real clinical improvement was 8.8 points in the FJS. We could confirm responsiveness of the FJS and found no relevant floor- or ceiling effects.

Research conclusions

Clinicians are suggested to use the FJS for evaluation of PRO after fractures of the acetabulum.

Research perspectives

Further studies are needed to confirm the study results, especially concerning longitudinal data. Based on the study results, the FJS can now be used for further clinical studies on post-traumatic conditions after fractures of the acetabulum.

ACKNOWLEDGEMENTS

We thank all participants for contributing to this study. The authors wish to thank the German Pelvic Multicenter Study Group (http://www.dgu-online.de) for their support in the conception of the study.

World Journal of Orthopedics

2021年2期