Hostname: page-component-745bb68f8f-v2bm5 Total loading time: 0 Render date: 2025-02-06T11:03:15.590Z Has data issue: false hasContentIssue false

PATIENT-REPORTED OUTCOMES IN RARE LYSOSOMAL STORAGE DISEASES: KEY INFORMANT INTERVIEWS AND A SYSTEMATIC REVIEW PROTOCOL

Published online by Cambridge University Press:  28 December 2016

Patricia A. Miller
Affiliation:
School of Rehabilitation Science, McMaster Universitypmiller@mcmaster.ca
Sohail M. Mulla
Affiliation:
Clinical Epidemiology and Biostatistics, McMaster University
Thomasin Adams-Webber
Affiliation:
The Hospital for Sick Children
Yasmin Sivji
Affiliation:
Department of Medicine, McMaster University
Gordon H. Guyatt
Affiliation:
Clinical Epidemiology and Biostatistics, McMaster University
Bradley C. Johnston
Affiliation:
The Hospital for Sick Children
Rights & Permissions [Opens in a new window]

Abstract

Objectives: To investigate the use, challenges and opportunities associated with using patient-reported outcomes (PROs) in studies with patients with rare lysosomal storage diseases (LSDs), we conducted interviews with researchers and health technology assessment (HTA) experts, and developed the methods for a systematic review of the literature. The purpose of the review is to identify the psychometrically sound generic and disease-specific PROs used in studies with patients with five LSDs of interest: Fabry, Gaucher (Type I), Niemann-Pick (Type B) and Pompe diseases, and mucopolysaccharidosis (Types I and II).

Methods: Researchers and HTA experts who responded to an email invitation participated in a telephone interview. We used qualitative content analysis to analyze the anonymized transcripts. We conducted a comprehensive literature search for studies that used PROs to investigate burden of disease or to assess the impact of interventions across the five LSDs of interest.

Results: Interviews with seven researchers and six HTA experts representing eight countries revealed five themes. These were: (i) the importance of using psychometrically sound PROs in studies with rare diseases, (ii) the paucity of disease-specific PROs, (iii) the importance of having PRO data for economic analyses, (iv) practical and psychometric limitations of existing PROs, and (v) suggestions for new PROs. The systematic review has been completed.

Conclusions: The interviews highlight current challenges and opportunities experienced by researchers and HTA experts involved in work with rare LSDs. The ongoing systematic review will highlight the experience, opportunities, and limitations of PROs in LSDs and provide suggestions for future research.

Type
Methods
Copyright
Copyright © Cambridge University Press 2016 

BACKGROUND

Clinical research evaluating medical treatments and health interventions traditionally use clinical outcomes such as survival, cure, physiological, biological, or laboratory-based measures and these measures are typically clinician-reported. These measures can only provide indirect evidence regarding outcomes of importance to patients (Reference Guyatt, Montori, Devereaux, Schunemann and Bhandari1;Reference Johnston, Donen and Pooni2). There is an increasing use of self-reported measures that can measure the overall quality of life as well as the symptoms and side effects of treatment that may or may not affect quality of life (Reference Fayers and Machin3). Patient-reported outcomes (PROs) report the status of the patient's health condition directly from the patient without any interpretation by clinicians or anyone else (4). PROs enable patients to provide information regarding the consequences of their disease and the impact of health technologies on their health and functioning and are often the outcomes of most significance for patients. PROs are increasingly used in clinical trials such as those addressing cardiovascular disease (Reference Rahimi, Malhotra, Banning and Jenkinson5) and rheumatoid arthritis (Reference Kalyoncu, Dougados, Daures and Gossec6). However, their use in studies of lysosomal storage diseases (LSDs), one category of rare diseases, is not well established and their potential is uncertain.

To support health technology assessments (HTA), it is recommended that researchers include both generic and disease-specific instruments when relative effectiveness assessments are considered, and a generic multi-attribute health utility instruments when an economic evaluation is required (7). While it is recognized that the patient is the best source of information about what she or he is experiencing, and it is crucial to measure PROs in a standardized manner using instruments that have sound psychometric properties (Reference Reeve, Wyrwich and Wu8), there are several challenges that confront both researchers and HTA experts within the domain of research involving patients with rare diseases. There is limited knowledge regarding the psychometric properties of available PROs used to evaluate quality of life among patients with rare diseases (7;Reference Magasi, Ryan and Revicki9). Furthermore, there is an ongoing discussion regarding the role of PROs addressing subjective well-being or quality of life and health utility (Reference Richardson, Chen, Khan and Iezzi10). It was these issues that prompted us to explore both the current state of the literature with regard to the PROs used with patients with five LSDs of interest, and to interview both researchers and HTA experts in this area.

We undertook a project involving both qualitative and quantitative methodologies, including a series of interviews with key stakeholders and a systematic review, to explore the use of PROs in studies involving patients with rare diseases, and in particular, LSDs. Of the rare diseases, we focused on LSDs because there have been therapies available for approximately thirty years, providing an opportunity to examine the usefulness of PROs for detecting treatment effects. The five LSDs or LSD sub-types were: Fabry disease, Gaucher disease, mucopolysaccharidosis (MPS) (Types I & II), Niemann-Pick disease (Type B), and Pompe disease. In the qualitative study, we interviewed researchers and HTA experts to explore their views and experiences regarding the use of PROs in studies with patients with rare diseases, including those with the LSDs of interest. The purpose of the systematic review is to investigate the psychometric properties of PROs used in descriptive and experimental studies involving the five LSDs of interest. Both studies were initiated at approximately the same time, and the initial findings of the literature review (i.e., eligible articles) informed the recruiting of the participants for the qualitative study (see Figure 1).

Figure 1. Relationship between quantitative and qualitative methodologies.

This study could be of interest to clinicians, HTA experts, and researchers working in the field of LSDs, and other rare diseases, and the results may influence the types of outcomes that are developed and used in future research.

Qualitative Study

The specific research objectives of this study were to explore the barriers and facilitators that can influence the inclusion and use of PROs in research with patients with rare diseases. This study was approved by the Hamilton Integrated Research Ethics Board.

Through email, we invited international researchers and HTA experts to participate in a telephone interview conducted in English. We identified the researchers through the initial literature review and sought individuals who were first authors on articles in the field of LSDs that included the use of a PRO. We emailed the author of the studies using the contact information provided in the publication. We identified HTA experts in Europe and North America by reviewing the list of presenters and participants at the Health Technology Assessment international (HTAi) meetings in 2013 and 2014.

The research team developed the semi-structured interview guide. Open-ended questions prompted the participants to: (i) describe their experiences using PROs in their research or to inform HTAs, (ii) discuss the importance or the benefits of using PROs, (iii) describe some of the challenges they may have experienced in using PROs, and (iv) make suggestions regarding ways to improve or optimize the use of PROs in research or HTAs. See Supplementary Materials 1 and 2 (HTA experts and researchers, respectively) for interview scripts.

One researcher (P.M.) conducted all telephone interviews; another researcher (B.C.J.) was present for the first four interviews. The interviews, undertaken between October 2013 and February 2014, lasted approximately 1 hour, and were audiotaped. At the start of the interview, we obtained the participant's consent and collected demographic information. Two researchers (P.M., B.C.J.) used qualitative content analysis (Reference Hsieh and Shannon11) to analyze the audiotaped interviews that had been transcribed. For the analysis reported herein we anonymized the data (removed the name of participants and organizations). Qualitative content analysis involves the subjective interpretation of text data through coding and the identification of patterns or themes (Reference Hsieh and Shannon11). We applied conventional content analysis in which there was no application of preconceived categories or theoretical frameworks and the results are based in the actual data (Reference Hsieh and Shannon11). Both researchers independently reviewed the transcripts and identified preliminary codes, categories, and themes. The two researchers then met to come to consensus on the categories and themes, and selected representative quotes to highlight those themes.

We extended invitations to ten researchers and seven agreed to participate. There were three women and four men. Their research focused on Fabry disease (n = 2), Gaucher disease (n = 2), Pompe disease (n = 1), MPSII (n = 1), and both Fabry and Gaucher diseases (n = 1). The researchers resided in the United States (n = 3), Denmark (n = 1), Holland (n = 1), Israel (n = 1), and Italy (n = 1).

We extended twenty-six email invitations to HTA experts and four agreed to participate and five declined. We asked the five experts who declined our invitation to identify another potential interviewee (i.e., snow-ball sampling) (Reference Creswell12). This enabled us to identify two willing participants, totaling six HTA experts. There were two women and four men. The HTA experts resided in USA (n = 2), Germany (n = 2), Canada (n = 1), and Spain (n = 1).

RESULTS

We identified five themes, illustrated below with selected quotes.

  1. 1. PROs need to be valid and responsive. The participants stressed the importance of using psychometrically sound instruments. “So they have to be, the tools have to be concise, you know validated yet concise. So you know having a validated instrument across age ranges.” (Researcher 5). “So I think the reproducibility and validity of the patient reported outcome measure is important. . . .You may want some sort of global measure that's been validated in some way, the SF-36, the SF-12, or you may want something that is very tailored to the specific condition that you're looking at.” (HTA Expert 5).

  2. 2. PRO data are used for economic analyses. The participants, especially the HTA experts, noted that it is important to have certain PRO data to conduct economic analyses. “You know certainly as I look ahead to the evidence reviewer focus, it [SF-36] is also one of the validated instruments that is tied to data on patient utilities . . . having utility data is of course critical for accurate cost effectiveness analysis and there aren't that many instruments out there that do have the utility information.” (HTA Expert 2). “So patient outcomes are very important, they deal with quality of life issues, and of course, one of the metrics in the financial assessment, the economic evaluation of the new product, is related to quality adjusted life years and so quality of life measures are important.” (HTA Expert 3).

  3. 3. Psychometric limitations exist for current PROs. Participants noted concerns that existing PROs were not always valid or responsive. “There is also a little problem with the existing patient reported outcomes for chronic, slowly progressive Diseases like Fabry Disease and Gaucher Disease. Because the EQ-5D and SF-36 for example they are not very responsive to changes in these patients because the changes are minimal.” (Researcher 3). “Not all of the tools are necessarily validated in the patient population that's being studied and that can create some problem.” (HTA Expert 6).

  4. 4. Practical limitations exist for current PROs. Participants identified challenges to using PROs related to feasibility and response rates. “I primarily use the SF-36 although again it does have some drawbacks in terms of its length and getting patients to invest the time in completing it . . . it becomes somewhat burdensome for patients to want to fill out.” (Researcher 4). “But for some reason the data are not very complete on quality of life endpoints so we frequently see, for example, a 50 percent dropout or 50 percent of people who have not answered those questionnaires . . . and this is really hard to use in the final decision because of the potential reporting bias . . .so that's a big problem.” (HTA Expert 1).

  5. 5. Suggestions for new PROs. Participants, especially the researchers, offered a variety of suggestions to improve the accessibility and responsiveness of PROs. “So ideally if patients had access to electronic data entry every week . . .I think that would give a much more accurate representation of what's really happening to them over a longer period of time.” (Researcher 4). “I hope in the near future there will be a very disease-specific quality of life measure. I really think that it could add to our knowledge and also to the treatment of patient.” (Researcher 3).

DISCUSSION

The findings of this study indicate the importance that both researchers and HTA experts place upon using PROs with sound psychometric properties to monitor changes in the patients’ quality of life, with a particular focus on the instrument's validity and responsiveness. Participants identified the need for new disease-specific instruments which likely reflects the knowledge that disease-specific instruments are known to be more responsive than generic instruments (Reference Wiebe, Guyatt, Weaver, Matijevic and Sidwell13). Participants confirmed the importance of health utility instruments for those conducting health technology assessments. As well, participants noted the importance of having PROs that are both feasible to complete and readily accessible to patients (i.e., online) to optimize the collection of PROs.

SYSTEMATIC REVIEW

We have undertaken a systematic review to explore the use of PROs and the psychometric properties of PROs used in descriptive and experimental studies involving patients with the following five LSDs of interest: Gaucher (Type I), Fabry, Pompe, and Niemann-Pick (Type I) diseases and mucopolysaccharidosis Types I and II (MPS I and MPS II, respectively).

The specific objectives of the systematic review are to: (i) describe the burden of illness as assessed by PROs in descriptive studies, (ii) describe the nature and responsiveness of PROs versus non-PROs such as biological and physiological end points in interventional studies, and (iii) investigate the extent to which the PROs used have demonstrated psychometric properties.

The literature search strategy and preliminary results are provided in Supplementary Material 3.

Eligibility Criteria

Articles were selected based on sample, study design, and inclusion of an eligible PRO. Table 1 outlines the eligibility criteria. Eligible PROs of interest were those that tapped into the patient experience, including symptoms experienced, overall sense of well-being or quality of life. We included tested (i.e., those with published psychometric properties) and untested (ad hoc) PROs (i.e., without published evidence of psychometric properties). We excluded quantitative studies that did not address quality of life using PROs (e.g., measuring a patient's response to a noxious stimulus), as well as studies addressing prediction or describing the qualitative experience of patients with any of the LSDs of interest.

Table 1. Inclusion Criteria for the Systematic Review

Study Selection

Using a pilot-tested, standardized form, pairs of reviewers worked, independently and in duplicate, to screen titles and abstracts of identified citations to identify the full texts of articles that either reviewer judged as potentially eligible. Subsequently, using another pilot-tested, standardized form, pairs of reviewers applied eligibility criteria to the full text of potentially eligible studies. Supplementary Material 4 includes our full-text screening form.

Data Extraction

Subsequent to the screening processes, the reviewers extracted data independently and in duplicate from each eligible study using standardized forms. To improve consistency, abstraction forms were accompanied by a detailed instruction manual. Before beginning data abstraction, we conducted calibration exercises to further ensure consistency between reviewers. Specifically, we assigned each review team the same two articles from which they extracted information on at least two occasions. A third, more experienced reviewer checked the abstractions and provided individualized feedback to each team throughout the data extraction process.

Extracted data included type of study, demographic information, details of the intervention, PROs used (both tested and ad hoc), results from both PROs and other outcomes, and specific details about the study methodology (e.g., cross-sectional, cohort, case-control), and results from articles that addressed the development or testing of PROs in LSDs.

We resolved any disagreements, at all stages of the screening and extraction, by means of discussion or, when disagreement remained after discussion, with the help of a third reviewer.

Data Analysis

We summarized the PROs (generic and disease-specific) and the ad hoc PROs used, as well as summarizing identified studies that reported the psychometric properties of PROs developed or tested among patients with our LSDs of interest. We presented the findings of both the descriptive and interventional studies separately. When possible, we used the minimally important difference (MID) to quantify the burden of disease (descriptive studies) or the responsiveness (interventional studies) of PROs.

Definition and Assessment of PROs Using the MID

The MID provides a measure of the smallest change in the PRO of interest that patients perceive as important, either beneficial or harmful, and which would lead the patient or clinician to consider a change in management (Reference Schunemann and Guyatt14).

We used the MID to quantify the burden of disease and magnitude of change for frequently reported PROs. We defined frequently reported PROs as those reported more than five times in our systematic review of five LSDs.

We used the same instrument-specific MIDs for the descriptive and interventional studies. Two primary approaches for estimating an MID are distribution- and anchor-based methods. Distribution-based methods rely on the distribution around the mean scores of the measure of interest (e.g., standard deviation[SD], standard error) (Reference Guyatt, Osoba, Wu, Wyrwich and Norman15). In the anchor-based approach, investigators examine the relation between the target PRO and an independent measure that is itself interpretable, the anchor. An appropriate anchor will be relevant to patients (e.g., measures of symptoms, disease severity, or response to treatment) (Reference Guyatt, Juniper, Walter, Griffith and Goldstein16). When available, we used published anchor-based MIDs (Reference Guyatt, Osoba, Wu, Wyrwich and Norman15). When anchor-based MIDs were not available, we calculated the distribution-based MID using published normative data (Reference Norman, Sloan and Wyrwich17). We considered normative data as that from a substantial sample of representative individuals in the general population, typically sampled by those who developed the outcome measure of interest (e.g., development of normative standards for the SF-36 by the developers of the SF-36). To calculate the MID, we divided the SD of the normative mean scores by two. When authors provided no normative or control data, we calculated the MID using the largest available sample of patients with the target LSD who were not receiving ERT from any study that used the PRO of interest. We divided the SD of the mean PRO score by two to determine the MID.

Application of the MID: Descriptive Studies

We applied the MID to quantify the disease burden if studies reported mean scores for patients, mean scores for comparator group, and the results demonstrated significant differences in scores between patient and comparator groups; or, failing that if authors did not compare study samples with normative populations, but included a very large sample (i.e., 200 patients).

To quantify the burden of disease in descriptive studies, we first identified the difference in mean PRO scores between the study sample and a comparator group, which could be a normative sample, or a sample of healthy controls (i.e., a sample smaller or less clearly representative than the normative sample). When the authors did not provide comparator data, we used the following approaches, in order of decreasing preference, to identify the comparator group: (i) we searched the literature for a large sample of healthy people for whom PRO scores were available (normative data); (ii) when normative data were not available, we used mean scores from healthy controls reported in a study included in our systematic review; (iii) when we could not identify a healthy control sample for scales measuring disability and pain, we assumed a sample of disability-free or pain-free individuals who would obtain the full score.

We then used the MID to determine the extent of the burden by dividing the difference between the patient and comparator means by the MID. We labeled the result a “decrement.”

In studies in which the authors quantified the difference between the patients’ PRO scores and those of the controls or normative sample, we quantified the burden of disease by calculating the number of MID units for the respective PRO in context of SD units. For example, we determined one SD to be equivalent to two MIDs (Reference Norman, Sloan and Wyrwich17).

Application of the MID: Interventional studies

To determine the magnitude of change or difference in experimental studies, we first identified the statistically significant between-group difference in PRO scores for controlled studies or the statistically significant within-group difference in PRO scores for observational before-after studies using the last reported time-point. In controlled studies, we identified whether the difference was an improvement or deterioration in health status associated with the intervention. In observational before-after studies, we identified whether the change was an improvement or deterioration in health status. Subsequently, we used the MID to determine the magnitude of difference or change by dividing the mean difference by the MID.

We considered a difference that was less than half of the anchor-based or distribution-based MID as a trivial effect; when the difference was between half of one MID and included one MID, we considered this a small effect; when it was greater than one MID but less than two MIDs, a moderate effect. We considered a large effect twice the MID or greater.

Assessment of Variability in PRO Results and Disease Burden

When authors of descriptive studies reported both mean and SDs, we determined the variability within the scores and burden of disease of frequently used PROs, using the following criteria: (i) Score variability: we considered large variations in reported scores if the SD was greater than 60 percent of the mean. (ii) Disease burden variability: we considered a large variation in the degree of decrement if patients who reported scores either one SD above or below the mean had a disease burden difference of two or more categories.

PRO-Specific Information

Below is the process we followed for: (i) selection of comparative data when authors of descriptive studies did not present normative or control data; and, (ii) identification of MID for specific PROs.

  1. 1. Medical Outcomes Study Short Form (36) Health Survey (SF-36) (Reference Ware18). Comparator: When authors did not include normative or control data for comparison, we used normative values reported by Sullivan and colleagues (Reference Sullivan, Karlsson and Ware19) for the domain scores. This was a sample of 8930 healthy Swedish people, of whom 52 percent were female, and whose mean age was 43 years. For the physical and mental summary scores, we used control data reported by Schermuly and colleagues (Reference Schermuly, Muller and Muller20). This study enrolled a sample of 20 healthy people from the United States, of whom 55 percent were female, and whose mean age was 37 years. MID: We used the anchor-based MID for the eight domain scores and summary scores reported by Kosinski and colleagues (Reference Kosinski, Zhao, Dedhiya, Osterhaus and Jr21). The sample included 693 Americans, of whom 76 percent were female, and 84 percent were over 45 years old, with rheumatoid arthritis (RA).

  2. 2. EuroQOL 5 Dimensions (EQ-5D) (Reference Szende, Oppe and Devlin22). (i) EQ-5D utility. Comparator: When authors did not include normative data or control data for comparison, we used normative data reported by Kind and colleagues (Reference Kind, Hardman and Macran23). The authors reported EQ-5D utility data for healthy adults living in the United Kingdom. The study enrolled 3,392 participants, of whom 57 percent were female. The authors did not report the mean age of the study participants, but noted that 41 percent of the women and 34 percent of the men were younger than 60 years. MID: We used the anchor-based MID for the EQ-5D utility score reported by Waters and colleagues (Reference Walters and Brazier24). The mean scores from ten patient groups including those with leg ulcers, back pain, early RA, irritable bowel syndrome, acute myocardial infarction, and COPD were used to calculate the MID (n = 532; no demographic data reported). (ii) EQ-5D Visual analogue scale (VAS). Comparator: When authors did not provide normative data or control data for comparison, we used data reported by Kind and colleagues (Reference Kind, Hardman and Macran23). MID: We calculated the distribution-based MID for the EQ-5D VAS using healthy adults in the United Kingdom. The study enrolled 3,378 participants, of whom 57 percent were female. The authors did not report the mean age of the study, but noted that 41 percent of the women and 34 percent of the men were younger than 60 years (Reference Kind, Hardman and Macran23).

  3. 3. Brief Pain Inventory (BPI) (Reference Cleeland and Ryan25). (i) BPI summary scores for pain severity and pain interference. Comparator: When authors did not provide normative or control data, we used the mean scores of the healthy control group (n = 20; 55 percent female; mean age 37 years) reported by Schermuly and colleagues (Reference Schermuly, Muller and Muller20). MID: We calculated the distribution-based MID for the pain severity and the pain interference scores using the healthy control group reported by Schermuly and colleagues (Reference Schermuly, Muller and Muller20). (ii) BPI item scores. Comparator: We were unable to find a normative population or a sample of healthy controls to determine the extent of burden. So, we assumed a sample of pain-free individuals who would obtain the score of 0. MID: According to the Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials, the MID for patients with chronic pain on the 11-point numeric rating scale is 1 (Reference Dworkin, Turk and Wyrwich26).

  4. 4. Short-Form McGill Pain Questionnaire (SF-MPQ) (Reference Melzack27). Comparator: We assumed a sample of disability-free or pain-free individuals who would obtain the full score, 0. MID: We calculated a distribution-based MID for the total score of SF-MPQ using a sample of enzyme-naïve Chinese patients with Fabry Disease from the same family (n = 16; 50 percent female; mean age 34 years) reported by Ro and colleagues (Reference Ro, Chen and Chang28).

  5. 5. Fatigue Severity Scale (Reference Krupp, LaRocca, Muir-Nash and Steinberg29). Comparator: When authors did not include normative or control data, we used the mean scores for the healthy control sample reported in Merkies and colleagues (Reference Merkies, Schmitz, Samijn, van der Meche and van Doorn30). MID: We calculated a distribution-based MID for the total score using 113 healthy adults (48 percent female; mean age 54 years) reported by Merkies and colleagues (Reference Merkies, Schmitz, Samijn, van der Meche and van Doorn30).

  6. 6. Child Health Assessment Questionnaire (CHAQ) Disability Index (CHAQ DIS), CHAQ VAS Pain, and CHAQ VAS Well-Being (Reference Singh, Athreya, Fries and Goldsmith31). Comparator: When authors did not include normative or control data, we used the mean scores for the healthy control sample reported by Nugent and colleagues (Reference Nugent, Ruperto and Grainger32). MID: We used an anchor-based MID for the CHAQ DIS reported by Dempster and colleagues (Reference Dempster, Porepa, Young and Feldman33), who enrolled a sample of 131 children with inflammatory diseases (e.g., juvenile arthritis, psoriatic arthritis, reactive arthritis, etc.) (69 percent female, mean age: 10 years). We calculated distribution-based MIDs for the CHAQ VAS pain and VAS well-being scores using a sample of healthy children (n = 221; 50 percent female; mean age 10 years) (Reference Merkies, Schmitz, Samijn, van der Meche and van Doorn30). When authors reported CHAQ DIS scores combined with scores from the Health Assessment Questionnaire (HAQ) (CHAQ/HAQ DIS). When the CHAQ/HAQ DIS scores were reported together, we used an anchor-based MID for the HAQ DIS from Kosinski and colleagues (Reference Kosinski, Zhao, Dedhiya, Osterhaus and Jr21), who enrolled 693 Americans, of whom 52 percent were female, and 84 percent were older than 45 years with RA.

  7. 7. Rotterdam Handicap Scale (RHS) (Reference Merkies, Schmitz, Van Der Meche, Samijn and Van Doorn34). Comparator: We assumed a sample of disability-free or pain-free individuals that would obtain the full score, specifically 36. This approach is confirmed by Hagemans and colleagues (Reference Hagemans, Laforet and Hop35), who reports that the RHS score for a healthy person would be 36. MID: We calculated a distribution-based MID for the RHS total score using the sample of enzyme-naïve patients with PD (n = 257; 53 percent female; mean age 48 years) reported by Hagemans and colleagues (Reference Hagemans, Laforet and Hop35).

The systematic review has been completed (Reference Johnston, Miller and Agarwal36).

SUPPLEMENTARY MATERIAL

Supplementary Material 1: https://doi.org/10.1017/S0266462316000568

Supplementary Material 2: https://doi.org/10.1017/S0266462316000568

Supplementary Material 3: https://doi.org/10.1017/S0266462316000568

Supplementary Material 4: https://doi.org/10.1017/S0266462316000568

CONFLICTS OF INTEREST

P.A.M. and T.A.W. and Y.S. report receiving personal fees from Genzyme Corporation, a Sanofi company, during the conduct of the study. S.M.M., G.H.G., T.A.W. and B.C.J. report receiving grants from Genzyme Corporation during the conduct of the study.

References

REFERENCES

1. Guyatt, G, Montori, V, Devereaux, PJ, Schunemann, H, Bhandari, M. Patients at the center: In our practice, and in our use of language. ACP J Club. 2004;140:A11A12.Google Scholar
2. Johnston, BC, Donen, R, Pooni, A, et al. Conceptual framework for health-related quality of life assessment in acute gastroenteritis. J Pediatr Gastroenterol Nutr. 2013;56:280289.Google Scholar
3. Fayers, P, Machin, D. Quality of life: The assessment, analysis and interpretation of patient-reported outcomes. New York: John Wiley & Sons; 2013.Google Scholar
4. US Food and Drug Administration. Guidance for industry: Patient-reported outcome measures. Use in medical product development to support labeling claims; 2009. http://www.fda.gov/downloads/drugs/guidances/ucm193282.pdf (accessed February 4, 2016).Google Scholar
5. Rahimi, K, Malhotra, A, Banning, AP, Jenkinson, C. Outcome selection and role of patient reported outcomes in contemporary cardiovascular trials: Systematic review. BMJ. 2010;341:c5707.Google Scholar
6. Kalyoncu, U, Dougados, M, Daures, JP, Gossec, L. Reporting of patient-reported outcomes in recent trials in rheumatoid arthritis: A systematic literature review. Ann Rheum Dis. 2009;68:183190.Google Scholar
7. European Network for Health Technology Assessment. Endpoints used for relative effectiveness assessment of pharmaceuticals: Health-related quality of life and utility measures; 2013. http://www.eunethta.eu/sites/5026.fedimbo.belgium.be/files/Health-related%20quality%20of%20life.pdf (accessed February 4, 2016).Google Scholar
8. Reeve, BB, Wyrwich, KW, Wu, AW, et al. ISOQOL recommends minimum standards for patient-reported outcome measures used in patient-centered outcomes and comparative effectiveness research. Qual Life Res. 2013;22:18891905.Google Scholar
9. Magasi, S, Ryan, G, Revicki, D, et al. Content validity of patient-reported outcome measures: Perspectives from a PROMIS meeting. Qual Life Res. 2012;21:739746.Google Scholar
10. Richardson, J, Chen, G, Khan, MA, Iezzi, A. Subjective wellbeing versus utility: Incommensurable or mismeasured construct. Working Paper 04-14; 2014. http://www.aqol.com.au/papers/workingpaper04-14.pdf (accessed February 4, 2016).Google Scholar
11. Hsieh, HF, Shannon, SE. Three approaches to qualitative content analysis. Qual Health Res. 2005;15:12771288.Google Scholar
12. Creswell, JW. Qualitative inquiry and research design: Choosing among the five traditions. Thousand Oaks: Sage Publications, Inc; 1998.Google Scholar
13. Wiebe, S, Guyatt, G, Weaver, B, Matijevic, S, Sidwell, C. Comparative responsiveness of generic and specific quality-of-life instruments. J Clin Epidemiol. 2003;56:5260.Google Scholar
14. Schunemann, HJ, Guyatt, GH. Commentary–goodbye M(C)ID! Hello MID, where do you come from? Health Serv Res. 2005;40:593597.CrossRefGoogle Scholar
15. Guyatt, GH, Osoba, D, Wu, AW, Wyrwich, KW, Norman, GR. Methods to explain the clinical significance of health status measures. Mayo Clin Proc. 2002;77:371383.CrossRefGoogle ScholarPubMed
16. Guyatt, GH, Juniper, EF, Walter, SD, Griffith, LE, Goldstein, RS. Interpreting treatment effects in randomised trials. BMJ. 1998 Feb 28;316:690693.Google Scholar
17. Norman, GR, Sloan, JA, Wyrwich, KW. Interpretation of changes in health-related quality of life: The remarkable universality of half a standard deviation. Med Care. 2003;41:582592.Google Scholar
18. Ware, JE Jr. SF-36 health survey update. Spine. 2000;25:31303139.CrossRefGoogle ScholarPubMed
19. Sullivan, M, Karlsson, J, Ware, JE Jr. The Swedish SF-36 Health Survey–I. Evaluation of data quality, scaling assumptions, reliability and construct validity across general populations in Sweden. Soc Sci Med. 1995;41:13491358.Google Scholar
20. Schermuly, I, Muller, MJ, Muller, KM, et al. Neuropsychiatric symptoms and brain structural alterations in Fabry disease. Eur J Neurol. 2011;18:347353.CrossRefGoogle ScholarPubMed
21. Kosinski, M, Zhao, SZ, Dedhiya, S, Osterhaus, JT, Jr, Ware JE. Determining minimally important changes in generic and disease-specific health-related quality of life questionnaires in clinical trials of rheumatoid arthritis. Arthritis Rheum. 2000;43:14781487.Google Scholar
22. Szende, A, Oppe, M, Devlin, NJ, editors. EQ-5D value sets: Inventory, comparative review and user guide. Dordrecht, The Netherlands: Springer; 2007.Google Scholar
23. Kind, P, Hardman, G, Macran, S. UK population norms for EQ-5D; 1999. http://www.york.ac.uk/media/che/documents/papers/discussionpapers/CHE%20Discussion%20Paper%20172.pdf (accessed February 4, 2016).Google Scholar
24. Walters, SJ, Brazier, JE. Comparison of the minimally important difference for two health state utility measures: EQ-5D and SF-6D. Qual Life Res. 2005;14:15231532.Google Scholar
25. Cleeland, CS, Ryan, KM. Pain assessment: Global use of the Brief Pain Inventory. Ann Acad Med Singapore. 1994;23:129138.Google Scholar
26. Dworkin, RH, Turk, DC, Wyrwich, KW, et al. Interpreting the clinical importance of treatment outcomes in chronic pain clinical trials: IMMPACT recommendations. J Pain. 2008;9:105121.CrossRefGoogle ScholarPubMed
27. Melzack, R. The short-form McGill Pain Questionnaire. Pain. 1987;30:191197.Google Scholar
28. Ro, LS, Chen, CM, Chang, HS, et al. Contribution of clinical screening to carrier detection in a large Chinese family with Fabry disease due to a novel alpha-galactosidase A gene deletion. Eur J Neurol. 2007;14:493497.CrossRefGoogle Scholar
29. Krupp, LB, LaRocca, NG, Muir-Nash, J, Steinberg, AD. The fatigue severity scale. Application to patients with multiple sclerosis and systemic lupus erythematosus. Arch Neurol. 1989;46:11211123.Google Scholar
30. Merkies, IS, Schmitz, PI, Samijn, JP, van der Meche, FG, van Doorn, PA. Fatigue in immune-mediated polyneuropathies. European Inflammatory Neuropathy Cause and Treatment (INCAT) Group. Neurology. 1999;53:16481654.CrossRefGoogle ScholarPubMed
31. Singh, G, Athreya, B, Fries, JF, Goldsmith, DP. Measurement of health status in children with juvenile rheumatoid arthritis. Arthritis Rheum. 1994;37:17611769.Google Scholar
32. Nugent, J, Ruperto, N, Grainger, J, et al. The British version of the Childhood Health Assessment Questionnaire (CHAQ) and the Child Health Questionnaire (CHQ). Clin Exp Rheumatol. 2001;19 (Suppl 23):S163S167.Google Scholar
33. Dempster, H, Porepa, M, Young, N, Feldman, BM. The clinical meaning of functional outcome scores in children with juvenile arthritis. Arthritis Rheum. 2001;44:17681774.3.0.CO;2-Q>CrossRefGoogle ScholarPubMed
34. Merkies, IS, Schmitz, PI, Van Der Meche, FG, Samijn, JP, Van Doorn, PA. Psychometric evaluation of a new handicap scale in immune-mediated polyneuropathies. Muscle Nerve. 2002;25:370377.CrossRefGoogle ScholarPubMed
35. Hagemans, ML, Laforet, P, Hop, WJ, et al. Impact of late-onset Pompe disease on participation in daily life activities: Evaluation of the Rotterdam Handicap Scale. Neuromuscul Disord. 2007; 17:537543.Google Scholar
36. Johnston, BC, Miller, PA, Agarwal, A, et al. Limited responsiveness related to the minimal important difference of patient-reported outcomes in rare diseases. J Clin Epidemiol. 2016; Jul 2. pii: S0895-4356(16)30187-1. doi: 10.1016/j.jclinepi.2016.06.010. [Epub ahead of print]Google Scholar
Figure 0

Figure 1. Relationship between quantitative and qualitative methodologies.

Figure 1

Table 1. Inclusion Criteria for the Systematic Review

Supplementary material: File

Miller supplementary material

Miller supplementary material 1

Download Miller supplementary material(File)
File 107.6 KB
Supplementary material: File

Miller supplementary material

Miller supplementary material 2

Download Miller supplementary material(File)
File 108.6 KB
Supplementary material: File

Miller supplementary material

Miller supplementary material 3

Download Miller supplementary material(File)
File 35.8 KB
Supplementary material: File

Miller supplementary material

Miller supplementary material 4

Download Miller supplementary material(File)
File 25.1 KB