Published online by Cambridge University Press: 01 August 2004
Objectives: This study reviewed the European guidelines of the Committee for Proprietary Medicinal Products (CPMP) on how health-related quality of life (HRQOL) research should be conducted in clinical trials. Published product-level information was also reviewed to investigate the actual role of HRQOL data in the European regulatory process.
Methods: All disease-specific notes for guidance and concept papers on clinical investigations, development and evaluation of human medicinal products, as well as the European Public Assessment Reports (EPAR) of all approved drugs published on the European Agency for the Evaluation of Medicinal Products (EMEA) Web site were evaluated for their HRQOL recommendations.
Results: Only twenty of the fifty CPMP guidance notes for clinical investigation of pharmaceutical products in specific disease areas included a reference to HRQOL. Most of the recommendations were generic and vague, and the terminology used was inconsistent across documents. The EPAR provided nonspecific information about HRQOL and contradictory conclusions on the effect of a drug on HRQOL sometimes occurred in different documents. The criteria used by the CPMP to assess the HRQOL data could not be identified due to an ad hoc approach to the inclusion of data in the EPAR.
Conclusions: A more systematic approach is needed on the way health outcomes data are considered, reviewed, and interpreted by the regulatory authorities. For this to be achieved, CPMP should develop general guidelines on the importance of HRQOL and how research should be conducted if data are to be included in the registration process.
Outcomes research is increasingly conducted alongside clinical trials (38). The importance of health-related quality of life (HRQOL) measures is well recognized by the scientific community, the pharmaceutical industry, regulators, and payers (32;38;43). HRQOL measures provide an understanding of the impact of disease and its treatment on the patient's daily life and functioning, and they represent a patient-perceived indicator of the benefit of medical interventions (46). Therefore, the number of clinical trials incorporating the measurement of HRQOL has increased substantially in recent years (35). In oncology research, for example, HRQOL has been identified as the second most frequently used outcome after survival (32). Additionally, HRQOL may provide relevant information to third-party payers and doctors making decisions regarding the use of a therapy and the impact that the medical treatment has on patients' health status and well-being (44;46).
Currently, marketing authorization agencies do not require HRQOL evidence for the approval of new drugs. Given the increasing importance associated with this information being included on drug labels, the industry is investing more toward providing HRQOL information (4). European pricing and reimbursement authorities, on the other hand, formally require, or will consider, HRQOL data to decide whether a medicine actually provides “value for money.” Following the example of Australia and Canada, several European countries, such as the United Kingdom, Sweden, and the Netherlands, have started, or are about to start, using health outcomes and economic criteria in decision making on the public reimbursement of pharmaceuticals. To facilitate the process, reimbursement authorities, such as the National Institute of Clinical Excellence in the United Kingdom, have published guidelines for the preparation of the submissions made by the pharmaceutical industry (39). Most of these guidelines distinguish and separately discuss HRQOL assessment and the assessment of utilities among the possible measures of clinical effectiveness of treatments.
However, there are many methodological issues and controversies surrounding the investigation of health outcomes that provide real challenges for the conduct of research, analysis, review, interpretation, and the use of results. Issues specific to HRQOL evaluations include the definition of clinically meaningful changes in HRQOL scores, the handling of missing and censored data, and the integration of the length of life within HRQOL outcomes (38). Other methodological concerns include (i) the type of instruments to be used (generic versus disease specific) (2;34), as the selection of HRQOL measures for clinical trials requires attention to the appropriateness, psychometric characteristics, and practicality of the available instruments (42); (ii) whether the overall results from one instrument should be presented or whether the presentation of results from the individual domains of the instrument is valid; and (iii) whether it is the patient, the doctor, or the general population who should complete the questionnaires (47). Finally, the interpretation of HRQOL data may be different, depending on the person who uses the information; for example, a clinician, a health policy-maker, a regulatory authority, or a pharmaceutical company (48).
As a result of increased interest in HRQOL, there has been an effort to provide guidance to researchers on the conduct of HRQOL research. Since 1997, two initiatives that recently merged and were renamed ERIQA (European Regulatory Issues on Quality Assessment), were launched in Europe. Their aims were to bring together HRQOL researchers, the pharmaceutical industry, and representatives of regulatory agencies to discuss the role and value of HRQOL within the specific context of drug registration and reimbursement, and to agree on recommendations for HRQOL assessment in clinical research and the integration of this information into the regulatory process. A checklist for designing, conducting, and reporting HRQOL studies in clinical trials was published by ERIQA (4). At the same time, similar initiatives were launched in the United States by the Food and Drug Administration in collaboration with the Health Outcomes Committee of the Pharmaceutical Research and Manufacturers Association (PhRMA-HOWG), the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) and the International Society for Quality of Life Research (ISOQOL) (30;41). PhRMA-HOWG, ISPOR, ISOQOL, and ERIQA have now created the Patient Reported Outcomes Harmonization Group with the mission to “harmonize patient reported outcomes issues used in drug development and evaluation.”
The aim of this present study is to review the guidelines of the Committee for Proprietary Medicinal Products (CPMP) (40), which is part of the European Agency for the Evaluation of Medicinal Products (EMEA) on how HRQOL research should be conducted in clinical trials. Additionally, all existing published product-level information will be reviewed to investigate the actual role of HRQOL data within the regulatory process. Although this area is widely investigated for the U.S. environment, an understanding of the use of HRQOL data by the European regulatory authorities has not been extensively researched.
In this study, all disease-specific notes for guidance documents and concept papers on the clinical investigation, development, and evaluation of human medicinal products, both adopted and drafted, as well as International Conference on Harmonisation (ICH) guidelines, posted on the EMEA Web site on 13 March 2003, were identified. These documents were then reviewed to identify references to HRQOL recommendations. The keywords used in the search were health status, well-being, quality of life, HRQOL, QOL, HQL, questionnaire, and scale. The content of the documents that contained HRQOL recommendations was then evaluated by considering explicit statements on the use of HRQOL measures.
To investigate how the EMEA considers HRQOL data submitted for a pharmaceutical product in relation to adopted recommendations, the European Public Assessment Reports (EPAR) of all approved drugs that were publicly available on the EMEA Web site on 7 March 2003 were reviewed to identify those that included HRQOL data. The same keywords mentioned above were used. The EPAR that contained HRQOL data were investigated further.
The EPAR reflect the scientific conclusions reached by the CPMP at the end of the centralized evaluation process and summarize the basis for the CPMP opinion in favor of granting a marketing authorization for a specific medicinal product. The EMEA makes these summaries available to the public by publishing them on the EMEA Web site [http://www.emea.eu.int/index/indexh1.htm] after deletion of commercially confidential information. The documents that are included under the EPAR heading are the abstract, authorized presentations, the product information leaflet, the summary of product characteristics (SPC), labeling, scientific discussion, steps taken after assessment, steps taken after authorization.
The EPAR documents that were relevant to the aims of this study and, therefore, were reviewed are as follows:
No formal regulatory guidelines currently exist in Europe for HRQOL evaluation in relation to the marketing authorization of pharmaceutical products. However, recommendations have been published by CPMP on the use of HRQOL in clinical investigations within specific disease areas. These guidelines intend to serve as a starting point for discussions on the inclusion of patient reported outcomes data in drug labeling for licensing and promotional purposes (1). Although European standards on this matter still need to be developed, as will be discussed in this study, approval for labeling claims regarding HRQOL has been given for some drugs by national European agencies. These guidelines were identified and are discussed in more detail in this section. Additionally, the EPAR that contained specific information or refer to HRQOL were also identified and are presented.
Of a total of fifty notes for guidance, notes to consider for clinical investigation and ICH guidelines that were identified on the EMEA Web site, twenty guidelines and one ICH document included reference to, or recommendations on, HRQOL. Apolone et al. (1) previously identified thirteen CPMP guidelines and one ICH document that contained statements about HRQOL. However, since 1999, when their research was conducted, seven additional notes for guidance that referred to HRQOL have been published. The disease-specific guidelines are summarized in Table 1.
All of the guidelines that are summarized in Table 1, apart from the guidelines for the evaluation of antiarrhythmics and drugs for HIV infection, considered HRQOL measures as an efficacy outcome in clinical trials. The two guidelines that are the exception referred to the potential use of HRQOL measures for the assessment of the risk-benefit ratio of the new agent. In eleven of these notes for guidance, HRQOL was recommended as a secondary end point; in three of the cases, HRQOL was recommended both as a primary and secondary objective; and finally, in six of the cases, a specification on the type of end point that HRQOL should constitute within clinical trials was not given. The importance of selecting the appropriate end points in clinical trials will be elaborated upon in the discussion section.
The disease-specific guidelines that recommended HRQOL as an efficacy outcome have been separated into three broad categories:
In addition to these twenty disease-specific notes for guidance, one general guidance document that referred to HRQOL was also identified. The ICH ‘Note for guidance on statistical principles for clinical trials’ was adopted in February 1998 by the EMEA. It refers to HRQOL by stating that “… Measurements relating to quality of life and health economics are further potential primary variables…”.
Between January 1995 and March 2003, the EMEA and the European Commission gave Community Marketing authorization to 218 medicinal products for human use. The EPAR documentation of 10 of these products was not available on the EMEA Web site; therefore, 208 medicines were included in our research. From these drugs, only fifty-two contained at least one of our search terms in their EPAR documentation. However, five of these were excluded from the analysis as they were not relevant to our research questions. Three of these five medicines were excluded because the EPAR documents only mentioned that HRQOL is important for the particular disease or that HRQOL should be further evaluated. Two of the five excluded drugs measured physician satisfaction rather than patient satisfaction. Altogether, forty-seven drugs with HRQOL data in their EPAR were identified and included in the review. In Table 2, more detailed data on these EPAR are presented; however, the drugs have been grouped by their generic name and indication when the same substance has been indicated for different diseases; therefore, the results from the evaluation of thirty-three substances are illustrated.
In summary, 24 percent of the thirty-three substances that contained HRQOL data in their EPAR were indicated for the treatment of cancer (eight substances); 9 percent were indicated for the treatment of erectile dysfunction (three substances), hepatitis, HIV, and diabetes. The remaining 42 percent (fourteen drugs) covered Fabry disease, Kaposi's sarcoma, lymphomatous meningitis, rhinitis, AIDS-related Kaposi's sarcoma, facial hirsutism, renal failure, epilepsy, growth failure, obesity, atrial fibrillation, Gaucher disease, acromegaly, and schizophrenia.
The scientific discussion documents for all the thirty-three substances included information or referred to the effect that the drug under examination has on the HRQOL of patients. Nine SPCs also included reference to HRQOL, as well as three abstracts. Finally, none of the substances had a reference to HRQOL in their package leaflet (Table 2).
In nineteen of the thirty-three cases, the treatment under review had some positive effect on the HRQOL of the patients; in eleven of the cases the drug did not have a beneficial effect over the comparator in terms of improving HRQOL, while for three substances the results from clinical trials on their effect on HRQOL were not mentioned in the EPAR. However, for only eight of the thirty-three substances was there a clarification on whether the HRQOL changes observed with treatment were statistically significant or clinically meaningful (Table 2). In twenty of the thirty-three cases the name of at least one of the HRQOL instruments that were used in the clinical trials was mentioned (Table 2).
Measurement of HRQOL has been extensively used as a standard of therapeutic efficacy (16). Usually it represents a patient-centered approach to assessing functional status and well-being that integrates the impact of both medical treatment and disease. As the evaluation of HRQOL end points in drug development continues to grow, interest in using these outcomes for labeling and promotional purposes increases (3;4;42). However, inclusion of HRQOL information in drug labels and promotion is still limited, although it could be valuable for prescribers and payers.
The challenge facing the industry, regulatory agencies, and the scientific community is to develop robust HRQOL instruments and to reach consensus on the criteria for evaluating the scientific validity and clinical meaningfulness of data that will ultimately be used for labeling, reimbursement decisions, and promotion (36). In Europe, after the drug has been authorized, the inclusion of HRQOL claims in the label could be crucial for pricing and reimbursement negotiations and for differentiating a drug from other competing treatments. The only general EMEA guidance that deals with HRQOL measurement and the use of such data is the ICH guidance on statistical principles that refers to the potential use of measurements relating to HRQOL and health economics as further primary variables in clinical trials. However, in most of the EMEA disease-specific guidelines for clinical investigation of new pharmacological agents that referred to HRQOL, HRQOL was recommended as a secondary end point, while in three disease-specific guidelines (rheumatoid arthritis, osteoarthritis, and chronic obstructive pulmonary disease), HRQOL measures were recommended both as primary and secondary end points. The choice of the primary end points of the clinical trial should be clinically relevant to the focus of the trial, reflect the therapeutic goals expected for the investigated agent and the clinical benefit the applicant wishes to claim in the future SPC (12). Examples of the most important objectives in the treatment of each disease and, therefore, the important end points that should be used in the clinical trials, were given within the recommendations but were not accompanied by references to supporting published data. However, the recommendation stated that the sponsor should cite published data that support the validity of the measurements that are used in the clinical trials to demonstrate the effect of the investigated drug in the relevant disease. With regard to the secondary end points, the recommendations stated that they should be supportive measurements related to the primary objective of the clinical trial or measurements of effects related to the secondary objective, but that their use should also be justified by referencing published data that support their validity.
The choice of the appropriate end points is important, because the regulators use the results from the primary end point analyses as the basis for labeling claims, and although they reluctantly use the results from secondary end points, they can consider them as supportive evidence. Therefore, if a comprehensive HRQOL marketing campaign is desired, trials should be designed accordingly (36).
In general, most of the recommendations that referred to the measurement of HRQOL were generic and vague, using nonstandard terminology that was also inconsistent across the documents. Moreover, there was no consensus on which instruments should be used for the assessment of HRQOL. In some guidelines, the use of generic instruments was recommended; in others, the use of disease-specific questionnaires was stressed; whereas in still others, the use of both instruments was mentioned. Some of the guidelines stated “disease-specific and generic instruments for measuring HRQOL can be used…” (23), or “the use of justified HRQOL instruments…” (26), or a “well-known general HRQOL scale…” (16), while other guidelines do not give any specific guidance on how HRQOL should be assessed. For example, in the guidance on the antiarrhythmics (7) the only statement on HRQOL was: “the use of antiarrhythmic drugs may be limited by major and minor drug-related side effects impacting on drug tolerance or HRQOL.” One would have thought, through experience over the years and as health outcomes research progresses, that the guidelines would become more systematic regarding the terminology they use and the guidance they provide regarding what the regulators consider to be robust HRQOL data to support a labeling claim. However, to date, this finding does not seem to be the case.
On the other hand, there are two examples of recommendations that are clear on the importance of HRQOL: the important characteristics that the chosen HRQOL instrument should possess, and how the HRQOL data might be used within the regulatory process. These guidelines refer to urinary incontinence (23) and cardiac failure (14).
The guidance on the clinical investigation of medicinal products for the treatment of urinary incontinence states that “the primary aim for developing new drugs for urinary incontinence should be to obtain a subjective improvement or cure of symptoms for the patient” and that “HRQOL data should be considered an extension of an evaluation of efficacy, which can provide meaningful information to the prescriber and the patient.”
The guidance on cardiac failure is the only one that includes details on the criteria to be satisfied by the chosen HRQOL instrument and the way the research on HRQOL should be conducted. The Minnesota Living With Heart Failure Questionnaire is included in the guidance as a disease-specific HRQOL instrument that is widely used in cardiac failure, but the guidance does not restrict the researchers to the use of this instrument.
Among all drugs with HRQOL data included in their EPAR (Table 2), only those treatments for cancer, HIV, and obesity were included in the disease areas for which CPMP recommendations explicitly mention that HRQOL can be used as an end point in the clinical trials supporting the efficacy or safety of the investigated agent. The cancer guideline (21) did not provide any specific recommendations on the HRQOL instrument that should be used. However, in most cases the EORTC QLQ-C30 questionnaire was used, which is the most widely used cancer-specific HRQOL measurement (29;37). The other HRQOL instruments that were used in cancer trials were the Washington University Head and Neck Questionnaire (UW-QOL) and SF-36 alongside the Profile of Mood States questionnaire (POMS). UW-QOL has been widely used as a head- and neck-specific quality of life questionnaire. It is short and simple to process, and its minimum clinically important difference has been defined (45). Trask et al. (49) also suggested that the combination of POMS and SF-36 is a valid approach for assessing the effect that emotional distress caused by cancer-specific worries has on the health functioning of patients.
In the case of weight control, the CPMP guideline was particularly generic (10), and no information on the questionnaire that was actually used in the clinical trials was given in the EPAR of orlistat. The difficulty, however, in interpreting the HRQOL data was clearly stated in the scientific discussion document that stated “although subjective and difficult to interpret, HRQOL seemed to improve…”.
Finally, the CPMP guideline that refers to HIV infection (26) was particularly vague on how HRQOL should be assessed, acknowledging, however, the potential value of these data in assessing the risk benefit ratio of the agent. The two HRQOL questionnaires that were mentioned were the Medical Outcomes Study HIV Health Survey (MOS-HIV) and the Karnofsky scale performance status. MOS-HIV is a brief, comprehensive HRQOL measure used extensively in HIV (50). The Karnofsky scale of performance status is a measure of health status that is widely used for HIV-infected persons, although few studies have documented its validity for HIV (28).
The remainder of the products evaluated are for the treatment of diseases for which either CPMP recommendations do not include HRQOL data, such as diabetes and schizophrenia, or for which recommendations have yet to be developed, such as rhinitis. This finding shows that, if HRQOL data are submitted for drugs for which relevant CPMP recommendations do not explicitly mention HRQOL end points, the EMEA still reviews them. However, this finding raises the question that, if HRQOL data are important enough to be included in the EPAR, then why has the importance of HRQOL not been mentioned in the CPMP guidance for that specific disease?
The HRQOL data included in the scientific discussion of different drugs varied in terms of the detail they included. In twenty of the thirty-three evaluations (Table 2), the name of the HRQOL instrument used was not specified. Usually only one disease-specific HRQOL questionnaire was used. Less frequently, only one generic questionnaire was used, or a generic and a disease-specific questionnaire were used together, or two generic questionnaires were used, or finally, two disease-specific questionnaires were used together (Table 2).
The debate on the type of HRQOL instruments that should be used if these data are to be used in health-care decision-making is still on-going. Generic questionnaires can be used in various populations regardless of the disease, and they permit comparisons across patient populations. However, it is argued that their major limitation is lack of responsiveness. Disease-specific questionnaires, on the other hand, are designed for a particular patient group and focus on aspects that are specific to the condition and deemed to be clinically important. Therefore, they should more appropriately reflect patient change in response to treatment (5). Some experts support the use of different types of HRQOL instruments in one clinical trial, as they play complementary roles (31;33). Dowie (6), on the other hand, argues that a generic measure is “intended to cover the full range of health outcomes,” whereas the disease-specific measure is, by definition, intended to cover a narrower range. On the basis of this distinction of intention, it is argued that, in most cases, the generic questionnaire should be used alone and never should be used together with a disease-specific instrument. Therefore, the decision of the type of instrument that will be used in a study should be driven by the scope of the study and the future use of the data. For example, if the objective is to compare drugs indicated for the same disease, then a disease-specific instrument should be used where available.
Of all the products with HRQOL data in their EPAR documentation, only nine discussed these data in the SPC (Table 2). In general, HRQOL data were presented briefly in the SPCs without mentioning specific results. A conclusion cannot be reached on whether the fact that a significant difference in patients' HRQOL is observed between the product being evaluated and the comparator during clinical trials is important enough to warrant the incorporation of this information in the SPC. There are examples (Table 2) where a significant difference between the HRQOL data for the two products was not observed and the data were still included in the SPC, and there are examples when the opposite has occurred. For example, the SPC of docetaxel states: “In both arms, quality of life measured by the EORTC questionnaire was comparable and stable during treatment and follow-up”; the SPC of thyrotropin alfa mentioned: “Quality of life was significantly reduced after thyroid hormone withdrawal, but maintained following either dosage regimen of Thyrogen”; and in the SPC of cytarabine: “No statistically significant differences were noted in secondary endpoints such as duration of response, … quality of life and overall survival.” On the other hand, there are examples when significant differences in HRQOL were mentioned in the scientific discussion, but the SPC did not refer to these data (Table 2).
By reviewing the EPAR that referred to HRQOL and were published for the same drug, some discrepancies were identified between the different documents. An example that demonstrates this is the quote in the SPC of agalsidase beta that states that “Quality of life scores slightly improved during the first year of treatment,” while the scientific discussion states that “no statistically significant differences in HRQOL were observed when analyzing the tertiary endpoints.” Moreover, the scientific discussion of thyrotropin alfa states that “Data on HRQOL were collected and showed differences in favor of Thyrogen on most items,” while the SPC states that “HRQOL was significantly reduced after thyroid hormone withdrawal but maintained following either dosage regimen of Thyrogen.”
None of the package leaflets contained information related to HRQOL that was included in the scientific discussion, and only three drugs (Table 2) contained HRQOL information in their abstract. In one case, the results from the clinical trials that were included in the scientific discussion and the SPC showed no difference when the HRQOL of the two groups was compared. However, despite this finding, a statement on potential HRQOL improvement with cytarabine was included in the abstract. The abstract stated that “DepoCyte has a more convenient schedule of administration compared to conventional ara-C, as it reduces the need for multiple injections and this may impact quality of life favorably.”
Only twenty of the fifty CPMP notes of guidance for clinical investigation of pharmaceutical products in specific disease areas include reference to, or recommendations on, HRQOL. Most of these recommendations are generic and vague. Moreover, terminology and recommendations are not consistent across documents.
There is evidence to suggest that HRQOL data are routinely discussed in the regulatory review process if the company submits this information, and this finding is evidenced by HRQOL data starting to appear in a product's EPAR. However, EPAR provide little and nonspecific information about the HRQOL outcomes of products, their meaning and the way they were derived. Moreover, in some instances, contradictory conclusions on the effect of the drug on HRQOL are presented in different EPAR documents for the same substance. Finally, the criteria used by the CPMP to assess the HRQOL data could not be identified, as the decision to include these data in the EPAR appears to be taken on an ad hoc basis.
Further research is needed to better understand and establish the value of HRQOL data in the decision-making process within specific disease areas. To ensure consistency, there is a need for a more systematic approach across therapy areas in considering health outcomes and how they are reviewed and interpreted by regulatory authorities. For this to be achieved, the CMPM should develop clear general guidelines on the importance of HRQOL and on the required criteria underlying appropriate HRQOL instruments if these data are to be included in the drug registration process. Recognizing that the role of HRQOL is more important in some disease areas than others, specific disease-specific guidelines should also be issued on how and when HRQOL assessment should be integrated into studies for registration purposes. However, this strategy will only be possible if experts on outcomes research are involved in the preparation of recommendations for HRQOL assessment within clinical trials and if regulatory agencies, particularly the EMEA, have appropriate expertise on the principles of outcomes research. This approach will allow for the proper review, assessment, and interpretation of such data and ensure the meaningful use of HRQOL information.
We are grateful to Dr. Adam Oliver, London School of Economics and Political Science for his useful comments on an earlier draft of this manuscript.