Developing an International Standard Set of Patient-Reported Outcome Measures for Psychotic Disorders
Abstract
Objective:
The objective of this project was to develop a set of patient-reported outcome measures for adolescents and adults who meet criteria for a psychotic disorder.
Methods:
A research team and an international consensus working group, including service users, clinicians, and researchers, worked together in an iterative process by using a modified Delphi consensus technique that included videoconferencing calls, online surveys, and focus groups. The research team conducted systematic literature searches to identify outcomes, outcome measures, and risk adjustment factors. After identifying outcomes important to service users, the consensus working group selected outcome measures, risk adjustment factors, and the final set of outcome measures. International stakeholder groups consisting of >100 professionals and service users reviewed and commented on the final set.
Results:
The consensus working group identified four outcome domains: symptoms, recovery, functioning, and treatment. The domains encompassed 14 outcomes of importance to service users. The research team identified 131 measures from the literature. The consensus working group selected nine measures in an outcome set that takes approximately 35 minutes to complete.
Conclusions:
A set of patient-reported outcome measures for use in routine clinical practice was identified. The set is free to service users, is available in at least two languages, and reflects outcomes important to users. Clinicians can use the set to improve clinical decision making, and administrators and researchers can use it to learn from comparing program outcomes.
HIGHLIGHTS
Patient-reported outcomes reflect the outcomes important to patients, and measures of these outcomes can be used for comparisons across programs and countries.
An international working group used systematic searches to identify and assess the quality of psychometric evidence supporting patient-reported outcome measures for psychosis and associated risk adjustment factors.
Service users, clinicians, and researchers were involved in a consensus process to reduce the measures to a standard set that can be completed in about 35 minutes.
The patient-reported outcomes are assessed by nine measures that cover symptoms, recovery, functioning, and treatment.
A patient-reported outcome (PRO), as measured by patient-reported outcome measures (PROMs), is any aspect of a patient's health status that comes directly from the patient (1). PROMs can be used to improve clinical care (2, 3); to inform clinicians and patients about treatment progress; to create, compare, and aggregate scores at a high level to inform policy; and to inform approval of drugs and devices (4, 5). Research and application of PROMs in health care, particularly in the management of chronic disorders, have increased over the past 20 years (6). The use of PROMs is a focus of patient-centered care (7). Notwithstanding the challenges to implementing large-scale PROMs systems in health care, factors that increase the rate of PROM collection include provider training in the use of PROMs, use of software to register and work with PROMs in daily practice, administrative surveillance of collection rates, and the presence of local clinical champions (3). International application of PROMs requires high standards for translation (8).
We identified two examples of large-scale implementation of recommended PROMs to mental health services in specific programs in the United Kingdom (9) and in routine outcome measurement in Australia (10). However, a Cochrane review of routine monitoring that used PROMs for improving treatment of common mental disorders among adults found insufficient evidence to support routine monitoring and identified the need for “more research of better quality,” including measuring a range of relevant outcomes (11).
The outcomes important to patients can be identified through focus groups, in-depth interviews, target population surveys, qualitative synthesis of the literature, and content analysis of available data sources (12). Ideally, PROMs display strong psychometric properties, including a conceptual and measurement model, reliability, validity, responsiveness, interpretability, alternative modes of administration, and cross-cultural and linguistic adaptations (13, 14). Practical implementation of measures warrants consideration and includes identification of the goals for collecting PROs, selection of patients, and determination of the setting and timing of assessments (15).
Psychotic disorders, including schizophrenia and bipolar disorder, represent significant burdens to service users, families, and health systems worldwide (16). Although schizophrenia is a low-prevalence disorder, with an estimated population prevalence of <1% (17), it is associated with adverse mental and general medical health outcomes, a high degree of disability, high health care costs, and a 15-year reduction in life expectancy (18–23). Bipolar disorder has a slightly higher prevalence, at approximately 1%, with psychiatric as well as general medical burden resulting from its early onset, severity, and chronicity (21–25). Although clinical recovery is defined as the remission of symptoms and the return of functioning (26), the meaning of personal recovery to service users is broader and a process that encompasses many aspects of life and promotion of an individual’s strength and potential (27). Evidence supports recovery as a feasible outcome for schizophrenia spectrum disorders and bipolar disorder (28, 29). At the health systems level, there has been a shift in focus toward a recovery orientation and personalized care (30, 31). The broad impact of psychotic disorders has spurred investigators to examine a range of PROs in schizophrenia and patient-reported quality of life (32) and functional outcomes (33) in bipolar disorder.
Through use of a consensus-building process, the International Consortium for Health Outcomes Measurement (ICHOM) developed and implemented standard sets of PROMs for use in routine clinical practice for various medical conditions. The process is supported by identification of the evidence and by systematic, critical evaluation of measures and their psychometric properties. ICHOM organized a working group of psychosis experts, including clinicians, researchers, and service users, to identify a set of PROMs to monitor individual treatment outcomes or to compare outcomes of similar mental health services, with a view to establishing the value of these services. The value of health care can be defined as the patient outcomes relative to the costs for obtaining those outcomes (34). Outcome assessment can be guided by a set of PROs for a specific disorder, as exemplified by the set for depression and anxiety developed by ICHOM (35). The specific aims of this study were to develop a standard minimum set of PROMs for psychotic disorders that can be used anywhere in the world and to identify a set of risk adjustment factors to enhance utility of comparisons across treatment modalities, institutions, and systems.
Methods
The research team included a chair, project manager, research fellow, and five research associates. The working group (N=19 members) included service users (N=3), and its members were selected to represent diverse professions and geographic areas. Ten areas of expertise were represented: psychometrists, psychiatrists, mental health nurses, psychologists, health economists, epidemiologists, national clinical quality programs, health service researchers, social workers, and service users, with members from 11 countries (Australia, Canada, Denmark, Greece, India, Israel, Japan, Mexico, the Netherlands, the United Kingdom, and the United States). No institutional review board approval or informed consent was necessary for this study.
Service User Focus Groups
Before commencement of the working group meetings, two focus groups were held with four service users to identify outcomes of highest importance to them. Service user recruitment occurred through recommendations from patient organizations and from working group members.
Systematic Literature Review to Identify Outcomes
Between January and March 2019, systematic literature searches were conducted to identify outcomes related to schizophrenia and bipolar disorder type I in adolescent and adult populations. The databases MEDLINE, PubMed, and PsycInfo were searched for publications between January 2013 and January 2019 (the search strategies are detailed in an online supplement to this article).
The Cochrane highly sensitive search strategy for identifying randomized trials (36) was first used to identify outcomes for schizophrenia spectrum and bipolar I disorders. Additional searches that excluded randomized trials were conducted in PsycInfo and MEDLINE. This included qualitative research that examined service users’ perspectives on outcomes of importance and the impact of schizophrenia or bipolar disorder on service users’ lives. Supplementary sources from working group members and reference lists in identified papers were used to find additional PROs and PROMs.
Consensus Process
A modified Delphi consensus process was used to select the outcome set (37). The process involved reaching consensus in five main areas: scope, to determine which psychotic disorders, patient populations, and treatments to include; outcomes, to identify a minimum number of outcomes for inclusion in the set; measures, to assess each outcome; definitions and time points, to assess outcomes; and risk adjustment factors, to enable comparisons among providers implementing the set. The research team prepared and distributed presentations for review before each videoconferencing meeting. The five main areas were discussed during the meetings, and feedback from the working group was incorporated. Members rated and provided feedback on each item under review in the five main areas in online surveys.
On the basis of ICHOM processes outlined a priori, inclusion in the set required that at least 80% of the working group voted an item as “essential” (score of 7–9) in the first or second Delphi round (the scores in the Qualtrics survey ranged from 1, not recommended, to 9, essential). When consensus was not reached by voting, the item was discussed and revisited in the next meeting and survey. Outcomes were excluded if at least 80% of the working group voted an item as “not recommended” (score 1–3). The working group voted on all inconclusive outcomes in the third and final survey round, in which response options were “include” or “exclude” and inclusion required only 70% consensus.
Identification of Potential Outcome Measures From the Systematic Literature Review
Publications identified in the systematic literature review were the source of potential outcome measures. After development of a comprehensive measures list, a search filter was used to facilitate measure selection (38) that identified studies in PubMed reporting psychometric properties of measures. The research team excluded measures that had a cost associated with use.
Assessment of PROMs
The COnsensus-based Standards for the selection of health outcomes Measurement Instruments (COSMIN) checklist was used to assess the psychometric properties of measures, including reliability (test-retest and internal consistency), validity (content validity, face validity, and construct validity), and responsiveness (sensitivity to change) (14). In addition, the working group considered the feasibility of collecting the measures, including length of time to administer, training needed, and international applicability.
Breakout Sessions to Narrow the List of Measures
After the COSMIN checklist was applied, four breakout sessions were held to review and reduce the number of potential measures. The sessions were held with a small number of working group members with lived experience or professional expertise in the areas of functioning, personal recovery, symptoms, and treatment. Participants narrowed the list of potential measures and established a proposal for the wider working group to discuss and endorse. Any measures that took >20 minutes to complete were removed.
Risk Adjustment Factors
A preliminary list of risk adjustment factors and definitions was developed on the basis of risk adjustment factors identified from the systematic literature searches, national registries, and review of existing ICHOM standard sets. Factors were identified according to evidence of their effect on the outcomes selected. Demographic and clinical factors were assessed on relevance and feasibility of measurement. Further, to harmonize across ICHOM mental health standard sets, ICHOM mental health working group chairs developed a list of factors to reduce burden of implementation for service users with more than one diagnosis. The harmonized list was presented to each mental health consensus working group. Factors voted for inclusion reached 70% consensus.
Open Review and Patient Validation
After development, the set was distributed to international stakeholder groups, including professionals, adult service users, and caregivers in two separate stakeholder surveys to obtain feedback on outcomes, measures, risk adjustment factors, and timing of outcome collection. Respondents were recruited via networks identified by the research team and working group members through e-mail and social media and national and international patient organizations. Service users and caregivers were asked to rate the importance of each outcome using a 9-point Likert scale and were provided space to suggest additional outcomes.
Results
Scope
The working group selected both schizophrenia spectrum disorders (as classified in ICD-11 and DSM-5) and bipolar disorder type I (as classified in ICD-11). The set is limited to the adolescent (ages ≥12 years) and adult populations.
Service User Focus Groups
The core outcomes identified as important to service users included improvement in positive and negative symptoms, general medical health, and medication side effects; personal recovery; and prevention of relapse.
Identifying PROs
The literature searches identified 9,118 references, of which 758 were eligible for full-text review (see diagram in online supplement). Additional sources were recommended by working group members. After removal of duplicates and measures with associated costs and in languages other than English, a total of 131 measures were examined that assess symptoms, personal recovery, functioning, and treatment in psychosis (see online supplement).
Domain, Outcome, and Measure Selection by Consensus Working Group
The development process used by the consensus working group is illustrated in Figure 1. The working group identified four outcome domains (symptoms, recovery, functioning, and treatment) that encompassed 14 outcomes (Table 1) (39–47). Symptoms included depressive symptoms, suicidal ideation and behavior, positive symptoms, negative symptoms (schizophrenia), mania or hypomania (bipolar I), sleep quality, and relapse rate as measured by hospitalizations. Personal recovery included quality-of-life measures. Functioning included global, social, and role functioning and general medical health. Medication side effects were included under treatment outcomes. Each core outcome identified by service users in the focus groups was included in the final set.
Domain and outcome | Definition | Outcome measurea | Administration timing | Data source |
---|---|---|---|---|
Symptoms | ||||
Depressive symptoms | Mood or emotional state that is marked by feelings such as depressed mood, hopelessness, worthlessness, or guilt and a reduced ability to enjoy life | PHQ-9 (41) | Baseline and every 6 months | Patient |
Suicidal ideation and behavior | Suicidal ideation, suicidal thoughts or behaviors, suicide attempts, most often accompanied by intense feelings of hopelessness or depression or by self-destructive behaviors | PHQ-9 (41) | Baseline and every 6 months | Patient |
Positive symptoms | Change in thoughts or perceptions, including hallucinations, delusions, or disorganized thought | MCSI (40) | Baseline and every 6 months | Patient |
Negative symptomsb | A withdrawal or lack of function not expected in a healthy person, including blunting of affect, poverty of speech and thought, apathy, anhedonia, reduced social drive, loss of motivation, lack of social interest, and inattention to social or cognitive input | ReQoL-20 (39) | Baseline and every 6 months | Patient |
Mania, hypomaniac | Abnormally elevated mood state characterized by symptoms such as inappropriate elation, increased irritability, severe insomnia, grandiose notions, increased speed or volume of speech, disconnected and racing thoughts, increased energy and activity level, and inappropriate social behavior | ASRM (42) | Baseline and every 6 months | Patient |
Sleep quality | Sleep problems resulting in decreased quality, including difficulty falling asleep, difficulty staying asleep, early morning awakening, and not feeling rested on waking up | PROMIS-Sleep (43) | Baseline and every 6 months | Patient |
Relapse rate | Reemergence of symptoms or disorder after partial or complete recovery | Hospitalizations | Baseline and every 6 months | Clinician or patient |
Recovery | ||||
Personal recovery | A very personal process of changing one’s attitudes, values, feelings, goals, skills, or roles. A way of living a satisfying, hopeful, and contributing life, according to the CHIME domains: connectedness, hope and optimism, identity, meaning and purpose, and empowerment | ReQoL-20 (39) | Baseline and every 6 months | Patient |
Quality of life | Individuals’ perception of their position in life in the context of the culture and value systems in which they live and in relation to their goals, including independence, experiencing a fuller range of emotions, and life satisfaction | ReQoL-20 (39) | Baseline and every 6 months | Patient |
Functioning | ||||
Global functioning | Individuals’ social, occupational, and psychological functioning | WHODAS 2.0 (adult) (44); KIDSCREEN-10 (adolescent) (45) | Baseline and every 6 months | Patient |
Social functioning | Individuals’ interactions with their environment, the quality of those interactions, and their ability to fulfill their role within such environments as work, social activities, and relationships with partners, families, or friends | WHODAS 2.0 (adult) (44); KIDSCREEN-10 (adolescent) (45) | Baseline and every 6 months | Patient |
Role functioning | Ability to perform occupational activities or performance of functional tasks that support participation in the academic aspects of school | WHODAS 2.0 (adult) (44); KIDSCREEN-10 (adolescent) (45) | Baseline and every 6 months | Patient |
Physical health | Measure of general medical health and well-being and overall satisfaction with general medical health | PHQ-15 (46) | Baseline and every 6 months | Patient |
Treatment side effects | Effects of the prescribed medication, whether therapeutic or adverse, besides the intended treatment effect | GASS (47) | Baseline and every 6 months | Patient |
Domain, outcome, definition, measure, timing of administration, and data sources identified by the working group in developing a standard set of patient-reported outcome measures for psychotic disorders
A total of 57 measures were presented to the working group for a vote (see online supplement). Outcome measures were reviewed in their entirety, with consideration given to psychometric properties, previous use in the specified population, the number and wording of items, and time taken to complete (Table 2) (39–50). Response rates were 80%, 85%, 80%, and 80% for the first through fourth modified Delphi processes, respectively.
Measure, abbreviation, and relevant study | No of items | Validitya | Reliabilitya | Sensitivity to changeb |
---|---|---|---|---|
9-item Patient Health Questionnaire (PHQ-9) (41, 48, 49) | 9 | ++ | + | ** |
Modified Colorado Symptom Index (MCSI) (40) | 14 | + | ++ | × |
20-item version of Recovering Quality of Life (ReQoL-20) (39, 50) | 20 | NAc | ++ | * |
Altman Self-Rating Mania Scale (ASRM) (42) | 5 | NA | ++ | × |
PROMIS Short Form V1.0 Sleep Disturbance 4a (PROMIS-Sleep) (43) | 4 | ++ | + | × |
12-item version of the WHO Disability Assessment Schedule 2.0 (WHODAS 2.0 [adult]) (44) | 12 | + | + | × |
10-item KIDSCREEN (adolescent) (45) | 10 | ++ | + | × |
15-item Patient Health Questionnaire (PHQ-15) (46, 49) | 15 | + | ++ | * |
Glasgow Antipsychotic Side-Effect Scale (GASS) (47) | 22 | ++ | ++ | × |
Psychometric properties of nine measures identified for the set of patient-reported outcome measures for psychotic disorders
Service users provided feedback regarding item wording and appropriateness and their opinion about the ability of the measure to capture the outcome for individuals with a psychotic disorder. This feedback was summarized in tables and presented alongside feedback from other working group members. A total of 147 measures were initially mapped to the 14 outcomes, and 39 measures were presented via short lists in breakout sessions.
Evaluation of Measures
Measures were assessed on psychometric properties, including acceptable interrater reliability (r≥0.70) (51), internal consistency (Cronbach’s α≥0.70) (52), and evidence of sensitivity to change, reflected by change in scores measured over time, consistent with a priori hypotheses about anticipated treatment outcome (Table 2). Measures rated as strong had acceptable interrater reliability, internal consistency, and evidence of sensitivity to change. Measures rated as mixed had only a single evaluation identified, mixed evidence from several sources, or strong evidence only for certain items. Measures rated as weak had below-threshold evidence. The working group selected nine measures in an outcome set that takes approximately 35 minutes to complete. The selected measures are freely available in English to users.
Time Point Recommendations
The outcome assessment time line was proposed by the working group to best achieve a balance between the clinically relevant times when outcomes may be expected to change and pragmatic concerns in data collection (53). The working group recommended assessment of outcomes before treatment as a baseline and then every 6 months and assessment of risk adjustment factors at baseline and annually thereafter.
Risk Adjustment Factors
The preliminary list of risk adjustment factors included demographic factors, such as age and socioeconomic status; clinical factors, such as comorbid conditions (54) and hospitalizations; and intervention factors, such as the setting. Harmonization of risk adjustment factors across mental health sets resulted in the addition of two factors: trauma, as assessed by adverse childhood experiences, and contact with law enforcement (Table 3).
Risk adjustment area, patient population, and measure | Supporting informationa | Administration timing | Data source |
---|---|---|---|
Demographic factor | |||
All patients | |||
Year of birth | NA | Baseline | Patient reported |
Sex | Sex at birth | Baseline | Patient reported |
Gender identity | NA | Baseline | Patient reported |
Sexual orientation | NA | Baseline | Patient reported |
Socioeconomic status | Adults, highest level of education completed; adolescents, highest level of education completed by parents (proxy to be used) | Baseline, transition to adult services, and annually if still in education | Patient reported |
Work or education status | NA | Baseline and annually | Patient reported |
Housing status | NA | Baseline and annually | Patient reported |
Living arrangement | NA | Baseline and annually | Patient reported |
Ethnic minority group or marginalization | NA | Baseline | Patient reported |
Adult patients and adolescent patients where appropriate | |||
Contact with law enforcement | To be administered to adolescents only when appropriate to do so and for whom this measure would not cause unnecessary distress. Baseline, ever been convicted (lifetime); annually, ever been convicted (in past 12 months) | Baseline and annually | Patient reported |
Clinical factor | |||
All patients | |||
Comorbid conditions | Based on the Self-Administered Comorbidity Questionnaire (54) | Baseline and annually | Patient reported |
Hospitalizations | Number of lifetime hospitalizations related to the target condition | Baseline | Administrative data |
Adult patients and adolescent patients where appropriate | |||
Adverse life experiences | To be administered to adolescents only when appropriate to do so and for whom this measure would not cause unnecessary distress | Baseline and transition to adult services | Patient reported |
Intervention factor | |||
All patients | |||
Intervention setting | NA | Baseline and annually | Clinical |
Intervention type | NA | Baseline and annually | Clinical |
Risk adjustment factors identified by the working group in developing a standard set of patient-reported outcome measures for psychotic disorders
Open Review and Patient Validation
Ninety-five professionals living in Australia, Canada, Chile, Nigeria, Sweden, the United Kingdom, and the United States responded to the open review survey. Service users and caregivers (N=25) were from Australia, the United Kingdom, and the United States. Overall endorsement of the set and its elements exceeded the required 70%. Of the 106 participants with professional experience, support for outcome domains ranged from 77% to 93%, and support for outcome measures ranged from 61% to 84%. Of the 25 participants with lived experience, 92% agreed that the measures are useful to collect, and 91% stated that the list captured all important outcomes. Endorsement of included outcomes ranged from 72% to 92%.
Discussion
The research team identified numerous PROMs for both schizophrenia spectrum disorders and bipolar disorder. The process involved a review to identify the measures and a subsequent review to assess the measures’ psychometric properties (55), a process similar to that used by other reviewers of PROMs. The consensus process was successful in reducing the number of measures to a pragmatic set for use in routine practice. Service users were integral to the development of the set, from initial identification of core outcomes that mattered most to them to assessment of measures’ face validity and comprehensibility of items. The open review and patient validation phase helped ensure the interpretability and cultural sensitivity of the set.
In addition to the well-recognized symptoms associated with psychosis, sleep problems are common among people with schizophrenia spectrum disorders and bipolar I disorder; they have a negative impact on functioning and well-being and are associated with a reduced ability or opportunity to participate in valued activities (56). Sleep quality, as assessed by the PROMIS-Sleep measure (43), is included in the set.
Personal recovery was very important to service users. The personal recovery measure in the outcome set has good psychometric properties and has been used in published research on populations of mental health service users (39, 57, 58). We did not find a positive symptom measure that has been developed and tested exclusively in samples of people with schizophrenia. However, the modified Colorado Symptom Index has been used in large-scale studies of populations with severe mental illness (40). We did not identify a self-report measure for negative symptoms. Because of its length and the availability of only one language version, the Clinical Assessment Interview for Negative Symptoms (59), a 30-item measure, was not recommended for inclusion in the outcome set. As a best alternative, the working group suggested a PROM, the Recovering Quality of Life (39). Historically, a clinician-rated outcome measure (CROM), the Quality of Life Scale (60), has been used as a negative symptom measure. There was less research to support decision making on adolescent measures.
Limitations of this work include a low number of service users in the working group and no service users with lived experience of bipolar I disorder. The inclusion of service users in developing PROMs is important yet remains challenging. Few studies include them at all stages of development (61). In this study, service users were recruited after the design stage and before the decision to include PROMs for bipolar disorder. At project commencement, two service user–only focus groups were held. Additionally, we paid specific attention to recovery measure studies that involved service users in their development and evaluation. A limitation of the final set is redundancy in some items across measures. For example, sleep is assessed in the PROMIS-Sleep measure and in the depression and mania measures. This commonly encountered issue could be addressed in future research by using statistical methods to address overlap across the entire outcome set. Consistent with a systematic review of PROMs and CROMs for assessing youth outcomes (62), we found broader outcome measures developed for adolescents, including measures for quality of life. Targeted measures for symptoms and treatment side effects were often developed for adults and rarely tested with or adapted for adolescents. This research gap highlights the need to validate outcome measures for the adolescent population.
Important psychometric properties were not included in our selection criteria. These properties included meaningful change thresholds (63) and severity thresholds, such as mild, moderate, and severe, which can be linked to treatment decisions (64). The properties were present to varying degrees in the selected measures, but they were not used as selection criteria. A critical evaluation of each measure was therefore beyond the scope of this project. We did not address alternative modes of PRO administration, an important consideration in implementation. However, a meta-analytic review concluded substantial evidence indicating the equivalence of computer- and paper-administered PROs (65).
Conclusions
A standardized set of nine PROMs was identified in this study. The set can be used to support measurement-based care and, in combination with risk adjustment factors, to compare program outcomes. Finally, the set can be used to support development of value-based health care for people with psychotic disorders.
1
2 : The applications of PROs in clinical practice: what are they, do they work, and why? Qual Life Res 2009; 18:115–123Crossref, Medline, Google Scholar
3 : Factors associated with increased collection of patient-reported outcomes within a large health care system. JAMA Netw Open 2020; 3:e202764Crossref, Medline, Google Scholar
4 : Incorporating the patient’s perspective into drug development and communication: an ad hoc task force report of the Patient-Reported Outcomes (PRO) Harmonization Group meeting at the Food and Drug Administration, February 16, 2001. Value Health 2003; 6:522–531Crossref, Medline, Google Scholar
5 : The National Institutes of Health Patient-Reported Outcomes Measurement Information System (PROMIS): a view from the UK. Patient Relat Outcome Meas 2018; 9:345–352Crossref, Medline, Google Scholar
6 : The Patient-Reported Outcomes Measurement Information System (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005–2008. J Clin Epidemiol 2010; 63:1179–1194Crossref, Medline, Google Scholar
7 : Incorporating patient-reported outcomes into health care to engage patients and enhance care. Health Aff 2016; 35:575–582Crossref, Google Scholar
8 : Patient-Reported Outcome (PRO) Consortium translation process: consensus development of updated best practices. J Patient Rep Outcomes 2017; 2:12Crossref, Medline, Google Scholar
9 : Transparency about the outcomes of mental health services (IAPT approach): an analysis of public data. Lancet 2018; 391:679–686Crossref, Medline, Google Scholar
10 : Routine outcome measurement in Australia. Int Rev Psychiatry 2015; 27:264–275Crossref, Medline, Google Scholar
11 : Routine use of patient reported outcome measures (PROMs) for improving treatment of common mental health disorders in adults. Cochrane Database Syst Rev 2016; 7:CD011119Medline, Google Scholar
12 : Patient-reported outcomes: instrument development and selection issues. Value Health 2007; 10(suppl 2):S86–S93Crossref, Medline, Google Scholar
13 : Development of EMPRO: a tool for the standardized assessment of patient-reported outcome measures. Value Health 2008; 11:700–708Crossref, Medline, Google Scholar
14 : The COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) and how to select an outcome measurement instrument. Braz J Phys Ther 2016; 20:105–113Crossref, Medline, Google Scholar
15 : Implementing patient-reported outcomes assessment in clinical practice: a review of the options and considerations. Qual Life Res 2012; 21:1305–1314Crossref, Medline, Google Scholar
16 : Estimating the true global burden of mental illness. Lancet Psychiatry 2016; 3:171–178Crossref, Medline, Google Scholar
17 : Meta-analyses of the incidence and prevalence of schizophrenia: conceptual and methodological issues. Int J Methods Psychiatr Res 2008; 17:55–61Crossref, Medline, Google Scholar
18 : Global epidemiology and burden of schizophrenia: findings from the Global Burden of Disease Study 2016. Schizophr Bull 2018; 44:1195–1203Crossref, Medline, Google Scholar
19 : Years of potential life lost and life expectancy in schizophrenia: a systematic review and meta-analysis. Lancet Psychiatry 2017; 4:295–301Crossref, Medline, Google Scholar
20 : International incidence of psychotic disorders, 2002–17: a systematic review and meta-analysis. Lancet Public Health 2019; 4:e229–e244Crossref, Medline, Google Scholar
21 : The Lancet Psychiatry Commission: a blueprint for protecting physical health in people with mental illness. Lancet Psychiatry 2019; 6:675–712Crossref, Medline, Google Scholar
22 : Prevalence, incidence and mortality from cardiovascular disease in patients with pooled and specific severe mental illness: a large-scale meta-analysis of 3,211,768 patients and 113,383,368 controls. World Psychiatry 2017; 16:163–180Crossref, Medline, Google Scholar
23 : Risk of metabolic syndrome and its components in people with schizophrenia and related psychotic disorders, bipolar disorder and major depressive disorder: a systematic review and meta-analysis. World Psychiatry 2015; 14:339–347Crossref, Medline, Google Scholar
24 : The prevalence and burden of bipolar disorder: findings from the Global Burden of Disease Study 2013. Bipolar Disord 2016; 18:440–450Crossref, Medline, Google Scholar
25 : Review and meta-analysis of epidemiologic studies of adult bipolar disorder. J Clin Psychiatry 2017; 78:e1259–e1269Crossref, Medline, Google Scholar
26 : A systematic review and meta-analysis of recovery in schizophrenia. Schizophr Bull 2013; 39:1296–1306Crossref, Medline, Google Scholar
27 : Conceptual framework for personal recovery in mental health: systematic review and narrative synthesis. Br J Psychiatry 2011; 199:445–452Crossref, Medline, Google Scholar
28 : Comparison of early intervention services vs treatment as usual for early-phase psychosis: a systematic review, meta-analysis, and meta-regression. JAMA Psychiatry 2018; 75:555–565Crossref, Medline, Google Scholar
29 Psychological approaches to understanding and promoting recovery in psychosis and bipolar disorder: a mixed-methods approach. Programme Grants Appl Res 2016; 4; doi: 10.3310/pgfar04050Crossref, Google Scholar
30 : REFOCUS-PULSAR recovery-oriented practice training in specialist mental health care: a stepped-wedge cluster randomised controlled trial. Lancet Psychiatry 2019; 6:103–114Crossref, Medline, Google Scholar
31 : Canadian practice guidelines for comprehensive community treatment for schizophrenia and schizophrenia spectrum disorders. Can J Psychiatry 2017; 62:662–672Crossref, Medline, Google Scholar
32 : A review of the health-related quality of life literature in bipolar disorder. Qual Life Res 2001; 10:105–115Crossref, Medline, Google Scholar
33 : Functional outcome assessment in bipolar disorder: a systematic literature review. Bipolar Disord 2019; 21:194–214Crossref, Medline, Google Scholar
34 : Standardizing patient outcomes measurement. N Engl J Med 2016; 374:504–506Crossref, Medline, Google Scholar
35 : Standardization of health outcomes assessment for depression and anxiety: recommendations from the ICHOM Depression and Anxiety Working Group. Qual Life Res 2017; 26:3211–3225Crossref, Medline, Google Scholar
36 : Updated guidance for trusted systematic reviews: a new edition of the Cochrane Handbook for Systematic Reviews of Interventions. Cochrane Database Syst Rev 2019; 10:ED000142Medline, Google Scholar
37 : Cost comparison of laparoscopic colectomy versus open colectomy in colon cancer. Curr Med Res Opin 2017; 33:1215–1221Crossref, Medline, Google Scholar
38 : Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments. Qual Life Res 2009; 18:1115–1123Crossref, Medline, Google Scholar
39 : Recovering Quality of Life (ReQoL): a new generic self-reported outcome measure for use with people experiencing mental health difficulties. Br J Psychiatry 2018; 212:42–49Crossref, Medline, Google Scholar
40 : Reliability and validity of a modified Colorado Symptom Index in a national homeless sample. Ment Health Serv Res 2001; 3:141–153Crossref, Medline, Google Scholar
41 : The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med 2001; 16:606–613Crossref, Medline, Google Scholar
42 : The Altman Self-Rating Mania Scale. Biol Psychiatry 1997; 42:948–955Crossref, Medline, Google Scholar
43 : Development and validation of patient-reported outcome measures for sleep disturbance and sleep-related impairments. Sleep 2010; 33:781–792Crossref, Medline, Google Scholar
44 : Measuring Health and Disability: Manual for WHO Disability Assessment Schedule (WHODAS 2.0). Geneva, World Health Organization, 2010Google Scholar
45 : Reliability, construct and criterion validity of the KIDSCREEN-10 score: a short measure for children and adolescents’ well-being and health-related quality of life. Qual Life Res 2010; 19:1487–1500Crossref, Medline, Google Scholar
46 : The PHQ-15: validity of a new measure for evaluating the severity of somatic symptoms. Psychosom Med 2002; 64:258–266Crossref, Medline, Google Scholar
47 : A new self-rating scale for detecting atypical or second-generation antipsychotic side effects. J Psychopharmacol 2008; 22:238–243Crossref, Medline, Google Scholar
48 : Validation of the PHQ-9 in a psychiatric sample. J Affect Disord 2016; 193:267–273Crossref, Medline, Google Scholar
49 : The Patient Health Questionnaire somatic, anxiety, and depressive symptom scales: a systematic review. Gen Hosp Psychiatry 2010; 32:345–359Crossref, Medline, Google Scholar
50 : Development and Validation of the Recovering Quality of Life (ReQoL) Outcome Measures. EEPRU Research Report 050. Sheffield, United Kingdom, Policy Research Unit in Economic Evaluation of Health and Care Interventions, Universities of Sheffield and York, 2017Google Scholar
51 : ISOQOL recommends minimum standards for patient-reported outcome measures used in patient-centered outcomes and comparative effectiveness research. Qual Life Res 2013; 22:1889–1905Crossref, Medline, Google Scholar
52 : Psychometric Theory. New York, McGraw-Hill, 1978Google Scholar
53 : An international collaborative standardizing a comprehensive patient-centered outcomes measurement set for colorectal cancer. JAMA Oncol 2017; 3:686–694Crossref, Medline, Google Scholar
54 : The Self-Administered Comorbidity Questionnaire: a new method to assess comorbidity for clinical and health services research. Arthritis Rheum 2003; 49:156–163Crossref, Medline, Google Scholar
55 : COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res 2018; 27:1147–1157Crossref, Medline, Google Scholar
56 : Experiences, perspectives and priorities of people with schizophrenia spectrum disorders regarding sleep disturbance and its treatment: a qualitative study. BMC Psychiatry 2017; 17:158Crossref, Medline, Google Scholar
57 : The importance of content and face validity in instrument development: lessons learnt from service users when developing the Recovering Quality of Life measure (ReQoL). Qual Life Res 2018; 27:1893–1902Crossref, Medline, Google Scholar
58 : Integrating qualitative and quantitative data in the development of outcome measures: the case of the Recovering Quality of Life (ReQoL) measures in mental health populations. Int J Environ Res Public Health 2018; 15:15Crossref, Google Scholar
59 : Screening for negative symptoms: preliminary results from the self-report version of the Clinical Assessment Interview for Negative Symptoms. Schizophr Res 2012; 135:139–143Crossref, Medline, Google Scholar
60 : Confirmatory factor analysis of the Quality of Life Scale and new proposed factor structure for the Quality of Life Scale–Revised. Schizophr Res 2017; 181:117–123Crossref, Medline, Google Scholar
61 : Patient involvement in the development of patient-reported outcome measures: a scoping review. Health Expect 2017; 20:11–23Crossref, Medline, Google Scholar
62 : A systematic review of mental health outcome measures for young people aged 12 to 25 years. BMC Psychiatry 2015; 15:279Crossref, Medline, Google Scholar
63 : Clinical significance: a statistical approach to defining meaningful change in psychotherapy research. J Consult Clin Psychol 1991; 59:12–19Crossref, Medline, Google Scholar
64 : Measurement of health status. Ascertaining the minimal clinically important difference. Control Clin Trials 1989; 10:407–415Crossref, Medline, Google Scholar
65 : Equivalence of electronic and paper-and-pencil administration of patient-reported outcome measures: a meta-analytic review. Value Health 2008; 11:322–333Crossref, Medline, Google Scholar