Abstract

A growing body of evidence from preclinical, translational, and clinical studies supports a bidirectional relationship within the gut-brain axis that contributes to neurobiological symptoms including anxiety, depression, fatigue, stress, and sleep disturbance. These symptoms have a significant impact on health-related quality of life and functional ability in individuals with inflammatory bowel disease. Clinical studies that generate high-quality evidence on pharmacological and nonpharmacological (eg, psychosocial, behavioral) interventions are needed to ultimately improve access to safe and effective therapies that have a meaningful impact on patients and to guide medical and regulatory decisions. This review outlines a framework for designing and conducting randomized controlled trials for interventions that target neurobiological symptoms in patients with inflammatory bowel disease based on the most recent guidance published within the past 5 years from policy makers, clinicians specialized in inflammatory bowel disease, patient-reported outcomes methodologists, health economists, patient advocates, industry representatives, ethicists, and clinical trial experts.

Introduction

The pathogenesis of inflammatory bowel disease (IBD) involves genetic susceptibility, environmental factors, gut microbiome dysbiosis, and immune dysregulation, although the exact mechanism remains to be elicited.1 A growing body of evidence from preclinical, translational, and clinical studies in patients with IBD suggest that dynamic bidirectional interactions within the gut-brain axis contribute to neurobiological symptoms such as stress,2 depression,3, 4 anxiety,3 fatigue,5, 6 and sleep disturbance.7 These neurobiological symptoms have significant negative impacts on an individual’s functional ability, emotional well-being, and quality of life and contribute to the overall disease burden in IBD with an increased risk for disease relapse and health care utilization.8-11

Brain-to-gut signaling through neuroendocrine transmission between the enteric, autonomic, and central nervous system in IBD is supported by preclinical and clinical observations detailing the effects of psychological distress on proinflammatory cytokines (eg, tumor necrosis factor-alpha, interleukins),12 intestinal permeability,13, 14 and the intestinal microbiota composition in patients with IBD.15-17 Gut-to-brain signaling in IBD is supported by evidence from preclinical and clinical observations that intestinal inflammation, intestinal microbiota composition, and structural changes to the central nervous system in patients with IBD affect neurocognitive function and psychological well-being, including mood disorders (eg, anxiety, depression),3, 18 alteration of circadian rhythm (eg, sleep disturbances),6, 19 fatigue,5 and pain perception.3

Pragmatic clinical research that investigates the extent to which pharmacological and nonpharmacological interventions impact the overall disease burden of IBD and neurobiological symptoms from brain-gut interactions is urgently needed.11, 20, 21 Clinical studies should be designed in a way that isolates the effect(s) of the intervention from other influences, such as “spontaneous change in the course of the disease, placebo effect, or biased observation.” 22 Although meaningful changes in quality of life and neurobiological symptoms can be patient-reported (eg, have heterogeneous expression and interpretation) and can be influenced by the natural variability of the disease (particularly with the relapsing and remitting nature of IBD), the goal of clinical trial design is to construct efficient and effective studies within these research constraints to support the development of treatments that are better tailored to patients’ needs.23

Patient-reported outcomes (PRO) are now at the forefront of determining treatment efficacy in clinical studies because it is important to include the perspectives of those living with a condition in the determination of meaningful endpoints.23, 24 The U.S. Food & Drug Administration (FDA) mandates that “symptoms or other unobservable concepts known only to the patient can only be measured by PRO measures,” which are reported “directly from the patient (i.e., study subject) about the status of a patient’s health condition without amendment or interpretation of the patient’s response by a clinician or anyone else.” 25

This architectural article aims to provide a framework for approaching clinical trials of pharmacological and biopsychosocial therapies that target modifiable neurobiological factors in the gut-brain axis in IBD to potentially aid investigators, pharmaceutical companies, and granting agencies to “help ensure that patients’ voices are central to informing shared decision making, labeling claims, clinical guidelines, and health policy, making patient-centered care a reality.” 24 The content of this article reflects the most updated guidance from the International Council for Harmonisation for contemporary clinical study design, which is endorsed by the FDA23 and the European Medicines Agency26; the international consensus-based Guidelines for Inclusion of Patient-Reported Outcomes in Clinical Trial Protocols24; and the most recent guidance for clinical trial designs and PRO instruments published within the past 5 years.22, 23, 25, 27-32

STRUCTURAL DESIGN, INTERVENTIONS, AND CONTROL GROUPS

Study Design

Single-arm trials, placebo-controlled trials, crossover trials, active comparator trials, and comparisons with no treatment are examples of clinical trial designs used to test the efficacy of an intervention depending on the research objectives. 23 Interventions in clinical trials that target neurobiological symptoms in IBD can be broadly classified into 2 categories: pharmacological and nonpharmacological. Fig. 1 illustrates the factors to consider in the design of a clinical trial for interventions of neurobiological symptoms in individuals with IBD depending on the research objective, the specific endpoints of interest, and the target population.

Framework for clinical trials for interventions targeting neurobiological symptoms in individuals with IBD. Outlines 8 key elements to consider in the design of clinical trials for interventions that target neurobiological symptoms in individuals with IBD. The investigators’ research objectives and hypotheses impact the methodological approach, target population, instruments for data collection, endpoint determination, and data analysis.
Figure 1.

Framework for clinical trials for interventions targeting neurobiological symptoms in individuals with IBD. Outlines 8 key elements to consider in the design of clinical trials for interventions that target neurobiological symptoms in individuals with IBD. The investigators’ research objectives and hypotheses impact the methodological approach, target population, instruments for data collection, endpoint determination, and data analysis.

Interventions

The majority of pharmacological interventions in clinical trials that target neurobiological symptoms in IBD have involved antidepressants33 (eg, duloxetine, fluoxetine, buproprion34), although some trials have investigated the impact of biologic therapy on neurobiological symptoms as secondary outcomes.35 In contrast, many forms of nonpharmacological therapies have been studied including behavioral and cognitive therapies,34, 36-39 dietary supplementation (eg, ferric maltol, omega-3, Agaricus blazei Murrill-based mushroom extract),40 and complementary and alternative medicine (eg, acupuncture, yoga, exercise regimens).40-42 The most common form of behavioral/cognitive intervention that has been studied to date is cognitive-behavioral therapy (CBT), in which maladaptive thinking such as negative illness beliefs (eg, “My illness has complete control over my life”) and dysfunctional attitudes (eg, “I have to be perfect to be happy”) are reduced through interventions such as behavioral activation, response prevention, imagination, and reframing.43 Other forms of psychotherapeutic interventions studied to date include problem-solving, stress management, meditation, guided imagery, hypnotherapy, and yoga with mixed results in patients with symptomatically quiescent IBD.36, 40, 42

More recently, 2 newer forms of psychotherapeutic interventions with positive effects on stress and fatigue in individuals with IBD were published: acceptance and commitment therapy, in which an individual is encouraged to accept the existence of negative thoughts or feelings (ie, to foster “psychological flexibility” rather than to eliminate these thoughts or feelings as with CBT),44 and stepped-care intervention with behavioral therapy and thereafter pharmacologic therapy (ie, using bupropion, an antidepressant) for the subset of patients with persistent neurological symptoms.34 These results have not yet been replicated but are promising avenues for further research.

Control Groups and Comparators

Standard-of-care and waitlist control groups

The Council for International Organizations of Medical Sciences recognizes that “the use of placebo is usually uncontroversial in the absence of an established effective intervention” and that researchers do not “delay or withhold an established effective intervention from participants.” 45 In clinical trials that target neurobiological symptoms in patients with IBD (eg, fatigue, anxiety, depression, and stress), the most common comparison groups for psychotherapeutic and behavioral interventions have been (1) a standard-of-care control group, also called best medical management, usual care, or treatment as usual,8, 36, 37, 40, 42, 44, 46 and (2) a waitlist control group, in which participants receive the standard of care with the option of receiving the intervention at the end of the study period (Fig. 1).8, 36, 39, 40, 42, 43

Goren et al39 recently published in 2021 an example of a randomized parallel-group physician-blinded controlled trial with a waitlist control group: 139 patients with mild-to-moderate Crohn disease activity were randomized to participate in a 3-month program of cognitive-behavioral and mindfulness-based stress reduction using daily exercise taught weekly via one-on-one videoconferences with social workers (n = 70 intervention arm) or to a waitlist control group (n = 69). Waitlisted participants received treatment as usual until the end of the study period (completion of the last follow-up outcomes assessments at 3 months) and then had the opportunity to participate in the stress-reduction program.39

Although some researchers may regard waitlist control groups as being more equitable for study participants than standard-of-care control groups in psychological and behavioral intervention studies, drawbacks that should be considered include program administration costs/time (higher/more in waitlist control groups than in standard-of-care control groups) and the potential risk of overestimating the active treatment effects. Waitlisted patients may be less inclined to engage in self-help behavior they would otherwise engage in if they did not have the expectation of the intervention at the end of the study, and the anticipation of the intervention may confound the outcomes (report less improvement than standard-of-care control groups).47

Attention, sham, and placebo control groups

Several psychotherapeutic trials in IBD have used attention control groups as a form of sham comparator where participants in the control group receive the same frequency and duration of interpersonal interaction with research personnel and are engaged in comparable control activities (eg, educational lectures on IBD) as the intervention group (eg, mind-body psychotherapy; Fig. 1).37, 48, 49 Attention control groups in behavioral intervention trials can in theory reduce biases from the Hawthorn effect, where participants’ responses are consciously or unconsciously influenced by attention from investigators (“showering of medical attention”).46, 50, 51 Other forms of placebo or sham control groups are uncommon, but 1 trial on the effects on fatigue in patients with IBD compared electroacupuncture with a sham procedure (needles were inserted in “non-acupuncture” locations on the body without an electric current).41

Placebo controls are most common in randomized controlled trials (RCTs) of pharmacotherapies to evaluate their impact on anxiety,33 depression,33 and fatigue in patients with IBD (Fig. 1).40 Placebos are inert substances with a form and administration indistinguishable from the active drug.45 Active substances that have been studied include adalimumab,35 antidepressants,33 and nutritional supplements.40 Active comparator trials, in which an investigational intervention is compared to an established effective intervention in the presence of credible uncertainty about its superiority (referred to as “clinical equipoise”),45 where neurobiological symptoms in patients with IBD were assessed as the primary outcome, have not been published to date, likely because of the lack of high-quality evidence-based studies already published for interventions with confirmed treatment efficacy on anxiety, depression, fatigue, stress, and sleep disturbance in the IBD population.8, 36, 52, 53

PATIENT SELECTION AND SAMPLE SIZE DETERMINATION

Target Population, Inclusion and Exclusion Criteria

Defining the target population in a study is important for endpoint determination and result analysis and interpretation, with implications for the relevance, reliability, and generalizability of a study’s observations.32 To illustrate this point, the FDA offered an example where “patients with late-stage disease may have different symptoms or perspectives than patients in the early stage.”31 Because patient selection for a particular study depends on the research objective and resource constraints, input should be sought from patients living with the condition, statisticians, and psychometricians when designing a clinical study.31, 32

Prototypical inclusion and exclusion criteria that could be considered when designing clinical trials for interventions on neurobiological symptoms in individuals with IBD are illustrated in Table 1. Factors to consider include IBD phenotype and disease activity (based on symptoms alone or with objective inflammatory burden), neurobiological symptom burden (eg, cutoffs on PROs; mild, moderate, or high symptomatic ratings), and psychiatric history (including medications and treatments).24, 36, 37 Depending on the research objective and the attributes associated with the endpoints of interest, the inclusion and exclusion criteria need to reflect “the characteristics of the target population, the disease or condition under study, and the intended use of study results” so that statements made about the qualitative/quantitative data are meaningful and relevant to the target population (Table 1).32

Table 1.

Prototypical Inclusion and Exclusion Criteria to Consider for Clinical Trials That Target Neurobiological Symptoms in Individuals With IBD

FactorsInclusion ConsiderationsExclusion Considerations
Age (y)≥18< 18
IBD phenotype, disease activity, and medicationsIBD subtypes: Crohn disease, UC;
prior IBD-related surgeries (eg, colectomy, small-bowel resection);
diagnostic criteria by clinical, endoscopic, and histologic confirmation
Indeterminate colitis; restriction of IBD disease activity (symptomatic, biochemical, endoscopic); changes in IBD medications in the past 3 months; concurrent corticosteroid use (potential psychological and endocrinological effects)
Psychological status and psychiatric historyPast participation in psychotherapeutic interventions; stable doses of antidepressants (dose or type) within last 3 mo; clinically significant baseline neurobiological symptoms (eg, cutoff scores on baseline psychological screening)Concurrent participation in psychotherapeutic intervention;
major psychiatric disorders per Diagnostic and Statistical Manual of Mental Disorders, 5th ed.; active alcohol, opioid, or illicit drug dependency
Medical and social historyAll relevant race/ethnicity, level of education, socioeconomic statusComorbid major medical illness that would require >3 medical visits/y, including severe cardiac or pulmonary disease, renal disease requiring dialysis, decompensated cirrhosis, and cancer; pregnant at time of enrollment
Literacy, access, and cognitive abilitiesInternet access and access to computer or mobile smartphone/tablet (eg, web-based assessments)Difficulty understanding English; numeracy skills (eg, quantitative scale assessments); physical or logistical limitations for in-person interventions/testing; cognitive impairments (dementia, intellectual disability)
FactorsInclusion ConsiderationsExclusion Considerations
Age (y)≥18< 18
IBD phenotype, disease activity, and medicationsIBD subtypes: Crohn disease, UC;
prior IBD-related surgeries (eg, colectomy, small-bowel resection);
diagnostic criteria by clinical, endoscopic, and histologic confirmation
Indeterminate colitis; restriction of IBD disease activity (symptomatic, biochemical, endoscopic); changes in IBD medications in the past 3 months; concurrent corticosteroid use (potential psychological and endocrinological effects)
Psychological status and psychiatric historyPast participation in psychotherapeutic interventions; stable doses of antidepressants (dose or type) within last 3 mo; clinically significant baseline neurobiological symptoms (eg, cutoff scores on baseline psychological screening)Concurrent participation in psychotherapeutic intervention;
major psychiatric disorders per Diagnostic and Statistical Manual of Mental Disorders, 5th ed.; active alcohol, opioid, or illicit drug dependency
Medical and social historyAll relevant race/ethnicity, level of education, socioeconomic statusComorbid major medical illness that would require >3 medical visits/y, including severe cardiac or pulmonary disease, renal disease requiring dialysis, decompensated cirrhosis, and cancer; pregnant at time of enrollment
Literacy, access, and cognitive abilitiesInternet access and access to computer or mobile smartphone/tablet (eg, web-based assessments)Difficulty understanding English; numeracy skills (eg, quantitative scale assessments); physical or logistical limitations for in-person interventions/testing; cognitive impairments (dementia, intellectual disability)
Table 1.

Prototypical Inclusion and Exclusion Criteria to Consider for Clinical Trials That Target Neurobiological Symptoms in Individuals With IBD

FactorsInclusion ConsiderationsExclusion Considerations
Age (y)≥18< 18
IBD phenotype, disease activity, and medicationsIBD subtypes: Crohn disease, UC;
prior IBD-related surgeries (eg, colectomy, small-bowel resection);
diagnostic criteria by clinical, endoscopic, and histologic confirmation
Indeterminate colitis; restriction of IBD disease activity (symptomatic, biochemical, endoscopic); changes in IBD medications in the past 3 months; concurrent corticosteroid use (potential psychological and endocrinological effects)
Psychological status and psychiatric historyPast participation in psychotherapeutic interventions; stable doses of antidepressants (dose or type) within last 3 mo; clinically significant baseline neurobiological symptoms (eg, cutoff scores on baseline psychological screening)Concurrent participation in psychotherapeutic intervention;
major psychiatric disorders per Diagnostic and Statistical Manual of Mental Disorders, 5th ed.; active alcohol, opioid, or illicit drug dependency
Medical and social historyAll relevant race/ethnicity, level of education, socioeconomic statusComorbid major medical illness that would require >3 medical visits/y, including severe cardiac or pulmonary disease, renal disease requiring dialysis, decompensated cirrhosis, and cancer; pregnant at time of enrollment
Literacy, access, and cognitive abilitiesInternet access and access to computer or mobile smartphone/tablet (eg, web-based assessments)Difficulty understanding English; numeracy skills (eg, quantitative scale assessments); physical or logistical limitations for in-person interventions/testing; cognitive impairments (dementia, intellectual disability)
FactorsInclusion ConsiderationsExclusion Considerations
Age (y)≥18< 18
IBD phenotype, disease activity, and medicationsIBD subtypes: Crohn disease, UC;
prior IBD-related surgeries (eg, colectomy, small-bowel resection);
diagnostic criteria by clinical, endoscopic, and histologic confirmation
Indeterminate colitis; restriction of IBD disease activity (symptomatic, biochemical, endoscopic); changes in IBD medications in the past 3 months; concurrent corticosteroid use (potential psychological and endocrinological effects)
Psychological status and psychiatric historyPast participation in psychotherapeutic interventions; stable doses of antidepressants (dose or type) within last 3 mo; clinically significant baseline neurobiological symptoms (eg, cutoff scores on baseline psychological screening)Concurrent participation in psychotherapeutic intervention;
major psychiatric disorders per Diagnostic and Statistical Manual of Mental Disorders, 5th ed.; active alcohol, opioid, or illicit drug dependency
Medical and social historyAll relevant race/ethnicity, level of education, socioeconomic statusComorbid major medical illness that would require >3 medical visits/y, including severe cardiac or pulmonary disease, renal disease requiring dialysis, decompensated cirrhosis, and cancer; pregnant at time of enrollment
Literacy, access, and cognitive abilitiesInternet access and access to computer or mobile smartphone/tablet (eg, web-based assessments)Difficulty understanding English; numeracy skills (eg, quantitative scale assessments); physical or logistical limitations for in-person interventions/testing; cognitive impairments (dementia, intellectual disability)

Because there is a poor correlation between symptomatic disease activity (eg, the Harvey-Bradshaw Index, the Simple Clinical Colitis Activity Index) and objective assessments of inflammation (eg, biochemical and endoscopic),54 the inclusion of biochemical and/or endoscopic evaluations (eg, fecal calprotectin, serum C-reactive protein, ileocolonoscopy) in the study design for enrollment, allocation, and outcomes assessment may offer additional insights into the impact of pharmacological and nonpharmacological interventions on the psychological comorbidities of somatization, anxiety, depression, fatigue, and stress39,54 (Fig. 1 and Table 1).

In individuals with IBD, the IBD subtype (primarily Crohn disease or ulcerative colitis [UC]) and disease activity (active vs quiescent) are some of the most prominent factors that delineate patterns of health care utilization, medical management, and neurobiological symptom burden.5, 8, 53, 55 The majority of RCTs conducted to date that have investigated psychotherapeutic interventions for neurobiological symptoms have been limited to patients with symptomatic remission.8, 36, 40, 42 This trend does not fully reflect the heterogeneity of disease activity and renders a knowledge gap about the extent to which adjunctive interventions that target neurobiological symptoms impact patients with symptomatically active disease alongside the use of dedicated pharmacologic therapies directed at reducing the inflammatory burden.2, 53, 55 The additional challenge to studying patients with active disease also needs to be recognized, however, because the use of concomitant medical therapies may confound outcomes assessment.37

Depending on the research objective, acceptable methods to delineate the effects of “attributes that are associated with the endpoints of interest” and approximate heterogeneity in the target population include (1) a subpopulation study, where the study sample is restricted to participants with characteristics of interest (eg, active vs quiescent disease), or (2) a subgroup analysis of a broader target population with a prespecified sample size determination to facilitate “weighing to account for the over-sampling (or under-sampling) of certain subpopulations if the sample is obtained with known sampling probabilities.” 32

An example of a recent subpopulation study (quiescent disease activity) is a feasibility RCT published in 2019 in which the effects of telephone-delivered CBT (intervention) on fatigue (measured by the IBD-Fatigue scale56) were compared between patients with clinical-biochemical quiescent IBD (measured by the Harvey-Bradshaw Index or the Simple Clinical Colitis Activity Index and the fecal calprotectin level within 3 months of study enrollment) and control patients who received a 4-page information sheet on fatigue in IBD (ie, without interaction with therapists).57

An example of subgroup analysis (UC vs Crohn disease) is the INSPIRE study by Boye et al58 that compared the impact of stress management psychotherapy on the quality of life and risk of relapse in patients with IBD. A total of 58 patients with UC and 56 with Crohn disease were randomized to participate in 6 to 9 weekly sessions of psychoeducation and CBT (intervention) or continue treatment as usual (control patients). Mixed linear model analyses were performed separately for UC and Crohn disease to delineate the intervention’s effects on the 2 IBD subtypes over the course of 18 months of follow-up. Patients with UC, as compared to Crohn disease, seemed to have more improvements in the disease course and health-related quality of life. However, this analysis was performed posthoc rather than being prespecified in the study design.58 Where there is interest in subgroup analysis (eg, UC vs Crohn disease) within the target population (eg, IBD), subgroup analysis should be determined a priori and the sample size should be calculated to ensure adequate representation from the subgroups.32

Sample Size Calculations

Sample size calculations allow investigators to determine the number of study participants to enroll to provide adequate clinically relevant data for statistical power23; however, enrolling too many patients can be unnecessarily costly or time-consuming.59 The number of patients should be sufficient to make statistical conclusions with adequate precision to detect treatment effect and to control the probabilities for false conclusion.23 A type I error rate (α) of 0.05 for 2-sided tests (ie, a 5% probability of falsely concluding that the intervention is different from the control when no difference exists) and 80% power (calculated as 1-β, where β is the probability of committing a type II error; ie, an 80% probability of identifying a treatment effect when indeed a true treatment effect exists and a 20% chance that a significant difference is missed) are most commonly used for sample size calculations in clinical trials.28 Sample size calculations should also take into account dropout and nonresponse rates to arrive at a recruitment target (depending on how missing data are accounted for).32 Past studies of psychosocial and pharmacological interventions targeting neurobiological symptoms in patients with IBD have had attrition rates of 0% to 50%, depending on the mode of intervention (eg, time commitment, travel requirements), duration of study (more participants are lost to follow-up as the study lengthens), and study setting (eg, in person, telephone, or videoconference; real-time or asynchronous; one-on-one or groups).21, 33, 34, 37, 60

When a PRO is the primary outcome, the target sample size is generally based on an a priori sample size calculation for that endpoint.24 The minimal clinically important difference (MCID) of the primary outcome can aid in the determination of the sample size (the number of patients needed in each treatment group to detect meaningful clinically significant differences).61 The MCID is a patient-centered concept that determines the magnitude of improvement meaningful to patients, because small differences (especially in large studies) may be statistically significant but lack clinical relevance from the perspectives of patients and clinicians.30, 62 The most common approaches for determining MCIDs for PRO measures are consensus, anchor, and distribution-based methods, although the details of MCID determination are beyond the scope of this article.61, 63 The FDA recommends anchor-based methods to establish a range of score change that represents meaningful within-patient changes (rather than group-level data).30 An example of anchor-based and distribution-based methods for determining the MCIDs for health-related quality-of-life outcome measures in IBD have been outlined for the GEMINI II and GEMINI III vedolizumab registration trials.64

SELECTION OF PRO ENDPOINTS

Clinical benefit is defined as a positive clinically meaningful change on how an individual feels, functions, or survives; these elements are measured in research studies using clinical outcome assessments reported by patients, clinicians, or observers or are assessed by performance measures.25 The scope of this article focuses on PROs for anxiety, depression, sleep quality, stress, and fatigue, which are important contributing factors to health-related quality of life and disability among patients with IBD5, 9, 12, 15, 16, 52 and are regarded as “long-term treatment targets irrespective of other objective markers of inflammation” according to the recently updated STRIDE-II study.20

The National Institutes of Health Patient-Reported Outcomes Measurement Information System (PROMIS) has developed more than 300 valid, responsive, and general (not disease-specific) PRO instruments for measuring the symptoms, function, well-being, and general health of individuals living with chronic conditions and in the general population (detailed information available at https://www.healthmeasures.net/explore-measurement-systems/promis).65 One major advantage of using PROMIS measures is that they were developed using the modern standards for systematic and rigorous PRO development methodologies, including literature reviews, qualitative item reviews, focus groups, cognitive interviewing, and psychometric testing. All PROMIS measures are interpreted using a T-score of 50 and an SD of 10 calibrated to the mean of the U.S. general population.65, 66 Higher scores indicate more symptoms within a domain. PROMIS scores < 55 are considered within normal limits, scores from 55 to 60 as mild, scores from 60 to 70 as moderate, and scores >70 as severe symptoms of the domain measured.66, 67 The MCID is considered as having a T-score of 2.5.68

Although PROMIS measures for depression, anxiety, fatigue, and sleep disturbances are available (Table 2) and have been studied for construct validity and responsiveness in a large sample of patients with IBD in the United States (10,634 patients with IBD: 6689 with Crohn disease and 3945 with UC/indeterminate colitis),69, 70 they have not been widely used in research studies among patients with IBD. Moreover, the FDA has not endorsed specific PROs for determining treatment efficacy in neurobiological domains in patients with IBD but rather encourages “early discussions with the FDA about outcome measure selection.”29, 31, 71 For example, in 2019, the FDA accepted the use of the 10-item PROMIS Short Form Fatigue as a determination of efficacy studies of adults with rheumatoid arthritis after it was first entered into the FDA’s Clinical Outcomes Assessment qualification program in 2016.27 Examples of the most commonly used PROs for neurobiological symptoms in observational and interventional studies in patients with IBD are illustrated below and in Table 2.

Table 2.

Validated PRO Measures for Depression, Anxiety, Stress, Fatigue, and Sleep Most Commonly Used in Studies of Individuals With IBD

PRO MeasurementDescription (estimated time to complete)InterpretationEstimated Scores in Patients With IBDRemarks
Depression
PROMIS Depression658 items, each scored 1-5, total score 38.2-81.3 (transformed scores into T-scores); recall period: past wk (2 min)Higher scores indicate more depressive symptoms. T-score = 50 and SD = 10 represent the general population; T-score > 50 is abnormal, 2.5 considered as MCID65, 68IBD T-score = 54 (SD = 10, n = 10,634); UC = 54 (SD = 10, n = 6689); CD = 54 (SD = 10, n = 3945)69Developed by the NIH; validated in IBD69, 77
HADS-Depression767 items, each scored 0-3, summed score 0-21; recall period: past 7 d (2 min)Higher scores indicate more depressive symptoms; cut points 8, 11, and 15 for mild, moderate, and severe depressionIBD mean = 5.4 (95% CI, 4.9-5.8; n = 7589)94; CD = 6.09 (95% CI ± 0.89; n = 1299); UC = 5.5 (95% CI ± 1.08; n = 1694)53Most commonly used in patients with IBD94
BDI-II8121 items, each scored 0-3, summed score 0-63; recall period: past 2 wk (5-10 min)Higher scores indicate more depressive symptoms; cut points 14, 20, and 29 for mild, moderate, and severe depressionIBD mean = 9.1 (95% CI, 8.6-9.5; n = 398)94Second most commonly used scale in patients with IBD94; emphasis on somatic symptoms (eg, loss of appetite, fatigue, loss of energy) may overestimate depression in medically ill patients77
PHQ-9789 items, each scored 0-3, summed score 0-27; recall period: past 2 wk (2-5 min)Higher scores indicate more depressive symptoms; cut points 5, 10, and 15 for mild, moderate, and severe depressionIBD = 7.6 (95% CI, 6.3-8.8; n = 926)94Contents overlap with common symptoms of inflammatory disorders (eg, lack of energy, sleep disturbance, appetite changes)77
Anxiety
PROMIS Anxiety658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more anxiety symptoms; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCID65, 68IBD T-score = 52 (SD = 10, n = 10,634); UC = 52 (SD = 10, n = 6689); CD = 52 (SD = 9, n = 3945)69Developed by the NIH77; validated in patients with IBD and has psychometric properties compatible with HADS-A69, 77
HADS-A767 items, each scored 0-3, summed score 0-21; recall period: past 7 d (2 min)Higher scores indicate more anxiety; cut points 8, 11, and 15 for mild, moderate, and severe anxietyIBD mean = 7.5 (95% CI, 7.1-7.9; n = 8088)94; CD = 7.0 (95% CI ± 0.85; n = 1818); UC = 6.4 (95% CI ± 0.5; n = 1423)53Most commonly used in patients with IBD94
STAI8040 items (20 for trait, 20 for state), each scored 1-4, summed subscore range 20-80; recall period: “in general,” scored as “almost never,” “sometimes,” “often,” or “almost always” (10 min)Higher scores indicate more anxiety; cut point ≥40 for clinically significant anxiety (state is situation-specific; trait is personality trait)IBD mean = 43.0 (95% CI, 40.1-45.8; n = 857)94Second most commonly used scale in patients with IBD 94; copyright and requires purchase for use
GAD-7797 items, each scored 0-3, summed score 0-21; recall period: past 2 wk (2-5 min)Higher scores indicate more anxiety; cut points 5, 10, and 15 for mild, moderate, and severe anxietyMedian 3, interquartile range 1-7 (n = 242)77Commonly used in primary care setting but rarely in IBD populations57; lower sensitivity in IBD patients as compared to HADS-A and PROMIS Anxiety77
Stress
PROMIS Stress658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more stress; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCID65, 68Not establishedDeveloped by the NIH but not tested in the IBD population
PSS-107210 items, each scored 0-4, summed score 0-40; recall period: past 4 wk (3 min)Higher scores indicate more perceived stress; cutoffs 13 and 27 for moderate and high stressIBD mean = 20.3 (SD = 7.2, n = 134)95A 14-item PSS has been used in prior studies of patients with IBD55 but has weaker psychometric properties than the 10-item version75, 95
Fatigue
PROMIS Short Form Fatigue658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more fatigue; T-score > 50 abnormal, 2.5 MCID68IBD T-score = 56 (SD = 11, n = 10,634); UC = 56 (SD = 11, n = 6689); CD = 54 (SD = 11, n = 3945)69Developed by the NIH and has compatible psychometric performance to the PSQI70; validated in IBD69; infrequently used in studies7, 86, 90
FACIT-F8413 items, each scored 0-4, summed score 0-52; recall period: past 7 d (3 min)Lower scores indicate more fatigue; cutoff ≤ 30 for clinically significant fatigue; ≥4 point increase indicates clinically significant improvement5IBD = 38.9 ± 11.0 (n = 209); CD = 38.6 ± 11.3 (n = 111); UC = 39.4 ± 10.6 (n = 96)82Most widely used; validated in patients with IBD and correlates with serum inflammatory markers5, 82
MFI8420 items, each scored 1-5, for 5 dimensions of fatigue (4 questions each with dimensional summed score 4-20); recall period: “lately” (5-10 min)Higher scores indicate greater levels of fatigue; cutoff of ≥13 indicates significant fatigueNot establishedMultifactorial description of fatigue (general, physical, mental, activity, motivation)84; widely used and validated in chronic diseases but not in IBD5, 40
IBD-F scale5630 items: 25 items, each scored 0-4, subsection scores 0-20 (quantifies fatigue) and 0-120 (impact of fatigue); 5 unscored open-ended questions; recall period: past 30 d (10 min)Higher scores indicate higher fatigue symptoms and impactNot establishedDeveloped in consultation with patients with IBD and designed as an IBD-specific fatigue scale but has not been widely used5, 57
Sleep Quality
PROMIS Sleep Disturbance 8a Short Form708 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more sleep disturbance; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCIDIBD T-score = 52 (SD = 9, n = 10,634); UC = 53 (SD = 9, n = 6689); CD = 52 (SD = 8, n = 3945)69Developed by the NIH and has psychometric performance compatible with the PSQI70; validated in IBD69; infrequently used in studies7, 86, 90
PSQI8919 items, each scored 0-3, summed score 0-21 (7 domains: sleep duration, night awakenings, sleep latency, daytime dysfunction, sleep efficiency, sleep quality, use of sleep medications); recall period: past 30 d; (5-10 min)Higher scores indicate worse sleep quality; >5 points clinically significant sleep disturbanceCD: mean = 7.13 (SD = 4.33, n = 405); UC: mean = 6.36 (SD = 4.22, n = 255)6Most commonly used in patients with IBD7, 90
ESS918 items, each scored 0-3, summed score 0-24; recall period: “in recent times” (2 min)Higher scores indicate more daytime sleepiness; cutoffs 11, 13, and 16 for mild, moderate, and severe excessive daytime sleepinessNot establishedUsed in a few observational studies in patients with IBD in conjunction with PSQI,88 but less correlated with psychological symptoms and sleep diaries than PSQI92
PRO MeasurementDescription (estimated time to complete)InterpretationEstimated Scores in Patients With IBDRemarks
Depression
PROMIS Depression658 items, each scored 1-5, total score 38.2-81.3 (transformed scores into T-scores); recall period: past wk (2 min)Higher scores indicate more depressive symptoms. T-score = 50 and SD = 10 represent the general population; T-score > 50 is abnormal, 2.5 considered as MCID65, 68IBD T-score = 54 (SD = 10, n = 10,634); UC = 54 (SD = 10, n = 6689); CD = 54 (SD = 10, n = 3945)69Developed by the NIH; validated in IBD69, 77
HADS-Depression767 items, each scored 0-3, summed score 0-21; recall period: past 7 d (2 min)Higher scores indicate more depressive symptoms; cut points 8, 11, and 15 for mild, moderate, and severe depressionIBD mean = 5.4 (95% CI, 4.9-5.8; n = 7589)94; CD = 6.09 (95% CI ± 0.89; n = 1299); UC = 5.5 (95% CI ± 1.08; n = 1694)53Most commonly used in patients with IBD94
BDI-II8121 items, each scored 0-3, summed score 0-63; recall period: past 2 wk (5-10 min)Higher scores indicate more depressive symptoms; cut points 14, 20, and 29 for mild, moderate, and severe depressionIBD mean = 9.1 (95% CI, 8.6-9.5; n = 398)94Second most commonly used scale in patients with IBD94; emphasis on somatic symptoms (eg, loss of appetite, fatigue, loss of energy) may overestimate depression in medically ill patients77
PHQ-9789 items, each scored 0-3, summed score 0-27; recall period: past 2 wk (2-5 min)Higher scores indicate more depressive symptoms; cut points 5, 10, and 15 for mild, moderate, and severe depressionIBD = 7.6 (95% CI, 6.3-8.8; n = 926)94Contents overlap with common symptoms of inflammatory disorders (eg, lack of energy, sleep disturbance, appetite changes)77
Anxiety
PROMIS Anxiety658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more anxiety symptoms; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCID65, 68IBD T-score = 52 (SD = 10, n = 10,634); UC = 52 (SD = 10, n = 6689); CD = 52 (SD = 9, n = 3945)69Developed by the NIH77; validated in patients with IBD and has psychometric properties compatible with HADS-A69, 77
HADS-A767 items, each scored 0-3, summed score 0-21; recall period: past 7 d (2 min)Higher scores indicate more anxiety; cut points 8, 11, and 15 for mild, moderate, and severe anxietyIBD mean = 7.5 (95% CI, 7.1-7.9; n = 8088)94; CD = 7.0 (95% CI ± 0.85; n = 1818); UC = 6.4 (95% CI ± 0.5; n = 1423)53Most commonly used in patients with IBD94
STAI8040 items (20 for trait, 20 for state), each scored 1-4, summed subscore range 20-80; recall period: “in general,” scored as “almost never,” “sometimes,” “often,” or “almost always” (10 min)Higher scores indicate more anxiety; cut point ≥40 for clinically significant anxiety (state is situation-specific; trait is personality trait)IBD mean = 43.0 (95% CI, 40.1-45.8; n = 857)94Second most commonly used scale in patients with IBD 94; copyright and requires purchase for use
GAD-7797 items, each scored 0-3, summed score 0-21; recall period: past 2 wk (2-5 min)Higher scores indicate more anxiety; cut points 5, 10, and 15 for mild, moderate, and severe anxietyMedian 3, interquartile range 1-7 (n = 242)77Commonly used in primary care setting but rarely in IBD populations57; lower sensitivity in IBD patients as compared to HADS-A and PROMIS Anxiety77
Stress
PROMIS Stress658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more stress; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCID65, 68Not establishedDeveloped by the NIH but not tested in the IBD population
PSS-107210 items, each scored 0-4, summed score 0-40; recall period: past 4 wk (3 min)Higher scores indicate more perceived stress; cutoffs 13 and 27 for moderate and high stressIBD mean = 20.3 (SD = 7.2, n = 134)95A 14-item PSS has been used in prior studies of patients with IBD55 but has weaker psychometric properties than the 10-item version75, 95
Fatigue
PROMIS Short Form Fatigue658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more fatigue; T-score > 50 abnormal, 2.5 MCID68IBD T-score = 56 (SD = 11, n = 10,634); UC = 56 (SD = 11, n = 6689); CD = 54 (SD = 11, n = 3945)69Developed by the NIH and has compatible psychometric performance to the PSQI70; validated in IBD69; infrequently used in studies7, 86, 90
FACIT-F8413 items, each scored 0-4, summed score 0-52; recall period: past 7 d (3 min)Lower scores indicate more fatigue; cutoff ≤ 30 for clinically significant fatigue; ≥4 point increase indicates clinically significant improvement5IBD = 38.9 ± 11.0 (n = 209); CD = 38.6 ± 11.3 (n = 111); UC = 39.4 ± 10.6 (n = 96)82Most widely used; validated in patients with IBD and correlates with serum inflammatory markers5, 82
MFI8420 items, each scored 1-5, for 5 dimensions of fatigue (4 questions each with dimensional summed score 4-20); recall period: “lately” (5-10 min)Higher scores indicate greater levels of fatigue; cutoff of ≥13 indicates significant fatigueNot establishedMultifactorial description of fatigue (general, physical, mental, activity, motivation)84; widely used and validated in chronic diseases but not in IBD5, 40
IBD-F scale5630 items: 25 items, each scored 0-4, subsection scores 0-20 (quantifies fatigue) and 0-120 (impact of fatigue); 5 unscored open-ended questions; recall period: past 30 d (10 min)Higher scores indicate higher fatigue symptoms and impactNot establishedDeveloped in consultation with patients with IBD and designed as an IBD-specific fatigue scale but has not been widely used5, 57
Sleep Quality
PROMIS Sleep Disturbance 8a Short Form708 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more sleep disturbance; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCIDIBD T-score = 52 (SD = 9, n = 10,634); UC = 53 (SD = 9, n = 6689); CD = 52 (SD = 8, n = 3945)69Developed by the NIH and has psychometric performance compatible with the PSQI70; validated in IBD69; infrequently used in studies7, 86, 90
PSQI8919 items, each scored 0-3, summed score 0-21 (7 domains: sleep duration, night awakenings, sleep latency, daytime dysfunction, sleep efficiency, sleep quality, use of sleep medications); recall period: past 30 d; (5-10 min)Higher scores indicate worse sleep quality; >5 points clinically significant sleep disturbanceCD: mean = 7.13 (SD = 4.33, n = 405); UC: mean = 6.36 (SD = 4.22, n = 255)6Most commonly used in patients with IBD7, 90
ESS918 items, each scored 0-3, summed score 0-24; recall period: “in recent times” (2 min)Higher scores indicate more daytime sleepiness; cutoffs 11, 13, and 16 for mild, moderate, and severe excessive daytime sleepinessNot establishedUsed in a few observational studies in patients with IBD in conjunction with PSQI,88 but less correlated with psychological symptoms and sleep diaries than PSQI92

BDI-II indicates the Beck Depression Inventory-II; CD, Crohn disease; CI, confidence interval; ESS, Epworth Sleepiness Scale; FACIT-F, Functional Assessment of Chronic Illness Therapy-Fatigue; GAD-7, Generalized Anxiety Disorder-7; HADS, Hospital Anxiety and Depression Scale; IBD-F, Inflammatory Bowel Disease-Fatigue; MFI, Multidimensional Fatigue Inventory; NIH, National Institutes of Health; PROMIS, Patient-Reported Outcomes Measurement Information System; PHQ-9, Patient-Health Questionnaire-9; PSQI, Pittsburgh Sleep Quality Index; PSS-10, 10-Item Perceived Stress Scale; STAI, State and Trait Anxiety Inventory.

Table 2.

Validated PRO Measures for Depression, Anxiety, Stress, Fatigue, and Sleep Most Commonly Used in Studies of Individuals With IBD

PRO MeasurementDescription (estimated time to complete)InterpretationEstimated Scores in Patients With IBDRemarks
Depression
PROMIS Depression658 items, each scored 1-5, total score 38.2-81.3 (transformed scores into T-scores); recall period: past wk (2 min)Higher scores indicate more depressive symptoms. T-score = 50 and SD = 10 represent the general population; T-score > 50 is abnormal, 2.5 considered as MCID65, 68IBD T-score = 54 (SD = 10, n = 10,634); UC = 54 (SD = 10, n = 6689); CD = 54 (SD = 10, n = 3945)69Developed by the NIH; validated in IBD69, 77
HADS-Depression767 items, each scored 0-3, summed score 0-21; recall period: past 7 d (2 min)Higher scores indicate more depressive symptoms; cut points 8, 11, and 15 for mild, moderate, and severe depressionIBD mean = 5.4 (95% CI, 4.9-5.8; n = 7589)94; CD = 6.09 (95% CI ± 0.89; n = 1299); UC = 5.5 (95% CI ± 1.08; n = 1694)53Most commonly used in patients with IBD94
BDI-II8121 items, each scored 0-3, summed score 0-63; recall period: past 2 wk (5-10 min)Higher scores indicate more depressive symptoms; cut points 14, 20, and 29 for mild, moderate, and severe depressionIBD mean = 9.1 (95% CI, 8.6-9.5; n = 398)94Second most commonly used scale in patients with IBD94; emphasis on somatic symptoms (eg, loss of appetite, fatigue, loss of energy) may overestimate depression in medically ill patients77
PHQ-9789 items, each scored 0-3, summed score 0-27; recall period: past 2 wk (2-5 min)Higher scores indicate more depressive symptoms; cut points 5, 10, and 15 for mild, moderate, and severe depressionIBD = 7.6 (95% CI, 6.3-8.8; n = 926)94Contents overlap with common symptoms of inflammatory disorders (eg, lack of energy, sleep disturbance, appetite changes)77
Anxiety
PROMIS Anxiety658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more anxiety symptoms; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCID65, 68IBD T-score = 52 (SD = 10, n = 10,634); UC = 52 (SD = 10, n = 6689); CD = 52 (SD = 9, n = 3945)69Developed by the NIH77; validated in patients with IBD and has psychometric properties compatible with HADS-A69, 77
HADS-A767 items, each scored 0-3, summed score 0-21; recall period: past 7 d (2 min)Higher scores indicate more anxiety; cut points 8, 11, and 15 for mild, moderate, and severe anxietyIBD mean = 7.5 (95% CI, 7.1-7.9; n = 8088)94; CD = 7.0 (95% CI ± 0.85; n = 1818); UC = 6.4 (95% CI ± 0.5; n = 1423)53Most commonly used in patients with IBD94
STAI8040 items (20 for trait, 20 for state), each scored 1-4, summed subscore range 20-80; recall period: “in general,” scored as “almost never,” “sometimes,” “often,” or “almost always” (10 min)Higher scores indicate more anxiety; cut point ≥40 for clinically significant anxiety (state is situation-specific; trait is personality trait)IBD mean = 43.0 (95% CI, 40.1-45.8; n = 857)94Second most commonly used scale in patients with IBD 94; copyright and requires purchase for use
GAD-7797 items, each scored 0-3, summed score 0-21; recall period: past 2 wk (2-5 min)Higher scores indicate more anxiety; cut points 5, 10, and 15 for mild, moderate, and severe anxietyMedian 3, interquartile range 1-7 (n = 242)77Commonly used in primary care setting but rarely in IBD populations57; lower sensitivity in IBD patients as compared to HADS-A and PROMIS Anxiety77
Stress
PROMIS Stress658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more stress; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCID65, 68Not establishedDeveloped by the NIH but not tested in the IBD population
PSS-107210 items, each scored 0-4, summed score 0-40; recall period: past 4 wk (3 min)Higher scores indicate more perceived stress; cutoffs 13 and 27 for moderate and high stressIBD mean = 20.3 (SD = 7.2, n = 134)95A 14-item PSS has been used in prior studies of patients with IBD55 but has weaker psychometric properties than the 10-item version75, 95
Fatigue
PROMIS Short Form Fatigue658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more fatigue; T-score > 50 abnormal, 2.5 MCID68IBD T-score = 56 (SD = 11, n = 10,634); UC = 56 (SD = 11, n = 6689); CD = 54 (SD = 11, n = 3945)69Developed by the NIH and has compatible psychometric performance to the PSQI70; validated in IBD69; infrequently used in studies7, 86, 90
FACIT-F8413 items, each scored 0-4, summed score 0-52; recall period: past 7 d (3 min)Lower scores indicate more fatigue; cutoff ≤ 30 for clinically significant fatigue; ≥4 point increase indicates clinically significant improvement5IBD = 38.9 ± 11.0 (n = 209); CD = 38.6 ± 11.3 (n = 111); UC = 39.4 ± 10.6 (n = 96)82Most widely used; validated in patients with IBD and correlates with serum inflammatory markers5, 82
MFI8420 items, each scored 1-5, for 5 dimensions of fatigue (4 questions each with dimensional summed score 4-20); recall period: “lately” (5-10 min)Higher scores indicate greater levels of fatigue; cutoff of ≥13 indicates significant fatigueNot establishedMultifactorial description of fatigue (general, physical, mental, activity, motivation)84; widely used and validated in chronic diseases but not in IBD5, 40
IBD-F scale5630 items: 25 items, each scored 0-4, subsection scores 0-20 (quantifies fatigue) and 0-120 (impact of fatigue); 5 unscored open-ended questions; recall period: past 30 d (10 min)Higher scores indicate higher fatigue symptoms and impactNot establishedDeveloped in consultation with patients with IBD and designed as an IBD-specific fatigue scale but has not been widely used5, 57
Sleep Quality
PROMIS Sleep Disturbance 8a Short Form708 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more sleep disturbance; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCIDIBD T-score = 52 (SD = 9, n = 10,634); UC = 53 (SD = 9, n = 6689); CD = 52 (SD = 8, n = 3945)69Developed by the NIH and has psychometric performance compatible with the PSQI70; validated in IBD69; infrequently used in studies7, 86, 90
PSQI8919 items, each scored 0-3, summed score 0-21 (7 domains: sleep duration, night awakenings, sleep latency, daytime dysfunction, sleep efficiency, sleep quality, use of sleep medications); recall period: past 30 d; (5-10 min)Higher scores indicate worse sleep quality; >5 points clinically significant sleep disturbanceCD: mean = 7.13 (SD = 4.33, n = 405); UC: mean = 6.36 (SD = 4.22, n = 255)6Most commonly used in patients with IBD7, 90
ESS918 items, each scored 0-3, summed score 0-24; recall period: “in recent times” (2 min)Higher scores indicate more daytime sleepiness; cutoffs 11, 13, and 16 for mild, moderate, and severe excessive daytime sleepinessNot establishedUsed in a few observational studies in patients with IBD in conjunction with PSQI,88 but less correlated with psychological symptoms and sleep diaries than PSQI92
PRO MeasurementDescription (estimated time to complete)InterpretationEstimated Scores in Patients With IBDRemarks
Depression
PROMIS Depression658 items, each scored 1-5, total score 38.2-81.3 (transformed scores into T-scores); recall period: past wk (2 min)Higher scores indicate more depressive symptoms. T-score = 50 and SD = 10 represent the general population; T-score > 50 is abnormal, 2.5 considered as MCID65, 68IBD T-score = 54 (SD = 10, n = 10,634); UC = 54 (SD = 10, n = 6689); CD = 54 (SD = 10, n = 3945)69Developed by the NIH; validated in IBD69, 77
HADS-Depression767 items, each scored 0-3, summed score 0-21; recall period: past 7 d (2 min)Higher scores indicate more depressive symptoms; cut points 8, 11, and 15 for mild, moderate, and severe depressionIBD mean = 5.4 (95% CI, 4.9-5.8; n = 7589)94; CD = 6.09 (95% CI ± 0.89; n = 1299); UC = 5.5 (95% CI ± 1.08; n = 1694)53Most commonly used in patients with IBD94
BDI-II8121 items, each scored 0-3, summed score 0-63; recall period: past 2 wk (5-10 min)Higher scores indicate more depressive symptoms; cut points 14, 20, and 29 for mild, moderate, and severe depressionIBD mean = 9.1 (95% CI, 8.6-9.5; n = 398)94Second most commonly used scale in patients with IBD94; emphasis on somatic symptoms (eg, loss of appetite, fatigue, loss of energy) may overestimate depression in medically ill patients77
PHQ-9789 items, each scored 0-3, summed score 0-27; recall period: past 2 wk (2-5 min)Higher scores indicate more depressive symptoms; cut points 5, 10, and 15 for mild, moderate, and severe depressionIBD = 7.6 (95% CI, 6.3-8.8; n = 926)94Contents overlap with common symptoms of inflammatory disorders (eg, lack of energy, sleep disturbance, appetite changes)77
Anxiety
PROMIS Anxiety658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more anxiety symptoms; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCID65, 68IBD T-score = 52 (SD = 10, n = 10,634); UC = 52 (SD = 10, n = 6689); CD = 52 (SD = 9, n = 3945)69Developed by the NIH77; validated in patients with IBD and has psychometric properties compatible with HADS-A69, 77
HADS-A767 items, each scored 0-3, summed score 0-21; recall period: past 7 d (2 min)Higher scores indicate more anxiety; cut points 8, 11, and 15 for mild, moderate, and severe anxietyIBD mean = 7.5 (95% CI, 7.1-7.9; n = 8088)94; CD = 7.0 (95% CI ± 0.85; n = 1818); UC = 6.4 (95% CI ± 0.5; n = 1423)53Most commonly used in patients with IBD94
STAI8040 items (20 for trait, 20 for state), each scored 1-4, summed subscore range 20-80; recall period: “in general,” scored as “almost never,” “sometimes,” “often,” or “almost always” (10 min)Higher scores indicate more anxiety; cut point ≥40 for clinically significant anxiety (state is situation-specific; trait is personality trait)IBD mean = 43.0 (95% CI, 40.1-45.8; n = 857)94Second most commonly used scale in patients with IBD 94; copyright and requires purchase for use
GAD-7797 items, each scored 0-3, summed score 0-21; recall period: past 2 wk (2-5 min)Higher scores indicate more anxiety; cut points 5, 10, and 15 for mild, moderate, and severe anxietyMedian 3, interquartile range 1-7 (n = 242)77Commonly used in primary care setting but rarely in IBD populations57; lower sensitivity in IBD patients as compared to HADS-A and PROMIS Anxiety77
Stress
PROMIS Stress658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more stress; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCID65, 68Not establishedDeveloped by the NIH but not tested in the IBD population
PSS-107210 items, each scored 0-4, summed score 0-40; recall period: past 4 wk (3 min)Higher scores indicate more perceived stress; cutoffs 13 and 27 for moderate and high stressIBD mean = 20.3 (SD = 7.2, n = 134)95A 14-item PSS has been used in prior studies of patients with IBD55 but has weaker psychometric properties than the 10-item version75, 95
Fatigue
PROMIS Short Form Fatigue658 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more fatigue; T-score > 50 abnormal, 2.5 MCID68IBD T-score = 56 (SD = 11, n = 10,634); UC = 56 (SD = 11, n = 6689); CD = 54 (SD = 11, n = 3945)69Developed by the NIH and has compatible psychometric performance to the PSQI70; validated in IBD69; infrequently used in studies7, 86, 90
FACIT-F8413 items, each scored 0-4, summed score 0-52; recall period: past 7 d (3 min)Lower scores indicate more fatigue; cutoff ≤ 30 for clinically significant fatigue; ≥4 point increase indicates clinically significant improvement5IBD = 38.9 ± 11.0 (n = 209); CD = 38.6 ± 11.3 (n = 111); UC = 39.4 ± 10.6 (n = 96)82Most widely used; validated in patients with IBD and correlates with serum inflammatory markers5, 82
MFI8420 items, each scored 1-5, for 5 dimensions of fatigue (4 questions each with dimensional summed score 4-20); recall period: “lately” (5-10 min)Higher scores indicate greater levels of fatigue; cutoff of ≥13 indicates significant fatigueNot establishedMultifactorial description of fatigue (general, physical, mental, activity, motivation)84; widely used and validated in chronic diseases but not in IBD5, 40
IBD-F scale5630 items: 25 items, each scored 0-4, subsection scores 0-20 (quantifies fatigue) and 0-120 (impact of fatigue); 5 unscored open-ended questions; recall period: past 30 d (10 min)Higher scores indicate higher fatigue symptoms and impactNot establishedDeveloped in consultation with patients with IBD and designed as an IBD-specific fatigue scale but has not been widely used5, 57
Sleep Quality
PROMIS Sleep Disturbance 8a Short Form708 items, each scored 1-5, total score transformed scores into T-scores; recall period: past wk (2 min)Higher scores indicate more sleep disturbance; T-score = 50 and SD = 10 represent general population; T-score > 50 abnormal, 2.5 MCIDIBD T-score = 52 (SD = 9, n = 10,634); UC = 53 (SD = 9, n = 6689); CD = 52 (SD = 8, n = 3945)69Developed by the NIH and has psychometric performance compatible with the PSQI70; validated in IBD69; infrequently used in studies7, 86, 90
PSQI8919 items, each scored 0-3, summed score 0-21 (7 domains: sleep duration, night awakenings, sleep latency, daytime dysfunction, sleep efficiency, sleep quality, use of sleep medications); recall period: past 30 d; (5-10 min)Higher scores indicate worse sleep quality; >5 points clinically significant sleep disturbanceCD: mean = 7.13 (SD = 4.33, n = 405); UC: mean = 6.36 (SD = 4.22, n = 255)6Most commonly used in patients with IBD7, 90
ESS918 items, each scored 0-3, summed score 0-24; recall period: “in recent times” (2 min)Higher scores indicate more daytime sleepiness; cutoffs 11, 13, and 16 for mild, moderate, and severe excessive daytime sleepinessNot establishedUsed in a few observational studies in patients with IBD in conjunction with PSQI,88 but less correlated with psychological symptoms and sleep diaries than PSQI92

BDI-II indicates the Beck Depression Inventory-II; CD, Crohn disease; CI, confidence interval; ESS, Epworth Sleepiness Scale; FACIT-F, Functional Assessment of Chronic Illness Therapy-Fatigue; GAD-7, Generalized Anxiety Disorder-7; HADS, Hospital Anxiety and Depression Scale; IBD-F, Inflammatory Bowel Disease-Fatigue; MFI, Multidimensional Fatigue Inventory; NIH, National Institutes of Health; PROMIS, Patient-Reported Outcomes Measurement Information System; PHQ-9, Patient-Health Questionnaire-9; PSQI, Pittsburgh Sleep Quality Index; PSS-10, 10-Item Perceived Stress Scale; STAI, State and Trait Anxiety Inventory.

Stress

Early-life stress has been found to be a risk factor in the later development of IBD, and ongoing stressors exacerbate inflammation and relapse.12 Psychosocial stress has a mediating role in circulating levels of inflammatory cytokines, hypothalamic-pituitary axis, intestinal permeability, and alterations in the fecal and gut mucosal microbiota.12, 16 The Perceived Stress Scale (PSS-10) is a standardized, valid, and reliable psychometric measurement of perceived stress in psychophysical health research and among patients with IBD.2, 55, 72 The PSS-10 contains 10 items (each scored from 0 to 4, summed score range 0 to 40), with higher scores corresponding to higher levels of perceived stress (Table 2).73 Sample items in the PSS-10 are:

In the last month, how often have you been upset because of something that happened unexpectedly?

In the last month, how often have you felt nervous and “stressed?”

The response scale is as follows: 0 = never, 1 = almost never, 2 = sometimes, 3 = fairly often, and 4 = very often.

Anxiety and Depression

Patients with IBD, particularly those with symptomatically active disease, have higher rates of both anxiety and depression as compared to healthy control patients and individuals with chronic health conditions.53 Diagnoses of depression and anxiety may precede IBD diagnoses by as much as 3 to 5 years, based on data from 3 large studies in North America and the United Kingdom (the Nurses’ Health Study in the United States,74 a primary care database in the United Kingdom,4 and a population-based administrative provincial health database in Manitoba, Canada75). However, longitudinal prospective studies are lacking to support the causal relationship between depression/anxiety and IBD.12

The Hospital Anxiety and Depression Scale (HADS)76 is one of the most commonly used valid and reliable instruments to screen for anxiety and depression symptoms in studies of patients with IBD.12, 42, 53, 77 It contains 14 items (each scored from 0 to 3; summed subscale ranges from 0 to 21 for depression and anxiety), with higher scores indicating more anxiety or depression symptoms (Table 2).76 Sample items from HADS are as follows:

I feel life is not worth living.

Worrying thoughts constantly go through my mind.

I feel restless as if I have to be on the move.

The response scale is as follows: 0 = no, not at all; 1 = no, not much; 2 = yes, sometimes; and 3 = yes, definitely.

Other commonly used validated instruments for depression and anxiety are the Patient-Health Questionnaire,78 the Generalized Anxiety Disorder-7,79 the State and Trait Anxiety Inventory,80 and the Beck Depression Inventory-II.81 However, the content areas in the Patient-Health Questionnaire overlap with common symptoms of inflammatory disorders (eg, lack of energy, sleep disturbance, appetite changes) and the Beck Depression Inventory-II can overestimate depression in medically ill populations because of its emphasis on somatic symptoms (eg, weight loss, appetite).53 In comparison, the HADS minimizes the overlapping of physical symptoms with depressive/anxiety symptoms and requires one-quarter of the time for patients to complete the questions as compared to the other instruments.36, 42, 53 Both the HADS and the State and Trait Anxiety Inventory are protected by copyright, and there is a licensing fee for their authorized use.

Fatigue

Fatigue is a multidimensional symptom with physiologic, emotional, behavioral, and cognitive components that have been defined as “an overwhelming, sustained sense of exhaustion and decreased capacity for physical and mental work.”82 Fatigue is one of the most common disabling symptoms reported by more than 80% of adults with IBD.5

The Functional Assessment of Chronic Illness Therapy-Fatigue (FACIT-F) is the most commonly used valid and reliable instrument to measure fatigue in studies of patients with IBD.40, 83 Although originally developed for measuring fatigue in patients with anemia receiving oncology care, the FACIT-F has been validated in U.S. patients with IBD and has shown high internal consistency, reliability, and construct validity with strong correlations with symptomatic disease activity (using the Harvey-Bradshaw Index for Crohn disease and the Simple Clinical Colitis Activity Index for UC) and with biochemical surrogate markers of disease activity (C-reactive protein, erythrocyte sedimentation rate, and hematocrit).82 The FACIT-F contains 13 items (each scored from 0 to 4, summed range 0 to 52), with higher scores indicating better functioning and less fatigue (Table 2).82 Sample items from FACIT-F are as follows:

I have trouble finishing things because I am tired.

I have to limit my social activity because I am tired.

The response scale is as follows: 0 = not at all, 1 = a little bit, 2 = somewhat, 3 = quite a bit, and 4 = very much.

The Multidimensional Fatigue Inventory (20 items, each scored from 1 to 7)84 is another commonly used scale to assess for fatigue, but it has not been validated for use in patients with IBD.56 Another less commonly used measurement is the IBD-Fatigue scale (25 items scored from 0 to 4, and 5 unscored open-ended questions), which was developed with input from patients with IBD but has not been widely used (Table 2).38, 56

Sleep Disturbance

Sleep quality impairments are common in patients with IBD, even during clinical remission (up to 77%).10 Sleep impairment can activate inflammatory pathways and has been found to be associated with histologic inflammation,85 endoscopic inflammation,85 risk of Crohn disease relapse,86 hospitalization,10 and surgery.10 Moreover, sleep disturbances and excessive daytime sleepiness in patients with IBD may be driven by factors beyond the measure of objective sleep parameters such as polysomnography,87 actigraphy (motion-based watch-like devices that record sleep patterns and circadian rhythms),88 and melatonin metabolites.88

The Pittsburgh Sleep Quality Index (PSQI)89 is the most commonly used valid and reliable assessment tool in research studies in IBD.6, 7, 90 The PSQI contains 19 items (each scored from 0 to 3, summed score range 0 to 21), with higher scores indicating more sleep dysfunction (in terms of sleep duration, disturbance, latency, efficiency, and quality; medication use; and daytime sleepiness; Table 2).89 Sample items from the PSQI are as follows:

During the past month, how often have you taken medicine (prescribed or over the counter) to help you sleep?

If you have a roommate or bed partner, ask him/her how often in the past month you have had loud snoring.

The response scale is as follows: 0 = not during the past month, 1 = less than once a week, 2 = once or twice a week, and 3 = three or more times a week.

An alternative validated PRO instrument for sleep quality that has been used in patients with IBD5, 34, 88 is the Epworth Sleepiness Scale (8 items, each scored from 0 to 3, summed range 0 to 24), which measures daytime sleepiness (Table 2).91 However, the Epworth Sleepiness Scale is less correlated with psychological symptoms and sleep diary assessments as compared to the PSQI.92

DATA ANALYSIS

Assessment of Endpoints

In general, PRO measures are ordinal or continuous in nature and endpoints for determining treatment efficacy can be framed as (1) a continuous variable, a comparison of means to estimate the difference in treatment effects between groups, or (2) a dichotomous variable, a responder analysis of the proportion of patients within each arm who reach a predefined threshold that “constitutes clinically meaningful within-patient change” (Figure 1).24, 30, 61 A dichotomous responder analysis that compares the proportion of patients in each treatment/control arm who experience an improvement at least as large as the MCID is less susceptible to the effects of highly skewed outliers than a comparison of means.61, 62 The FDA does not recommend the analysis of endpoints from a percentage change from baseline unless “the target response is complete resolution of signs and symptoms” because of poor statistical measurement properties (eg, floor and ceiling effects, higher symptom burden for trial entry).30

An example of 1 study that analyzed endpoints as continuous and dichotomous variables is summarized as follows: A parallel-group multicenter RCT evaluated the impact of eight 1-hour weekly CBT sessions on an intervention group (n = 59) compared to a waitlist control group (n = 59) on anxiety and depressive symptoms.43 In an intention-to-treat analysis (with multiple imputation using chained equations for missing data), outcomes were assessed as a continuous variable for the change in HADS scores from baseline to follow-up at 3.5 months and were compared between the intervention group and the control group using analyses of covariance and reported as the difference and its 95% confidence interval. In a secondary analysis, the proportion of patients with HADS scores >10 (cutoff for moderate to severe anxiety/depressive symptoms76) was a dichotomous variable that was compared between the intervention and control groups at baseline and at follow-up, reported as effect size and its 95% confidence interval.

Time-to-event endpoints can also represent clinical outcomes of interests in interventional studies in patients with IBD, such as time to disease relapse/flare, escalation of IBD therapy, emergency department visits, hospitalizations, surgery, and death.33

Intention-to-Treat, per-Protocol, and Complete-Case Data Analysis

Intention-to-treat analysis, where baseline and outcomes data are analyzed for “all patients randomized according to assigned treatment arm, regardless of adherence,” is most often preferred to preserve the benefits of randomization and minimize bias.28, 93 Strict intention-to-treat analysis is often difficult to achieve in pragmatic clinical trials, primarily because of nonadherence to the trial protocol (eg, withdrawal from the study after enrollment/randomization, failure to attend psychotherapy sessions) and missing outcomes (eg, no-show at follow-up visits, nonresponse on assessment surveys).93 The imputation of missing data is commonly performed to conform to intention-to-treat analysis,93 and examples from 2 recently published RCTs of psychotherapeutic interventions in patients with IBD include a simple imputation method of replacing the missing final outcome variable with the last known value before the participant was lost to follow-up (“last observation carried forward”)44 and a more complex multiple imputation using chained equations (pooled estimates from multiple imputed data sets).43

Nonadherence to the assigned intervention may underestimate its potential benefit in the intention-to-treat analysis.93 The FDA accepts secondary analysis using per-protocol and complete-case protocol (nonrandomized observational comparisons) as sensitivity analysis to determine the impact of missing data,28 which can be common in RCTs for voluntary behavioral interventions for psychological comorbidities in IBD.34, 44, 60 Complete-case analysis includes the outcomes of all participants who have follow-up assessments, whereas per-protocol analysis includes only the participants who fulfill the trial protocol and outcome assessments.93

A recently published RCT illustrates the various data analysis methods discussed above. This RCT investigated whether 8 weekly acceptance and commitment therapy sessions reduced stress in patients with clinical-biochemical quiescent IBD as compared to control patients (who received treatment as usual).44 A total of 122 patients were enrolled and were randomized 1:1 to the intervention (n = 61) and control (n = 61) groups. After enrollment and randomization, 21 patients withdrew from the study and 22 failed to attend 1 of the follow-up visits. The primary analysis was complete-case intention-to-treat analysis that included 79 participants for whom data were available at weeks 0 (baseline), 8 (end of treatment), and 20 (follow-up). An exploratory analysis with 101 participants with partial or full follow-up data was performed using the last-observation-carried-forward to impute missing data for the 22 participants. A per-protocol analysis of psychological variables was also performed for 33 participants who attended at least 5 out of 8 or more acceptance and commitment therapy sessions.44

APPROACHES TO MINIMIZE BIAS

Randomization

Randomization, blinding or masking, and control groups can help discriminate between the effects of the intervention effects and the natural history of the disease while minimizing measurement or reporting bias.23 Types of control groups include placebo, active control, no treatment, and sham.23 The FDA favors the randomized allocation of study participants as the preferred means of ensuring the comparability of test groups and minimizing the possibility of selection bias in clinical trials.26 Randomization helps reduce bias associated with treatment selection, and although randomization cannot guarantee balance for all participant characteristics between treatment groups, there is an inference of balance for factors that may be unknown or unmeasured.43, 59 The stratification of participants for known, measurable confounding variables (eg, IBD disease activity36, 37, 45, 53) can decrease between-treatment group variations and increase the reliability of the results.32 However, overstratification can limit the feasibility of trials because a very large sample size would then be needed for the randomization of participants into each stratum.59

Blinding/Masking

Blinding (concealing of participant assignment) can prevent and reduce bias in the reporting or interpretation of psychological or physical responses by study participants, investigators, and/or assessors.23, 37 The blinding of study participants is particularly important for PROs because the knowledge of treatment assignment can affect participant behavior and responses, which has been referred to as the Hawthorne effect.51 Nonetheless, blinding can be challenging, impractical, or unethical in certain types of trials.23, 45, 46 In open-label studies (investigators and participants are not blinded to the treatment allocation), the FDA recommends “the use of pre-specified decision rules for aspects of study conduct, such as treatment assignment, patient management, safety reporting, and response variable ascertainment” to reduce the consequences of the lack of blinding.23 Although it is often not possible to blind participants to their allocated group, blinding investigators can reduce bias at the stage of randomization,44 clinical outcome assesments,39, 44, 49 and outcome adjudication.49

The National Institutes of Health published the Behavioral and Social Science Research (BSSR) Involving Humans clinical trials template in 2019, which can be used as a resource for researchers preparing behavioral-based interventions in humans (https://grants.nih.gov/policy/clinical-trials/protocol-template.htm).

Conclusions

In summary, RCTs with pharmacological and nonpharmacological interventions with primary endpoints assessed by PRO measurements are needed to effectively address neurobiological symptoms prevalent among individuals with IBD, particularly those with active disease. It is possible that a combination of conventional anti-inflammatory therapy for IBD and adjunctive pharmacotherapy/psychological treatments directed at the gut-brain axis can modulate neurobiological symptoms common in patients with IBD, including stress, anxiety, depression, fatigue, and sleep disturbances. The MCID of validated instruments can aid in the interpretation of statistical significance and clinical relevance. Trial designs with randomization, masking, and control groups can reduce bias and increase the reliability of the results.

Conflicts of interest: Dr. Sandborn reports research grants from AbbVie, Abivax, Arena Pharmaceuticals, Boehringer Ingelheim, Celgene, Genentech, Gilead Sciences, GlaxoSmithKline, Janssen, Lilly, Pfizer, Prometheus Biosciences, Seres Therapeutics, Shire, Takeda, and Theravance Biopharma; consulting fees from AbbVie, Abivax, Admirx, Alfasigma, Alimentiv (previously Robarts Clinical Trials, owned by Alimentiv Health Trust), Alivio Therapeutics, Allakos, Amgen, Applied Molecular Transport, Arena Pharmaceuticals, Bausch Health (Salix), Beigene, Bellatrix Pharmaceuticals, Boehringer Ingelheim, Boston Pharmaceuticals, Bristol Myers Squibb, Celgene, Celltrion, Cellularity, Cosmo Pharmaceuticals, Escalier Biosciences, Equillium, Forbion, Genentech/Roche, Gilead Sciences, Glenmark Pharmaceuticals, Gossamer Bio, Immunic (Vital Therapies), Index Pharmaceuticals, Intact Therapeutics, Janssen, Kyverna Therapeutics, Landos Biopharma, Lilly, Oppilan Pharma, Otsuka, Pandion Therapeutics, Pfizer, Progenity, Prometheus Biosciences, Prometheus Laboratories, Protagonists Therapeutics, Provention Bio, Reistone Biopharma, Seres Therapeutics, Shanghai Pharma Biotherapeutics, Shire, Shoreline Biosciences, Sublimity Therapeutics, Surrozen, Takeda, Theravance Biopharma, Thetis Pharmaceuticals, Tillotts Pharma, UCB, Vendata Biosciences, Ventyx Biosciences, Vimalan Biosciences, Vivelix Pharmaceuticals, Vivreon Biosciences, and Zealand Pharma; stock or stock options from Allakos, BeiGene, Gossamer Bio, Oppilan Pharma, Prometheus Biosciences, Prometheus Laboratories Progenity, Shoreline Biosciences, Ventyx Biosciences, Vimalan Biosciences, and Vivreon Biosciences; and is an employee at Shoreline Biosciences. Dr. Singh is supported by the National Institutes of Health/National Institute of Diabetes and Digestive and Kidney Diseases. Dr. Tse has no conflicts to disclose.

References

1.

Xavier
RJ
,
Podolsky
DK
.
Unravelling the pathogenesis of inflammatory bowel disease
.
Nature.
2007
;
448
:
427
434
.

2.

Targownik
LE
,
Sexton
KA
,
Bernstein
MT
, et al.
The relationship among perceived stress, symptoms, and inflammation in persons with inflammatory bowel disease
.
Am J Gastroenterol.
2015
;
110
:
1001
1012
.

3.

Thomann
AK
,
Griebe
M
,
Thomann
PA
, et al.
Intrinsic neural network dysfunction in quiescent Crohn’s disease
.
Sci Rep.
2017
;
7
:
11579
.

4.

Frolkis
AD
,
Vallerand
IA
,
Shaheen
AA
, et al.
Depression increases the risk of inflammatory bowel disease, which may be mitigated by the use of antidepressants in the treatment of depression
.
Gut.
2019
;
68
:
1606
1612
.

5.

Borren
NZ
,
van der Woude
CJ
,
Ananthakrishnan
AN
.
Fatigue in IBD: epidemiology, pathophysiology and management
.
Nat Rev Gastroenterol Hepatol.
2019
;
16
:
247
259
.

6.

Hashash
JG
,
Ramos-Rivers
C
,
Youk
A
, et al.
Quality of sleep and coexistent psychopathology have significant impact on fatigue burden in patients with inflammatory bowel disease
.
J Clin Gastroenterol.
2018
;
52
:
423
430
.

7.

Qazi
T
,
Farraye
FA
.
Sleep and inflammatory bowel disease: an important bi-directional relationship
.
Inflamm Bowel Dis.
2019
;
25
:
843
852
.

8.

Kredentser
MS
,
Graff
LA
,
Bernstein
CN
.
Psychological comorbidity and intervention in inflammatory bowel disease
.
J Clin Gastroenterol.
2021
;
55
:
30
35
.

9.

Gracie
DJ
,
Guthrie
EA
,
Hamlin
PJ
, et al.
Bi-directionality of brain-gut interactions in patients with inflammatory bowel disease
.
Gastroenterology.
2018
;
154
:
1635
1646.e3
.

10.

Sofia
MA
,
Lipowska
AM
,
Zmeter
N
, et al.
Poor sleep quality in Crohn’s disease is associated with disease activity and risk for hospitalization or surgery
.
Inflamm Bowel Dis.
2019
;
26
:
1251
1259
.

11.

Szigethy
E
,
Murphy
SM
,
Ehrlich
OG
, et al.
Mental health costs of inflammatory bowel diseases
.
Inflamm Bowel Dis.
2021
;
27
:
40
48
.

12.

Abautret-Daly
Á
,
Dempsey
E
,
Parra-Blanco
A
, et al.
Gut-brain actions underlying comorbid anxiety and depression associated with inflammatory bowel disease
.
Acta Neuropsychiatr.
2018
;
30
:
275
296
.

13.

Varghese
AK
,
Verdú
EF
,
Bercik
P
, et al.
Antidepressants attenuate increased susceptibility to colitis in a murine model of depression
.
Gastroenterology.
2006
;
130
:
1743
1753
.

14.

Amara
J
,
Saliba
Y
,
Hajal
J
, et al.
Circadian rhythm disruption aggravates DSS-induced colitis in mice with fecal calprotectin as a marker of colitis severity
.
Dig Dis Sci.
2019
;
64
:
3122
3133
.

15.

Gracie
DJ
,
Hamlin
PJ
,
Ford
AC
.
The influence of the brain-gut axis in inflammatory bowel disease and possible implications for treatment
.
Lancet Gastroenterol Hepatol.
2019
;
4
:
632
642
.

16.

Peppas
S
,
Pansieri
C
,
Piovani
D
, et al.
The brain-gut axis: psychological functioning and inflammatory bowel diseases
.
J Clin Med.
2021
;
10:377
.

17.

Thomann
AK
,
Mak
JWY
,
Zhang
JW
, et al.
Review article: bugs, inflammation and mood—a microbiota-based approach to psychiatric symptoms in inflammatory bowel diseases
.
Aliment Pharmacol Ther.
2020
;
52
:
247
266
.

18.

Bercik
P
,
Verdu
EF
,
Foster
JA
, et al.
Chronic gastrointestinal inflammation induces anxiety-like behavior and alters central nervous system biochemistry in mice
.
Gastroenterology.
2010
;
139
:
2102
2112.e1
.

19.

Weintraub
Y
,
Cohen
S
,
Chapnik
N
, et al.
Clock gene disruption is an initial manifestation of inflammatory bowel diseases
.
Clin Gastroenterol Hepatol.
2020
;
18
:
115
122.e1
.

20.

Turner
D
,
Ricciuto
A
,
Lewis
A
, et al. ;
International Organization for the Study of IBD
.
STRIDE-II: an update on the Selecting Therapeutic Targets in Inflammatory Bowel Disease (STRIDE) Initiative of the International Organization for the Study of IBD (IOIBD): determining therapeutic goals for treat-to-target strategies in IBD
.
Gastroenterology.
2021
;
160
:
1570
1583
.

21.

Craven
MR
,
Quinton
S
,
Taft
TH
.
Inflammatory bowel disease patient experiences with psychotherapy in the community
.
J Clin Psychol Med Settings.
2019
;
26
:
183
193
.

22.

U.S. Food & Drug Administration.
CFR—
Code of Federal Regulations Title 21
. Published April 1, 2020. Accessed May 28,
2021
. https://www.accessdata.fda.gov/scripts/cdrh/cfdocs/cfcfr/CFRSearch.cfm?fr=314.126.

23.

U.S. Food & Drug Administration.
E8(R1) General considerations for clinical studies
. Published August
2019
. Accessed May 28, 2021. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/e8r1-general-considerations-clinical-studies.

24.

Calvert
M
,
Kyte
D
,
Mercieca-Bebber
R
, et al. ;
the SPIRIT-PRO Group
.
Guidelines for inclusion of patient-reported outcomes in clinical trial protocols: the SPIRIT-PRO extension
.
JAMA.
2018
;
319
:
483
494
.

25.

U.S. Food & Drug Administration.
BEST (Biomarkers, EndpointS, and other Tools) Resource.
Silver Spring, MD, and
Bethesda, MD
: U.
S. Food & Drug Administration and National Institutes of Health
;
2016
.

26.

European
Medicines Agency.
ICH E8 General considerations for clinical trials
.
1998
. Accessed May 28, 2021. https://www.ema.europa.eu/en/ich-e8-general-considerations-clinical-studies.

28.

U.S. Food & Drug Administration.
Multiple endpoints in clinical trials guidance for industry.
Published January
2017
. Accessed May 28, 2021. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/multiple-endpoints-clinical-trials-guidance-industry.

29.

U.S. Food & Drug Administration.
Patient-focused drug development guidance:
methods to identify what is important to patients and select, develop or modify fit-for-purpose clinical outcomes assessments
.
Published October
2018
. Accessed May 28, 2021. https://www.fda.gov/drugs/news-events-human-drugs/patient-focused-drug-development-guidance-methods-identify-what-important-patients-and-select.

30.

U.S. Food & Drug Administration.
Public workshop on patient-focused drug development: guidance 4—
incorporating clinical outcome assessments into endpoints for regulatory decision making
.
Published December
2019
. Accessed May 28, 2021. https://www.fda.gov/drugs/development-approval-process-drugs/public-workshop-patient-focused-drug-development-guidance-4-incorporating-clinical-outcome.

31.

U.S.
Food & Drug Administration.
Principles for selecting, developing, modifying, and adapting patient-reported outcome instruments for use in medical device evaluation
. Published August
2020
. Accessed May 28, 2021. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/principles-selecting-developing-modifying-and-adapting-patient-reported-outcome-instruments-use.

32.

U.S. Food & Drug Administration.
Patient-focused drug development: collecting comprehensive and representative input
. Published June
2018
. Accessed May 28, 2021. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/patient-focused-drug-development-collecting-comprehensive-and-representative-input.

33.

Mikocka-Walus
A
,
Prady
SL
,
Pollok
J
, et al.
Adjuvant therapy with antidepressants for the management of inflammatory bowel disease
.
Cochrane Database Syst Rev.
2019
;
4:CD012680
.

34.

Hashash
JG
,
Knisely
MR
,
Germain
A
, et al.
Brief behavioral therapy and bupropion for sleep and fatigue in young adults with Crohn’s disease: an exploratory open trial study
.
Clin Gastroenterol Hepatol.
Published online September 30,
2020
. doi: 10.1016/j.cgh.2020.09.047.

35.

Panaccione
R
,
Colombel
J-F
,
Bossuyt
P
, et al.
Tight control with adalimumab-based treatment is associated with improved quality of life outcomes in patients with moderate to severely active Crohn’s disease: data from CALM
.
J Crohn’s Colitis.
2018
;
12
:
S078
S079
.

36.

Gracie
DJ
,
Irvine
AJ
,
Sood
R
, et al.
Effect of psychological therapy on disease activity, psychological comorbidity, and quality of life in inflammatory bowel disease: a systematic review and meta-analysis
.
Lancet Gastroenterol Hepatol.
2017
;
2
:
189
199
.

37.

McCombie
AM
,
Mulder
RT
,
Gearry
RB
.
Psychotherapy for inflammatory bowel disease: a review and update
.
J Crohns Colitis.
2013
;
7
:
935
949
.

38.

O’Connor
A
,
Ratnakumaran
R
,
Warren
L
, et al.
Randomized controlled trial: a pilot study of a psychoeducational intervention for fatigue in patients with quiescent inflammatory bowel disease
.
Ther Adv Chronic Dis.
2019
;
10
:
2040622319838439
.

39.

Goren
G
,
Schwartz
D
,
Friger
M
, et al.
Randomized controlled trial of cognitive-behavioral and mindfulness-based stress reduction on the quality of life of patients with Crohn disease
.
Inflamm Bowel Dis.
2022; 28:393–408.

40.

Farrell
D
,
Artom
M
,
Czuber-Dochan
W
, et al.
Interventions for fatigue in inflammatory bowel disease
.
Cochrane Database Syst Rev.
2020
;
4
:
CD012005
.

41.

Horta
D
,
Lira
A
,
Sanchez-Lloansi
M
, et al.
A prospective pilot randomized study: electroacupuncture vs. sham procedure for the treatment of fatigue in patients with quiescent inflammatory bowel disease
.
Inflamm Bowel Dis.
2020
;
26
:
484
492
.

42.

Davis
SP
,
Bolin
LP
,
Crane
PB
, et al.
Non-pharmacological interventions for anxiety and depression in adults with inflammatory bowel disease: a systematic review and meta-analysis
.
Front Psychol.
2020
;
11
:
538741
.

43.

Bennebroek Evertsz’
F
,
Sprangers
MAG
,
Sitnikova
K
, et al.
Effectiveness of cognitive-behavioral therapy on quality of life, anxiety, and depressive symptoms among patients with inflammatory bowel disease: a multicenter randomized controlled trial
.
J Consult Clin Psychol.
2017
;
85
:
918
925
.

44.

Wynne
B
,
McHugh
L
,
Gao
W
, et al.
Acceptance and commitment therapy reduces psychological stress in patients with inflammatory bowel diseases
.
Gastroenterology.
2019
;
156
:
935
945.e1
.

45.

Council for International Organizations of Medical Sciences.
International Ethical Guidelines for Health-Related Research Involving Humans.
Geneva, Switzerland: Council for International Organizations of Medical Sciences;
2016
.

46.

Brody
T
.
Placebo arm as part of clinical trial design
. In:
Brody
T
, ed.
Clinical Trials, 2nd ed
.
Boston, MA
:
Academic Press
;
2016
:
155
172
.

47.

Cunningham
JA
,
Kypri
K
,
McCambridge
J
.
Exploratory randomized controlled trial evaluating the impact of a waiting list control design
.
BMC Med Res Methodol.
2013
;
13
:
150
.

48.

Aycock
DM
,
Hayat
MJ
,
Helvig
A
, et al.
Essential considerations in developing attention control groups in behavioral research
.
Res Nurs Health.
2018
;
41
:
320
328
.

49.

Jedel
S
,
Hoffman
A
,
Merriman
P
, et al.
A randomized controlled trial of mindfulness-based stress reduction to prevent flare-up in patients with inactive ulcerative colitis
.
Digestion.
2014
;
89
:
142
155
.

50.

Campbell
G.
FDA guidance: design considerations for pivotal clinical investigations for medical devices
.
FDA Stakeholder Webinar.
December 4
,
2013
. Accessed May 28, 2021. https://www.fda.gov/files/medical%20devices/published/FDA-Guidance--Design-Considerations-for-Pivotal-Clinical-Investigations-for-Medical-Devices.pdf.

51.

McCambridge
J
,
Witton
J
,
Elbourne
DR
.
Systematic review of the Hawthorne effect: new concepts are needed to study research participation effects
.
J Clin Epidemiol.
2014
;
67
:
267
277
.

52.

Collins
SM
.
Interrogating the gut-brain axis in the context of inflammatory bowel disease: a translational approach
.
Inflamm Bowel Dis.
2020
;
26
:
493
501
.

53.

Mikocka-Walus
A
,
Knowles
SR
,
Keefer
L
, et al.
Controversies revisited: a systematic review of the comorbidity of depression and anxiety with inflammatory bowel diseases
.
Inflamm Bowel Dis.
2016
;
22
:
752
762
.

54.

Gracie
DJ
,
Williams
CJ
,
Sood
R
, et al.
Poor correlation between clinical disease activity and mucosal inflammation, and the role of psychological comorbidity, in inflammatory bowel disease
.
Am J Gastroenterol.
2016
;
111
:
541
551
.

55.

Graff
LA
,
Walker
JR
,
Lix
L
, et al.
The relationship of inflammatory bowel disease type and activity to psychological functioning and quality of life
.
Clin Gastroenterol Hepatol.
2006
;
4
:
1491
1501
.

56.

Czuber-Dochan
W
,
Norton
C
,
Bassett
P
, et al.
Development and psychometric testing of inflammatory bowel disease fatigue (IBD-F) patient self-assessment scale
.
J Crohns Colitis.
2014
;
8
:
1398
1406
.

57.

Artom
M
,
Czuber-Dochan
W
,
Sturt
J
, et al.
Cognitive-behavioural therapy for the management of inflammatory bowel disease-fatigue: a feasibility randomised controlled trial
.
Pilot Feasibility Stud.
2019
;
5
:
145
.

58.

Boye
B
,
Lundin
KE
,
Jantschek
G
, et al.
INSPIRE study: does stress management improve the course of inflammatory bowel disease and disease-specific quality of life in distressed patients with ulcerative colitis or Crohn’s disease? A randomized controlled trial
.
Inflamm Bowel Dis.
2011
;
17
:
1863
1873
.

59.

Evans
SR
.
Fundamentals of clinical trial design
.
J Exp Stroke Transl Med.
2010
;
3
:
19
27
.

60.

Mikocka-Walus
A
,
Pittet
V
,
Rossel
JB
, et al. ;
Swiss IBD Cohort Study Group
.
Symptoms of depression and anxiety are independently associated with clinical recurrence of inflammatory bowel disease
.
Clin Gastroenterol Hepatol.
2016
;
14
:
829
835.e1
.

61.

McGlothlin
AE
,
Lewis
RJ
.
Minimal clinically important difference: defining what really matters to patients
.
JAMA.
2014
;
312
:
1342
1343
.

62.

Jaeschke
R
,
Singer
J
,
Guyatt
GH
.
Measurement of health status. Ascertaining the minimal clinically important difference
.
Control Clin Trials.
1989
;
10
:
407
415
.

63.

Mouelhi
Y
,
Jouve
E
,
Castelli
C
, et al.
How is the minimal clinically important difference established in health-related quality of life instruments? Review of anchors and methods
.
Health Qual Life Outcomes.
2020
;
18
:
136
.

64.

Canadian Agency for Drugs and Technologies in Health.
Entyvio (vedolizumab). Appendix 5: validity of outcome measures
.
Published December
2016
. Accessed May 28, 2021. https://www.ncbi.nlm.nih.gov/books/NBK424360/.

65.

Cella
D
,
Yount
S
,
Rothrock
N
, et al. ;
PROMIS Cooperative Group
.
The Patient-Reported Outcomes Measurement Information System (PROMIS): progress of an NIH Roadmap cooperative group during its first two years
.
Med Care.
2007
;
45
:
S3
S11
.

67.

Shi
Q
,
Mendoza
TR
,
Cleeland
CS
.
Interpreting patient-reported outcome scores for clinical research and practice: definition, determination, and application of cutpoints
.
Med Care.
2019
;
57
:
S8
S12
.

68.

Long
MD
,
Martin
CF
,
Chen
W
, et al.
Changes in patient-reported outcomes with vedolizumab therapy in patients with inflammatory bowel diseases (IBD): results from the IBD Partners patient powered research network
.
Crohn’s Colitis 360.
2019
;
1:otz020
.

69.

Kappelman
MD
,
Long
MD
,
Martin
C
, et al.
Evaluation of the patient-reported outcomes measurement information system in a large cohort of patients with inflammatory bowel diseases
.
Clin Gastroenterol Hepatol.
2014
;
12
:
1315
–13
23.e2
.

70.

Yu
L
,
Buysse
DJ
,
Germain
A
, et al.
Development of short forms from the PROMIS™ sleep disturbance and sleep-related impairment item banks
.
Behav Sleep Med.
2011
;
10
:
6
24
.

71.

Slagle
AF
.
A Regulatory Perspective on Disease-Specific Outcome Measures: How does PROMIS Fit?
In:
Office of New Drugs Study Endpoints Center for Drug Evaluation and Research
ed.
International Society for Pharmacoeconomics and Outcomes Research (ISPOR): U.S. Food and Drug Administration
,
2015
. https://www.ispor.org/docs/default-source/presentations/332.pdf?sfvrsn=fbe4e09b_1.

72.

Cohen
S
,
Perceived stress in a probability sample of the United States. In: Spacapan S, Oskamp S, eds
.
The social psychology of health. Sage Publications, Inc.;
1988
:
31
67
.

73.

Cohen
S
,
Kamarck
T
,
Mermelstein
R
.
A global measure of perceived stress
.
J Health Soc Behav.
1983
;
24
:
385
396
.

74.

Ananthakrishnan
AN
,
Khalili
H
,
Pan
A
, et al.
Association between depressive symptoms and incidence of Crohn’s disease and ulcerative colitis: results from the Nurses’ Health Study
.
Clin Gastroenterol Hepatol.
2013
;
11
:
57
62
.

75.

Marrie
RA
,
Walld
R
,
Bolton
JM
, et al.
Rising incidence of psychiatric disorders before diagnosis of immune-mediated inflammatory disease
.
Epidemiol Psychiatr Sci.
2019
;
28
:
333
342
.

76.

Zigmond
AS
,
Snaith
RP
.
The Hospital Anxiety and Depression Scale
.
Acta Psychiatr Scand.
1983
;
67
:
361
370
.

77.

Bernstein
CN
,
Zhang
L
,
Lix
LM
, et al.
The validity and reliability of screening measures for depression and anxiety disorders in inflammatory bowel disease
.
Inflamm Bowel Dis.
2018
;
24
:
1867
1875
.

78.

Kroenke
K
,
Spitzer
RL
,
Williams
JB
.
The PHQ-9: validity of a brief depression severity measure
.
J Gen Intern Med.
2001
;
16
:
606
613
.

79.

Spitzer
RL
,
Kroenke
K
,
Williams
JB
, et al.
A brief measure for assessing generalized anxiety disorder: the GAD-7
.
Arch Intern Med.
2006
;
166
:
1092
1097
.

80.

Spielberger
CD.
State-Trait Anxiety Inventory: A Comprehensive Bibliography.
Palo Alto, CA: Consulting Psychologists Press
;
1989
:102. https://books.google.com/books/about/State_trait_Anxiety_Inventory.html?id=I8o8GwAACAAJ.

81.

Beck
AT
,
Steer
RA
,
Brown
G.
Manual for the Beck Depression Inventory–II.
San Antonio, TX: Psychological Corporation
;
1996
.

82.

Tinsley
A
,
Macklin
EA
,
Korzenik
JR
, et al.
Validation of the Functional Assessment of Chronic Illness Therapy-Fatigue (FACIT-F) in patients with inflammatory bowel disease
.
Aliment Pharmacol Ther.
2011
;
34
:
1328
1336
.

83.

Webster
K
,
Cella
D
,
Yost
K
.
The Functional Assessment of Chronic Illness Therapy (FACIT) measurement system: properties, applications, and interpretation
.
Health Qual Life Outcomes.
2003
;
1
:
79
.

84.

Smets
EM
,
Garssen
B
,
Bonke
B
, et al.
The Multidimensional Fatigue Inventory (MFI): psychometric qualities of an instrument to assess fatigue
.
J Psychosom Res.
1995
;
39
:
315
325
.

85.

Ali
T
,
Madhoun
MF
,
Orr
WC
, et al.
Assessment of the relationship between quality of sleep and disease activity in inflammatory bowel disease patients
.
Inflamm Bowel Dis.
2013
;
19
:
2440
2443
.

86.

Ananthakrishnan
AN
,
Long
MD
,
Martin
CF
, et al.
Sleep disturbance and risk of active disease in patients with Crohn’s disease and ulcerative colitis
.
Clin Gastroenterol Hepatol.
2013
;
11
:
965
971
.

87.

Paixão
DL
,
Poyares
D
,
de Paula
MS
, et al.
Evaluation of home polysomnography findings, quality of sleep, and fatigue in inflammatory bowel disease: a case series
.
J Clin Sleep Med.
2019
;
15
:
39
45
.

88.

Iskandar
HN
,
Linan
EE
,
Patel
A
, et al.
Self-reported sleep disturbance in Crohn’s disease is not confirmed by objective sleep measures
.
Sci Rep.
2020
;
10
:
1980
.

89.

Buysse
DJ
,
Reynolds
CF
3
,
Monk
TH
, et al.
The Pittsburgh Sleep Quality Index: a new instrument for psychiatric practice and research
.
Psychiatry Res.
1989
;
28
:
193
213
.

90.

Kinnucan
JA
,
Rubin
DT
,
Ali
T
.
Sleep and inflammatory bowel disease: exploring the relationship between sleep disturbances and inflammation
.
Gastroenterol Hepatol (N Y).
2013
;
9
:
718
727
.

91.

Johns
MW
.
A new method for measuring daytime sleepiness: the Epworth Sleepiness Scale
.
Sleep.
1991
;
14
:
540
545
.

92.

Buysse
DJ
,
Hall
ML
,
Strollo
PJ
, et al.
Relationships between the Pittsburgh Sleep Quality Index (PSQI), Epworth Sleepiness Scale (ESS), and clinical/polysomnographic measures in a community sample
.
J Clin Sleep Med.
2008
;
4
:
563
571
.

93.

Moher
D
,
Hopewell
S
,
Schulz
KF
, et al.
CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials
.
Bmj.
2010
;
340
:
c869
.

94.

Neuendorf
R
,
Harding
A
,
Stello
N
, et al.
Depression and anxiety in patients with inflammatory bowel disease: a systematic review
.
J Psychosom Res.
2016
;
87
:
70
80
.

95.

Moradkhani
A
,
Beckman
LJ
,
Tabibian
JH
.
Health-related quality of life in inflammatory bowel disease: psychosocial, clinical, socioeconomic, and demographic predictors
.
J Crohns Colitis.
2013
;
7
:
467
473
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/pages/standard-publication-reuse-rights)