Assessing Histological Inflammatory Activity in Patients With Ulcerative Colitis: A Diagnostic Accuracy Study Testing Fecal Biomarkers Lactoferrin and Calprotectin

Abstract Background and Aims Histological remission has arisen as the optimal treatment outcome in ulcerative colitis (UC). The aim of this retrospective study was to explore the diagnostic performance of the noninvasive fecal biomarkers calprotectin (FC) and lactoferrin (FL) compared to the histological indices Nancy Index (NI) and Riley Index (RI). Methods This study is a retrospective diagnostic accuracy study based on secondary analysis of patient data from 2002 to 2017 extracted from medical registries of our clinics in Essen-Mitte, Germany. Patients with UC underwent a colonoscopy, with biopsies taken from the rectum and the sigmoid scored by 2 experienced pathologists according to NI and RI and provided a stool sample within 7 days pre- or post-colonoscopy. Diagnostic accuracy of recommended cutoffs for FC (>50 μg/g) and FL (≥7.25 μg/g) were tested against our reference standard (NI ≥2) in terms of specificity, sensitivity, positive predictive value, negative predictive value, and accuracy (effectiveness). Results The number of patients with UC recruited was n = 226, aged 45.2 (SD 13.3). Histological indices were highly correlated (r = 0.980, P < 0.001). Fecal biomarkers correlated moderately with NI (FC: r = 0.383, P < 0.001; FL: r = 0.420, P < 0.001) and RI (FC: r = 0.395, P < 0.001; FL: r = 0.424, P < 0.001). Fecal biomarker concentrations were increased in patients with active histological disease (NI ≥2), median [IQR], FC 69.72 [20.07–254.38], FL 18.59 [6.06–44.42], compared to those with inactive disease (NI ≤1), FC 12.35 [3.89 – 32.16], FL 3.14 [0.75–11.05], z = −6.60, P < 0.001. Fecal biomarker concentrations differed significantly across NI grades 0–4 (FC: H4 = 45.2; FL: H4 = 47.5, both P < 0.001). Patients with grade 0 had significantly lower concentrations of fecal biomarkers than those with grade 3 (median; FC 10.94 vs 72.22; FL 2.30 vs 29.10; both P < 0.001) or grade 4 (FC 10.94 vs 67.00; FL 2.30 vs 27.64; both P < 0.001), as well as grade 2 for FC only (10.94 vs 56.22, P = 0.001). Concentrations were also lower in patients with grade 1 compared to those with grade 3 (FC 17.49 vs 72.22; FL 4.24 vs. 29.10; both P ≤ 0.001) or grade 4 (FC 17.49 vs 67.00; FL 4.24 vs 27.64; both P < 0.001). Receiver operating characteristics area under the curve showed moderate diagnostic accuracy for both FC 0.76 (95% confidence interval [CI] 0.70–0.83) and FL 0.73 (95% CI 0.66–0.80). Optimized cutoffs for both FC (≥34.29) and FL (≥5.85 μg/g) had slightly improved accuracy, compared with the manufacturer’s cutoffs (FC: 69.9% vs 65.9%; FL: 71.7% vs 69.0%). Conclusions Fecal biomarkers calprotectin and lactoferrin correlate with histological disease activity and differentiate between patients in histological remission from those with evidence of moderate to severe disease activity. Their noninvasiveness, in addition to being inexpensive, supports their use in the clinical monitoring of patients with UC.


INTRODUCTION
Inflammatory bowel disease (IBD) refers to chronic diseases characterized by idiopathic, recurrent, and remitting inflammation of the gastrointestinal tract. The two diagnosable forms, ulcerative colitis (UC) and Crohn's disease (CD), affect more than 0.3% of populations in westernized countries. 1 In UC, inflammation is confined to the colon, leading to characterizing symptoms of abdominal pain, diarrhea, and rectal bleeding. Healing of the intestinal mucosa has been associated with long-term remission and minimizing the need for colectomy. 2 Mucosal healing may be directly assessed visually via endoscopy and hence endoscopy remains the gold standard for routine monitoring of UC. 3 More recently, however, histological remission has arisen as a superior treatment target in UC, given its greater performance in predicting future relapse or sustained remission in patients with UC, compared with endoscopy. [4][5][6][7] Thus, it has been recommended that endoscopic assessment be complemented by a validated histological index for clinical monitoring of patients with UC. 8 Histological scoring systems quantify the degree of inflammation, integrity of the epithelium, and alterations to the architecture of colonic crypts observed in tissue samples. 9 While a number of scoring systems have been developed, only the Nancy Index (NI) has been validated for use in clinical practice and in clinical trials with patients with UC, 10 although the Riley Index (RI) is more commonly applied in clinical practice. 11 A significant limitation to the assessment of histological inflammation, however, is the need to collect tissue samples (biopsies) via colonoscopy. This invasive procedure is expensive and burdensome to patients. [12][13][14] Valid, reliable, and lessinvasive diagnostic methods are therefore crucial for routine assessment of disease status in patients with UC.
Fecal biomarkers provide a promising alternative for this purpose given their relative ease of collection and analysis, lower cost, and greater acceptance by patients, in comparison to colonoscopy. 15 In our previous work, we showed 3 neutrophil-derived proteins (fecal calprotectin [FC], fecal lactoferrin [FL], and PMN elastase) were able to discern between active and remitting UC 16 and that elevated levels are predictive of a future flare in patients with UC. 17 FC, a calcium and zinc-binding protein found within the cytosol of innate immune cells (eg, neutrophils), correlates strongly with endoscopic disease activity 17,18 and has the utility of being able to detect early relapse (ie, prior to clinical symptoms) in patients with quiescent UC. 17,19,20 Furthermore, 3 histological scoring systems the Geboes Score, NI, and Robarts Histopathology Index are predictive of FC levels in patients with UC. 21 FL, an iron-binding glycoprotein found in polymorphonuclear neutrophils, has also been shown to be increased in active UC. [22][23][24] As an inflammatory response is initiated prior to the onset of clinical relapse, this offers a window of opportunity to predict and prevent potential flares in patients with UC by detecting the presence of these biomarkers in the feces. 25 Numerous studies support this, demonstrating an increase in both FC and FL in severe (or relapsing) UC. [26][27][28][29] Furthermore, FC and FL both remain stable for up to 7 days when held at 4°C, further favoring their use for routine monitoring of disease activity in UC. 30,31 A limitation, however, to the use of fecal biomarkers in diagnosing histological disease activity in UC is the lack of defined cutoff values for grading the presence of histological disease activity or severity in practice. The aim of this retrospective diagnostic accuracy study was therefore to (1) examine associations between fecal biomarkers (FC and FL) and histological indices (NI and RI) and (2) evaluate the diagnostic performance of fecal biomarkers against NI as the reference standard for grading the degree of histological inflammation in UC.

METHODS
The reporting of this study is in accordance with the 2015 Standards for Reporting of Diagnostic Accuracy Studies guidelines. 32,33 Ethical approval was granted by the ethical committee of the Faculty of Medicine, University Duisburg-Essen (No. 18-8268-BO). This study was conducted within the Clinic for Naturopathy and Integrative Medicine, Clinics of Essen-Mitte, Germany, in affiliation with the Faculty of Medicine, University Duisburg-Essen.

Study Design
This was a retrospective diagnostic accuracy study based on the secondary analysis of clinical patient data extracted from medical registries between 2002 and 2017.

Study Procedure
All inpatients and/or outpatients presenting to the Department of Internal and Integrative Medicine, Clinics of Essen-Mitte, Germany for UC treatment between 2002 and 2017 were invited to participate. Two research coordinators (A.O. and Dorit Schroeder) approached patients who were provided information about our department's research. Interested patients gave their written consent to the use of their medical data for use in prospective research at the clinics in Essen-Mitte and were scheduled for a colonoscopy and stool sample collection. For this retrospective analysis, the physical and/ or electronic medical records (iMedOne, Deutsche Telekom Assessing Histological InflammatoryA ctivity in PatientsW ith Ulcerative Colitis AG; Bonn, Germany) of consenting patients were reviewed by A.O. and Dorit Schroeder to identify patients meeting eligibility criteria for inclusion. Data extracted from eligible patients included age, gender, smoking behavior, UC diagnosis (ie, ulcerative proctitis, left-sided colitis, or extensive colitis), current medications, fecal biomarker concentrations (FC and FL), and data for both histological indices (NI and RI).

Eligibility Criteria
We included patients aged 18-75 years, with a prior diagnosis of UC verified by endoscopy, histopathology, and the presence of clinical symptoms, eg, rectal bleeding or diarrhea, who had undergone a colonoscopy with 6 biopsies taken (3 each taken from the sigma and rectum) with a stool sample harvested and frozen at −20°C within the 7 days before, or after, colonoscopy to facilitate the procedure for the patients, and who had results for both fecal biomarkers (FC and FL) available. We excluded patients with preexisting or newly diagnosed mental and/or physical illness(es), alcohol or drug abuse, complete colectomy, tumor diagnosis in the past 5 years, as well as pregnant or breast-feeding women, and those following a special diet.

Index Test: Fecal Biomarkers
As described in our previous papers, 16,17 and briefly reiterated here, fecal samples were collected in the morning of the appointment in the study center by patients using a disposable plastic bucket-type device which avoids contamination with the toilet water artifact and simplifies laboratory sampling. Samples were then frozen at −30°C and sent within 24 hours to our laboratory (Labor L+S AG, Bad Bocklet-Großenbrach, Germany) for immediate analysis. Fecal samples were tested for total concentrations of FC and FL using an enzyme-linked immunosorbent assay (ELISA) according to the respective manufacturers' instructions (IDK Calprotectin ELISA from Immundiagnostik, Bensheim, Germany and LACTOFERRIN SCAN kit from TechLab, Blacksburg, VA). Results were calculated from a standard curve and reported in micrograms per gram. The following manufacturer's recommended cutoffs were applied to define a positive result for FC (>50 μg/g) and FL (≥7.25 μg/g).

Reference Standard: Histological Indices
Biopsies collected during colonoscopy were evaluated by two experienced pathologists who, blinded to the results of the fecal biomarkers, independently scored either the NI 10 or the RI. 11 In the case of divergent results, both pathologists discussed and established a final result. The RI grades the severity of histological disease activity based on 6 criteria: (1) acute inflammatory cell infiltrate, (2) crypt abscesses, (3) mucin depletion, (4) surface epithelial integrity, (5) chronic inflammatory cell infiltrate, and (6) crypt architectural irregularities. 11 Each feature is graded as either none, mild, moderate, or severe to yield an overall score ranging from 0 to 18. The NI also grades the severity of histological disease activity, but on only 3 criteria: ulceration, acute inflammatory cells infiltrate (ie, the presence of neutrophils in either the lamina propria or epithelium), and chronic inflammatory infiltrate. 34 Each criterion is sequentially assessed to yield an overall grade (grade 0: absence of significant histological disease activity; grade 1: chronic inflammatory infiltrate with no acute inflammatory infiltrate; grade 2: mildly active disease; grade 3: moderately active disease; and grade 4: severely active disease). Grades 2-4 are consistent with histologically active disease and predictive of future relapse in patients with UC. 35

Data Analyses
Data were exported to SPSS Statistics for Windows, version 26.0 (IBM Corp., Armonk, NY) for analysis. Statistical significance was set at 0.05. Descriptive statistics were computed to describe participating patients in terms of age, gender, smoking behavior, diagnosis (proctitis, left-sided colitis, or extensive colitis), and medication use. Fecal biomarkers were compared to histological indices using nonparametric Spearman's rank correlations, as data for fecal biomarkers were highly skewed. We then compared fecal biomarkers concentrations between patients with evident histological disease activity (ie, NI ≥2) and those in histological remission (NI ≤1) using the nonparametric Wilcoxon-Mann-Whitney test, and across each grade of the NI (grades 0-4) using the nonparametric Kruskal-Wallis test. We explored the performance of our index tests (ie, fecal biomarkers FC and FL) in diagnosing our reference standard for histological disease activity (ie, NI ≥2), using the manufacturer's recommended cutoffs (ie, FC >50 μg/g, FL ≥7.25 μg/g) by creating two-by-two tables and calculating measures of specificity, sensitivity, positive predictive value (PPV), negative predictive value (NPV), and accuracy (effectiveness). Corresponding confidence intervals (CIs) were calculated using the Clopper-Pearson method for specificity, sensitivity, and accuracy, and logit method for predictive values (PPV and NPV). 36 Given the breadth of cutoffs used for FC in the literature, we also planned to test FC at concentrations more than 100 μg/g and more than 150 μg/g, although due to small cell sizes only results for more than 100 μg/g were reported. Receiver operating characteristics (ROC) curves were then computed for both FC and FL to define the area under the curve (AUC) with corresponding 95% CI, as well as optimized cutoffs based on Youden's Index ([Sensitivity + Specificity] -1), which yields the highest attainable sensitivity and sensitivity for a given cutoff. Diagnostic performance measures for these optimized cutoffs were also reported (specificity, sensitivity, PPV, NPV, and accuracy) for purposes of comparison to the manufacturer's recommended cutoffs.

Participants
Complete data were available for n = 226 patients. Table 1 presents the key characteristics of participants who were aged on average 45.2 (SD 13.3) years and mostly female (60.2%), nonsmokers (90.3%).
The diagnostic performance of FC and FL is illustrated in Table 4. The higher specificity and PPV for FC, compared with FL, arose from the relatively low number of false-positive diagnoses. In comparison, FL had a higher sensitivity, compared with FC, due to the relatively lower number of falsenegative diagnoses (Table 4). A cutoff of more than 100 μg/g for FC showed improved specificity (92.8%) and PPV (90.6%),  Another option for data illustration here. *P < 0.001.

Assessing Histological InflammatoryA ctivity in PatientsW ith Ulcerative Colitis
at the expense of a greater number of false negatives-reflected in the reduced sensitivity of 40.6%. Compared with the manufacturer's recommended cutoffs, all parameters, except specificity, improved when using the optimized cutoffs for FC (≥34.29 μg/g) and FL (≥5.85 μg/g) ( Table 5). Figure 2 illustrates the ROC curves for FC and FL, respectively, and AUC of 0.76 (95% CI 0.70-0.83) and 0.73 (95% CI 0.66-0.80), with 0.7-0.8 considered "good" in terms of diagnostic accuracy. 37

DISCUSSION
In this study, we first aimed to examine the relationship between fecal biomarkers FC and FL and histopathological indices NI and RI. While histological indices were perfectly correlated with one another, ranked correlations between fecal biomarkers and histological indices were moderate, albeit statistically significant.
Second, our study presents novel evidence for differentiation of FC and FL concentrations across grades 0-4 of the NI. We found that both FC and FL were able to differentiate between patients graded according to the NI with histological remission (grades 0-1) from those with evidence of moderate to severe histological disease activity (grades 3-4). FC and FL concentrations were notably higher in patients with grade 3 and grade 4 disease activity. However, concentrations were comparable between grade 3 and grade 4, suggesting that higher concentrations of both FC and FL are reflective of moderate to severe disease, rather than mild disease activity (ie, grade 2). The marked increase in both FC and FL in patients with grade 3 disease activity was unsurprising as grade 3 is designated where multiple clusters of neutrophils in the lamina propria and/or epithelium are present. 34 As these fecal markers are released from neutrophils as part of our innate inflammatory response, these findings support the clinical appropriateness of using these biomarkers in the assessment of histological disease activity. Furthermore, FC and FL concentrations were slightly lower in patients with grade 4 disease activity-defined the presence of ulceration, which might be reflective of the initial clearing of these fecal markers from the intestinal lumen.
We further aimed to evaluate the diagnostic performance of fecal biomarkers against the NI which has been validated for use in the assessment of histological activity in UC in clinical practice and clinical trials. First, both FC and FL were both significantly increased in patients with evidence of acute inflammatory cells infiltrate (ie, NI ≥2)-a finding confirmed by Magro et al 21 for FC levels among a sample of 377 patients with UC. Post hoc analyses showed a marked difference in FC and FL levels between NI grades 0-1 and 2-4, further supporting the appropriateness of these fecal biomarkers in assessing histological inflammatory activity in UC. These are supported by earlier research showing patients with UC with active histological inflammation, as measured by neutrophil infiltration, had significantly higher median levels of FC, compared to those without. 38,39 Our research is, however, the first to show this marked increase occurs also in levels of FL.
What is more, the diagnostic performance was achieved despite testing in a rather mild-to-moderate severity UC patient group with only 6.8% of patients were currently treated with biologics.
patients in histological remission who tested negative to either calprotectin or lactoferrin (ie, specificity) was 80.7% and 65.1%, respectively. Arguably, it is the specificity of this test that is of greater value in clinical practice, as a high specificity reflects a higher proportion of patients without acute inflammatory cells infiltrate (ie, in histological remission) being diagnosed as such using the index biomarker. In this sense, FC with a specificity of 80.7% provides an inexpensive biomarker to allow clinicians to first "rule out" the presence of histological inflammation and avoid further invasive histological investigations via colonoscopy. Using the optimized cutoffs for FC and FL, specificity in both cases was lower than that obtained when using the manufacturers' cutoffs, suggesting the manufacturer-recommended cutoffs of more than 50 μg/g for FC and at least 7.25 μg/g are superior for this purpose, at the expense of more false-negative diagnoses.
Overall, existing studies testing the diagnostic performance of fecal biomarkers against histological scoring indices are scarce. Both Theede et al 39 and Guardiola et al 38 tested the diagnostic performance of FC in predicting histological remission in patients with endoscopically quiescent UC, with reported AUCs ranging from 7.54 to 7.55, although sample sizes in these studies were comparatively small, reflected in the large CIs reported. Magro et al 40 tested the diagnostic performance of FC and FL in predicting histological disease activity in patients with UC after receiving    Earlier studies compared tested the diagnostic accuracy of FC for defining endoscopically active disease using cutoffs ranging from ≥48 to ≥250 μg/g, with inconsistent sensitivities and specificities reported. 17,19,28,29,[41][42][43][44] Other studies have specifically  examined the diagnostic accuracy of FC in predicting relapse, on the basis of clinical and endoscopic findings, in patients with UC, from which it has been concluded that the ideal cutoff for predicting relapse remains to be defined. 45 Tested cutoffs above which FC is predictive of relapse in UC using in existing studies have ranged from 120 to 300 μg/g. 45,46 Comparatively, few studies have tested the diagnostic accuracy of FL for detecting endoscopically active disease. In our prior work, we showed cutoffs of more than 7.05 and at least 7.25 μg/g for FL to have a sensitivity of 92.6% and 75%, and a specificity of 66.7% and 62.5%, respectively, using the same IBD-SCAN method described in the present study. 17,44 Other studies adopted cutoffs for FL of at least 140 and more than 150 μg/g found to have a sensitivity of 67% and 62%, and a specificity of 68% and 65%, respectively, for predicting relapse in patients with UC. 29,47 Thus, suggesting a superior performance of FL as a marker of histological disease activity, as opposed to endoscopically active disease, when comparing these results to those reported in the present study.
Our results are not without their limitations. The number of biopsies collected from each patient was limited to 3 each from both the rectum and sigma which might lead to a sampling error. The primary strength of this study is the relatively large sample size compared to other studies in this field consisting of samples with less than 100 patients. Furthermore, as fecal biomarker concentrations are known to vary over the course of the day in calprotectin, we requested patients in this study to provide a morning stool sample only. FC and FL are also not the only fecal markers available that may be used in the assessment of intestinal inflammation, although they are arguably the most researched and used in clinical practice. For example S100A12 (calgranulin C) has shown greater specificity in predicting relapse in IBD compared to FC. In addition HMGB1 was correlated with disease activitity in pediatric and adult patients with IBS and Fecal Immunochemical Test has shown to be predictive of relapse in patients with UC. 49,50 Further the generalizability of our results is restricted to testing FC and FL concentrations using the specific laboratory testing kits described in the present study. Internationally, there are at least 6 different commercial testing kits available for measuring FC and FL concentrations, all offered from different manufacturers with some based on monoclonal ELISA and others on multiclonal ELISA.
As histological remission is predictive of relapse in patients with UC, even those in endoscopic remission, the assessment of histological disease activity in the clinical monitoring of UC is invaluable. Particularly as the presence of histological disease activity has been associated with the incidence of colorectal neoplasia and colectomy. [51][52][53] Biomarkers of histological disease activity, such as fecal biomarkers, may be performed onsite permitting an inexpensive, noninvasive, and rapid measure of histological inflammatory activity in cases where information on the specific location and behavior of disease is not necessary. This being the primary limitation to fecal markers that once cannot concurrently screen for other lesions, as with colonoscopy, to support early detection of other intestinal conditions (eg, bowel cancer). A secondary limitation is that, as general markers of neutrophil infiltration into the intestinal mucosa, levels will also increase in response to intestinal infections and inflammation, such as undiagnosed celiac disease or infectious diarrhea. 16

CONCLUSIONS
Fecal biomarkers calprotectin and lactoferrin correlated with histological inflammation and differentiated between patients graded according to the NI with histological remission from those with evidence of moderate to severe histological disease activity. In addition, the fecal biomarkers are noninvasive and inexpensive which support their use in the clinical monitoring of patients with UC.