Abstract
Objective
This study aimed to compare the predictive accuracy of five commonly used clinical scoring systems - albumin, international normalised ratio,altered mental status, systolic blood pressure, age (AIMS65), Charlson Comorbidity Index ≥2, in-hospital onset, albumin <2.5 g/dL, altered mental status, Eastern Cooperative Oncology Group (ECOG) performance status ≥2, and steroid use (CHAMPS), age, blood tests, and comorbidities (ABC), Glasgow-Blatchford score (GBS), and Complete Rockall score (CRS)- in estimating in-hospital mortality among patients presenting with non-variceal upper gastrointestinal bleeding (UGIB).
Materials and Methods
This retrospective, single-center observational study included 917 adult patients diagnosed with non-variceal UGIB between January 2020 and January 2025. Clinical data were extracted from electronic medical records. Each patient’s risk scores (AIMS65, CHAMPS, ABC, GBS, and CRS) were calculated based on admission data. The predictive performance of each scoring system for in-hospital mortality was assessed using receiver operating characteristic curve analysis, and area under the curve (AUC) values were compared using the DeLong test.
Results
The overall in-hospital mortality rate was 5.2%. AIMS65 demonstrated the highest predictive performance (AUC: 0.815, 95% confidence interval: 0.788-0.840), significantly outperforming GBS (AUC: 0.631, p<0.001) and showing comparable accuracy to CHAMPS (AUC: 0.801, p=0.493). The CHAMPS score also showed good discriminatory power, particularly in high-risk patients. The ABC score (AUC: 0.708) and CRS (AUC: 0.702) demonstrated moderate predictive ability, while GBS had the lowest accuracy.
Conclusion
Among the five evaluated scoring systems, AIMS65 exhibited the best performance in predicting in-hospital mortality in non-variceal UGIB patients, followed closely by CHAMPS.
Introduction
Acute upper gastrointestinal bleeding (UGIB) is a frequently encountered and potentially life-threatening clinical condition in emergency departments and hospitals [1]. Despite advances in pharmacological and endoscopic therapies, the estimated mortality rate for UGIB remains between 2% and 10% [2, 3]. Non-variceal causes-such as peptic ulcers, gastritis, and Mallory-Weiss tears-account for a significant proportion of UGIB cases. Early risk stratification in these patients is critically important for reducing both mortality and morbidity [4].
Risk scoring systems have been developed to support clinical decision-making, predict patient prognosis, and guide appropriate treatment strategies [4]. Currently, several risk scores are commonly used in clinical practice, including the Glasgow-Blatchford score (GBS); albumin, international normalised ratio,altered mental status, systolic blood pressure, age (AIMS65) age, blood tests, and comorbidities (ABC score); Charlson Comorbidity Index (CCI) ≥2, in-hospital onset, albumin <2.5 g/dL, altered mental status, Eastern Cooperative Oncology Group (ECOG) performance status ≥2, and steroid use (CHAMPS score); and the Complete Rockall score (CRS) [5-8]. However, there is ongoing debate regarding the relative accuracy and predictive value of these scoring systems for in-hospital mortality [5, 9].
The aim of this study is to compare the predictive performance of the CHAMPS, GBS, AIMS65, ABC, and CRS scores in estimating in-hospital mortality among patients with non-variceal UGIB. The findings are expected to provide clinically relevant guidance for physicians in managing these patients more effectively.
Materials and Methods
Ethics, Study Design, and Data Collection
This study was approved by the University of Health Sciences Türkiye, Bakırköy Dr. Sadi Konuk Training and Research Hospital Ethics Committee on May 21, 2025 (protocol number: 2025/157, decision number: 2025-10-07, date: 21.05.2025). The research was conducted in accordance with the ethical principles of the Declaration of Helsinki and international data protection standards [10]. Due to the retrospective nature of the study, the requirement for additional informed consent was waived by the ethics committee. However, all patients provided written informed consent regarding the diagnosis and treatment of UGIB as part of standard clinical care upon admission.
Data Handling and Confidentiality
Clinical data were obtained in encrypted form from the hospital’s electronic medical record system and stored in a secure database accessible only to the research team. During the analysis phase, all personal identifiers were anonymized, and only clinical parameters were evaluated. The data processing procedures strictly adhered to the standards of the General Data Protection Regulation to ensure patient privacy [11]. The methodological design and findings of the study were reported in accordance with the Strengthening the Reporting of Observational Studies in Epidemiology guidelines for observational research [12].
Study Design
This single-center, retrospective observational study was conducted in the emergency department of a tertiary care training and research hospital, involving patients diagnosed with non-variceal UGIB. The study site is a high-volume referral center, with approximately 400,000 emergency department visits annually, continuous 24-hour endoscopy availability, and frequent referrals from surrounding healthcare facilities for suspected UGIB cases.
Study Population
This retrospective study included adult patients (aged ≥18 years) who presented to the emergency department of University of Health Sciences Türkiye, Bakırköy
Dr. Sadi Konuk Training and Research Hospital between January 1, 2020, and January 1, 2025, and were diagnosed with non-variceal UGIB. Diagnosis of UGIB was confirmed endoscopically and supported by at least one of the following clinical criteria: (1) presence of hematemesis or melena; or (2) a drop of ≥2 g/dL in hemoglobin levels compared to previous values. Exclusion criteria included (1) bleeding secondary to endoscopic mucosal resection, and (2) cases with insufficient data to calculate risk scores. These criteria were applied to ensure a homogeneous study population and enhance the reliability of the findings.
Data Collection and Definitions
All cases presenting to the hospital during the specified study period were retrospectively reviewed using the hospital’s electronic medical record system. Medical records of patients diagnosed with non-variceal UGIB were examined in detail, and the relevant data were recorded using a pre-designed standardized data collection form. This form included demographic characteristics (age, sex, and comorbidities), presenting symptoms to the emergency department (hematemesis, melena, syncope, and altered mental status), and the setting of presentation (in-hospital vs. out-of-hospital onset).
The etiology of bleeding was classified as gastric ulcer, duodenal ulcer, or other causes. Vital signs at presentation (systolic blood pressure and pulse rate) and laboratory parameters [hemoglobin, albumin, creatinine, blood urea nitrogen, and international normalized ratio (INR)] were recorded. Additionally, data were collected on the patients’ medication history (use of anticoagulants, antiplatelet agents, nonsteroidal anti-inflammatory drugs, corticosteroids, and antisecretory agents), physical performance status (Eastern Cooperative Oncology Group Performance Status), comorbidity burden (CCI), and operative risk assessment (American Society of Anesthesiologists score).
Rebleeding was defined as the occurrence of fresh hematemesis, melena, or hemodynamic instability within seven days of the initial presentation, is confirmed endoscopically to have originated from the same source as the initial bleeding. The primary outcome of the study was all-cause in-hospital mortality.
Statistical Analysis
Data analyses were performed using SPSS Statistics for Windows, version 23.0 (SPSS Inc., Chicago, IL, USA) and MedCalc version 16.8.4 (MedCalc Software, Mariakerke, Belgium). The normality of distribution for continuous variables was assessed using the Kolmogorov-Smirnov test and histograms. Descriptive statistics were reported as mean ± standard deviation for normally distributed variables, and as median and interquartile range for non-normally distributed variables. Categorical variables were expressed as counts and percentages (%). For group comparisons, Student’s t-test was used for normally distributed continuous variables, while the Mann-Whitney U test was employed for non-normally distributed variables. The Pearson chi-square test was used to compare categorical variables.
The predictive performance of each risk scoring system was assessed using receiver operating characteristic curve analysis. The area under the curve was calculated for each score, and comparisons between scores were made using the DeLong test. Based on previous literature, the cut-off values for low-risk classification were defined as follows: ABC score ≤3, AIMS65 ≤1, CHAMPS = 0, CRS ≤1, and GBS ≤1. High-risk thresholds were set at ABC score ≥8, AIMS65 ≥2, CHAMPS ≥3, CRS ≥5, and GBS ≥5 [4-9]. The performance of the prediction scores was evaluated in terms of sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and weighted accuracy. A p<0.05 was considered statistically significant.
Results
A total of 917 consecutive adult patients who met the eligibility criteria were included in the study (Figure 1). The mean age of the patients was 64.1±20.8 years, and 73.8% (n=677) were male. The rebleeding rate was 7.5% (n=69), and the in-hospital mortality rate was 5.2% (n=48). The mean age of patients who did not survive was significantly higher than that of survivors (78.9±9.7 vs. 63.4±21.1 years; p<0.001). The baseline characteristics of the study population are presented in Table 1.
The ABC score, AIMS65, CHAMPS, CRS, and GBS classified 57.3%, 72.1%, 24.5%, 12.4%, and 4.2% of patients, respectively, as low risk. The in-hospital mortality rates among these low-risk groups were 4.0%, 1.7%, 0.4%, 0.9%, and 2.3%, respectively. Conversely, the same scoring systems classified 10.6%, 27.9%, 11.3%, 47.3%, and 88.6% of patients, respectively, as high-risk. In-hospital mortality rates among the high-risk groups were calculated as 16.5% for the ABC score, 16.4% for AIMS65, 33.7% for CHAMPS, 9.2% for CRS, and 6.6% for GBS.
The sensitivity, specificity, PPV, and NPV of each scoring system in predicting in-hospital mortality are presented in Table 2. Among patients with non-variceal UGIB, the AIMS65 score demonstrated good predictive performance for in-hospital mortality, with an AUC of 0.815 (95% confidence interval (CI): 0.788-0.840). The performance of the AIMS65 score was significantly superior to that of the GBS (AUC: 0.631, 95% CI: 0.599-0.663; p<0.001), and comparable to the CHAMPS score (AUC: 0.801, 95% CI: 0.773-0.872; p=0.493). It also showed statistically better discrimination than both the ABC score (AUC: 0.708, 95% CI: 0.678-0.738; p=0.026) and the CRS (AUC: 0.702, 95% CI: 0.671-0.731; p=0.018).
Discussion
This study aimed to evaluate and compare the predictive performance of five widely used clinical risk scoring systems, -CHAMPS, AIMS65, ABC score, GBS, and CRS-in estimating in-hospital mortality among patients presenting with non-variceal UGIB.
Emergency departments in Türkiye are often severely overcrowded [13], with approximately one million emergency surgical procedures performed annually [14]. In such high-volume and resource-constrained settings, clinical risk scoring systems play a pivotal role in optimizing triage and management decisions [15, 16]. Moreover, these tools have proven particularly valuable during global crises such as pandemics, when infection control is paramount. By identifying patients at low risk, they help prevent unnecessary hospital admissions and support more efficient allocation of healthcare resources [17]. Our findings provide important insights into the relative strengths and limitations of these scoring systems in early mortality risk stratification-a process that is essential for guiding timely and appropriate patient management in emergency care settings.
Among the evaluated scoring systems, the AIMS65 demonstrated the highest predictive value for in-hospital mortality, with an AUC of 0.815 (95% CI: 0.788-0.840), indicating good discriminatory performance. This finding is consistent with previous literature suggesting that AIMS65 is a reliable tool for predicting mortality in patients with UGIB [18]. Its simple structure, reliance on readily available clinical and laboratory parameters, and consistent performance across diverse patient populations make it particularly practical for use in routine clinical settings. Notably, AIMS65 outperformed GBS significantly, while showing comparable predictive ability to the CHAMPS, ABC, and CRS scores [4].
The CHAMPS score also demonstrated strong predictive capability, particularly within the high-risk classification group, which had a 32.4% mortality rate. Although the CHAMPS score lacks a universally accepted high-risk threshold, a cut-off of ≥3 was selected based on prior evidence suggesting, increased mortality with the accumulation of multiple adverse features [4, 7]. This threshold also aligned with the mortality distribution in our cohort and allowed meaningful stratification. Further validation in diverse settings is needed. By incorporating variables such as ECOG performance status, albumin level, and steroid use, the CHAMPS score may offer enhanced prognostic accuracy, especially in elderly patients or those with significant comorbidities [4, 7]. However, its lower sensitivity compared to AIMS65 (72.9% vs. 77.1%) may limit its utility as a standalone tool during the initial triage process. The ABC score and CRS showed moderate discriminatory ability, with AUCs of 0.708 and 0.702 respectively. Although both scores were able to identify high-risk patients associated with higher mortality rates, their lower sensitivity and specificity values suggest that their predictive effectiveness may be limited when used independently [19, 20]. Nonetheless, when applied in conjunction with more robust tools such as AIMS65 or CHAMPS, they may provide additional value, particularly in complex clinical scenarios.
The ABC score, which incorporates age, basic laboratory results, and comorbidity burden, has been proposed as a simplified tool for mortality risk stratification in gastrointestinal bleeding [21]. Its moderate performance in this study (AUC: 0.708) is consistent with international data, highlighting its utility in settings where rapid decision-making is required. Although it did not out-perform AIMS65 or CHAMPS, its reliance on objective parameters and ease of use may make it a practical alternative in centers lacking comprehensive clinical assessment resources. Further validation across different healthcare systems could help define its role in UGIB management pathways.
Although the GBS is widely used in the assessment of UGIB, it demonstrated poor performance in predicting in-hospital mortality in this study. This finding aligns with previous research indicating that GBS is more effective in predicting the need for clinical interventions such as blood transfusion or endoscopy rather than mortality itself [22]. Its high sensitivity (89.6%) coupled with low specificity (20.9%) suggests a tendency to overestimate mortality risk. In clinical practice, scoring systems with a high NPV, such as AIMS65 and CHAMPS, are particularly useful for identifying low-risk patients who may be suitable for conservative management. On the other hand, although their PPVs are relatively low, these scores can aid in the early identification of high-risk patients who may require intensive monitoring or intervention.
Study Limitations
This study has several limitations. First, its retrospective and single-center design may introduce selection and information bias, thereby limiting the generalizability of the findings. Since the study was conducted in a well-resourced tertiary care hospital with 24-hour endoscopy access, the results may not be fully applicable to rural or resource-limited settings. Future multicenter studies are needed to confirm these findings in more diverse healthcare environments. Second, the study focused exclusively on non-variceal UGIB cases, which restricts the applicability of the results to patients with variceal bleeding. Finally, although the scoring systems were calculated based on data obtained at the time of admission, dynamic changes in patients’ clinical conditions and physician judgment during management may have influenced the outcomes.
Conclusion
In conclusion, this study demonstrates that the AIMS65 score has the highest predictive value for in-hospital mortality among patients with non-variceal UGIB, with the CHAMPS score offering comparable utility. The use of these scoring systems for early risk stratification can support clinical decision-making and facilitate more efficient allocation of healthcare resources in the management of UGIB patients.


