Research Article - (2026) Volume 4, Issue 1
Artificial Intelligence-Assisted Monitoring in Anaesthesia: A Systematic Review and Meta-analysis of Diagnostic Accuracy and Clinical Impact AI in Anaesthesia: Monitoring and Outcomes Review
2Centro Universitário Lusíada, Brazil
3Center of Medical Sciences, Universidade Federal da Paraíba, Paraíba, Brazil https://orcid.org/0009- 0000-4207-3994, Italy
4Department of Medicine, Estácio de Sá University, Rio de Janeiro, Brazil
5Departamento de Anestesiologia, Universidade de São Paulo, São Paulo, Brazil
6Instituto de Cirurgia do Lago, Brasilia, https://orcid.org/0009-0008-6045-9768, Brazil
7Faculdade de Medicina da Universidade Federal do Ceara, Brazil
8Faculdade Pernambucana de Saude, Brazil
9SANIT (servico de anestesiologia de itajai), Brazil
10Departamento de Cirurgia, Universidade Federal de Santa Catarina, Santa Catarina, Brazil https:// orcid.org/0000-0003-4741-0245, Brazil
Received Date: Jan 22, 2026 / Accepted Date: Feb 12, 2026 / Published Date: Feb 17, 2026
Copyright: ©2026 Antonio Andrea Camastra. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation: Camastra, A. A., Escobar, M. R., Oliveira, D. M., Novaes, L. G. C., Donato, A. B. D., et al. (2026). Artificial Intelligence-Assisted Monitoring in Anaesthesia: A Systematic Review and Meta-analysis of Diagnostic Accuracy and Clinical Impact AI in Anaesthesia: Monitoring and Outcomes Review. J Future Med Healthcare Innovation, 4(1), 01-20.
Abstract
Background: Artificial intelligence (AI) is transforming medicine by enabling real-time data analysis and improved decision-making. In anaesthesiology, AI tools are increasingly used for perioperative risk assessment and intraoperative monitoring, but evidence on their real-world performance and safety remains limited.
Methods: We conducted a systematic review and meta-analysis following PRISMA guidelines, including studies from 2010 to May 2025 that evaluated AI applications—machine learning (ML), deep learning, neural networks, and fuzzy logic— in adult patients undergoing general or regional anaesthesia. Primary outcomes were perioperative complications (e.g., hypotension, hypoxia, bradycardia, delirium, vomiting, cardiac arrest, mortality, acute kidney injury [AKI]); secondary outcomes included haemodynamic stability, ICU admission, and length of stay. Risk of bias was assessed using RoB 2 and ROBINS-I, and random-effects models were applied.
Results: Eighteen studies with diverse surgical settings and sample sizes (60 to >450,000 patients) were included. ML models consistently outperformed conventional statistical methods. Ensemble algorithms, such as XGBoost and random forests, achieved AUROC values of 0.942 and 0.96, respectively. Deep learning models, including Max-Pooling Convolutional Neural Networks, predicted mortality with AUROC 0.867. Hypotension Prediction Index (HPI) trials showed 88% sensitivity, 87% specificity, and a 77% reduction in hypotension burden. Hybrid models integrating waveform and electronic health record data reported AUROCs of 0.807 for mortality and 0.766 for AKI.
Conclusions: AI-based monitoring, especially ML and biomarker-guided strategies, offers substantial improvements in perioperative risk stratification and haemodynamic management. Wider clinical adoption requires external validation, explainable AI frameworks, and rigorously designed randomized controlled trials demonstrating meaningful patient outcome benefits.
Keywords
Machine Learning, Hypotension Prediction Index Intraoperative Hypotension, Acute Kidney Injury, Perioperative Prediction, Ensemble Models and Deep Learning
PROSPERO registration n: CRD420251060430.
Key Message
1. AI-based perioperative monitoring systems, encompassing machine learning and biomarker-guided methodologies, exhibit enhanced diagnostic precision relative to conventional statistical techniques in various surgical environments.
2. Randomised controlled evidence demonstrates the clinical efficacy of AI-assisted haemodynamic management, resulting in substantial decreases in the occurrence, severity, and duration of intraoperative hypotension.
3. The transition of AI from predictive accuracy to enhanced patient-centered outcomes is dependent on extensive, multicenter randomised trials, external validation, and transparent AI frameworks.
Introduction
Artificial intelligence (AI) has been instrumental in changing the global healthcare scene over the last 20 years, paving the way for new developments in clinical care, diagnosis, individualised treatment, and healthcare data management. Because anaesthesiology requires the real-time interpretation and processing of vast volumes of physiological and pharmacological data, it is a particularly promising field for the adoption of AI-based technologies. Therefore, perioperative safety and the standard of anaesthesia care could be significantly improved by integrating AI-assisted predictive and decision support tools.
The precise administration of fast-acting medications, the timely adaptation to significant physiological changes, and the continuous, high-precision monitoring of multiple vital parameters are all necessary for modern anaesthesiology. However, many clinical decisions are still based on doctors' subjective experiences and established practices rather than standardised, predictive approaches, even with major advancements in medical technology. In this regard, moving from reactive to proactive decision-making models that can foresee crucial events before they materialise clinically is made possible by artificial intelligence and its subfields, such as machine learning (ML), deep learning (DL), and artificial neural networks [1-4].
Predicting intraoperative and postoperative complications, including bradycardia, hypotension, hypoxia, postoperative delirium, postoperative nausea and vomiting (PONV), intensive care admission, and mortality, is a primary objective of artificial intelligence in anaesthesiology. Many of these incidents happen unexpectedly, necessitating quick action to avoid negative outcomes. AI systems analyse preoperative clinical and anamnestic data, as well as real-time multi-parameter signals from automated anaesthetic delivery systems, electroencephalograms (EEGs), and physiological monitors. Finding patterns that are hidden from view, estimating the likelihood of future occurrences, and proposing prompt intervention techniques are the objectives.
Numerous observational and randomised clinical studies have assessed the efficacy of these technologies at different points during the perioperative period during the past ten years. For instance, the Hypotension Prediction Index (HPI) software was created with the help of artificial intelligence (AI) and has been thoroughly investigated for its capacity to anticipate intraoperative hypotension episodes up to several minutes beforehand [5-9]. This enables medical professionals to use fluidic interventions or pharmacological modulation to stop the event from happening. The automated administration of intravenous anaesthetics, including propofol and remifentanil, has also been optimised through the use of AI algorithms. This has improved the depth of anaesthesia and haemodynamic stability while allowing for more customisation than conventional target-controlled infusion (TCI) systems.
More effective control of arterial pressure, depth of anaesthesia, fluid administration, and mechanical ventilation has been made possible by intelligent controllers built on fuzzy logic or neuro- adaptive predictive models [10]. By minimising inter-individual variability in anaesthetic response and maximising the balance between efficacy and safety, these tools have been demonstrated to increase the precision and consistency of therapeutic strategies. Additionally, AI-powered clinical information management tools, like predictive electronic anaesthesia records, have enhanced perioperative data collection and analysis, opening up new avenues for quality improvement and clinical audit.
The current body of scientific literature is disjointed and frequently contradictory, despite the excitement these innovations have generated. Few studies have shown a significant impact on critical clinical outcomes like length of hospital stay, major complication rates, or mortality, despite the fact that many have reported encouraging results in terms of predictive accuracy and improved intraoperative parameters. Moreover, it is challenging to reach firm conclusions regarding the applicability of these technologies in routine clinical practice due to variations in the AI models used, the clinical contexts examined, and the methodological calibre of publications.
In light of this, the goal of this systematic review and meta- analysis is to present a comprehensive summary of the data that is currently available regarding the application of AI in perioperative anaesthesia. Through a comparative analysis of data from observational and randomised clinical trials, the review will assess the safety and effectiveness of AI-based tools for intraoperative monitoring and complication prediction. The results will be stratified by the technology (deep learning (DL), fuzzy logic, neural networks, or machine learning (ML), the type of complication targeted, the operative setting (major versus minor surgery; elective versus urgent), and the methodological quality of the included studies.
The PICOT framework, which is advised for creating structured clinical questions, was used to define the inclusion criteria that were chosen for this review. Adult patients undergoing regional or general anaesthesia make up the target population. The use of artificial intelligence (AI)-based technologies for intraoperative monitoring or perioperative adverse event prediction is the intervention (I). Standard, non-AI-assisted monitoring techniques are used in comparison (C). Predictive accuracy in relation to particular complications (such as hypotension, hypoxia, post- operative vomiting syndrome, delirium, and death) is the primary outcome (O), and clinical variables like length of hospital stay, ICU admission rate, and intraoperative stability are the secondary outcomes. Since 2010 is when AI-based technologies were first introduced and used in anaesthesiology, both prospective observational studies and randomised controlled trials (RCTs) and relevant retrospective studies published since then were included.
This meta-analysis differs from earlier synthesis attempts in that it focusses on studies published after 2010 and has stricter inclusion criteria. Methodological quality, which is evaluated with particular instruments like ROBINS-I for observational studies and RoB 2 for randomised controlled trials (RCTs), is emphasised. Additionally, subgroup analyses will be carried out based on the kind of algorithm employed, the perioperative phase of application (pre-, intra-, or postoperative), and the kind of complication that is being targeted. There are also moral, legal, and financial concerns with the broad use of AI in anaesthesia. Thorough consideration must be given to issues like algorithm transparency, medico-legal liability for mistakes, sensitive data protection, and guaranteeing fair access to these advancements. Training medical staff to enable anaesthetists to consciously incorporate the new technologies into their daily clinical practice is another essential component. Therefore, educational institutions and healthcare facilities ought to support professional development courses that address fundamental AI concepts, data analysis, and human-machine interaction.
Establishing a strong body of evidence to direct the clinical application of AI tools in anaesthesiology and to guide future research aimed at creating dependable, secure, and genuinely practical predictive models for daily use is the ultimate goal of this work. Technical innovation, strong proof of clinical efficacy, safety, and acceptability, as well as long-term economic viability, are necessary for the adoption of such technologies.
Methods
To maintain openness, reproducibility, and methodological rigour, this systematic review with meta-analysis closely adhered to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Because the protocol was pre- registered with the PROSPERO (International Prospective Register of Systematic Reviews) database, the authors were bound to adhere to a predetermined plan, reducing the possibility of selective bias and boosting the accuracy of the findings.
Type of Studies Included
Randomised controlled trials (RCTs), retrospective studies and prospective observational studies on the application of AI-based technologies in anaesthesiology that were carried out in adult human populations were included in the review. The recent adoption of AI technologies in clinical practice, which has reduced the quantity of RCTs available, served as the impetus for the decision to incorporate observational studies. This does not, however, rule out the possibility of gathering pertinent data from carefully planned non-randomized research.
Inclusion Criteria
Every study that satisfied the following requirements was deemed qualified:
- To guarantee the inclusion of current research in line with recent advancements in AI technologies, the publication date should be between January 1, 2010, and December 31, 2024.
- Research design: observational studies with a control group and prospective RCTs.
- Adult subjects (≥18 years old) undergoing general or loco- regional anaesthesia in a surgical or procedural setting comprise the population.
- Intervention: the application of models, algorithms, or predictive tools based on artificial intelligence, such as expert systems, machine learning, deep learning, artificial neural networks, or fuzzy logic algorithms.
- The ability to predict or prevent perioperative complications, such as intraoperative hypotension, hypoxia, adverse cardiovascular events, bradycardia, postoperative delirium, PONV, cardiac arrest, and AKI, is the main outcome.
- Long hospital stays, the need to be admitted to an intensive care unit, haemodynamic stability, and the effectiveness of intraoperative physiological monitoring are examples of secondary outcomes.
Exclusion Criteria
Research that satisfied these requirements was not included: - research done on children (less than 18 years old) or in animal models.
- Research that lacked a clear and documented AI-based intervention.
- Simulation studies, case reports, editorials, letters to the editor, and conference abstracts without peer-reviewed full texts are all examples.
- To guarantee linguistic homogeneity in data extraction and interpretation, studies published in languages other than English were disqualified.
|
Inclusion criteria |
Exclusion criteria |
|
Adult subjects (≥18 years old) |
Research done on children (less than 18 years old) or in animal models |
|
observational or RCTs |
Research that lacked a clear and documented AI-based intervention |
|
publication date between January 1, 2010- present. |
not observational or RCTs |
|
All AI technologies or system |
studies published in languages other than English |
Table. Inclusion and Exclusion Criteria
Sources of Information and Search Methodology
Three primary biomedical databases—PubMed/MEDLINE, Embase, and the Cochrane Central Register of Controlled Trials (CENTRAL)—were thoroughly searched. The search approach was predetermined and comprised MeSH (Medical Subject Headings) terms and keywords related to the fields of artificial intelligence, perioperative complications, and anaesthesia. Terms like "artificial intelligence," "machine learning," "deep learning," "anaesthesia," "perioperative complications," "monitoring," "prediction," and others were examples of Boolean combinations. Only human subjects’ studies published between 2010 and May 2025 were included in the results.
Search Strategy
Study Selection Process
Two reviewers (A.A.C., M. R. E.) independently screened all identified abstracts and titles. Using the predetermined criteria, the full texts of potentially eligible articles were retrieved and evaluated for inclusion. A third reviewer (T. R. E.) was consulted or discussed in order to settle disagreements. (Table 1.) (Table 1.)
|
Stage |
Number of Records |
Notes |
|
Records identified |
2772 |
Using the defined search strategy (see below) |
|
Records after duplicates removed |
2586 |
Duplicate records removed |
|
Records screened (title and abstract) |
2586 |
Screening based on predefined PICOTT criteria |
|
Records excluded |
2552 |
Excluded for:
|
|
Full-text articles assessed for retrieval |
34 |
Full-text retrieval attempted |
|
Full-text articles not retrieved |
0 |
Full texts not accessible |
|
Full-text articles assessed for eligibility |
34 |
Evaluated according to inclusion/exclusion criteria |
|
Full-text articles excluded |
16 |
Common reasons for exclusion:
|
|
Studies included in the systematic review |
18 |
All were randomized controlled trials on AI in perioperative anaesthesia |
|
Studies included in the meta-analysis |
18 |
All provided extractable quantitative outcome data |
Data extraction
A standardised collection form that was tested in a pilot phase was used to extract pertinent data from each includedstudy.
This form was used to extract pertinent data from each included study. The variables that were gathered included the author's name, the year of publication, the study design, the sample that was analysed, the population's demographics, the type and characteristics of the AI algorithm that was used, the clinical setting, the type of surgery or anaesthesia that was performed, the primary and secondary outcomes, the main results, and the authors' main conclusions. Two authors independently collected the data in duplicate.
Risk of Bias Assessment
Validated instruments were used to evaluate the included studies' methodological quality:
- RoB 2.0 for RCTs, in accordance with the Cochrane Collaboration's guidelines. Randomisation, treatment deviation, attrition, outcome measurement, and the selection of reported data were among the biases evaluated.
Non-randomized observational studies were conducted using ROBINS-I, which takes into account biases in participant selection, confounding factors, intervention classification, missing data, and analysis techniques.
Two authors conducted each evaluation independently, and any disagreements over the results were discussed. Each study's overall quality was assigned a risk of bias rating of "low," "moderate," or "high."
Statistical Analysis and Data Synthesis
Meta-analysis was used to synthesise quantitative data with homogeneous populations, interventions, and outcomes. To account for variation amongst studies, a random effects model was used. With 95% confidence intervals, effect sizes were presented as odds ratios (OR), risk ratios (RR), or mean differences (MD). Cochran's Q statistic was used to evaluate study heterogeneity, and the I2 index was used to quantify it based on three criteria: 0–25% (low), 25–50% (moderate), and >50% (high).
To investigate the efficacy of AI, subgroup analyses were designed according to the following criteria: the algorithm type (e.g., HPI, fuzzy logic, deep learning, or generic predictive models); the type of complication (e.g., neurological, cardiovascular, or respiratory); and the surgical context (major vs. minor surgery; elective vs. emergency).
Studies with a high risk of bias were excluded through sensitivity analyses.
Affiliation and Peer Review
A multidisciplinary team unaffiliated with academic institutions carried out the work, and during the protocol phase, the review was not exposed to external peer review. However, after the manuscript is finished, it will be submitted for peer review and publication.
Results
There were 18 studies in all, and they were a mix of randomised controlled trials (RCTs), prospective observational studies, and retrospective analyses. The studies took place in many different parts of the world, including Europe (the Netherlands, Italy, Germany, and Spain), North America (the United States), and Asia (Korea and China). This shows that people all over the world are interested in using machine learning (ML) and biomarkers to predict and manage perioperative events. (Table 2.)
|
Study (Au- thor, Year) |
Setting |
Sample Size |
Study Type |
Primary Outcome |
Model(s) Used |
Best Model Performance |
Input Fea- tures |
Unique Fea- tures |
|
Schenk et al., |
Amster- |
60 (54 ana- |
Randomized |
Time-weight- |
Hypotension |
88% sensitivi- |
Continuous |
Machine learn- |
|
2021 |
dam UMC, |
lyzed) |
Controlled |
ed average |
Prediction |
ty, 87% speci- |
arterial wave- |
ing-based early |
|
|
Netherlands |
|
Sub-Study |
(TWA) of |
Index (HPI) |
ficity for IOH; |
form; MAP |
warning system; |
|
|
– PACU |
|
|
postoperative |
|
77% reduction |
<65 mm Hg; |
integration with |
|
|
|
|
|
hypotension |
|
in IOH depth/ |
FloTracIQ |
HemoSphere |
|
|
|
|
|
(POH) |
|
duration |
sensor data |
monitor |
|
Fritz et al., |
Single center |
95,907 |
Retrospective |
30-day |
MPCNN |
AUROC: |
>70 static and |
Real-time dy- |
|
2019 |
(USA) |
|
|
postoperative |
(CNN + |
0.867, |
time-series |
namic predic- |
|
|
|
|
|
mortality |
LSTM), |
AUPRC: 0.097 |
variables |
tion, intraopera- |
|
|
|
|
|
|
DNN, SVM, |
|
|
tive monitoring |
|
|
|
|
|
|
RF, LR |
|
|
|
|
Lee et al., |
Multi-center |
454,404 |
Retrospective |
30-day |
XGBoost, |
AUROC: |
12–18 |
External vali- |
|
2022 |
(Korea) |
|
|
postoperative |
LR, RF, DNN |
0.942, |
preoperative |
dation across |
|
|
|
|
|
mortality |
|
AUPRC: 0.175 |
variables |
4 hospitals, |
|
|
|
|
|
|
|
(XGBoost) |
|
compact model |
|
Te et al., 2024 |
Single center |
23,305 |
Retrospective |
PIHI Index |
Extra Tree |
R²: 0.9047, |
14 variables: |
Continuous |
|
|
(China) |
|
|
(post-in- |
Regressor, |
MAPE: |
demographics |
outcome, ICV |
|
|
|
|
|
tubation |
XGBoost, |
20.86% (ETR) |
+ drug dosages |
index creation, |
|
|
|
|
|
instability) |
MLP, SVR, |
|
|
SMOTETomek |
|
|
|
|
|
|
MLR |
|
|
balancing |
|
Fritz et al., |
Barnes-Jew- |
5071 pa- |
Single-centre |
30-day |
ML models |
Death predic- |
Electronic |
Clinicians |
|
2024 |
ish Hospital, |
tients |
Randomised |
all-cause |
for death & |
tion: AUROC |
health records: |
randomized to |
|
|
Saint Louis, |
|
Controlled |
postoperative |
AKI |
0.807; AKI |
demographics, |
ML-assisted vs |
|
|
MO, USA |
|
Trial |
mortality and |
|
prediction: |
labs, pre-op |
unassisted re- |
|
|
|
|
|
AKI within 7 |
|
AUROC 0.766 |
data, anaesthe- |
view; live EHR |
|
|
|
|
|
days |
|
|
sia record |
integration; |
|
|
|
|
|
|
|
|
|
Likert-based |
|
|
|
|
|
|
|
|
|
risk rating scale |
|
Frassanito et |
IRCCS |
60 patients |
Single-centre |
TWA-MAP |
HPI + |
82% reduction |
HPI alerts, |
Real-time HPI |
|
al., 2023 |
Policlinico |
|
RCT |
<65 mm Hg; |
Modified |
in IOH events; |
SVV, dP/ |
alert protocol; |
|
|
Gemelli, |
|
|
IOH after |
Goal-Direct- |
TWA-MAP |
dtmax, Eadyn, |
faster interven- |
|
|
Rome, Italy |
|
|
induction; |
ed Therapy |
<65 mm Hg: |
MAP, BIS, flu- |
tion; tailored |
|
|
|
|
|
severe hypo- |
|
0.14 vs 0.77 |
id/vasopressor |
hemodynamic |
|
|
|
|
|
tension |
|
mm Hg |
response |
algorithm |
|
Kang et al., |
Soonchunhy- |
222 patients |
Retrospective |
Hypotension |
Naïve Bayes, |
Random For- |
89 features |
High-res |
|
2020 |
ang Univer- |
|
ML study |
after anesthe- |
Logistic |
est: AUROC |
from EHR |
time-syn- |
|
|
sity Bucheon |
|
|
sia induction |
Regression, |
0.842 (CI: |
+ anesthesia |
chronized |
|
|
Hospital, |
|
|
(tracheal |
Random |
0.736–0.948); |
devices: SBP, |
multi-device |
|
|
South Korea |
|
|
intubation– |
Forest, Arti- |
83.7% recall |
MBP, HR, |
monitoring; |
|
|
|
|
|
incision) |
ficial Neural |
with selected |
TIVA doses, |
pre-intuba- |
|
|
|
|
|
|
Network |
features |
ventilator set- |
tion-only train- |
|
|
|
|
|
|
|
|
tings, patient |
ing window; |
|
|
|
|
|
|
|
|
demographics |
feature set |
|
|
|
|
|
|
|
|
|
comparisons |
|
Wijnberge et |
Amsterdam |
64 patients |
Single-centre |
Reduction in |
Hypotension |
HPI group:- |
Arterial |
First RCT on |
|
al., 2020 |
UMC, Neth- |
|
RCT |
intraopera- |
Prediction |
significant re- |
waveform data |
HPI; real-time |
|
|
erlands |
|
|
tive hypoten- |
Index (HPI) |
duction in time |
via FloTracIQ |
alerts; integra- |
|
|
|
|
|
sion duration |
|
spent MAP |
sensor, MAP |
tion with Acu- |
|
|
|
|
|
|
|
<65 mm Hg |
thresholds |
men platform |
|
Baig et al., |
Auckland |
30 patients |
Real-time |
Detection of |
RT-SAAM, |
FLMS-2: |
HR, BP, PV, |
Multi-module |
|
2013 |
City Hos- |
|
fuzzy logic |
absolute hy- |
FLMS-2 |
Kappa = 0.75; |
EtCOâ??; wave- |
fuzzy alarms; |
|
|
pital, New |
|
validation |
povolaemia |
(Fuzzy logic |
RT-SAAM: |
form analysis; |
real-time alerts; |
|
|
Zealand |
|
|
|
systems) |
Kappa = 0.62 |
fuzzy rule sets |
clinician agree- |
|
|
|
|
|
|
|
|
|
ment validation |
|
Rip- |
28 hospitals |
917 patients |
Multicenter |
Moder- |
Hypotension |
No significant |
HPI >80, |
Largest HPI |
|
ollés-Melchor |
(Spain + |
|
RCT |
ate-to-severe |
Prediction |
difference in |
MAP, SVV, |
RCT; diverse |
|
et al., 2025 |
Jordan) |
|
|
AKI within |
Index (HPI) |
AKI incidence |
Eadyn, dP/ |
real-world prac- |
|
|
|
|
|
7 days; |
|
(HPI: 6.1% vs |
dtmax, vaso- |
tices; subgroup |
|
|
|
|
|
complica- |
|
control: 7.0%) |
pressor/fluids |
analysis for |
|
|
|
|
|
tions; 30-day |
|
|
guided by |
hypertension |
|
|
|
|
|
mortality |
|
|
Hemosphere |
effect |
|
Hu et al., |
Henan Univ. |
202 patients |
Deep learn- |
Preoperative |
ANN + CNN |
R² = 0.915 (in- |
Gender, age, |
Multi-stage |
|
2024 |
of Sci. & |
|
ing model |
& intraopera- |
+ LSTM + |
tra-op); MAPE |
weight, SBP, |
model; attention |
|
|
Tech., China |
|
validation |
tive anes- |
Attention |
= 12.25% |
DBP, BIS, HR; |
mechanism; |
|
|
|
|
|
thetic dose |
|
|
time-series |
convolutional |
|
|
|
|
|
prediction |
|
|
vitals |
feature ex- |
|
|
|
|
|
|
|
|
|
traction |
|
Abin et al., |
3 hospitals, |
998 patients |
Cohort ML- |
AKI risk |
AKI– and |
Accuracy: |
8 pre-op |
Dual regression |
|
2024 |
Tehran, Iran |
|
based design |
reduction via |
AKI+ regres- |
80.6%; F1- |
features → |
system; visual |
|
|
|
|
|
anesthesia |
sion planners |
score: 0.821; |
6 anesthesia |
dashboard; vali- |
|
|
|
|
|
management |
|
recall: 84.8% |
targets (CPB |
dated by cardiac |
|
|
|
|
|
|
|
|
time, fluids, |
anesthesiolo- |
|
|
|
|
|
|
|
|
transfusion, |
gists |
|
|
|
|
|
|
|
|
diuretics) |
|
|
Hayase et al., |
Kyoto Chubu |
83,867 EEG |
Observa- |
BIS predic- |
MLPNN |
R = 0.87; |
Poincaré-index |
Hierarchical |
|
2020 |
Medical Cen- |
epochs from |
tional + ML |
tion from |
(deep learn- |
RMSE = 7.09; |
(0.5–47 Hz & |
Poincaré plot |
|
|
ter, Japan |
30 patients |
validation |
EEG scatter |
ing) |
Bias = 0.07 |
20–30 Hz), |
fusion; EMG- |
|
|
|
|
|
patterns |
|
|
EMG70–110 |
aware BIS |
|
|
|
|
|
|
|
|
Hz, suppres- |
estimation; BIS |
|
|
|
|
|
|
|
|
sion ratio |
used as super- |
|
|
|
|
|
|
|
|
|
visor |
|
Šribar et al., |
University |
34 patients |
Single-centre |
Duration, |
Hypotension |
TWA-AUT: |
HPI alert |
First HPI RCT |
|
2023 |
Hospital |
|
RCT |
depth, and |
Prediction |
0.01 mmHg |
threshold ≥90; |
in thoracic sur- |
|
|
Dubrava, |
|
|
frequency of |
Index (HPI) |
(HPI) vs 0.08 |
MAP, SVV, |
gery; structured |
|
|
Croatia |
|
|
IOH during |
vs FloTrac |
mmHg (GDT); |
Eadyn, dP/dt, |
decision tree; |
|
|
|
|
|
thoracic |
GDT |
fewer hypoten- |
CI, SVI, HR, |
high-fidelity |
|
|
|
|
|
surgery |
|
sive episodes |
BP, lactate, |
waveform |
|
|
|
|
|
|
|
in HPI group |
ScvOâ?? |
analysis |
|
Morisson et |
Maison- |
66 patients |
Ancillary |
Prediction of |
ML algo- |
Penalized |
Age, ASA |
First ML analy- |
|
al., 2022 |
neuve-Rose- |
|
RCT analysis |
moderate to |
rithms (elas- |
logistic regres- |
class, BMI, |
sis of intraoper- |
|
|
mont Hospi- |
|
|
severe PACU |
ticnet, RF, |
sion: CV-AUC |
NOL metrics |
ative nocicep- |
|
|
tal, Montréal, |
|
|
pain |
SVM…) |
0.753(0.718– |
(reaction to in- |
tion (NOL) to |
|
|
Canada |
|
|
|
|
0.788); Brier |
tubation/inci- |
predict PACU |
|
|
|
|
|
|
|
score: 0.194 |
sion), surgical |
pain; SHAP |
|
|
|
|
|
|
|
|
TWA NOL |
interpretability; |
|
|
|
|
|
|
|
|
|
surgery-specific |
|
|
|
|
|
|
|
|
|
feature mod- |
|
|
|
|
|
|
|
|
|
eling |
|
Velagapudi et |
4 academic |
100 patients |
Prospective |
Accuracy |
XGBoost |
Glucose: Ac- |
Age, BMI, |
First dual-mod- |
|
al., 2022 |
hospitals |
× 2 models |
validation via |
of anesthe- |
(glucose), |
curacy ↑ from |
ASA, diabetes, |
el validation |
|
|
(USA) |
|
web-based |
siologist |
Random For- |
79% to 84.7%; |
meds, pain his- |
with real clini- |
|
|
|
|
simulation |
predictions |
est (opioids) |
Opioids: ↑ |
tory, surgery |
cians; REDCap |
|
|
|
|
|
for glucose & |
|
from 18% to |
type, duration, |
interface; |
|
|
|
|
|
opioid needs |
|
42% |
anesthesia type |
SHAP-informed |
|
|
|
|
|
|
|
|
|
feedback |
|
Luckscheiter |
Southwest |
25,556 trau- |
Regis- |
Need for |
Random For- |
RF: AUROC |
24 features via |
First large-scale |
|
et al., 2022 |
Germany |
ma patients |
try-based |
preclinical |
est (RF) & |
0.96; PRC |
PCA: auscul- |
airway predic- |
|
|
EMS registry |
|
retrospective |
airway man- |
Naive Bayes |
area 0.83; NB: |
tation, injury |
tion model in |
|
|
(MIND 3.1) |
|
analysis |
agement in |
(NB) |
AUROC 0.93; |
pattern, oxy- |
EMS; validated |
|
|
|
|
|
trauma |
|
PRC area 0.66 |
gen therapy, |
on real-world |
|
|
|
|
|
|
|
|
shock index, |
trauma registry; |
|
|
|
|
|
|
|
|
vital signs, |
SMOTE ap- |
|
|
|
|
|
|
|
|
interventions |
plied for class |
|
|
|
|
|
|
|
|
|
balance |
|
Tan et al., |
Huashan |
406 carotid |
Retrospective |
Early Phase |
Gradient |
AUROC: |
100+ perioper- |
First ML model |
|
2021 |
Hospital, Fu- |
endarterec- |
single-center |
Postoperative |
Boosted |
0.77 (95% CI: |
ative variables |
for EPOH |
|
|
dan Universi- |
tomy (CEA) |
cohort study |
Hypertension |
Regres- |
0.62–0.92); |
including: |
prediction; |
|
|
ty, Shanghai, |
procedures |
|
(EPOH) |
sion Trees |
Sensitivity |
intraoperative |
feature impor- |
|
|
China |
|
|
requiring |
(GBRT) via |
~90%; Speci- |
peak SBP, |
tance analysis |
|
|
|
|
|
intravenous |
XGBoost |
ficity ~52% |
cardiac index, |
via XGBoost |
|
|
|
|
|
vasodilators |
|
|
anesthetic dos- |
gain scores; |
|
|
|
|
|
within 24 |
|
|
ages (propo- |
validated with |
|
|
|
|
|
hours post- |
|
|
fol, fentanyl, |
4-fold stratified |
|
|
|
|
|
CEA |
|
|
ephedrine), lab |
cross-valida- |
|
|
|
|
|
|
|
|
values (glu- |
tion; strong |
|
|
|
|
|
|
|
|
cose, choles- |
association |
|
|
|
|
|
|
|
|
terol, alkaline |
found between |
|
|
|
|
|
|
|
|
phosphatase), |
intraoperative |
|
|
|
|
|
|
|
|
Doppler |
BP spikes and |
|
|
|
|
|
|
|
|
echocardiog- |
EPOH risk |
|
|
|
|
|
|
|
|
raphy metrics, |
|
|
|
|
|
|
|
|
|
clamping time, |
|
|
|
|
|
|
|
|
|
and demo- |
|
|
|
|
|
|
|
|
|
graphic data |
|
The sample sizes were very different. For example, Schenk et al. (2021) had only 60 patients (n=60), while Lee et al. (2022) had 454,404 patients in a very large retrospective dataset [11]. The ORACLE Trial (Fritz et al., 2024) was the biggest interventional study [12]. It randomly assigned 5,071 patients to either ML- assisted or standard perioperative care. (Table 2.)
The Main Results of the Studies Were Different and Included
- Most of the time, intraoperative hypotension (IOH) is defined as a mean arterial pressure (MAP) of less than 65 mmHg (e.g., Schenk et al., 2021; Wijnberge et al., 2020).
- Acute kidney injury (AKI) is often used as a secondary endpoint (for example, Fritz et al., 2024; Ripollés-Melchor et al., 2025).
- Mortality after surgery: both 30 days after surgery and while in the hospital (e.g., Fritz et al., 2019; Lee et al., 2022).
- PIHI (Te et al., 2024) is an example of a composite haemodynamic instability index.
Modelling Methods and Algorithmic Techniques
There were a number of different predictive modelling methods used in the studies. These can be divided into three main groups: traditional ML algorithms, deep learning architectures, and biomarker-based indices (especially HPI).
Some of the models that were tested the most were random forests (RF), logistic regression (LR), gradient boosting machines (like XGBoost), and support vector machines (SVM).
Lee et al. (2022) directly compared several algorithms for predicting 30-day mortality. They found that XGBoost did better than both RF (AUROC = 0.92) and DNN models, with AUROC = 0.942 and AUPRC = 0.175.
Luckscheiter et al. (2022) made models for bad things that happen during surgery. RF got an AUROC of 0.96 and a PRC area of 0.83 [13].
Deep Learning Structures
Deep learning methods took advantage of the changing nature of data streams during surgery.
Fritz et al. (2019) used a combination of convolutional neural networks (CNNs) and LSTM-based architectures (MPCNN) to predict 30-day mortality with an AUROC of 0.867 and an AUPRC of 0.097. Hu et al. (2024) used these ideas on high-frequency arterial waveform data and found that R² = 0.915 and MAPE = 12.25% were good for predicting changes in blood flow during surgery [14].
The Hypotension Prediction Index (HPI)
HPI, a proprietary algorithm trained to predict IOH minutes before onset, was evaluated in a number of RCTs and observational studies (Schenk et al., 2021; Wijnberge et al., 2020; Ripollés- Melchor et al., 2025). In addition to a 77% decrease in the severity and duration of hypotension in the intervention group, Schenk et al. (2021) reported 88% sensitivity and 87% specificity for HPI. When HPI guidance was used, Wijnberge et al. (2020) also showed a significant decrease in cumulative hypotension time.
Innovative and Hybrid Methods
Te et al. (2024) developed the Post-Intubation Hemodynamic Instability (PIHI) index using 14 clinical and ventilatory variables, achieving R² = 0.9047. [15,16] For AKI risk, Abin et al. (2024) used a dual regression approach, reporting accuracy = 80.6%, F1- score = 0.821, and recall = 84.8%.
Predictive Factors and the Significance of Features
MAP and derived waveform parameters were universally critical, according to a comparative analysis of input features (Table 3).
In HPI-based studies, significant attention was paid to variables such as systolic and diastolic blood pressure (SBP and DBP), heart rate (HR), pulse volume (PV), stroke volume variation (SVV), and dP/dtmax (Frassanito et al., 2023; Šribar et al., 2023[16]).
Larger retrospective models included preoperative variables like age, comorbidity indices, baseline renal function, and ASA status (Lee et al., 2022; Kang et al., 2020) [17].
Analysis of Comparative Performance
Among the best-performing models (AUROC > 0.90) were: '
- Lee et al. (2022): AUROC = 0.942, AUPRC = 0.175 (XGBoost).
- Luckscheiter et al. (2022) : PRC = 0.83 (RF), AUROC = 0.96.
- Te et al. (2024)]: R² = 0.9047 for continuous hemodynamic predictions.
Performers in the middle (AUROC 0.80–0.89):
- Fritz et al. (2019) : AUROC = 0.867, AUPRC = 0.097 (MPCNN).
- Fritz et al. (2024, ORACLE) : AUROC = 0.807 for 30-day mortality; 0.766 for AKI.
Models for experiments or lower performers:
- Tan et al. (2021): sensitivity ~90%, specificity ~52%, AUROC = 0.77 (95% CI: 0.62–0.92).
- Morisson et al. (2022) : Brier score = 0.194, AUC = 0.753, penalised logistic regression [18]. (Table 3.)
|
Stu dies /Va riab le |
Hea rt Rat e (H R) |
Blo od Pre ssur e (BP ) |
Pul se Vol um e (PV ) |
Me an Art eria l Pre ssur |
Mo rtal ity (an y) |
AI Mo dels Co mp are d |
Bes t Mo del Per for ma |
AU RO C |
IC V Val ues |
Typ e of Sur ger y |
Ris k of AK I |
Hy pert ensi on Ris k |
Air way Ma nag eme nt |
Dos age for Ane sthe sia |
Wa rni ng Syst em vs Sta |
30- Day Pos top erat ive Mo |
Pos top erat ive Co mpl icat |
Ana esth esia Dep th Mo nito |
Posto perati ve Pain |
|
|
|
|
|
e (M AP) |
|
|
nce |
|
|
|
|
|
|
|
nda rd Car e |
rtal ity |
ions |
ring |
|
|
Frit z et al., 201 9 |
Use d (pre - and intr aop erati ve) |
Use d (sys tolic & dias tolic , intr aop erati ve) |
NR |
Deri ved fro m BP, not expl icitl y repo rted |
30- day post oper ativ e mor talit y (1% ) |
MP CN N (CN N + LST M), DN N, SV M, RF, LR |
MP CN N (AU RO C: 0.86 7, AU PR C: 0.09 7) |
|
Not use d |
All type s (ma inly AS A II– III) |
NR |
Incl ude d as com orbi dity |
Incl ude d (onl y intu bate d pati ents ) |
Incl ude d (cu mul ativ e intr aop dos es) |
Des igne d for real - time war ning inte grat ion |
Pri mar y end poin t |
Not strat ifie d |
NR |
NR |
|
Lee et al., 202 2 |
Use d (pre oper ativ e only ) |
Use d (pre oper ativ e only ) |
NR |
NR |
30- day post oper ativ e mor talit y (0.2 – 0.4 %) |
XG Boo st, Ran dom For est, Log istic Reg ress ion, DN N |
XG Boo st (AU RO C: 0.94 2, AU PR C: 0.17 5) |
|
Not use d |
Non - card iac surg erie s acro ss spec ialti es |
NR |
Incl ude d as inpu t feat ure |
Not expl icitl y desc ribe d |
Not incl ude d |
Offl ine mod el for preo pera tive risk strat ifica tion |
Pri mar y end poin t |
Not strat ifie d |
NR |
NR |
|
Te et al., 202 4 |
Use d (pre oper ativ e |
Use d (sys tolic & dias |
NR |
NR |
No dire ct mor talit y |
Extr a Tre e Reg ress |
ET R (R²: 0.90 47, MA |
|
Pre dict ed as pri mar |
Gen eral anes thes ia with |
NR |
Incl ude d as bina ry inpu |
All pati ents had end otra |
Incl ude d (init ial infu |
Not real - time ; pred |
Not mea sure d |
Foc use d only on PIH |
NR |
NR |
|
|
only ) |
tolic , preo pera tive ) |
|
|
outc ome |
or, XG Boo st, SV R, ML R, ML P |
E: 0.05 12) |
|
y outp ut |
trac heal intu bati on (AS A I– II) |
|
t feat ure |
che al intu bati on |
sion : fent anyl , pro pof ol, etc.) |
ictiv e sup port only |
|
I (he mod yna mic inst abili ty) |
|
|
|
Sch enk et al. (HP I Tri al) |
Not expl icitl y repo rted |
MA P targ et: >65 mm Hg |
Cap ture d via Flo Tra cIQ sens or |
Not spec ifie d |
Not repo rted dire ctly (30- day outc ome s) |
HPI - guid ed vs Stan dard Car e |
HPI algo rith m |
~0.9 0– 0.92 (bas ed on sens itivi ty/s peci ficit y) |
Not appl icab le |
Elec tive non card iac (ma inly GI and gyn ecol ogic al) |
Ele vate d in pres enc e of IOH /PO H |
Peri oper ativ e MA P man age men t use d to miti gate extr eme s |
Gen eral ana esth esia with mec hani cal vent ilati on |
Sev oflu rane med ian: 1.55 – 1.64 vol % |
HPI guid anc e vs rout ine hae mod yna mic prot ocol s |
Not dire ctly asse ssed |
4 pati ents with Cla vien - Din do Gra de ≥ III (ble edin g inte rven tion s) |
Not expl icitl y repo rted |
Not discus sed in this sub- study |
|
OR AC LE Tri al (Fri tz et al., 202 4) |
Ava ilabl e via EH R; use d in ML inpu t feat ures |
MA P mon itor ed; pred ictiv e rele van ce for AKI and mor talit y |
Not repo rted as stan dalo ne met ric |
MA P use d by ML mod el for pred ictio n |
30- day post oper ativ e mor talit y asse ssed ; 2.2 % inci den ce |
ML - assi sted vs ML - una ssist ed clini cian pred ictio n |
Dea th: AU RO C 0.80 7; AKI : AU RO C 0.76 6 |
Dea th (M L mod el): 0.80 7; AKI (M L mod el): 0.76 6 |
Not appl icab le |
Mix ed elec tive surg erie s acro ss 10+ spec ialti es |
11.1 % AKI inci den ce; ML pred ictio n com pare d to clini cian acc urac |
Inco rpor ated into ML risk mod els |
Stan dard GA prot ocol s, man age d rem otel y |
Not deta iled; capt ured in EH R for ML inpu t |
ML pred ictio ns visi ble or hidd en to clini cian s |
Ass esse d via EH R; 98 deat hs amo ng 507 1 pati ents |
AKI in 450 pati ents ; com plic atio ns infl uen ced by ML pred ictio |
Not repo rted expl icitl y; dept h infe rred fro m EH R inpu ts |
Not analyz ed |
|
|
|
|
|
|
|
|
|
|
|
|
y |
|
|
|
|
|
n |
|
|
|
Fra ssa nito et al., 202 3 |
Con tinu ousl y mon itor ed; data not indi vidu ally repo rted |
MA P targ et: >65 mm Hg; inte rven tion thre shol d was HPI ≥85 |
Mo nito red indi rectl y via dP/ dtm ax and SV V, not repo rted as stan dalo ne |
Pri mar y mea sure for IOH and seve re hyp oten sion (cut offs : 65 and 50 mm Hg) |
1 deat h with in 30 day s in Con trol gro up |
HPI + mod ifie d GD T prot ocol vs stan dard GD T prot ocol |
82 % redu ctio n in IOH eve nts (97 vs 313 ); shor ter time to treat men t inte rven tion |
Not repo rted num eric ally |
Not asse ssed |
Maj or gyn aec olog ic onc olog ic surg ery (lap aros copi c, lapa roto my, com bine d) |
Not dire ctly asse ssed ; IOH link ed to orga n inju ry in cite d liter atur e |
Slig htly high er inci den ce of MA P > 110 mm Hg in HPI gro up |
Gen eral anes thes ia with mec hani cal vent ilati on |
Pro pof ol, sufe ntan il, and sev oflu rane (BI S- targ eted ); mor e dob uta min e in HPI gro up |
Rea l- time HPI alert s (≥8 5) trig gere d proa ctiv e deci sion - mak ing |
1 pati ent (Co ntro l gro up) |
Pleu ral effu sion (HP I 3%, Con trol 20 %), arrh yth mia, card iac isch emi a |
BIS mon itor ed; targ et rang e 40– 50 |
Manag ed via intrath ecal morph ine or epidur al cathete r |
|
Ka ng et al. (20 20) |
Incl ude d in feat ure set |
SBP , MB P use d as key pred icto rs |
Not dire ctly repo rted |
MB P <65 mm Hg use d to defi ne hyp oten |
Not pri mar y end poin t |
Naï ve Bay es, LR, RF, AN N |
RF: AU RO C 0.84 2; 83.7 % reca ll |
RF: 0.84 2 (CI: 0.73 6– 0.94 8) |
Not appl icab le |
Lap aros copi c chol ecys tect omy |
Not asse ssed |
Hyp erte nsio n incl ude d in com orbi dity feat |
GA with TIV A and intu bati on |
Pro pof ol/r emi fent anil via TCI pum p |
ML mod el trai ned on pre- intu bati on data |
Not asse ssed |
Not repo rted |
BIS use d; not anal yze d |
Not reporte d |
|
|
|
|
|
sion |
|
|
|
|
|
|
|
ures |
|
|
|
|
|
|
|
|
Wij nbe rge et al. (20 20) |
Mo nito red via arte rial wav efor m |
MA P <65 mm Hg thre shol d for inte rven tion |
Flo Tra cIQ wav efor m anal ysis |
Pri mar y outc ome : MA P <65 mm Hg dura tion |
Not stati stic ally pow ered for mor talit y |
HPI vs con vent iona l care |
Sig nifi cant redu ctio n in MA P <65 mm Hg dura tion |
Not repo rted ; prio r stud ies sug gest >0. 9 |
Not appl icab le |
Hig h- risk non card iac surg ery |
Not asse ssed |
MA P cont rol strat egy use d to prev ent extr eme s |
GA with inva sive mon itori ng |
Not deta iled |
Rea l- time HPI alert s trig gere d earl y inte rven tion |
Not stati stic ally pow ered |
Not repo rted |
Not spec ifie d |
Not reporte d |
|
Bai g et al. (20 13) |
Use d as a fuzz y inpu t for hyp ovol aem ia diag nosi s |
BP wav efor m anal ysis with fuzz y thre shol ds |
PV fro m plet hys mog raph use d in fuzz y rule engi ne |
Cen tral to alar m gen erati on logi c |
Co mpa red clini cal imp act usin g Kap pa agre eme nt |
RT- SA AM , FL MS- 2 (Fu zzy logi c exp ert syst ems ) |
Kap pa agre eme nt: FL MS- 2 = 0.75 ; RT- SA AM = 0.62 |
Not use d; Kap pa stati stic for diag nost ic agre eme nt |
Not appl icab le |
Mo dera te- to- high bloo d loss proc edur es |
Not dire ctly eval uate d |
Rul e- base d flag ging of elev ated BP |
Not deta iled |
Not spec ifie d |
Mul ti- laye r fuzz y alert s: pro babi listi c, SP V, fuzz y rule |
Mo nito red indi rectl y via user feed bac k |
Dia gno stic acc urac y vali date d with clini cian agre eme nt |
Not part of fuzz y syst em logi c |
Pain control studie d in other fuzzy system s |
|
Rip ollé s- Mel cho r et al. (20 25) |
Mo nito red; not anal yze d sepa ratel y |
MA P <65 mm Hg thre shol d; HPI alert at > 80 |
dP/ dtm ax and SV V use d in HPI algo rith m |
Pri mar y outc ome : AKI link ed to MA P <65 mm Hg |
30- day mor talit y: 1.1 % (HP I) vs 0.9 % (con trol) |
HPI - guid ed vs real - wor ld stan dard care |
No sign ifica nt AKI redu ctio n; RR = 0.89 ; P = 0.66 |
Not repo rted ; pri mar y outc ome anal yze d via RR |
Not appl icab le |
Mo dera te- to- high risk elec tive abd omi nal surg ery |
Pri mar y end poin t; no sign ifica nt redu ctio n |
Sub gro up anal ysis sho wed tren d in non - hyp erte nsiv es |
GA with BIS ; vent ilati on stan dard ized |
Bal anc ed crys tallo id; eph edri ne, nore pine phri ne use d |
HPI alert at > 80 trig gere d algo rith mic inte rven tion |
1.1 % (HP I) vs 0.9 % (con trol) ; no sign ifica nt diff eren ce |
31.9 % (HP I) vs 29.7 % (con trol) ; no sign ifica nt diff eren ce |
BIS use d in both gro ups |
Not analyz ed |
|
Hu et al. (20 24) |
Use d as inpu t to CN N and LST M |
SBP & DB P use d for dos e pred ictio n |
Extr acte d via CN N feat ure laye rs |
Tre nd mod eled via LST M |
Not eval uate d |
AN N + CN N + LST M with atte ntio n |
R² = 0.91 5 (intr a- op); MA PE = 12.2 5% |
Not appl icab le; regr essi on focu s |
Not appl icab le |
Vari ous; not strat ifie d |
Not asse ssed |
Not strat ifie d |
GA assu med ; not deta iled |
Eto mid ate, atra curi um, sufe ntan il pred icte d |
AI dosi ng mod el; atte ntio n high ligh ts key feat ures |
Not eval uate d |
Not repo rted |
BIS inpu t coll ecte d; not outp ut vari able |
Not analyz ed |
|
Abi n et al. (20 |
Inp ut to dual |
Cr use d as inpu |
Not dire ctly mod |
Use d to strat ify |
5– 10 % AKI |
Dua l regr essi |
Acc urac y = 80.6 |
Not repo rted ; |
Not appl icab le |
Car diac surg ery |
Pri mar y end |
HT N incl ude |
GA assu med ; |
CP B time , |
AKI + and AKI |
5– 10 % AKI |
AKI seve rity trac |
Not mod eled |
Not analyz ed |
|
24) |
regr essi on mod el |
t; MA P indi rectl y mod eled |
eled |
AKI risk |
- relat ed mor talit y cite d |
on mod els (AK I+ / AKI –) |
%; F1 = 0.82 1; reca ll = 84.8 % |
regr essi on- base d |
|
(CA BG, valv e, tran spla nt) |
poin t; AKI + vs AKI – mod elin g |
d in PM H feat ures |
not deta iled |
flui ds, diur etic s, tran sfus ions pred icte d |
– plan ners + visu al das hbo ard |
- relat ed mor talit y cite d |
ked via Cr leve ls (KD IGO ) |
|
|
|
Hay ase et al. (20 20) |
Cap ture d; use d as cont ext for EE G patt erns |
Rel ated to BIS acc urac y via EM G cont ami nati on |
Not dire ctly use d; EE G- deri ved via 20– 30H z targ eted freq uen cy |
Indi rectl y relat ed to EM G/B IS fide lity at low and high dept h |
Not asse ssed |
Poi ncar é- inde x fusi on via ML PN N (dee p lear ning appr oac h) |
R = 0.87 , RM SE = 7.09 (Pre dBI S vs mBI S corr elati on) |
Not repo rted ; line ar regr essi on use d for vali dati on |
Not appl icab le |
Mix ed (30 pati ents , incl udin g spin al and GA case s) |
Not anal yze d |
EM G artif acts >8 0 BIS sign al pro ble mati c in ligh t anes thes ia |
GA with BIS ; som e spin al with pro pof ol |
Sev oflu rane , rem ifen tanil , pro pof ol; mus cle rela xant use strat ifie d |
Poi ncar é- inde x– driv en ML PN N alert s; laye red EE G anal ysis |
Not anal yze d |
Not asse ssed |
BIS + EE G scat ter via Poi ncar é indi ces |
Not evalua ted |
|
Šri bar et al. (20 |
Mo nito red; high er |
MA P <65 mm Hg |
Deri ved via wav efor |
Pri mar y end poin |
1 deat h (Ac ume |
HPI vs Flo Tra c |
TW A- AU T: 0.01 |
Not repo rted ; TW |
Not appl icab le |
Maj or thor acic (lun |
1 case (Flo trac gro |
MA P ≥65 mm Hg |
GA with one- lung vent |
Pro pof ol, sufe ntan |
HPI alert ≥90 + deci |
1 deat h (Ac ume |
No MI, CVI , or AKI |
BIS use d; MA C |
Epidur al or IV sufent anil |
|
23) |
base line in HPI gro up |
thre shol d |
m anal ysis |
t |
nIQ gro up) |
GD T |
vs 0.08 mm Hg |
A use d |
|
g/es oph agu s) |
up) |
targ et |
ilati on |
il, sev oflu rane |
sion tree |
nIQ gro up) |
in HPI gro up |
0.8– 1 |
|
|
Mo riss on et al. (20 22) |
Incl ude d in NO L algo rith m |
MA P indi rectl y mod eled via NO L |
Part of NO L algo rith m |
NO L- base d thre shol ds (10 – 25) |
Not asse ssed |
ML algo rith ms (ela stic net, RF, SV M …) |
CV- AU C 0.75 3 (0.7 18– 0.78 8); Brie r scor e 0.19 4 |
CV- AU C 0.75 3 |
Not appl icab le |
Gyn ecol ogic lapa rosc opy |
Not asse ssed |
Not strat ifie d |
GA with intu bati on |
Fent anyl guid ed by NO L |
NO L- guid ed fent anyl vs MA P/H R- base d care |
Not asse ssed |
63.6 % had mod erat e to seve re PA CU pain |
BIS use d |
|
|
Vel aga pud i et al. (20 22) |
Incl ude d in ML inpu t feat ures and NO L reac tion anal |
Imp licit in case para met ers; MA P not mod eled dire ctly |
Not repo rted ; NO L- base d surr ogat e feat ures likel |
Use d indi rectl y via preo pera tive AS A stat us and |
Not asse ssed |
XG Boo st for gluc ose pred ictio n; Ran dom For est for |
Glu cose : Acc urac y ↑ fro m 79 % to 84.7 %; Opi |
Opi oid mod el cros s- vali date d AU RO C: 0.75 3 |
Not appl icab le |
Mix ed amb ulat ory and inpa tient proc edur es |
Not eval uate d |
Imp lied in diab etes /glu cose cont rol mod els |
Intu bate d GA; not strat ifie d |
Fent anyl dosi ng mod ifie d base d on NO L and ML aid |
ML pred ictio ns pres ente d via RE DC ap inte rfac e; |
Not asse ssed |
Foc use d on pain and gluc ose mis esti mati on; not trac ked |
BIS use d; not mod eled |
Primar y outco me: ML- aided predict ion of opioid require ment |
|
|
ysis |
|
y con side red |
surg ery type |
|
opio id nee d |
oids : Acc urac y ↑ fro m 18 % to 42 % |
|
|
|
|
|
|
|
use d as refe renc e |
|
for mall y |
|
|
|
Luc ksc heit er et al., 202 2 |
Incl ude d as a key inpu t feat ure; rank ed thir d in imp orta nce in RF mod el |
Syst olic BP use d; top- rank ed feat ure in RF mod el |
Not dire ctly mod eled ; infe rred via sho ck inde x and ausc ultat ion |
Not expl icitl y mod eled ; indi rectl y refl ecte d via syst olic BP and sho ck inde x |
Not asse ssed |
Ran dom For est (RF ) vs Nai ve Bay es (NB ) |
RF: AU RO C 0.96 ; PR C area 0.83 ; NB: AU RO C 0.93 ; PR C area 0.66 |
RF: 0.96 (95 % CI: 0.96 – 0.97 ); NB: 0.93 (95 % CI: 0.92 – 0.93 ) |
Not appl icab le |
Not surg ical; preh ospi tal trau ma regi stry (E MS- base d) |
Not eval uate d |
Indi rectl y mod eled via syst olic BP and sho ck inde x |
Pri mar y end poi nt; 5.7 % of pati ents requ ired prec linic al airw ay man age men t |
Not mod eled ; airw ay man age men t infe rred via inte rven tion s and med icati ons |
ML mod el trai ned on EM S regi stry data ; no real - time alert s depl oye d |
Not asse ssed |
Not asse ssed ; preh ospi tal setti ng only |
Not appl icab le |
Not assess ed |
|
Tan et al., 202 1 |
Pre oper ativ e HR mea sure d; slig htly high er in EP OH gro up (not stati stic ally sign ifica nt) |
SBP >1 60 mm Hg defi nes EP OH; intr aop erati ve and post oper ativ e pea k SBP /DB P sign ifica ntly high er in EP OH gro up |
Not dire ctly mod eled ; MA P vari abili ty and vas oact ive resp ons e indi rectl y invo lved |
Not calc ulat ed dire ctly; SBP /DB P tren ds use d as surr ogat es for hyp erte nsiv e patt erns |
No deat hs occ urre d duri ng hos pital stay in eith er gro up |
Gra dien t Boo sted Reg ress ion Tre es (GB RT) via XG Boo st |
Mea n AU RO C: 0.77 (95 % CI: 0.62 – 0.92 ); Sen sitiv ity ~90 %; Spe cific ity 0.52 |
0.77 (cro ss- vali dati on aver age) |
Not appl icab le |
Car otid end arte rect omy (CE A) und er gen eral anes thes ia |
Not asse ssed |
EP OH risk spec ifica lly mod eled ; intr aop erati ve BP spik es high ly pred ictiv e |
GA with stan dard intu bati on; not a vari able in pred ictio n |
Sev oflu rane , pro pof ol, fent anyl , phe nyle phri ne, eph edri ne trac ked; thei r dos ages use d as inpu t feat ures |
ML mod el vali date d retr osp ecti vely ; no live alert inte grat ion |
Not asse ssed |
Stro ke: 5 case s; CH S: 4 case s (all in EP OH gro up); Cer ebra l hem orrh age: 2 case s (bot h EP OH) |
Not mod eled dire ctly; BIS not repo rted |
Not assess ed |
Discussion
This review emphasises how ML-based perioperative prediction models perform better than conventional techniques in terms of clinical utility and accuracy. Logistic regression is clearly outperformed by ensemble models such as XGBoost (AUROC up to 0.942) and RF (AUROC 0.96). By utilising time-series data, deep learning architectures provide additional incremental benefits.
Since several RCTs showed significant decreases in IOH burden, confirming the idea of predictive haemodynamic monitoring, the Hypotension Prediction Index (HPI) is noteworthy for its real- time clinical applicability. The idea that IOH is a modifiable intraoperative risk factor and that proactive ML-guided interventions can lessen its severity and duration is supported by this data.
ML-Driven vs. Biomarker-Based Models Biomarker- based methods (HPI)
Using arterial waveform analysis, provide timely, actionable alerts proven to lower IOH in clinical settings [19]. Reliance on proprietary algorithms and a comparatively limited prediction scope (hypotension only) are among thelimitations.
Models Powered by Machine Learning
Predict broader outcomes (mortality, AKI, composite complications) by integrating multi-modal data sources. Show improved discriminative abilities.
Model complexity, interpretability, and the requirement for sizable, superior datasets are among the difficulties.
Clinical Consequences
The findings of this synthesis point to a potential paradigm shift in perioperative care: predictive analytics, like capnography or pulse oximetry, may soon be used as a routine supplement to anaesthesia monitoring.
Proactive management techniques like prompt vasopressor titration, fluid optimisation, and renal protection protocols may be made possible by machine learning (ML) models integrated into anaesthesia information management systems (AIMS).
The Evidence's Limitations
Heterogeneity in outcome definitions, inconsistent performance metric reporting, and a small number of highly effective RCTs focussing on hard endpoints (mortality, AKI, myocardial injury) limit the body of evidence. Few studies evaluated cost-effectiveness or implementation feasibility in standard clinical practice, and proprietary algorithms like HPI further restrict reproducibility.
Prospects for the Future
Multicenter, sufficiently powered RCTs assessing clinically significant endpoints should be the top priority for future research. Explainable AI (XAI) to increase clinician trust and transparency combining machine learning models with biomarker-based indices to provide hybrid real-time decision support.
Health economics studies to assess implementation logistics and cost-effectiveness.
To guarantee generalisability, external validation is conducted across a variety of populations.
Conclusion
Perioperative risk management is about to be redefined by machine learning and biomarker-based monitoring. These models reliably forecast and reduce intraoperative complications more successfully than conventional techniques by utilising real-time waveform data, EHR variables, and sophisticated analytics. Large-scale retrospective studies validate the scalability of ML- based approaches, while HPI-guided trials demonstrate the clinical utility and viability of predictive monitoring.
However, strong proof of better patient outcomes—rather than merely surrogate metrics—will be necessary for wider adoption. In order to guarantee that predictive alerts result in significant clinical actions that lower morbidity and mortality, future research should focus on integrating ML algorithms into perioperative care pathways.
Acknowledgements
Declaration of Al-Assisted Technologies in the Writing Process: During the preparation of this work the authors used Chat-GPT version 3.5 to check grammar and spelling. After using this tool, the authors reviewed and edited the content and take full responsibility for the content of the publication.
Declaration of Interest
None of the authors has any conflicts of interest to disclose.
Funding
This work was conducted without any financial support from funding agencies in the public, commercial, or not-for-profit sectors.
References
- Kang, A. R., Lee, J., Jung, W., Lee, M., Park, S. Y., et al. (2020). Development of a prediction model for hypotension after induction of anesthesia using machine learning. PLOS ONE, 15(4), e0231172.
- Fritz, B. A., Cui, Z., Zhang, M., He, Y., Chen, Y., et al. (2019). Deep-learning model for predicting 30-day postoperative mortality. British Journal of Anaesthesia, 123(5), 688–695.
- Fritz, B. A., King, C. R., Abdelhack, M., Chen, Y., Kronzer, A., et al. (2024). Effect of machine learning models on clinician prediction of postoperative complications: The Perioperative ORACLE randomised clinical trial. British Journal of Anaesthesia, 133(5), 1042–1050.
- Hayase, K., Hayashi, K., & Sawa, T. (2020). Hierarchical Poincaré analysis for anaesthesia monitoring. Journal of Clinical Monitoring and Computing, 34(6), 1321–1330.
- Wijnberge, M., Geerts, B. F., Hol, L., Lemmers, N., Mulder,M. P., et al. (2020). Effect of a machine learning–derived early warning system for intraoperative hypotension vs standard care on depth and duration of intraoperative hypotension during elective noncardiac surgery: The HYPE randomized clinical trial. JAMA, 323(11), 1052–1060.
- Ripollés-Melchor, J., Tomé-Roca, J. L., Zorrilla-Vaca, A., Aldecoa, C., Colomina, M. J., et al. (2025). Hemodynamic management guided by the Hypotension Prediction Index in abdominal surgery: A multicenter randomized clinical trial. Anesthesiology, 142(4), 639–654.
- Schenk, J., Wijnberge, M., Maaskant, J. M., Hollmann, M. W., Hol, L., et al. (2021). Effect of Hypotension Prediction Index-guided intraoperative haemodynamic care on depth and duration of postoperative hypotension: A sub-study of the Hypotension Prediction trial. British Journal of Anaesthesia, 127(5), 681–688.
- Frassanito, L., Giuri, P. P., Vassalli, F., Piersanti, A., Garcia,M. I. M., et al. (2023). Hypotension Prediction Index guided goal-directed therapy and the amount of hypotension during major gynaecologic oncologic surgery: A randomized controlled clinical trial. Journal of Clinical Monitoring and Computing, 37(4), 1081–1093.
- Šribar, A., SokoloviÄ? Jurinjak, I., Almahariq, H., BandiÄ?, I., MatoševiÄ?, J., et al. (2023). Hypotension prediction index guided versus conventional goal-directed therapy to reduce intraoperative hypotension during thoracic surgery: A randomized trial. BMC Anesthesiology, 23, 101.
- Mansoor Baig, M., Gholamhosseini, H., & Harrison, M. J. (2013). Fuzzy logic based anaesthesia monitoring systems for the detection of absolute hypovolaemia. Computer Methods and Programs in Biomedicine, 43(6), 683–692.
- Lee, S. W., Lee, H. C., Suh, J., Lee, K. H., Lee, H., et al.(2022). Multi-center validation of machine learning model for preoperative prediction of postoperative mortality. NPJ Digital Medicine, 5(1), 91.
- Te, R., Zhu, B., Ma, H., Zhang, X., Chen, S., et al. (2024).Machine learning approach for predicting post-intubation hemodynamic instability (PIHI) index values: Towards enhanced perioperative anesthesia quality and safety. BMC Anesthesiology, 24(1), 136.
- Luckscheiter, A., Zink, W., Lohs, T., Eisenberger, J., Thiel, M., et al. (2022). Machine learning for the prediction of preclinical airway management in injured patients: A registry- based trial. Clinical and Experimental Emergency Medicine, 9(4), 304–313.
- Hu, Z., Pan, G., Wang, X., & Li, K. (2024). Intelligent algorithm based on deep learning to predict the dosage for anesthesia: A study on prediction of drug efficacy based on deep learning. Health Science Reports, 7, e2113.
- Abin, A. A., Molla, A., Ejmalian, A., Nabavi, S., Memari, B., et al. (2024). Anesthetic management recommendations using a machine learning algorithm to reduce the risk of acute kidney injury after cardiac surgeries. Anesthesiology and Pain Medicine, 14(3), e143853.
- Tan, J., Wang, Q., Shi, W., Liang, K., Yu, B., et al. (2021). A machine learning approach for predicting early phase postoperative hypertension in patients undergoing carotid endarterectomy. Annals of Vascular Surgery, 71, 121–131.
- Morisson, L., Nadeau-Vallee, D. M., Espitalier, F., Laferrière- Langlois, P., Idrissi, M., et al. (2023). Prediction of acute postoperative pain based on intraoperative nociception level (NOL) index values: The impact of machine learning-based analysis. Journal of Clinical Monitoring and Computing, 37, 337–344.
- Velagapudi, M., Nair, A. A., Strodtbeck, W., Flynn, D. N., Howell, K., et al. (2023). Evaluation of machine learning models as decision aids for anesthesiologists. Journal of Clinical Monitoring and Computing, 37, 155–163.
- U.S. Food and Drug Administration. (2021). Good machine learning practice for medical device development: Guiding principles.

