inner-banner-bg

Journal of Future Medicine and Healthcare Innovation(JFMHI)

ISSN: 3065-7628 | DOI: 10.33140/JFMHI

Research Article - (2026) Volume 4, Issue 1

Artificial Intelligence-Assisted Monitoring in Anaesthesia: A Systematic Review and Meta-analysis of Diagnostic Accuracy and Clinical Impact AI in Anaesthesia: Monitoring and Outcomes Review

Antonio Andrea Camastra MD *, Matheus Requena Escobar MD , Daniel Macedo Oliveira MD , Laiz G. C. Novaes MD , Andre Busatto de Donato MD , Lucas Teixeira Baldo MD , Joao Evangelista Ponte Conrado MD , Cecilia Schettini Gueiros MS , Raphael Matheus de Souza Makiyama Lopes MD and Thomas Rolf Erdmann MD, MsC, PhD
 
1Anesthesia and Intensive Care Department, Università Magna Grecia di Catanzaro, ORCID: 0009-0003-6025-7772, Italy
2Centro Universitário Lusíada, Brazil
3Center of Medical Sciences, Universidade Federal da Paraíba, Paraíba, Brazil https://orcid.org/0009- 0000-4207-3994, Italy
4Department of Medicine, Estácio de Sá University, Rio de Janeiro, Brazil
5Departamento de Anestesiologia, Universidade de São Paulo, São Paulo, Brazil
6Instituto de Cirurgia do Lago, Brasilia, https://orcid.org/0009-0008-6045-9768, Brazil
7Faculdade de Medicina da Universidade Federal do Ceara, Brazil
8Faculdade Pernambucana de Saude, Brazil
9SANIT (servico de anestesiologia de itajai), Brazil
10Departamento de Cirurgia, Universidade Federal de Santa Catarina, Santa Catarina, Brazil https:// orcid.org/0000-0003-4741-0245, Brazil
 
*Corresponding Author: Antonio Andrea Camastra MD, Anesthesia and Intensive Care Department, Università Magna Grecia di Catanzaro, Italy

Received Date: Jan 22, 2026 / Accepted Date: Feb 12, 2026 / Published Date: Feb 17, 2026

Copyright: ©2026 Antonio Andrea Camastra. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Citation: Camastra, A. A., Escobar, M. R., Oliveira, D. M., Novaes, L. G. C., Donato, A. B. D., et al. (2026). Artificial Intelligence-Assisted Monitoring in Anaesthesia: A Systematic Review and Meta-analysis of Diagnostic Accuracy and Clinical Impact AI in Anaesthesia: Monitoring and Outcomes Review. J Future Med Healthcare Innovation, 4(1), 01-20.

Abstract

Background: Artificial intelligence (AI) is transforming medicine by enabling real-time data analysis and improved decision-making. In anaesthesiology, AI tools are increasingly used for perioperative risk assessment and intraoperative monitoring, but evidence on their real-world performance and safety remains limited.

Methods: We conducted a systematic review and meta-analysis following PRISMA guidelines, including studies from 2010 to May 2025 that evaluated AI applications—machine learning (ML), deep learning, neural networks, and fuzzy logic— in adult patients undergoing general or regional anaesthesia. Primary outcomes were perioperative complications (e.g., hypotension, hypoxia, bradycardia, delirium, vomiting, cardiac arrest, mortality, acute kidney injury [AKI]); secondary outcomes included haemodynamic stability, ICU admission, and length of stay. Risk of bias was assessed using RoB 2 and ROBINS-I, and random-effects models were applied.

Results: Eighteen studies with diverse surgical settings and sample sizes (60 to >450,000 patients) were included. ML models consistently outperformed conventional statistical methods. Ensemble algorithms, such as XGBoost and random forests, achieved AUROC values of 0.942 and 0.96, respectively. Deep learning models, including Max-Pooling Convolutional Neural Networks, predicted mortality with AUROC 0.867. Hypotension Prediction Index (HPI) trials showed 88% sensitivity, 87% specificity, and a 77% reduction in hypotension burden. Hybrid models integrating waveform and electronic health record data reported AUROCs of 0.807 for mortality and 0.766 for AKI.

Conclusions: AI-based monitoring, especially ML and biomarker-guided strategies, offers substantial improvements in perioperative risk stratification and haemodynamic management. Wider clinical adoption requires external validation, explainable AI frameworks, and rigorously designed randomized controlled trials demonstrating meaningful patient outcome benefits.

Keywords

Machine Learning, Hypotension Prediction Index Intraoperative Hypotension, Acute Kidney Injury, Perioperative Prediction, Ensemble Models and Deep Learning

PROSPERO registration n: CRD420251060430.

Key Message

1. AI-based perioperative monitoring systems, encompassing machine learning and biomarker-guided methodologies, exhibit enhanced diagnostic precision relative to conventional statistical techniques in various surgical environments.

2. Randomised controlled evidence demonstrates the clinical efficacy of AI-assisted haemodynamic management, resulting in substantial decreases in the occurrence, severity, and duration of intraoperative hypotension.

3. The transition of AI from predictive accuracy to enhanced patient-centered outcomes is dependent on extensive, multicenter randomised trials, external validation, and transparent AI frameworks.

Introduction

Artificial intelligence (AI) has been instrumental in changing the global healthcare scene over the last 20 years, paving the way for new developments in clinical care, diagnosis, individualised treatment, and healthcare data management. Because anaesthesiology requires the real-time interpretation and processing of vast volumes of physiological and pharmacological data, it is a particularly promising field for the adoption of AI-based technologies. Therefore, perioperative safety and the standard of anaesthesia care could be significantly improved by integrating AI-assisted predictive and decision support tools.

The precise administration of fast-acting medications, the timely adaptation to significant physiological changes, and the continuous, high-precision monitoring of multiple vital parameters are all necessary for modern anaesthesiology. However, many clinical decisions are still based on doctors' subjective experiences and established practices rather than standardised, predictive approaches, even with major advancements in medical technology. In this regard, moving from reactive to proactive decision-making models that can foresee crucial events before they materialise clinically is made possible by artificial intelligence and its subfields, such as machine learning (ML), deep learning (DL), and artificial neural networks [1-4].

Predicting intraoperative and postoperative complications, including bradycardia, hypotension, hypoxia, postoperative delirium, postoperative nausea and vomiting (PONV), intensive care admission, and mortality, is a primary objective of artificial intelligence in anaesthesiology. Many of these incidents happen unexpectedly, necessitating quick action to avoid negative outcomes. AI systems analyse preoperative clinical and anamnestic data, as well as real-time multi-parameter signals from automated anaesthetic delivery systems, electroencephalograms (EEGs), and physiological monitors. Finding patterns that are hidden from view, estimating the likelihood of future occurrences, and proposing prompt intervention techniques are the objectives.

Numerous observational and randomised clinical studies have assessed the efficacy of these technologies at different points during the perioperative period during the past ten years. For instance, the Hypotension Prediction Index (HPI) software was created with the help of artificial intelligence (AI) and has been thoroughly investigated for its capacity to anticipate intraoperative hypotension episodes up to several minutes beforehand [5-9]. This enables medical professionals to use fluidic interventions or pharmacological modulation to stop the event from happening. The automated administration of intravenous anaesthetics, including propofol and remifentanil, has also been optimised through the use of AI algorithms. This has improved the depth of anaesthesia and haemodynamic stability while allowing for more customisation than conventional target-controlled infusion (TCI) systems.

More effective control of arterial pressure, depth of anaesthesia, fluid administration, and mechanical ventilation has been made possible by intelligent controllers built on fuzzy logic or neuro- adaptive predictive models [10]. By minimising inter-individual variability in anaesthetic response and maximising the balance between efficacy and safety, these tools have been demonstrated to increase the precision and consistency of therapeutic strategies. Additionally, AI-powered clinical information management tools, like predictive electronic anaesthesia records, have enhanced perioperative data collection and analysis, opening up new avenues for quality improvement and clinical audit.

The current body of scientific literature is disjointed and frequently contradictory, despite the excitement these innovations have generated. Few studies have shown a significant impact on critical clinical outcomes like length of hospital stay, major complication rates, or mortality, despite the fact that many have reported encouraging results in terms of predictive accuracy and improved intraoperative parameters. Moreover, it is challenging to reach firm conclusions regarding the applicability of these technologies in routine clinical practice due to variations in the AI models used, the clinical contexts examined, and the methodological calibre of publications.

In light of this, the goal of this systematic review and meta- analysis is to present a comprehensive summary of the data that is currently available regarding the application of AI in perioperative anaesthesia. Through a comparative analysis of data from observational and randomised clinical trials, the review will assess the safety and effectiveness of AI-based tools for intraoperative monitoring and complication prediction. The results will be stratified by the technology (deep learning (DL), fuzzy logic, neural networks, or machine learning (ML), the type of complication targeted, the operative setting (major versus minor surgery; elective versus urgent), and the methodological quality of the included studies.

The PICOT framework, which is advised for creating structured clinical questions, was used to define the inclusion criteria that were chosen for this review. Adult patients undergoing regional or general anaesthesia make up the target population. The use of artificial intelligence (AI)-based technologies for intraoperative monitoring or perioperative adverse event prediction is the intervention (I). Standard, non-AI-assisted monitoring techniques are used in comparison (C). Predictive accuracy in relation to particular complications (such as hypotension, hypoxia, post- operative vomiting syndrome, delirium, and death) is the primary outcome (O), and clinical variables like length of hospital stay, ICU admission rate, and intraoperative stability are the secondary outcomes. Since 2010 is when AI-based technologies were first introduced and used in anaesthesiology, both prospective observational studies and randomised controlled trials (RCTs) and relevant retrospective studies published since then were included.

This meta-analysis differs from earlier synthesis attempts in that it focusses on studies published after 2010 and has stricter inclusion criteria. Methodological quality, which is evaluated with particular instruments like ROBINS-I for observational studies and RoB 2 for randomised controlled trials (RCTs), is emphasised. Additionally, subgroup analyses will be carried out based on the kind of algorithm employed, the perioperative phase of application (pre-, intra-, or postoperative), and the kind of complication that is being targeted. There are also moral, legal, and financial concerns with the broad use of AI in anaesthesia. Thorough consideration must be given to issues like algorithm transparency, medico-legal liability for mistakes, sensitive data protection, and guaranteeing fair access to these advancements. Training medical staff to enable anaesthetists to consciously incorporate the new technologies into their daily clinical practice is another essential component. Therefore, educational institutions and healthcare facilities ought to support professional development courses that address fundamental AI concepts, data analysis, and human-machine interaction.

Establishing a strong body of evidence to direct the clinical application of AI tools in anaesthesiology and to guide future research aimed at creating dependable, secure, and genuinely practical predictive models for daily use is the ultimate goal of this work. Technical innovation, strong proof of clinical efficacy, safety, and acceptability, as well as long-term economic viability, are necessary for the adoption of such technologies.

Methods

To maintain openness, reproducibility, and methodological rigour, this systematic review with meta-analysis closely adhered to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Because the protocol was pre- registered with the PROSPERO (International Prospective Register of Systematic Reviews) database, the authors were bound to adhere to a predetermined plan, reducing the possibility of selective bias and boosting the accuracy of the findings.

Type of Studies Included

Randomised controlled trials (RCTs), retrospective studies and prospective observational studies on the application of AI-based technologies in anaesthesiology that were carried out in adult human populations were included in the review. The recent adoption of AI technologies in clinical practice, which has reduced the quantity of RCTs available, served as the impetus for the decision to incorporate observational studies. This does not, however, rule out the possibility of gathering pertinent data from carefully planned non-randomized research.

Inclusion Criteria

Every study that satisfied the following requirements was deemed qualified:

- To guarantee the inclusion of current research in line with recent advancements in AI technologies, the publication date should be between January 1, 2010, and December 31, 2024.

- Research design: observational studies with a control group and prospective RCTs.

- Adult subjects (≥18 years old) undergoing general or loco- regional anaesthesia in a surgical or procedural setting comprise the population.

- Intervention: the application of models, algorithms, or predictive tools based on artificial intelligence, such as expert systems, machine learning, deep learning, artificial neural networks, or fuzzy logic algorithms.

- The ability to predict or prevent perioperative complications, such as intraoperative hypotension, hypoxia, adverse cardiovascular events, bradycardia, postoperative delirium, PONV, cardiac arrest, and AKI, is the main outcome.

- Long hospital stays, the need to be admitted to an intensive care unit, haemodynamic stability, and the effectiveness of intraoperative physiological monitoring are examples of secondary outcomes.

Exclusion Criteria

Research that satisfied these requirements was not included: - research done on children (less than 18 years old) or in animal models.

- Research that lacked a clear and documented AI-based intervention.

- Simulation studies, case reports, editorials, letters to the editor, and conference abstracts without peer-reviewed full texts are all examples.

- To guarantee linguistic homogeneity in data extraction and interpretation, studies published in languages other than English were disqualified.

Inclusion criteria

Exclusion criteria

Adult subjects (≥18 years old)

Research done on children (less than 18 years old) or in animal models

observational or RCTs

Research that lacked a clear and documented AI-based intervention

publication date between January 1, 2010- present.

not observational or RCTs

All AI technologies or system

studies published in languages other than English

Table. Inclusion and Exclusion Criteria

Sources of Information and Search Methodology

Three primary biomedical databases—PubMed/MEDLINE, Embase, and the Cochrane Central Register of Controlled Trials (CENTRAL)—were thoroughly searched. The search approach was predetermined and comprised MeSH (Medical Subject Headings) terms and keywords related to the fields of artificial intelligence, perioperative complications, and anaesthesia. Terms like "artificial intelligence," "machine learning," "deep learning," "anaesthesia," "perioperative complications," "monitoring," "prediction," and others were examples of Boolean combinations. Only human subjects’ studies published between 2010 and May 2025 were included in the results.

Search Strategy

Study Selection Process

Two reviewers (A.A.C., M. R. E.) independently screened all identified abstracts and titles. Using the predetermined criteria, the full texts of potentially eligible articles were retrieved and evaluated for inclusion. A third reviewer (T. R. E.) was consulted or discussed in order to settle disagreements. (Table 1.) (Table 1.)

Stage

Number of Records

Notes

Records identified

2772

Using the defined search strategy (see below)

Records after duplicates removed

2586

Duplicate records removed

Records screened (title and abstract)

2586

Screening based on predefined PICOTT criteria

Records excluded

2552

Excluded for:

  • Not RCTs or observational
  • Not related to anaesthesiology
  • Not adult patients

Full-text articles assessed for retrieval

34

Full-text retrieval attempted

Full-text articles not retrieved

0

Full texts not accessible

Full-text articles assessed for eligibility

34

Evaluated according to inclusion/exclusion criteria

Full-text articles excluded

16

Common reasons for exclusion:

  • Not RCT
  • AI not directly used
  • Irrelevant outcomes

Studies included in the systematic review

18

All were randomized controlled trials on AI in

perioperative anaesthesia

Studies included in the meta-analysis

18

All provided extractable quantitative outcome data

Data extraction

A standardised collection form that was tested in a pilot phase was used to extract pertinent data from each includedstudy.

This form was used to extract pertinent data from each included study. The variables that were gathered included the author's name, the year of publication, the study design, the sample that was analysed, the population's demographics, the type and characteristics of the AI algorithm that was used, the clinical setting, the type of surgery or anaesthesia that was performed, the primary and secondary outcomes, the main results, and the authors' main conclusions. Two authors independently collected the data in duplicate.

Risk of Bias Assessment

Validated instruments were used to evaluate the included studies' methodological quality:

- RoB 2.0 for RCTs, in accordance with the Cochrane Collaboration's guidelines. Randomisation, treatment deviation, attrition, outcome measurement, and the selection of reported data were among the biases evaluated.

Non-randomized observational studies were conducted using ROBINS-I, which takes into account biases in participant selection, confounding factors, intervention classification, missing data, and analysis techniques.

Two authors conducted each evaluation independently, and any disagreements over the results were discussed. Each study's overall quality was assigned a risk of bias rating of "low," "moderate," or "high."

Statistical Analysis and Data Synthesis

Meta-analysis was used to synthesise quantitative data with homogeneous populations, interventions, and outcomes. To account for variation amongst studies, a random effects model was used. With 95% confidence intervals, effect sizes were presented as odds ratios (OR), risk ratios (RR), or mean differences (MD). Cochran's Q statistic was used to evaluate study heterogeneity, and the I2 index was used to quantify it based on three criteria: 0–25% (low), 25–50% (moderate), and >50% (high).

To investigate the efficacy of AI, subgroup analyses were designed according to the following criteria: the algorithm type (e.g., HPI, fuzzy logic, deep learning, or generic predictive models); the type of complication (e.g., neurological, cardiovascular, or respiratory); and the surgical context (major vs. minor surgery; elective vs. emergency).

Studies with a high risk of bias were excluded through sensitivity analyses.

Affiliation and Peer Review

A multidisciplinary team unaffiliated with academic institutions carried out the work, and during the protocol phase, the review was not exposed to external peer review. However, after the manuscript is finished, it will be submitted for peer review and publication.

Results

There were 18 studies in all, and they were a mix of randomised controlled trials (RCTs), prospective observational studies, and retrospective analyses. The studies took place in many different parts of the world, including Europe (the Netherlands, Italy, Germany, and Spain), North America (the United States), and Asia (Korea and China). This shows that people all over the world are interested in using machine learning (ML) and biomarkers to predict and manage perioperative events. (Table 2.)

Study (Au- thor, Year)

Setting

Sample Size

Study Type

Primary Outcome

Model(s) Used

Best Model Performance

Input Fea-

tures

Unique Fea-

tures

Schenk et al.,

Amster-

60 (54 ana-

Randomized

Time-weight-

Hypotension

88% sensitivi-

Continuous

Machine learn-

2021

dam UMC,

lyzed)

Controlled

ed average

Prediction

ty, 87% speci-

arterial wave-

ing-based early

 

Netherlands

 

Sub-Study

(TWA) of

Index (HPI)

ficity for IOH;

form; MAP

warning system;

 

PACU

 

 

postoperative

 

77% reduction

<65 mm Hg;

integration with

 

 

 

 

hypotension

 

in IOH depth/

FloTracIQ

HemoSphere

 

 

 

 

(POH)

 

duration

sensor data

monitor

Fritz et al.,

Single center

95,907

Retrospective

30-day

MPCNN

AUROC:

>70 static and

Real-time dy-

2019

(USA)

 

 

postoperative

(CNN +

0.867,

time-series

namic predic-

 

 

 

 

mortality

LSTM),

AUPRC: 0.097

variables

tion, intraopera-

 

 

 

 

 

DNN, SVM,

 

 

tive monitoring

 

 

 

 

 

RF, LR

 

 

 

Lee et al.,

Multi-center

454,404

Retrospective

30-day

XGBoost,

AUROC:

12–18

External vali-

2022

(Korea)

 

 

postoperative

LR, RF, DNN

0.942,

preoperative

dation across

 

 

 

 

mortality

 

AUPRC: 0.175

variables

4 hospitals,

 

 

 

 

 

 

(XGBoost)

 

compact model

Te et al., 2024

Single center

23,305

Retrospective

PIHI Index

Extra Tree

R²: 0.9047,

14 variables:

Continuous

 

(China)

 

 

(post-in-

Regressor,

MAPE:

demographics

outcome, ICV

 

 

 

 

tubation

XGBoost,

20.86% (ETR)

+ drug dosages

index creation,

 

 

 

 

instability)

MLP, SVR,

 

 

SMOTETomek

 

 

 

 

 

MLR

 

 

balancing

Fritz et al.,

Barnes-Jew-

5071 pa-

Single-centre

30-day

ML models

Death predic-

Electronic

Clinicians

2024

ish Hospital,

tients

Randomised

all-cause

for death &

tion: AUROC

health records:

randomized to

 

Saint Louis,

 

Controlled

postoperative

AKI

0.807; AKI

demographics,

ML-assisted vs

 

MO, USA

 

Trial

mortality and

 

prediction:

labs, pre-op

unassisted re-

 

 

 

 

AKI within 7

 

AUROC 0.766

data, anaesthe-

view; live EHR

 

 

 

 

days

 

 

sia record

integration;

 

 

 

 

 

 

 

 

Likert-based

 

 

 

 

 

 

 

 

risk rating scale

Frassanito et

IRCCS

60 patients

Single-centre

TWA-MAP

HPI +

82% reduction

HPI alerts,

Real-time HPI

al., 2023

Policlinico

 

RCT

<65 mm Hg;

Modified

in IOH events;

SVV, dP/

alert protocol;

 

Gemelli,

 

 

IOH after

Goal-Direct-

TWA-MAP

dtmax, Eadyn,

faster interven-

 

Rome, Italy

 

 

induction;

ed Therapy

<65 mm Hg:

MAP, BIS, flu-

tion; tailored

 

 

 

 

severe hypo-

 

0.14 vs 0.77

id/vasopressor

hemodynamic

 

 

 

 

tension

 

mm Hg

response

algorithm

Kang et al.,

Soonchunhy-

222 patients

Retrospective

Hypotension

Naïve Bayes,

Random For-

89 features

High-res

2020

ang Univer-

 

ML study

after anesthe-

Logistic

est: AUROC

from EHR

time-syn-

 

sity Bucheon

 

 

sia induction

Regression,

0.842 (CI:

+ anesthesia

chronized

 

Hospital,

 

 

(tracheal

Random

0.736–0.948);

devices: SBP,

multi-device

 

South Korea

 

 

intubation–

Forest, Arti-

83.7% recall

MBP, HR,

monitoring;

 

 

 

 

incision)

ficial Neural

with selected

TIVA doses,

pre-intuba-

 

 

 

 

 

Network

features

ventilator set-

tion-only train-

 

 

 

 

 

 

 

tings, patient

ing window;

 

 

 

 

 

 

 

demographics

feature set

 

 

 

 

 

 

 

 

comparisons

Wijnberge et

Amsterdam

64 patients

Single-centre

Reduction in

Hypotension

HPI group:-

Arterial

First RCT on

al., 2020

UMC, Neth-

 

RCT

intraopera-

Prediction

significant re-

waveform data

HPI; real-time

 

erlands

 

 

tive hypoten-

Index (HPI)

duction in time

via FloTracIQ

alerts; integra-

 

 

 

 

sion duration

 

spent MAP

sensor, MAP

tion with Acu-

 

 

 

 

 

 

<65 mm Hg

thresholds

men platform

Baig et al.,

Auckland

30 patients

Real-time

Detection of

RT-SAAM,

FLMS-2:

HR, BP, PV,

Multi-module

2013

City Hos-

 

fuzzy logic

absolute hy-

FLMS-2

Kappa = 0.75;

EtCOâ??; wave-

fuzzy alarms;

 

pital, New

 

validation

povolaemia

(Fuzzy logic

RT-SAAM:

form analysis;

real-time alerts;

 

Zealand

 

 

 

systems)

Kappa = 0.62

fuzzy rule sets

clinician agree-

 

 

 

 

 

 

 

 

ment validation

Rip-

28 hospitals

917 patients

Multicenter

Moder-

Hypotension

No significant

HPI >80,

Largest HPI

ollés-Melchor

(Spain +

 

RCT

ate-to-severe

Prediction

difference in

MAP, SVV,

RCT; diverse

et al., 2025

Jordan)

 

 

AKI within

Index (HPI)

AKI incidence

Eadyn, dP/

real-world prac-

 

 

 

 

7 days;

 

(HPI: 6.1% vs

dtmax, vaso-

tices; subgroup

 

 

 

 

complica-

 

control: 7.0%)

pressor/fluids

analysis for

 

 

 

 

tions; 30-day

 

 

guided by

hypertension

 

 

 

 

mortality

 

 

Hemosphere

effect

Hu et al.,

Henan Univ.

202 patients

Deep learn-

Preoperative

ANN + CNN

R² = 0.915 (in-

Gender, age,

Multi-stage

2024

of Sci. &

 

ing model

& intraopera-

+ LSTM +

tra-op); MAPE

weight, SBP,

model; attention

 

Tech., China

 

validation

tive anes-

Attention

= 12.25%

DBP, BIS, HR;

mechanism;

 

 

 

 

thetic dose

 

 

time-series

convolutional

 

 

 

 

prediction

 

 

vitals

feature ex-

 

 

 

 

 

 

 

 

traction

Abin et al.,

3 hospitals,

998 patients

Cohort ML-

AKI risk

AKI– and

Accuracy:

8 pre-op

Dual regression

2024

Tehran, Iran

 

based design

reduction via

AKI+ regres-

80.6%; F1-

features

system; visual

 

 

 

 

anesthesia

sion planners

score: 0.821;

6 anesthesia

dashboard; vali-

 

 

 

 

management

 

recall: 84.8%

targets (CPB

dated by cardiac

 

 

 

 

 

 

 

time, fluids,

anesthesiolo-

 

 

 

 

 

 

 

transfusion,

gists

 

 

 

 

 

 

 

diuretics)

 

Hayase et al.,

Kyoto Chubu

83,867 EEG

Observa-

BIS predic-

MLPNN

R = 0.87;

Poincaré-index

Hierarchical

2020

Medical Cen-

epochs from

tional + ML

tion from

(deep learn-

RMSE = 7.09;

(0.5–47 Hz &

Poincaré plot

 

ter, Japan

30 patients

validation

EEG scatter

ing)

Bias = 0.07

20–30 Hz),

fusion; EMG-

 

 

 

 

patterns

 

 

EMG70–110

aware BIS

 

 

 

 

 

 

 

Hz, suppres-

estimation; BIS

 

 

 

 

 

 

 

sion ratio

used as super-

 

 

 

 

 

 

 

 

visor

Šribar et al.,

University

34 patients

Single-centre

Duration,

Hypotension

TWA-AUT:

HPI alert

First HPI RCT

2023

Hospital

 

RCT

depth, and

Prediction

0.01 mmHg

threshold ≥90;

in thoracic sur-

 

Dubrava,

 

 

frequency of

Index (HPI)

(HPI) vs 0.08

MAP, SVV,

gery; structured

 

Croatia

 

 

IOH during

vs FloTrac

mmHg (GDT);

Eadyn, dP/dt,

decision tree;

 

 

 

 

thoracic

GDT

fewer hypoten-

CI, SVI, HR,

high-fidelity

 

 

 

 

surgery

 

sive episodes

BP, lactate,

waveform

 

 

 

 

 

 

in HPI group

ScvOâ??

analysis

Morisson et

Maison-

66 patients

Ancillary

Prediction of

ML algo-

Penalized

Age, ASA

First ML analy-

al., 2022

neuve-Rose-

 

RCT analysis

moderate to

rithms (elas-

logistic regres-

class, BMI,

sis of intraoper-

 

mont Hospi-

 

 

severe PACU

ticnet, RF,

sion: CV-AUC

NOL metrics

ative nocicep-

 

tal, Montréal,

 

 

pain

SVM…)

0.753(0.718–

(reaction to in-

tion (NOL) to

 

Canada

 

 

 

 

0.788); Brier

tubation/inci-

predict PACU

 

 

 

 

 

 

score: 0.194

sion), surgical

pain; SHAP

 

 

 

 

 

 

 

TWA NOL

interpretability;

 

 

 

 

 

 

 

 

surgery-specific

 

 

 

 

 

 

 

 

feature mod-

 

 

 

 

 

 

 

 

eling

Velagapudi et

4 academic

100 patients

Prospective

Accuracy

XGBoost

Glucose: Ac-

Age, BMI,

First dual-mod-

al., 2022

hospitals

× 2 models

validation via

of anesthe-

(glucose),

curacy ↑ from

ASA, diabetes,

el validation

 

(USA)

 

web-based

siologist

Random For-

79% to 84.7%;

meds, pain his-

with real clini-

 

 

 

simulation

predictions

est (opioids)

Opioids:

tory, surgery

cians; REDCap

 

 

 

 

for glucose &

 

from 18% to

type, duration,

interface;

 

 

 

 

opioid needs

 

42%

anesthesia type

SHAP-informed

 

 

 

 

 

 

 

 

feedback

Luckscheiter

Southwest

25,556 trau-

Regis-

Need for

Random For-

RF: AUROC

24 features via

First large-scale

et al., 2022

Germany

ma patients

try-based

preclinical

est (RF) &

0.96; PRC

PCA: auscul-

airway predic-

 

EMS registry

 

retrospective

airway man-

Naive Bayes

area 0.83; NB:

tation, injury

tion model in

 

(MIND 3.1)

 

analysis

agement in

(NB)

AUROC 0.93;

pattern, oxy-

EMS; validated

 

 

 

 

trauma

 

PRC area 0.66

gen therapy,

on real-world

 

 

 

 

 

 

 

shock index,

trauma registry;

 

 

 

 

 

 

 

vital signs,

SMOTE ap-

 

 

 

 

 

 

 

interventions

plied for class

 

 

 

 

 

 

 

 

balance

Tan et al.,

Huashan

406 carotid

Retrospective

Early Phase

Gradient

AUROC:

100+ perioper-

First ML model

2021

Hospital, Fu-

endarterec-

single-center

Postoperative

Boosted

0.77 (95% CI:

ative variables

for EPOH

 

dan Universi-

tomy (CEA)

cohort study

Hypertension

Regres-

0.62–0.92);

including:

prediction;

 

ty, Shanghai,

procedures

 

(EPOH)

sion Trees

Sensitivity

intraoperative

feature impor-

 

China

 

 

requiring

(GBRT) via

~90%; Speci-

peak SBP,

tance analysis

 

 

 

 

intravenous

XGBoost

ficity ~52%

cardiac index,

via XGBoost

 

 

 

 

vasodilators

 

 

anesthetic dos-

gain scores;

 

 

 

 

within 24

 

 

ages (propo-

validated with

 

 

 

 

hours post-

 

 

fol, fentanyl,

4-fold stratified

 

 

 

 

CEA

 

 

ephedrine), lab

cross-valida-

 

 

 

 

 

 

 

values (glu-

tion; strong

 

 

 

 

 

 

 

cose, choles-

association

 

 

 

 

 

 

 

terol, alkaline

found between

 

 

 

 

 

 

 

phosphatase),

intraoperative

 

 

 

 

 

 

 

Doppler

BP spikes and

 

 

 

 

 

 

 

echocardiog-

EPOH risk

 

 

 

 

 

 

 

raphy metrics,

 

 

 

 

 

 

 

 

clamping time,

 

 

 

 

 

 

 

 

and demo-

 

 

 

 

 

 

 

 

graphic data

 

The sample sizes were very different. For example, Schenk et al. (2021) had only 60 patients (n=60), while Lee et al. (2022) had 454,404 patients in a very large retrospective dataset [11]. The ORACLE Trial (Fritz et al., 2024) was the biggest interventional study [12]. It randomly assigned 5,071 patients to either ML- assisted or standard perioperative care. (Table 2.)

The Main Results of the Studies Were Different and Included

- Most of the time, intraoperative hypotension (IOH) is defined as a mean arterial pressure (MAP) of less than 65 mmHg (e.g., Schenk et al., 2021; Wijnberge et al., 2020).

- Acute kidney injury (AKI) is often used as a secondary endpoint (for example, Fritz et al., 2024; Ripollés-Melchor et al., 2025).

- Mortality after surgery: both 30 days after surgery and while in the hospital (e.g., Fritz et al., 2019; Lee et al., 2022).

- PIHI (Te et al., 2024) is an example of a composite haemodynamic instability index.

Modelling Methods and Algorithmic Techniques

There were a number of different predictive modelling methods used in the studies. These can be divided into three main groups: traditional ML algorithms, deep learning architectures, and biomarker-based indices (especially HPI).

Some of the models that were tested the most were random forests (RF), logistic regression (LR), gradient boosting machines (like XGBoost), and support vector machines (SVM).

Lee et al. (2022) directly compared several algorithms for predicting 30-day mortality. They found that XGBoost did better than both RF (AUROC = 0.92) and DNN models, with AUROC = 0.942 and AUPRC = 0.175.

Luckscheiter et al. (2022) made models for bad things that happen during surgery. RF got an AUROC of 0.96 and a PRC area of 0.83 [13].

Deep Learning Structures

Deep learning methods took advantage of the changing nature of data streams during surgery.

Fritz et al. (2019) used a combination of convolutional neural networks (CNNs) and LSTM-based architectures (MPCNN) to predict 30-day mortality with an AUROC of 0.867 and an AUPRC of 0.097. Hu et al. (2024) used these ideas on high-frequency arterial waveform data and found that R² = 0.915 and MAPE = 12.25% were good for predicting changes in blood flow during surgery [14].

The Hypotension Prediction Index (HPI)

HPI, a proprietary algorithm trained to predict IOH minutes before onset, was evaluated in a number of RCTs and observational studies (Schenk et al., 2021; Wijnberge et al., 2020; Ripollés- Melchor et al., 2025). In addition to a 77% decrease in the severity and duration of hypotension in the intervention group, Schenk et al. (2021) reported 88% sensitivity and 87% specificity for HPI. When HPI guidance was used, Wijnberge et al. (2020) also showed a significant decrease in cumulative hypotension time.

Innovative and Hybrid Methods

Te et al. (2024) developed the Post-Intubation Hemodynamic Instability (PIHI) index using 14 clinical and ventilatory variables, achieving R² = 0.9047. [15,16] For AKI risk, Abin et al. (2024) used a dual regression approach, reporting accuracy = 80.6%, F1- score = 0.821, and recall = 84.8%.

Predictive Factors and the Significance of Features

MAP and derived waveform parameters were universally critical, according to a comparative analysis of input features (Table 3).

In HPI-based studies, significant attention was paid to variables such as systolic and diastolic blood pressure (SBP and DBP), heart rate (HR), pulse volume (PV), stroke volume variation (SVV), and dP/dtmax (Frassanito et al., 2023; Šribar et al., 2023[16]).

Larger retrospective models included preoperative variables like age, comorbidity indices, baseline renal function, and ASA status (Lee et al., 2022; Kang et al., 2020) [17].

Analysis of Comparative Performance

Among the best-performing models (AUROC > 0.90) were: '

- Lee et al. (2022): AUROC = 0.942, AUPRC = 0.175 (XGBoost).

- Luckscheiter et al. (2022) : PRC = 0.83 (RF), AUROC = 0.96.

- Te et al. (2024)]: R² = 0.9047 for continuous hemodynamic predictions.

Performers in the middle (AUROC 0.80–0.89):

- Fritz et al. (2019) : AUROC = 0.867, AUPRC = 0.097 (MPCNN).

- Fritz et al. (2024, ORACLE) : AUROC = 0.807 for 30-day mortality; 0.766 for AKI.

Models for experiments or lower performers:

- Tan et al. (2021): sensitivity ~90%, specificity ~52%, AUROC = 0.77 (95% CI: 0.62–0.92).

- Morisson et al. (2022) : Brier score = 0.194, AUC = 0.753, penalised logistic regression [18]. (Table 3.)

Stu dies

/Va riab le

Hea rt Rat e (H

R)

Blo od Pre ssur e (BP

)

Pul se Vol um e (PV

)

Me an Art eria l Pre

ssur

Mo rtal ity (an

y)

AI

Mo dels Co mp are d

Bes t Mo del Per for

ma

AU RO C

IC V

Val ues

Typ e of Sur ger y

Ris k of AK I

Hy pert ensi on Ris k

Air way Ma nag eme nt

Dos age for Ane sthe sia

Wa rni ng Syst em vs

Sta

30-

Day Pos top erat ive Mo

Pos top erat ive Co mpl

icat

Ana esth esia Dep th Mo

nito

Posto perati ve Pain

 

 

 

 

e

(M

AP)

 

 

nce

 

 

 

 

 

 

 

nda

rd Car

e

rtal

ity

ions

ring

 

Frit z et al., 201

9

Use d (pre

-

and intr aop erati ve)

Use d (sys tolic & dias tolic

,

intr aop erati ve)

NR

Deri ved fro m BP,

not expl icitl y repo rted

30-

day post oper ativ e mor talit y (1%

)

MP CN N (CN N + LST M), DN N, SV M,

RF,

LR

MP CN N (AU RO C: 0.86

7, AU PR C: 0.09

7)

 

Not use d

All type s (ma inly AS A II– III)

NR

Incl ude d as com orbi dity

Incl ude d (onl y intu bate d pati ents

)

Incl ude d (cu mul ativ e intr aop dos es)

Des igne d for real

-

time war ning inte grat ion

Pri mar y end poin t

Not strat ifie d

NR

NR

Lee et al., 202

2

Use d (pre oper ativ e only

)

Use d (pre oper ativ e only

)

NR

NR

30-

day post oper ativ e mor talit y (0.2

0.4

%)

XG

Boo st, Ran dom For est, Log istic Reg ress ion, DN

N

XG

Boo st (AU RO C: 0.94

2, AU PR C: 0.17

5)

 

Not use d

Non

-

card iac surg erie s acro ss spec ialti es

NR

Incl ude d as inpu t feat ure

Not expl icitl y desc ribe d

Not incl ude d

Offl ine mod el for preo pera tive risk strat ifica tion

Pri mar y end poin t

Not strat ifie d

NR

NR

Te et al., 202

4

Use d (pre oper ativ

e

Use d (sys tolic &

dias

NR

NR

No dire ct mor talit

y

Extr a Tre e Reg

ress

ET R (R²: 0.90

47,

MA

 

Pre dict ed as pri

mar

Gen eral anes thes ia

with

NR

Incl ude d as bina ry

inpu

All pati ents had end

otra

Incl ude d (init ial

infu

Not real

-

time

;

pred

Not mea sure d

Foc use d only on

PIH

NR

NR

 

only

)

tolic

,

preo pera tive

)

 

 

outc ome

or, XG

Boo st, SV R, ML R, ML

P

E: 0.05

12)

 

y outp ut

trac heal intu bati on (AS A I–

II)

 

t feat ure

che al intu bati on

sion

:

fent anyl

,

pro pof ol, etc.)

ictiv e sup port only

 

I

(he mod yna mic inst abili ty)

 

 

Sch enk et al.

(HP I

Tri al)

Not expl icitl y repo rted

MA P

targ et:

>65

mm Hg

Cap ture d via Flo Tra cIQ sens or

Not spec ifie d

Not repo rted dire ctly (30-

day outc ome s)

HPI

-

guid ed vs Stan dard Car e

HPI

algo rith m

~0.9 0–

0.92

(bas ed on sens itivi ty/s peci ficit y)

Not appl icab le

Elec tive non card iac (ma inly GI

and gyn ecol ogic al)

Ele vate d in pres enc e of IOH

/PO H

Peri oper ativ e MA P

man age men t use d to miti gate extr eme s

Gen eral ana esth esia with mec hani cal vent ilati on

Sev oflu rane med ian: 1.55 1.64

vol

%

HPI

guid anc e vs rout ine hae mod yna mic prot ocol s

Not dire ctly asse ssed

4

pati ents with Cla vien

-

Din do Gra de ≥ III

(ble edin g inte rven tion

s)

Not expl icitl y repo rted

Not discus sed in this sub- study

OR AC LE

Tri al (Fri tz

et

al., 202

4)

Ava ilabl e via EH R;

use

d in

ML

inpu t feat ures

MA P

mon itor ed; pred ictiv

e

rele van ce for AKI

and mor talit

y

Not repo rted as stan dalo ne

met

ric

MA P

use d by ML

mod el

for

pred ictio n

30-

day post oper ativ e mor

talit

y asse ssed

;

2.2

%

inci den ce

ML

-

assi sted vs ML

-

una

ssist ed clini cian pred ictio n

Dea th: AU RO C 0.80

7;

AKI

: AU RO C 0.76

6

Dea th (M L

mod el): 0.80

7;

AKI (M L

mod el): 0.76

6

Not appl icab le

Mix ed elec tive surg erie s

acro

ss 10+

spec ialti es

11.1

% AKI

inci den ce; ML

pred

ictio n com pare d to clini cian acc

urac

Inco rpor ated into ML

risk mod els

Stan dard GA

prot ocol s, man

age

d rem otel y

Not deta iled; capt ured in EH

R

for ML

inpu t

ML

pred ictio ns visi ble or

hidd

en to clini cian s

Ass esse d via EH R; 98

deat

hs amo ng 507

1

pati ents

AKI

in 450

pati ents

;

com plic

atio

ns infl uen ced by ML

pred

ictio

Not repo rted expl icitl y; dept

h

infe rred fro m EH R

inpu ts

Not analyz ed

 

 

 

 

 

 

 

 

 

 

 

y

 

 

 

 

 

n

 

 

Fra ssa nito et al., 202

3

Con tinu ousl y mon itor ed; data not indi vidu ally repo rted

MA P

targ et:

>65

mm Hg; inte rven tion thre shol d was HPI

≥85

Mo nito red indi rectl y via dP/ dtm ax and SV V,

not repo rted as stan dalo ne

Pri mar y mea sure for IOH

and seve re hyp oten sion (cut offs

: 65

and 50

mm Hg)

1

deat h with in 30

day s in Con trol gro up

HPI

+

mod ifie d GD T

prot ocol vs stan dard GD T

prot ocol

82

%

redu ctio n in IOH

eve nts (97

vs 313

);

shor ter time to treat men t inte rven tion

Not repo rted num eric ally

Not asse ssed

Maj or gyn aec olog ic onc olog ic surg ery (lap aros copi c, lapa roto my, com bine d)

Not dire ctly asse ssed

; IOH

link ed to orga n inju ry in cite d liter atur e

Slig htly high er inci den ce of MA P > 110

mm Hg in HPI

gro up

Gen eral anes thes ia with mec hani cal vent ilati on

Pro pof ol, sufe ntan il, and sev oflu rane (BI

S-

targ eted

);

mor e dob uta min e in HPI

gro

up

Rea l- time HPI

alert s (≥8

5)

trig gere d proa ctiv e deci sion

-

mak ing

1

pati ent (Co ntro l gro up)

Pleu ral effu sion (HP I 3%,

Con trol 20

%),

arrh yth mia, card iac isch emi a

BIS

mon itor ed; targ et rang e 40–

50

Manag ed via intrath ecal morph ine or epidur al cathete r

Ka ng

et

al.

(20

20)

Incl ude

d in

feat ure set

SBP

, MB

P

use d as key pred icto rs

Not dire

ctly

repo rted

MB P

<65

mm Hg use d to defi ne hyp

oten

Not pri

mar

y end poin t

Naï ve

Bay

es,

LR,

RF, AN N

RF: AU

RO

C 0.84

2;

83.7

%

reca ll

RF: 0.84

2

(CI: 0.73

6–

0.94

8)

Not appl

icab

le

Lap aros

copi

c chol ecys tect omy

Not asse ssed

Hyp erte

nsio

n incl ude d in com orbi dity

feat

GA

with TIV

A

and intu bati on

Pro pof

ol/r

emi fent anil via TCI

pum p

ML

mod el

trai ned on pre- intu bati on

data

Not asse ssed

Not repo rted

BIS

use d;

not anal yze d

Not reporte d

 

 

 

 

sion

 

 

 

 

 

 

 

ures

 

 

 

 

 

 

 

Wij nbe rge et al.

(20

20)

Mo nito red via arte rial wav efor m

MA P

<65

mm Hg thre shol d for inte rven tion

Flo Tra cIQ wav efor m anal ysis

Pri mar y outc ome

: MA P

<65

mm Hg dura tion

Not stati stic ally pow ered for mor talit y

HPI

vs con vent iona l care

Sig nifi cant redu ctio n in MA P

<65

mm Hg dura tion

Not repo rted

;

prio r stud ies sug gest

>0.

9

Not appl icab le

Hig h- risk non card iac surg ery

Not asse ssed

MA P

cont rol strat egy use d to prev ent extr eme s

GA

with inva sive mon itori ng

Not deta iled

Rea l- time HPI

alert s trig gere d earl y inte rven

tion

Not stati stic ally pow ered

Not repo rted

Not spec ifie d

Not reporte d

Bai g et al.

(20

13)

Use d as a fuzz y inpu t for hyp ovol aem ia diag nosi s

BP

wav efor m anal ysis with fuzz y thre shol ds

PV

fro m plet hys mog raph use d in fuzz y rule engi ne

Cen tral to alar m gen erati on logi c

Co mpa red clini cal imp act usin g Kap pa agre eme nt

RT- SA AM

, FL

MS- 2

(Fu zzy logi c exp ert syst

ems

)

Kap pa agre eme nt: FL MS-

2 =

0.75

;

RT- SA AM

=

0.62

Not use d; Kap pa stati stic for diag nost ic agre eme nt

Not appl icab le

Mo dera te-

to- high bloo d loss proc edur es

Not dire ctly eval uate d

Rul e- base d flag ging of elev ated BP

Not deta iled

Not spec ifie d

Mul ti- laye r fuzz y alert s: pro babi listi c, SP V,

fuzz

y rule

Mo nito red indi rectl y via user feed bac k

Dia gno stic acc urac y vali date d with clini cian agre eme

nt

Not part of fuzz y syst em logi c

Pain control studie d        in

other fuzzy system s

Rip ollé s- Mel cho r et al.

(20

25)

Mo nito red; not anal yze d sepa ratel y

MA P

<65

mm Hg thre shol d; HPI

alert at > 80

dP/ dtm ax and SV V

use d in HPI

algo rith m

Pri mar y outc ome

: AKI

link ed to MA P

<65

mm Hg

30-

day mor talit y:

1.1

% (HP

I)

vs 0.9

%

(con trol)

HPI

-

guid ed vs real

-

wor ld stan dard care

No sign ifica nt AKI

redu ctio n; RR

= 0.89

;    P

= 0.66

Not repo rted

;

pri mar y outc ome anal yze d via RR

Not appl icab le

Mo dera te-

to- high risk elec tive abd omi nal surg ery

Pri mar y end poin t; no sign ifica nt redu ctio n

Sub gro up anal ysis sho wed tren d in non

-

hyp erte nsiv es

GA

with BIS

;

vent ilati on stan dard ized

Bal anc ed crys tallo id; eph edri ne, nore pine phri ne use d

HPI

alert at > 80

trig gere d algo rith mic inte rven tion

1.1

% (HP

I)

vs 0.9

%

(con trol)

; no sign ifica nt diff eren

ce

31.9

% (HP

I)

vs 29.7

%

(con trol)

; no sign ifica nt diff eren

ce

BIS

use d in both gro ups

Not analyz ed

Hu et al.

(20

24)

Use d as inpu t to CN N

and LST M

SBP

& DB P

use d for dos e pred ictio n

Extr acte d via CN N

feat ure laye rs

Tre nd mod eled via LST M

Not eval uate d

AN N + CN N + LST M

with atte ntio n

R² = 0.91

5

(intr a- op); MA PE

= 12.2

5%

Not appl icab le; regr essi on focu s

Not appl icab le

Vari ous; not strat ifie d

Not asse ssed

Not strat ifie d

GA

assu med

;

not deta iled

Eto mid ate, atra curi um, sufe ntan il pred icte d

AI

dosi ng mod el; atte ntio n high ligh ts key feat

ures

Not eval uate d

Not repo rted

BIS

inpu t coll ecte d; not outp ut vari able

Not analyz ed

Abi n et al.

(20

Inp ut to

dual

Cr use d as

inpu

Not dire ctly

mod

Use d to strat

ify

5–

10

% AKI

Dua l regr

essi

Acc urac y =

80.6

Not repo rted

;

Not appl icab

le

Car diac surg

ery

Pri mar y

end

HT N

incl

ude

GA

assu med

;

CP B

time

,

AKI

+

and AKI

5–

10

% AKI

AKI

seve rity

trac

Not mod eled

Not analyz ed

24)

regr essi on mod el

t; MA P

indi rectl y mod eled

eled

AKI

risk

-

relat ed mor talit y cite d

on mod els (AK

I+ / AKI –)

%; F1

= 0.82

1;

reca ll = 84.8

%

regr essi on- base d

 

(CA

BG,

valv e, tran spla nt)

poin t; AKI

+ vs AKI

mod elin g

d in PM H

feat ures

not deta iled

flui ds, diur etic s, tran sfus ions pred icte

d

plan ners

+

visu al das hbo ard

-

relat ed mor talit y cite d

ked via Cr leve ls (KD IGO

)

 

 

Hay ase et al.

(20

20)

Cap ture d; use d as cont ext for EE G

patt erns

Rel ated to BIS

acc urac y via EM G

cont ami nati on

Not dire ctly use d; EE G-

deri ved via 20–

30H

z targ eted freq uen cy

Indi rectl y relat ed to EM

G/B IS

fide lity at low and high dept h

Not asse ssed

Poi ncar é- inde x fusi on via ML PN N

(dee p lear ning appr oac h)

R = 0.87

, RM SE

= 7.09

(Pre dBI S vs mBI S

corr elati on)

Not repo rted

;

line ar regr essi on use d for vali dati on

Not appl icab le

Mix ed (30

pati ents

,

incl udin g spin al and GA

case s)

Not anal yze d

EM G

artif acts

>8

0

BIS

sign al pro ble mati c in ligh t anes thes ia

GA

with BIS

;

som e spin al with pro pof ol

Sev oflu rane

,

rem ifen tanil

,

pro pof ol; mus cle rela xant use strat ifie

d

Poi ncar é- inde x– driv en ML PN N

alert s; laye red EE G

anal ysis

Not anal yze d

Not asse ssed

BIS

+ EE G

scat ter via Poi ncar é indi ces

Not evalua ted

Šri bar et al.

(20

Mo nito red; high

er

MA P

<65

mm Hg

Deri ved via wav

efor

Pri mar y end

poin

1

deat h (Ac ume

HPI

vs Flo Tra c

TW A- AU T:

0.01

Not repo rted

;

TW

Not appl icab le

Maj or thor acic

(lun

1

case (Flo trac gro

MA P

≥65 mm

Hg

GA

with one- lung vent

Pro pof ol, sufe

ntan

HPI

alert

≥90

+

deci

1

deat h (Ac ume

No MI, CVI

, or

AKI

BIS

use d; MA C

Epidur al or IV

sufent

anil

23)

base

line

in HPI

gro

up

thre

shol

d

m

anal

ysis

t

nIQ

gro

up)

GD T

vs

0.08

mm Hg

A

use

d

 

g/es

oph

agu s)

up)

targ et

ilati on

il,

sev

oflu rane

sion tree

nIQ

gro

up)

in

HPI

gro up

0.8–

1

 

Mo riss on et al.

(20

22)

Incl ude d in NO L

algo rith m

MA P

indi rectl y mod eled via NO L

Part of NO L

algo rith m

NO L-

base d thre shol ds (10

25)

Not asse ssed

ML

algo rith ms (ela stic net, RF, SV M

…)

CV- AU C 0.75

3

(0.7

18–

0.78

8);

Brie r scor e 0.19

4

CV- AU C 0.75

3

Not appl icab le

Gyn ecol ogic lapa rosc opy

Not asse ssed

Not strat ifie d

GA

with intu bati on

Fent anyl guid ed by NO L

NO L-

guid ed fent anyl vs MA

P/H R-

base d care

Not asse ssed

63.6

%

had mod erat e to seve re PA CU

pain

BIS

use d

 

Vel aga pud i et al.

(20

22)

Incl ude d in ML

inpu t feat ures and NO L

reac tion anal

Imp licit in case para met ers; MA P

not mod eled dire ctly

Not repo rted

; NO L-

base d surr ogat e feat ures likel

Use d indi rectl y via preo pera tive AS A

stat us and

Not asse ssed

XG

Boo st for gluc ose pred ictio n; Ran dom For est for

Glu cose

:

Acc urac y ↑ fro m 79

%

to 84.7

%;

Opi

Opi oid mod el cros s- vali date d AU RO C: 0.75

3

Not appl icab le

Mix ed amb ulat ory and inpa tient proc edur es

Not eval uate d

Imp lied in diab etes

/glu cose cont rol mod els

Intu bate d GA;

not strat ifie d

Fent anyl dosi ng mod ifie d base d on NO L

and ML

aid

ML

pred ictio ns pres ente d via RE DC

ap inte rfac e;

Not asse ssed

Foc use d on pain and gluc ose mis esti mati on; not trac ked

BIS

use d; not mod eled

Primar y outco me:

ML-

aided predict ion of opioid require ment

 

ysis

 

y con side red

surg ery type

 

opio id nee d

oids

:

Acc urac

fro m 18

%

to 42

%

 

 

 

 

 

 

 

use d as refe

renc

e

 

for mall y

 

 

Luc ksc heit er et al., 202

2

Incl ude d as a key inpu t feat ure; rank ed thir d in imp orta nce in RF

mod el

Syst olic BP

use d; top- rank ed feat ure in RF

mod el

Not dire ctly mod eled

;

infe rred via sho ck inde x and ausc ultat ion

Not expl icitl y mod eled

;

indi rectl y refl ecte d via syst olic BP

and sho ck inde

x

Not asse ssed

Ran dom For est (RF

) vs Nai ve Bay es (NB

)

RF: AU RO C 0.96

; PR C

area 0.83

;

NB: AU RO C 0.93

; PR C

area 0.66

RF: 0.96

(95

% CI: 0.96 0.97

); NB: 0.93

(95

% CI: 0.92 0.93

)

Not appl icab le

Not surg ical; preh ospi tal trau ma regi stry (E

MS-

base d)

Not eval uate d

Indi rectl y mod eled via syst olic BP

and sho ck inde x

Pri mar y end poi nt;

5.7

%

of pati ents requ ired prec linic al airw ay man age men

t

Not mod eled

;

airw ay man age men t infe rred via inte rven tion s and med icati ons

ML

mod el trai ned on EM S

regi stry data

; no real

-

time alert s depl oye d

Not asse ssed

Not asse ssed

;

preh ospi tal setti ng only

Not appl icab le

Not assess ed

Tan et al., 202

1

Pre oper ativ e HR

mea

sure

d; slig htly high er in EP OH

gro up (not stati stic ally sign ifica nt)

SBP

>1

60

mm Hg defi

nes

EP OH;

intr aop erati ve and post oper ativ e pea k SBP

/DB P

sign ifica ntly high er in EP OH

gro

up

Not dire ctly mod eled

;

MA

P

vari abili ty and vas oact ive resp ons e indi rectl y invo lved

Not calc ulat ed dire ctly;

SBP

/DB P

tren ds use d as surr ogat es for hyp erte nsiv e patt erns

No deat hs occ urre d

duri

ng hos pital stay in eith er gro up

Gra dien t Boo sted Reg

ress

ion Tre es (GB

RT)

via XG

Boo st

Mea n AU RO C: 0.77

(95

% CI: 0.62 0.92

);

Sen sitiv ity

~90

%;

Spe cific ity 0.52

0.77

(cro ss- vali dati on

aver

age)

Not appl icab le

Car otid end arte rect omy

(CE

A)

und er gen eral anes thes ia

Not asse ssed

EP OH

risk spec ifica lly

mod

eled

;

intr aop erati ve BP

spik es high ly pred ictiv e

GA

with stan dard intu bati

on;

not a vari able in pred ictio n

Sev oflu rane

,

pro

pof ol,

fent anyl

,

phe nyle phri ne, eph edri ne trac ked; thei r dos ages use d as inpu t feat ures

ML

mod el vali date d

retr

osp ecti vely

; no live alert inte grat ion

Not asse ssed

Stro ke: 5

case s; CH

S: 4

case s (all in EP OH

gro up); Cer ebra l hem orrh age: 2

case s (bot h EP OH)

Not mod eled dire ctly; BIS

not

repo rted

Not assess ed

Discussion

This review emphasises how ML-based perioperative prediction models perform better than conventional techniques in terms of clinical utility and accuracy. Logistic regression is clearly outperformed by ensemble models such as XGBoost (AUROC up to 0.942) and RF (AUROC 0.96). By utilising time-series data, deep learning architectures provide additional incremental benefits.

Since several RCTs showed significant decreases in IOH burden, confirming the idea of predictive haemodynamic monitoring, the Hypotension Prediction Index (HPI) is noteworthy for its real- time clinical applicability. The idea that IOH is a modifiable intraoperative risk factor and that proactive ML-guided interventions can lessen its severity and duration is supported by this data.

ML-Driven vs. Biomarker-Based Models Biomarker- based methods (HPI)

Using arterial waveform analysis, provide timely, actionable alerts proven to lower IOH in clinical settings [19]. Reliance on proprietary algorithms and a comparatively limited prediction scope (hypotension only) are among thelimitations.

Models Powered by Machine Learning

Predict broader outcomes (mortality, AKI, composite complications) by integrating multi-modal data sources. Show improved discriminative abilities.

Model complexity, interpretability, and the requirement for sizable, superior datasets are among the difficulties.

Clinical Consequences

The findings of this synthesis point to a potential paradigm shift in perioperative care: predictive analytics, like capnography or pulse oximetry, may soon be used as a routine supplement to anaesthesia monitoring.

Proactive management techniques like prompt vasopressor titration, fluid optimisation, and renal protection protocols may be made possible by machine learning (ML) models integrated into anaesthesia information management systems (AIMS).

The Evidence's Limitations

Heterogeneity in outcome definitions, inconsistent performance metric reporting, and a small number of highly effective RCTs focussing on hard endpoints (mortality, AKI, myocardial injury) limit the body of evidence. Few studies evaluated cost-effectiveness or implementation feasibility in standard clinical practice, and proprietary algorithms like HPI further restrict reproducibility.

Prospects for the Future

Multicenter, sufficiently powered RCTs assessing clinically significant endpoints should be the top priority for future research. Explainable AI (XAI) to increase clinician trust and transparency combining machine learning models with biomarker-based indices to provide hybrid real-time decision support.

Health economics studies to assess implementation logistics and cost-effectiveness.

To guarantee generalisability, external validation is conducted across a variety of populations.

Conclusion

Perioperative risk management is about to be redefined by machine learning and biomarker-based monitoring. These models reliably forecast and reduce intraoperative complications more successfully than conventional techniques by utilising real-time waveform data, EHR variables, and sophisticated analytics. Large-scale retrospective studies validate the scalability of ML- based approaches, while HPI-guided trials demonstrate the clinical utility and viability of predictive monitoring.

However, strong proof of better patient outcomes—rather than merely surrogate metrics—will be necessary for wider adoption. In order to guarantee that predictive alerts result in significant clinical actions that lower morbidity and mortality, future research should focus on integrating ML algorithms into perioperative care pathways.

Acknowledgements

Declaration of Al-Assisted Technologies in the Writing Process: During the preparation of this work the authors used Chat-GPT version 3.5 to check grammar and spelling. After using this tool, the authors reviewed and edited the content and take full responsibility for the content of the publication.

Declaration of Interest

None of the authors has any conflicts of interest to disclose.

Funding

This work was conducted without any financial support from funding agencies in the public, commercial, or not-for-profit sectors.

References

  1. Kang, A. R., Lee, J., Jung, W., Lee, M., Park, S. Y., et al. (2020). Development of a prediction model for hypotension after induction of anesthesia using machine learning. PLOS ONE, 15(4), e0231172.
  2. Fritz, B. A., Cui, Z., Zhang, M., He, Y., Chen, Y., et al. (2019). Deep-learning model for predicting 30-day postoperative mortality. British Journal of Anaesthesia, 123(5), 688–695.
  3. Fritz, B. A., King, C. R., Abdelhack, M., Chen, Y., Kronzer, A., et al. (2024). Effect of machine learning models on clinician prediction of postoperative complications: The Perioperative ORACLE randomised clinical trial. British Journal of Anaesthesia, 133(5), 1042–1050.
  4. Hayase, K., Hayashi, K., & Sawa, T. (2020). Hierarchical Poincaré analysis for anaesthesia monitoring. Journal of Clinical Monitoring and Computing, 34(6), 1321–1330.
  5. Wijnberge, M., Geerts, B. F., Hol, L., Lemmers, N., Mulder,M. P., et al. (2020). Effect of a machine learning–derived early warning system for intraoperative hypotension vs standard care on depth and duration of intraoperative hypotension during elective noncardiac surgery: The HYPE randomized clinical trial. JAMA, 323(11), 1052–1060.
  6. Ripollés-Melchor, J., Tomé-Roca, J. L., Zorrilla-Vaca, A., Aldecoa, C., Colomina, M. J., et al. (2025). Hemodynamic management guided by the Hypotension Prediction Index in abdominal surgery: A multicenter randomized clinical trial. Anesthesiology, 142(4), 639–654.
  7. Schenk, J., Wijnberge, M., Maaskant, J. M., Hollmann, M. W., Hol, L., et al. (2021). Effect of Hypotension Prediction Index-guided intraoperative haemodynamic care on depth and duration of postoperative hypotension: A sub-study of the Hypotension Prediction trial. British Journal of Anaesthesia, 127(5), 681–688.
  8. Frassanito, L., Giuri, P. P., Vassalli, F., Piersanti, A., Garcia,M. I. M., et al. (2023). Hypotension Prediction Index guided goal-directed therapy and the amount of hypotension during major gynaecologic oncologic surgery: A randomized controlled clinical trial. Journal of Clinical Monitoring and Computing, 37(4), 1081–1093.
  9. Šribar, A., SokoloviÄ? Jurinjak, I., Almahariq, H., BandiÄ?, I., MatoševiÄ?, J., et al. (2023). Hypotension prediction index guided versus conventional goal-directed therapy to reduce intraoperative hypotension during thoracic surgery: A randomized trial. BMC Anesthesiology, 23, 101.
  10. Mansoor Baig, M., Gholamhosseini, H., & Harrison, M. J. (2013). Fuzzy logic based anaesthesia monitoring systems for the detection of absolute hypovolaemia. Computer Methods and Programs in Biomedicine, 43(6), 683–692.
  11. Lee, S. W., Lee, H. C., Suh, J., Lee, K. H., Lee, H., et al.(2022). Multi-center validation of machine learning model for preoperative prediction of postoperative mortality. NPJ Digital Medicine, 5(1), 91.
  12. Te, R., Zhu, B., Ma, H., Zhang, X., Chen, S., et al. (2024).Machine learning approach for predicting post-intubation hemodynamic instability (PIHI) index values: Towards enhanced perioperative anesthesia quality and safety. BMC Anesthesiology, 24(1), 136.
  13. Luckscheiter, A., Zink, W., Lohs, T., Eisenberger, J., Thiel, M., et al. (2022). Machine learning for the prediction of preclinical airway management in injured patients: A registry- based trial. Clinical and Experimental Emergency Medicine, 9(4), 304–313.
  14. Hu, Z., Pan, G., Wang, X., & Li, K. (2024). Intelligent algorithm based on deep learning to predict the dosage for anesthesia: A study on prediction of drug efficacy based on deep learning. Health Science Reports, 7, e2113.
  15. Abin, A. A., Molla, A., Ejmalian, A., Nabavi, S., Memari, B., et al. (2024). Anesthetic management recommendations using a machine learning algorithm to reduce the risk of acute kidney injury after cardiac surgeries. Anesthesiology and Pain Medicine, 14(3), e143853.
  16. Tan, J., Wang, Q., Shi, W., Liang, K., Yu, B., et al. (2021). A machine learning approach for predicting early phase postoperative hypertension in patients undergoing carotid endarterectomy. Annals of Vascular Surgery, 71, 121–131.
  17. Morisson, L., Nadeau-Vallee, D. M., Espitalier, F., Laferrière- Langlois, P., Idrissi, M., et al. (2023). Prediction of acute postoperative pain based on intraoperative nociception level (NOL) index values: The impact of machine learning-based analysis. Journal of Clinical Monitoring and Computing, 37, 337–344.
  18. Velagapudi, M., Nair, A. A., Strodtbeck, W., Flynn, D. N., Howell, K., et al. (2023). Evaluation of machine learning models as decision aids for anesthesiologists. Journal of Clinical Monitoring and Computing, 37, 155–163.
  19. U.S. Food and Drug Administration. (2021). Good machine learning practice for medical device development: Guiding principles.