Artificial Intelligence-Assisted Monitoring in Anaesthesia: A Systematic Review and Meta-analysis of Diagnostic Accuracy and Clinical Impact AI in Anaesthesia: Monitoring and Outcomes Review

Antonio Andrea Camastra MD; Matheus Requena Escobar MD; Daniel Macedo Oliveira MD; Laiz G. C. Novaes MD; Andre Busatto de Donato MD; Lucas Teixeira Baldo MD; Joao Evangelista Ponte Conrado MD; CecÃ­lia Schettini Gueiros MS; Raphael Matheus de Souza Makiyama Lopes MD; Thomas Rolf Erdmann MD; MsC; PhD

doi:10.33140/JFMHI.04.01.03

Journal of Future Medicine and Healthcare Innovation(JFMHI)

ISSN: 3065-7628 | DOI: 10.33140/JFMHI

Researchers and authors can directly submit their manuscript online through this link Online Manuscript Submission.

Track Your Submission

Share this page:

Open Access Journals

Research Article - (2026) Volume 4, Issue 1

View PDF Download PDF

Artificial Intelligence-Assisted Monitoring in Anaesthesia: A Systematic Review and Meta-analysis of Diagnostic Accuracy and Clinical Impact AI in Anaesthesia: Monitoring and Outcomes Review

Antonio Andrea Camastra MD ^*, Matheus Requena Escobar MD , Daniel Macedo Oliveira MD , Laiz G. C. Novaes MD , Andre Busatto de Donato MD , Lucas Teixeira Baldo MD , Joao Evangelista Ponte Conrado MD , Cecilia Schettini Gueiros MS , Raphael Matheus de Souza Makiyama Lopes MD and Thomas Rolf Erdmann MD, MsC, PhD

¹Anesthesia and Intensive Care Department, Università Magna Grecia di Catanzaro, ORCID: 0009-0003-6025-7772, Italy
²Centro Universitário Lusíada, Brazil
³Center of Medical Sciences, Universidade Federal da Paraíba, Paraíba, Brazil https://orcid.org/0009- 0000-4207-3994, Italy
⁴Department of Medicine, Estácio de Sá University, Rio de Janeiro, Brazil
⁵Departamento de Anestesiologia, Universidade de São Paulo, São Paulo, Brazil
⁶Instituto de Cirurgia do Lago, Brasilia, https://orcid.org/0009-0008-6045-9768, Brazil
⁷Faculdade de Medicina da Universidade Federal do Ceara, Brazil
⁸Faculdade Pernambucana de Saude, Brazil
⁹SANIT (servico de anestesiologia de itajai), Brazil
¹⁰Departamento de Cirurgia, Universidade Federal de Santa Catarina, Santa Catarina, Brazil https:// orcid.org/0000-0003-4741-0245, Brazil

^*Corresponding Author: Antonio Andrea Camastra MD, Anesthesia and Intensive Care Department, Università Magna Grecia di Catanzaro, Italy

Received Date: Jan 22, 2026 / Accepted Date: Feb 12, 2026 / Published Date: Feb 17, 2026

Copyright: ©2026 Antonio Andrea Camastra. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Citation: Camastra, A. A., Escobar, M. R., Oliveira, D. M., Novaes, L. G. C., Donato, A. B. D., et al. (2026). Artificial Intelligence-Assisted Monitoring in Anaesthesia: A Systematic Review and Meta-analysis of Diagnostic Accuracy and Clinical Impact AI in Anaesthesia: Monitoring and Outcomes Review. J Future Med Healthcare Innovation, 4(1), 01-20.

Abstract

Background: Artificial intelligence (AI) is transforming medicine by enabling real-time data analysis and improved decision-making. In anaesthesiology, AI tools are increasingly used for perioperative risk assessment and intraoperative monitoring, but evidence on their real-world performance and safety remains limited.

Methods: We conducted a systematic review and meta-analysis following PRISMA guidelines, including studies from 2010 to May 2025 that evaluated AI applications—machine learning (ML), deep learning, neural networks, and fuzzy logic— in adult patients undergoing general or regional anaesthesia. Primary outcomes were perioperative complications (e.g., hypotension, hypoxia, bradycardia, delirium, vomiting, cardiac arrest, mortality, acute kidney injury [AKI]); secondary outcomes included haemodynamic stability, ICU admission, and length of stay. Risk of bias was assessed using RoB 2 and ROBINS-I, and random-effects models were applied.

Results: Eighteen studies with diverse surgical settings and sample sizes (60 to >450,000 patients) were included. ML models consistently outperformed conventional statistical methods. Ensemble algorithms, such as XGBoost and random forests, achieved AUROC values of 0.942 and 0.96, respectively. Deep learning models, including Max-Pooling Convolutional Neural Networks, predicted mortality with AUROC 0.867. Hypotension Prediction Index (HPI) trials showed 88% sensitivity, 87% specificity, and a 77% reduction in hypotension burden. Hybrid models integrating waveform and electronic health record data reported AUROCs of 0.807 for mortality and 0.766 for AKI.

Conclusions: AI-based monitoring, especially ML and biomarker-guided strategies, offers substantial improvements in perioperative risk stratification and haemodynamic management. Wider clinical adoption requires external validation, explainable AI frameworks, and rigorously designed randomized controlled trials demonstrating meaningful patient outcome benefits.

Keywords

Machine Learning, Hypotension Prediction Index Intraoperative Hypotension, Acute Kidney Injury, Perioperative Prediction, Ensemble Models and Deep Learning

PROSPERO registration n: CRD420251060430.

Key Message

1. AI-based perioperative monitoring systems, encompassing machine learning and biomarker-guided methodologies, exhibit enhanced diagnostic precision relative to conventional statistical techniques in various surgical environments.

2. Randomised controlled evidence demonstrates the clinical efficacy of AI-assisted haemodynamic management, resulting in substantial decreases in the occurrence, severity, and duration of intraoperative hypotension.

3. The transition of AI from predictive accuracy to enhanced patient-centered outcomes is dependent on extensive, multicenter randomised trials, external validation, and transparent AI frameworks.

Introduction

Artificial intelligence (AI) has been instrumental in changing the global healthcare scene over the last 20 years, paving the way for new developments in clinical care, diagnosis, individualised treatment, and healthcare data management. Because anaesthesiology requires the real-time interpretation and processing of vast volumes of physiological and pharmacological data, it is a particularly promising field for the adoption of AI-based technologies. Therefore, perioperative safety and the standard of anaesthesia care could be significantly improved by integrating AI-assisted predictive and decision support tools.

The precise administration of fast-acting medications, the timely adaptation to significant physiological changes, and the continuous, high-precision monitoring of multiple vital parameters are all necessary for modern anaesthesiology. However, many clinical decisions are still based on doctors' subjective experiences and established practices rather than standardised, predictive approaches, even with major advancements in medical technology. In this regard, moving from reactive to proactive decision-making models that can foresee crucial events before they materialise clinically is made possible by artificial intelligence and its subfields, such as machine learning (ML), deep learning (DL), and artificial neural networks [1-4].

Predicting intraoperative and postoperative complications, including bradycardia, hypotension, hypoxia, postoperative delirium, postoperative nausea and vomiting (PONV), intensive care admission, and mortality, is a primary objective of artificial intelligence in anaesthesiology. Many of these incidents happen unexpectedly, necessitating quick action to avoid negative outcomes. AI systems analyse preoperative clinical and anamnestic data, as well as real-time multi-parameter signals from automated anaesthetic delivery systems, electroencephalograms (EEGs), and physiological monitors. Finding patterns that are hidden from view, estimating the likelihood of future occurrences, and proposing prompt intervention techniques are the objectives.

Numerous observational and randomised clinical studies have assessed the efficacy of these technologies at different points during the perioperative period during the past ten years. For instance, the Hypotension Prediction Index (HPI) software was created with the help of artificial intelligence (AI) and has been thoroughly investigated for its capacity to anticipate intraoperative hypotension episodes up to several minutes beforehand [5-9]. This enables medical professionals to use fluidic interventions or pharmacological modulation to stop the event from happening. The automated administration of intravenous anaesthetics, including propofol and remifentanil, has also been optimised through the use of AI algorithms. This has improved the depth of anaesthesia and haemodynamic stability while allowing for more customisation than conventional target-controlled infusion (TCI) systems.

More effective control of arterial pressure, depth of anaesthesia, fluid administration, and mechanical ventilation has been made possible by intelligent controllers built on fuzzy logic or neuro- adaptive predictive models [10]. By minimising inter-individual variability in anaesthetic response and maximising the balance between efficacy and safety, these tools have been demonstrated to increase the precision and consistency of therapeutic strategies. Additionally, AI-powered clinical information management tools, like predictive electronic anaesthesia records, have enhanced perioperative data collection and analysis, opening up new avenues for quality improvement and clinical audit.

The current body of scientific literature is disjointed and frequently contradictory, despite the excitement these innovations have generated. Few studies have shown a significant impact on critical clinical outcomes like length of hospital stay, major complication rates, or mortality, despite the fact that many have reported encouraging results in terms of predictive accuracy and improved intraoperative parameters. Moreover, it is challenging to reach firm conclusions regarding the applicability of these technologies in routine clinical practice due to variations in the AI models used, the clinical contexts examined, and the methodological calibre of publications.

In light of this, the goal of this systematic review and meta- analysis is to present a comprehensive summary of the data that is currently available regarding the application of AI in perioperative anaesthesia. Through a comparative analysis of data from observational and randomised clinical trials, the review will assess the safety and effectiveness of AI-based tools for intraoperative monitoring and complication prediction. The results will be stratified by the technology (deep learning (DL), fuzzy logic, neural networks, or machine learning (ML), the type of complication targeted, the operative setting (major versus minor surgery; elective versus urgent), and the methodological quality of the included studies.

The PICOT framework, which is advised for creating structured clinical questions, was used to define the inclusion criteria that were chosen for this review. Adult patients undergoing regional or general anaesthesia make up the target population. The use of artificial intelligence (AI)-based technologies for intraoperative monitoring or perioperative adverse event prediction is the intervention (I). Standard, non-AI-assisted monitoring techniques are used in comparison (C). Predictive accuracy in relation to particular complications (such as hypotension, hypoxia, post- operative vomiting syndrome, delirium, and death) is the primary outcome (O), and clinical variables like length of hospital stay, ICU admission rate, and intraoperative stability are the secondary outcomes. Since 2010 is when AI-based technologies were first introduced and used in anaesthesiology, both prospective observational studies and randomised controlled trials (RCTs) and relevant retrospective studies published since then were included.

This meta-analysis differs from earlier synthesis attempts in that it focusses on studies published after 2010 and has stricter inclusion criteria. Methodological quality, which is evaluated with particular instruments like ROBINS-I for observational studies and RoB 2 for randomised controlled trials (RCTs), is emphasised. Additionally, subgroup analyses will be carried out based on the kind of algorithm employed, the perioperative phase of application (pre-, intra-, or postoperative), and the kind of complication that is being targeted. There are also moral, legal, and financial concerns with the broad use of AI in anaesthesia. Thorough consideration must be given to issues like algorithm transparency, medico-legal liability for mistakes, sensitive data protection, and guaranteeing fair access to these advancements. Training medical staff to enable anaesthetists to consciously incorporate the new technologies into their daily clinical practice is another essential component. Therefore, educational institutions and healthcare facilities ought to support professional development courses that address fundamental AI concepts, data analysis, and human-machine interaction.

Establishing a strong body of evidence to direct the clinical application of AI tools in anaesthesiology and to guide future research aimed at creating dependable, secure, and genuinely practical predictive models for daily use is the ultimate goal of this work. Technical innovation, strong proof of clinical efficacy, safety, and acceptability, as well as long-term economic viability, are necessary for the adoption of such technologies.

Methods

To maintain openness, reproducibility, and methodological rigour, this systematic review with meta-analysis closely adhered to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Because the protocol was pre- registered with the PROSPERO (International Prospective Register of Systematic Reviews) database, the authors were bound to adhere to a predetermined plan, reducing the possibility of selective bias and boosting the accuracy of the findings.

Type of Studies Included

Randomised controlled trials (RCTs), retrospective studies and prospective observational studies on the application of AI-based technologies in anaesthesiology that were carried out in adult human populations were included in the review. The recent adoption of AI technologies in clinical practice, which has reduced the quantity of RCTs available, served as the impetus for the decision to incorporate observational studies. This does not, however, rule out the possibility of gathering pertinent data from carefully planned non-randomized research.

Inclusion Criteria

Every study that satisfied the following requirements was deemed qualified:

- To guarantee the inclusion of current research in line with recent advancements in AI technologies, the publication date should be between January 1, 2010, and December 31, 2024.

- Research design: observational studies with a control group and prospective RCTs.

- Adult subjects (≥18 years old) undergoing general or loco- regional anaesthesia in a surgical or procedural setting comprise the population.

- Intervention: the application of models, algorithms, or predictive tools based on artificial intelligence, such as expert systems, machine learning, deep learning, artificial neural networks, or fuzzy logic algorithms.

- The ability to predict or prevent perioperative complications, such as intraoperative hypotension, hypoxia, adverse cardiovascular events, bradycardia, postoperative delirium, PONV, cardiac arrest, and AKI, is the main outcome.

- Long hospital stays, the need to be admitted to an intensive care unit, haemodynamic stability, and the effectiveness of intraoperative physiological monitoring are examples of secondary outcomes.

Exclusion Criteria

Research that satisfied these requirements was not included: - research done on children (less than 18 years old) or in animal models.

- Research that lacked a clear and documented AI-based intervention.

- Simulation studies, case reports, editorials, letters to the editor, and conference abstracts without peer-reviewed full texts are all examples.

- To guarantee linguistic homogeneity in data extraction and interpretation, studies published in languages other than English were disqualified.

Inclusion criteria	Exclusion criteria
Adult subjects (≥18 years old)	Research done on children (less than 18 years old) or in animal models
observational or RCTs	Research that lacked a clear and documented AI-based intervention
publication date between January 1, 2010- present.	not observational or RCTs
All AI technologies or system	studies published in languages other than English

Table. Inclusion and Exclusion Criteria

Sources of Information and Search Methodology

Three primary biomedical databases—PubMed/MEDLINE, Embase, and the Cochrane Central Register of Controlled Trials (CENTRAL)—were thoroughly searched. The search approach was predetermined and comprised MeSH (Medical Subject Headings) terms and keywords related to the fields of artificial intelligence, perioperative complications, and anaesthesia. Terms like "artificial intelligence," "machine learning," "deep learning," "anaesthesia," "perioperative complications," "monitoring," "prediction," and others were examples of Boolean combinations. Only human subjects’ studies published between 2010 and May 2025 were included in the results.

Search Strategy

Study Selection Process

Two reviewers (A.A.C., M. R. E.) independently screened all identified abstracts and titles. Using the predetermined criteria, the full texts of potentially eligible articles were retrieved and evaluated for inclusion. A third reviewer (T. R. E.) was consulted or discussed in order to settle disagreements. (Table 1.) (Table 1.)

Stage	Number of Records	Notes
Records identified	2772	Using the defined search strategy (see below)
Records after duplicates removed	2586	Duplicate records removed
Records screened (title and abstract)	2586	Screening based on predefined PICOTT criteria
Records excluded	2552	Excluded for: Not RCTs or observational Not related to anaesthesiology Not adult patients
Full-text articles assessed for retrieval	34	Full-text retrieval attempted
Full-text articles not retrieved	0	Full texts not accessible
Full-text articles assessed for eligibility	34	Evaluated according to inclusion/exclusion criteria

Full-text articles excluded

16

Common reasons for exclusion:

Not RCT
AI not directly used
Irrelevant outcomes

Studies included in the systematic review

18

All were randomized controlled trials on AI in

perioperative anaesthesia

Studies included in the meta-analysis

18

All provided extractable quantitative outcome data

Data extraction

A standardised collection form that was tested in a pilot phase was used to extract pertinent data from each includedstudy.

This form was used to extract pertinent data from each included study. The variables that were gathered included the author's name, the year of publication, the study design, the sample that was analysed, the population's demographics, the type and characteristics of the AI algorithm that was used, the clinical setting, the type of surgery or anaesthesia that was performed, the primary and secondary outcomes, the main results, and the authors' main conclusions. Two authors independently collected the data in duplicate.

Risk of Bias Assessment

Validated instruments were used to evaluate the included studies' methodological quality:

- RoB 2.0 for RCTs, in accordance with the Cochrane Collaboration's guidelines. Randomisation, treatment deviation, attrition, outcome measurement, and the selection of reported data were among the biases evaluated.

Non-randomized observational studies were conducted using ROBINS-I, which takes into account biases in participant selection, confounding factors, intervention classification, missing data, and analysis techniques.

Two authors conducted each evaluation independently, and any disagreements over the results were discussed. Each study's overall quality was assigned a risk of bias rating of "low," "moderate," or "high."

Statistical Analysis and Data Synthesis

Meta-analysis was used to synthesise quantitative data with homogeneous populations, interventions, and outcomes. To account for variation amongst studies, a random effects model was used. With 95% confidence intervals, effect sizes were presented as odds ratios (OR), risk ratios (RR), or mean differences (MD). Cochran's Q statistic was used to evaluate study heterogeneity, and the I2 index was used to quantify it based on three criteria: 0–25% (low), 25–50% (moderate), and >50% (high).

To investigate the efficacy of AI, subgroup analyses were designed according to the following criteria: the algorithm type (e.g., HPI, fuzzy logic, deep learning, or generic predictive models); the type of complication (e.g., neurological, cardiovascular, or respiratory); and the surgical context (major vs. minor surgery; elective vs. emergency).

Studies with a high risk of bias were excluded through sensitivity analyses.

Affiliation and Peer Review

A multidisciplinary team unaffiliated with academic institutions carried out the work, and during the protocol phase, the review was not exposed to external peer review. However, after the manuscript is finished, it will be submitted for peer review and publication.

Results

There were 18 studies in all, and they were a mix of randomised controlled trials (RCTs), prospective observational studies, and retrospective analyses. The studies took place in many different parts of the world, including Europe (the Netherlands, Italy, Germany, and Spain), North America (the United States), and Asia (Korea and China). This shows that people all over the world are interested in using machine learning (ML) and biomarkers to predict and manage perioperative events. (Table 2.)

Study (Au- thor, Year)	Setting	Sample Size	Study Type	Primary Outcome	Model(s) Used	Best Model Performance	Input Fea- tures	Unique Fea- tures
Schenk et al.,	Amster-	60 (54 ana-	Randomized	Time-weight-	Hypotension	88% sensitivi-	Continuous	Machine learn-
2021	dam UMC,	lyzed)	Controlled	ed average	Prediction	ty, 87% speci-	arterial wave-	ing-based early
	Netherlands		Sub-Study	(TWA) of	Index (HPI)	ficity for IOH;	form; MAP	warning system;
	– PACU			postoperative		77% reduction	<65 mm Hg;	integration with
				hypotension		in IOH depth/	FloTracIQ	HemoSphere
				(POH)		duration	sensor data	monitor

Fritz et al.,	Single center	95,907	Retrospective	30-day	MPCNN	AUROC:	>70 static and	Real-time dy-
2019	(USA)			postoperative	(CNN +	0.867,	time-series	namic predic-
				mortality	LSTM),	AUPRC: 0.097	variables	tion, intraopera-
					DNN, SVM,			tive monitoring
					RF, LR
Lee et al.,	Multi-center	454,404	Retrospective	30-day	XGBoost,	AUROC:	12–18	External vali-
2022	(Korea)			postoperative	LR, RF, DNN	0.942,	preoperative	dation across
				mortality		AUPRC: 0.175	variables	4 hospitals,
						(XGBoost)		compact model
Te et al., 2024	Single center	23,305	Retrospective	PIHI Index	Extra Tree	R²: 0.9047,	14 variables:	Continuous
	(China)			(post-in-	Regressor,	MAPE:	demographics	outcome, ICV
				tubation	XGBoost,	20.86% (ETR)	+ drug dosages	index creation,
				instability)	MLP, SVR,			SMOTETomek
					MLR			balancing
Fritz et al.,	Barnes-Jew-	5071 pa-	Single-centre	30-day	ML models	Death predic-	Electronic	Clinicians
2024	ish Hospital,	tients	Randomised	all-cause	for death &	tion: AUROC	health records:	randomized to
	Saint Louis,		Controlled	postoperative	AKI	0.807; AKI	demographics,	ML-assisted vs
	MO, USA		Trial	mortality and		prediction:	labs, pre-op	unassisted re-
				AKI within 7		AUROC 0.766	data, anaesthe-	view; live EHR
				days			sia record	integration;
								Likert-based
								risk rating scale
Frassanito et	IRCCS	60 patients	Single-centre	TWA-MAP	HPI +	82% reduction	HPI alerts,	Real-time HPI
al., 2023	Policlinico		RCT	<65 mm Hg;	Modified	in IOH events;	SVV, dP/	alert protocol;
	Gemelli,			IOH after	Goal-Direct-	TWA-MAP	dtmax, Eadyn,	faster interven-
	Rome, Italy			induction;	ed Therapy	<65 mm Hg:	MAP, BIS, flu-	tion; tailored
				severe hypo-		0.14 vs 0.77	id/vasopressor	hemodynamic
				tension		mm Hg	response	algorithm
Kang et al.,	Soonchunhy-	222 patients	Retrospective	Hypotension	Naïve Bayes,	Random For-	89 features	High-res
2020	ang Univer-		ML study	after anesthe-	Logistic	est: AUROC	from EHR	time-syn-
	sity Bucheon			sia induction	Regression,	0.842 (CI:	+ anesthesia	chronized
	Hospital,			(tracheal	Random	0.736–0.948);	devices: SBP,	multi-device
	South Korea			intubation–	Forest, Arti-	83.7% recall	MBP, HR,	monitoring;
				incision)	ficial Neural	with selected	TIVA doses,	pre-intuba-
					Network	features	ventilator set-	tion-only train-
							tings, patient	ing window;
							demographics	feature set
								comparisons
Wijnberge et	Amsterdam	64 patients	Single-centre	Reduction in	Hypotension	HPI group:-	Arterial	First RCT on
al., 2020	UMC, Neth-		RCT	intraopera-	Prediction	significant re-	waveform data	HPI; real-time
	erlands			tive hypoten-	Index (HPI)	duction in time	via FloTracIQ	alerts; integra-
				sion duration		spent MAP	sensor, MAP	tion with Acu-
						<65 mm Hg	thresholds	men platform
Baig et al.,	Auckland	30 patients	Real-time	Detection of	RT-SAAM,	FLMS-2:	HR, BP, PV,	Multi-module
2013	City Hos-		fuzzy logic	absolute hy-	FLMS-2	Kappa = 0.75;	EtCOâ??; wave-	fuzzy alarms;
	pital, New		validation	povolaemia	(Fuzzy logic	RT-SAAM:	form analysis;	real-time alerts;
	Zealand				systems)	Kappa = 0.62	fuzzy rule sets	clinician agree-
								ment validation
Rip-	28 hospitals	917 patients	Multicenter	Moder-	Hypotension	No significant	HPI >80,	Largest HPI
ollés-Melchor	(Spain +		RCT	ate-to-severe	Prediction	difference in	MAP, SVV,	RCT; diverse
et al., 2025	Jordan)			AKI within	Index (HPI)	AKI incidence	Eadyn, dP/	real-world prac-
				7 days;		(HPI: 6.1% vs	dtmax, vaso-	tices; subgroup
				complica-		control: 7.0%)	pressor/fluids	analysis for
				tions; 30-day			guided by	hypertension
				mortality			Hemosphere	effect

Hu et al.,	Henan Univ.	202 patients	Deep learn-	Preoperative	ANN + CNN	R² = 0.915 (in-	Gender, age,	Multi-stage
2024	of Sci. &		ing model	& intraopera-	+ LSTM +	tra-op); MAPE	weight, SBP,	model; attention
	Tech., China		validation	tive anes-	Attention	= 12.25%	DBP, BIS, HR;	mechanism;
				thetic dose			time-series	convolutional
				prediction			vitals	feature ex-
								traction
Abin et al.,	3 hospitals,	998 patients	Cohort ML-	AKI risk	AKI– and	Accuracy:	8 pre-op	Dual regression
2024	Tehran, Iran		based design	reduction via	AKI+ regres-	80.6%; F1-	features →	system; visual
				anesthesia	sion planners	score: 0.821;	6 anesthesia	dashboard; vali-
				management		recall: 84.8%	targets (CPB	dated by cardiac
							time, fluids,	anesthesiolo-
							transfusion,	gists
							diuretics)
Hayase et al.,	Kyoto Chubu	83,867 EEG	Observa-	BIS predic-	MLPNN	R = 0.87;	Poincaré-index	Hierarchical
2020	Medical Cen-	epochs from	tional + ML	tion from	(deep learn-	RMSE = 7.09;	(0.5–47 Hz &	Poincaré plot
	ter, Japan	30 patients	validation	EEG scatter	ing)	Bias = 0.07	20–30 Hz),	fusion; EMG-
				patterns			EMG70–110	aware BIS
							Hz, suppres-	estimation; BIS
							sion ratio	used as super-
								visor
Šribar et al.,	University	34 patients	Single-centre	Duration,	Hypotension	TWA-AUT:	HPI alert	First HPI RCT
2023	Hospital		RCT	depth, and	Prediction	0.01 mmHg	threshold ≥90;	in thoracic sur-
	Dubrava,			frequency of	Index (HPI)	(HPI) vs 0.08	MAP, SVV,	gery; structured
	Croatia			IOH during	vs FloTrac	mmHg (GDT);	Eadyn, dP/dt,	decision tree;
				thoracic	GDT	fewer hypoten-	CI, SVI, HR,	high-fidelity
				surgery		sive episodes	BP, lactate,	waveform
						in HPI group	ScvOâ??	analysis
Morisson et	Maison-	66 patients	Ancillary	Prediction of	ML algo-	Penalized	Age, ASA	First ML analy-
al., 2022	neuve-Rose-		RCT analysis	moderate to	rithms (elas-	logistic regres-	class, BMI,	sis of intraoper-
	mont Hospi-			severe PACU	ticnet, RF,	sion: CV-AUC	NOL metrics	ative nocicep-
	tal, Montréal,			pain	SVM…)	0.753(0.718–	(reaction to in-	tion (NOL) to
	Canada					0.788); Brier	tubation/inci-	predict PACU
						score: 0.194	sion), surgical	pain; SHAP
							TWA NOL	interpretability;
								surgery-specific
								feature mod-
								eling
Velagapudi et	4 academic	100 patients	Prospective	Accuracy	XGBoost	Glucose: Ac-	Age, BMI,	First dual-mod-
al., 2022	hospitals	× 2 models	validation via	of anesthe-	(glucose),	curacy ↑ from	ASA, diabetes,	el validation
	(USA)		web-based	siologist	Random For-	79% to 84.7%;	meds, pain his-	with real clini-
			simulation	predictions	est (opioids)	Opioids: ↑	tory, surgery	cians; REDCap
				for glucose &		from 18% to	type, duration,	interface;
				opioid needs		42%	anesthesia type	SHAP-informed
								feedback
Luckscheiter	Southwest	25,556 trau-	Regis-	Need for	Random For-	RF: AUROC	24 features via	First large-scale
et al., 2022	Germany	ma patients	try-based	preclinical	est (RF) &	0.96; PRC	PCA: auscul-	airway predic-
	EMS registry		retrospective	airway man-	Naive Bayes	area 0.83; NB:	tation, injury	tion model in
	(MIND 3.1)		analysis	agement in	(NB)	AUROC 0.93;	pattern, oxy-	EMS; validated
				trauma		PRC area 0.66	gen therapy,	on real-world
							shock index,	trauma registry;
							vital signs,	SMOTE ap-
							interventions	plied for class
								balance

Tan et al.,	Huashan	406 carotid	Retrospective	Early Phase	Gradient	AUROC:	100+ perioper-	First ML model
2021	Hospital, Fu-	endarterec-	single-center	Postoperative	Boosted	0.77 (95% CI:	ative variables	for EPOH
	dan Universi-	tomy (CEA)	cohort study	Hypertension	Regres-	0.62–0.92);	including:	prediction;
	ty, Shanghai,	procedures		(EPOH)	sion Trees	Sensitivity	intraoperative	feature impor-
	China			requiring	(GBRT) via	~90%; Speci-	peak SBP,	tance analysis
				intravenous	XGBoost	ficity ~52%	cardiac index,	via XGBoost
				vasodilators			anesthetic dos-	gain scores;
				within 24			ages (propo-	validated with
				hours post-			fol, fentanyl,	4-fold stratified
				CEA			ephedrine), lab	cross-valida-
							values (glu-	tion; strong
							cose, choles-	association
							terol, alkaline	found between
							phosphatase),	intraoperative
							Doppler	BP spikes and
							echocardiog-	EPOH risk
							raphy metrics,
							clamping time,
							and demo-
							graphic data

The sample sizes were very different. For example, Schenk et al. (2021) had only 60 patients (n=60), while Lee et al. (2022) had 454,404 patients in a very large retrospective dataset [11]. The ORACLE Trial (Fritz et al., 2024) was the biggest interventional study [12]. It randomly assigned 5,071 patients to either ML- assisted or standard perioperative care. (Table 2.)

The Main Results of the Studies Were Different and Included

- Most of the time, intraoperative hypotension (IOH) is defined as a mean arterial pressure (MAP) of less than 65 mmHg (e.g., Schenk et al., 2021; Wijnberge et al., 2020).

- Acute kidney injury (AKI) is often used as a secondary endpoint (for example, Fritz et al., 2024; Ripollés-Melchor et al., 2025).

- Mortality after surgery: both 30 days after surgery and while in the hospital (e.g., Fritz et al., 2019; Lee et al., 2022).

- PIHI (Te et al., 2024) is an example of a composite haemodynamic instability index.

Modelling Methods and Algorithmic Techniques

There were a number of different predictive modelling methods used in the studies. These can be divided into three main groups: traditional ML algorithms, deep learning architectures, and biomarker-based indices (especially HPI).

Some of the models that were tested the most were random forests (RF), logistic regression (LR), gradient boosting machines (like XGBoost), and support vector machines (SVM).

Lee et al. (2022) directly compared several algorithms for predicting 30-day mortality. They found that XGBoost did better than both RF (AUROC = 0.92) and DNN models, with AUROC = 0.942 and AUPRC = 0.175.

Luckscheiter et al. (2022) made models for bad things that happen during surgery. RF got an AUROC of 0.96 and a PRC area of 0.83 [13].

Deep Learning Structures

Deep learning methods took advantage of the changing nature of data streams during surgery.

Fritz et al. (2019) used a combination of convolutional neural networks (CNNs) and LSTM-based architectures (MPCNN) to predict 30-day mortality with an AUROC of 0.867 and an AUPRC of 0.097. Hu et al. (2024) used these ideas on high-frequency arterial waveform data and found that R² = 0.915 and MAPE = 12.25% were good for predicting changes in blood flow during surgery [14].

The Hypotension Prediction Index (HPI)

HPI, a proprietary algorithm trained to predict IOH minutes before onset, was evaluated in a number of RCTs and observational studies (Schenk et al., 2021; Wijnberge et al., 2020; Ripollés- Melchor et al., 2025). In addition to a 77% decrease in the severity and duration of hypotension in the intervention group, Schenk et al. (2021) reported 88% sensitivity and 87% specificity for HPI. When HPI guidance was used, Wijnberge et al. (2020) also showed a significant decrease in cumulative hypotension time.

Innovative and Hybrid Methods

Te et al. (2024) developed the Post-Intubation Hemodynamic Instability (PIHI) index using 14 clinical and ventilatory variables, achieving R² = 0.9047. [15,16] For AKI risk, Abin et al. (2024) used a dual regression approach, reporting accuracy = 80.6%, F1- score = 0.821, and recall = 84.8%.

Predictive Factors and the Significance of Features

MAP and derived waveform parameters were universally critical, according to a comparative analysis of input features (Table 3).

In HPI-based studies, significant attention was paid to variables such as systolic and diastolic blood pressure (SBP and DBP), heart rate (HR), pulse volume (PV), stroke volume variation (SVV), and dP/dtmax (Frassanito et al., 2023; Šribar et al., 2023[16]).

Larger retrospective models included preoperative variables like age, comorbidity indices, baseline renal function, and ASA status (Lee et al., 2022; Kang et al., 2020) [17].

Analysis of Comparative Performance

Among the best-performing models (AUROC > 0.90) were: '

- Lee et al. (2022): AUROC = 0.942, AUPRC = 0.175 (XGBoost).

- Luckscheiter et al. (2022) : PRC = 0.83 (RF), AUROC = 0.96.

- Te et al. (2024)]: R² = 0.9047 for continuous hemodynamic predictions.

Performers in the middle (AUROC 0.80–0.89):

- Fritz et al. (2019) : AUROC = 0.867, AUPRC = 0.097 (MPCNN).

- Fritz et al. (2024, ORACLE) : AUROC = 0.807 for 30-day mortality; 0.766 for AKI.

Models for experiments or lower performers:

- Tan et al. (2021): sensitivity ~90%, specificity ~52%, AUROC = 0.77 (95% CI: 0.62–0.92).

- Morisson et al. (2022) : Brier score = 0.194, AUC = 0.753, penalised logistic regression [18]. (Table 3.)

Stu dies

/Va riab le

Hea rt Rat e (H

R)

Blo od Pre ssur e (BP

)

Pul se Vol um e (PV

)

Me an Art eria l Pre

ssur

Mo rtal ity (an

y)

AI

Mo dels Co mp are d

Bes t Mo del Per for

ma

AU RO C

IC V

Val ues

Typ e of Sur ger y

Ris k of AK I

Hy pert ensi on Ris k

Air way Ma nag eme nt

Dos age for Ane sthe sia

Wa rni ng Syst em vs

Sta

30-

Day Pos top erat ive Mo

Pos top erat ive Co mpl

icat

Ana esth esia Dep th Mo

nito

Posto perati ve Pain

e

(M

AP)

nce

nda

rd Car

e

rtal

ity

ions

ring

Frit z et al., 201

9

Use d (pre

-

and intr aop erati ve)

Use d (sys tolic & dias tolic

,

intr aop erati ve)

NR

Deri ved fro m BP,

not expl icitl y repo rted

30-

day post oper ativ e mor talit y (1%

)

MP CN N (CN N + LST M), DN N, SV M,

RF,

LR

MP CN N (AU RO C: 0.86

7, AU PR C: 0.09

7)

Not use d

All type s (ma inly AS A II– III)

NR

Incl ude d as com orbi dity

Incl ude d (onl y intu bate d pati ents

)

Incl ude d (cu mul ativ e intr aop dos es)

Des igne d for real

-

time war ning inte grat ion

Pri mar y end poin t

Not strat ifie d

NR

Lee et al., 202

2

Use d (pre oper ativ e only

)

Use d (pre oper ativ e only

)

NR

30-

day post oper ativ e mor talit y (0.2 –

0.4

%)

XG

Boo st, Ran dom For est, Log istic Reg ress ion, DN

N

XG

Boo st (AU RO C: 0.94

2, AU PR C: 0.17

5)

Not use d

Non

-

card iac surg erie s acro ss spec ialti es

NR

Incl ude d as inpu t feat ure

Not expl icitl y desc ribe d

Not incl ude d

Offl ine mod el for preo pera tive risk strat ifica tion

Pri mar y end poin t

Not strat ifie d

NR

Te et al., 202

4

Use d (pre oper ativ

e

Use d (sys tolic &

dias

NR

No dire ct mor talit

y

Extr a Tre e Reg

ress

ET R (R²: 0.90

47,

MA

Pre dict ed as pri

mar

Gen eral anes thes ia

with

NR

Incl ude d as bina ry

inpu

All pati ents had end

otra

Incl ude d (init ial

infu

Not real

-

time

;

pred

Not mea sure d

Foc use d only on

PIH

NR

only

)

tolic

,

preo pera tive

)

outc ome

or, XG

Boo st, SV R, ML R, ML

P

E: 0.05

12)

y outp ut

trac heal intu bati on (AS A I–

II)

t feat ure

che al intu bati on

sion

:

fent anyl

,

pro pof ol, etc.)

ictiv e sup port only

I

(he mod yna mic inst abili ty)

Sch enk et al.

(HP I

Tri al)

Not expl icitl y repo rted

MA P

targ et:

>65

mm Hg

Cap ture d via Flo Tra cIQ sens or

Not spec ifie d

Not repo rted dire ctly (30-

day outc ome s)

HPI

-

guid ed vs Stan dard Car e

HPI

algo rith m

~0.9 0–

0.92

(bas ed on sens itivi ty/s peci ficit y)

Not appl icab le

Elec tive non card iac (ma inly GI

and gyn ecol ogic al)

Ele vate d in pres enc e of IOH

/PO H

Peri oper ativ e MA P

man age men t use d to miti gate extr eme s

Gen eral ana esth esia with mec hani cal vent ilati on

Sev oflu rane med ian: 1.55 – 1.64

vol

%

HPI

guid anc e vs rout ine hae mod yna mic prot ocol s

Not dire ctly asse ssed

4

pati ents with Cla vien

-

Din do Gra de ≥ III

(ble edin g inte rven tion

s)

Not expl icitl y repo rted

Not discus sed in this sub- study

OR AC LE

Tri al (Fri tz

et

al., 202

4)

Ava ilabl e via EH R;

use

d in

ML

inpu t feat ures

MA P

mon itor ed; pred ictiv

e

rele van ce for AKI

and mor talit

y

Not repo rted as stan dalo ne

met

ric

MA P

use d by ML

mod el

for

pred ictio n

30-

day post oper ativ e mor

talit

y asse ssed

;

2.2

%

inci den ce

ML

-

assi sted vs ML

-

una

ssist ed clini cian pred ictio n

Dea th: AU RO C 0.80

7;

AKI

: AU RO C 0.76

6

Dea th (M L

mod el): 0.80

7;

AKI (M L

mod el): 0.76

6

Not appl icab le

Mix ed elec tive surg erie s

acro

ss 10+

spec ialti es

11.1

% AKI

inci den ce; ML

pred

ictio n com pare d to clini cian acc

urac

Inco rpor ated into ML

risk mod els

Stan dard GA

prot ocol s, man

age

d rem otel y

Not deta iled; capt ured in EH

R

for ML

inpu t

ML

pred ictio ns visi ble or

hidd

en to clini cian s

Ass esse d via EH R; 98

deat

hs amo ng 507

1

pati ents

AKI

in 450

pati ents

;

com plic

atio

ns infl uen ced by ML

pred

ictio

Not repo rted expl icitl y; dept

h

infe rred fro m EH R

inpu ts

Not analyz ed

y

n

Fra ssa nito et al., 202

3

Con tinu ousl y mon itor ed; data not indi vidu ally repo rted

MA P

targ et:

>65

mm Hg; inte rven tion thre shol d was HPI

≥85

Mo nito red indi rectl y via dP/ dtm ax and SV V,

not repo rted as stan dalo ne

Pri mar y mea sure for IOH

and seve re hyp oten sion (cut offs

: 65

and 50

mm Hg)

1

deat h with in 30

day s in Con trol gro up

HPI

+

mod ifie d GD T

prot ocol vs stan dard GD T

prot ocol

82

%

redu ctio n in IOH

eve nts (97

vs 313

);

shor ter time to treat men t inte rven tion

Not repo rted num eric ally

Not asse ssed

Maj or gyn aec olog ic onc olog ic surg ery (lap aros copi c, lapa roto my, com bine d)

Not dire ctly asse ssed

; IOH

link ed to orga n inju ry in cite d liter atur e

Slig htly high er inci den ce of MA P > 110

mm Hg in HPI

gro up

Gen eral anes thes ia with mec hani cal vent ilati on

Pro pof ol, sufe ntan il, and sev oflu rane (BI

S-

targ eted

);

mor e dob uta min e in HPI

gro

up

Rea l- time HPI

alert s (≥8

5)

trig gere d proa ctiv e deci sion

-

mak ing

1

pati ent (Co ntro l gro up)

Pleu ral effu sion (HP I 3%,

Con trol 20

%),

arrh yth mia, card iac isch emi a

BIS

mon itor ed; targ et rang e 40–

50

Manag ed via intrath ecal morph ine or epidur al cathete r

Ka ng

et

al.

(20

20)

Incl ude

d in

feat ure set

SBP

, MB

P

use d as key pred icto rs

Not dire

ctly

repo rted

MB P

<65

mm Hg use d to defi ne hyp

oten

Not pri

mar

y end poin t

Naï ve

Bay

es,

LR,

RF, AN N

RF: AU

RO

C 0.84

2;

83.7

%

reca ll

RF: 0.84

2

(CI: 0.73

6–

0.94

8)

Not appl

icab

le

Lap aros

copi

c chol ecys tect omy

Not asse ssed

Hyp erte

nsio

n incl ude d in com orbi dity

feat

GA

with TIV

A

and intu bati on

Pro pof

ol/r

emi fent anil via TCI

pum p

ML

mod el

trai ned on pre- intu bati on

data

Not asse ssed

Not repo rted

BIS

use d;

not anal yze d

Not reporte d

sion

ures

Wij nbe rge et al.

(20

20)

Mo nito red via arte rial wav efor m

MA P

<65

mm Hg thre shol d for inte rven tion

Flo Tra cIQ wav efor m anal ysis

Pri mar y outc ome

: MA P

<65

mm Hg dura tion

Not stati stic ally pow ered for mor talit y

HPI

vs con vent iona l care

Sig nifi cant redu ctio n in MA P

<65

mm Hg dura tion

Not repo rted

;

prio r stud ies sug gest

>0.

9

Not appl icab le

Hig h- risk non card iac surg ery

Not asse ssed

MA P

cont rol strat egy use d to prev ent extr eme s

GA

with inva sive mon itori ng

Not deta iled

Rea l- time HPI

alert s trig gere d earl y inte rven

tion

Not stati stic ally pow ered

Not repo rted

Not spec ifie d

Not reporte d

Bai g et al.

(20

13)

Use d as a fuzz y inpu t for hyp ovol aem ia diag nosi s

BP

wav efor m anal ysis with fuzz y thre shol ds

PV

fro m plet hys mog raph use d in fuzz y rule engi ne

Cen tral to alar m gen erati on logi c

Co mpa red clini cal imp act usin g Kap pa agre eme nt

RT- SA AM

, FL

MS- 2

(Fu zzy logi c exp ert syst

ems

)

Kap pa agre eme nt: FL MS-

2 =

0.75

;

RT- SA AM

=

0.62

Not use d; Kap pa stati stic for diag nost ic agre eme nt

Not appl icab le

Mo dera te-

to- high bloo d loss proc edur es

Not dire ctly eval uate d

Rul e- base d flag ging of elev ated BP

Not deta iled

Not spec ifie d

Mul ti- laye r fuzz y alert s: pro babi listi c, SP V,

fuzz

y rule

Mo nito red indi rectl y via user feed bac k

Dia gno stic acc urac y vali date d with clini cian agre eme

nt

Not part of fuzz y syst em logi c

Pain control studie d in

other fuzzy system s

Rip ollé s- Mel cho r et al.

(20

25)

Mo nito red; not anal yze d sepa ratel y

MA P

<65

mm Hg thre shol d; HPI

alert at > 80

dP/ dtm ax and SV V

use d in HPI

algo rith m

Pri mar y outc ome

: AKI

link ed to MA P

<65

mm Hg

30-

day mor talit y:

1.1

% (HP

I)

vs 0.9

%

(con trol)

HPI

-

guid ed vs real

-

wor ld stan dard care

No sign ifica nt AKI

redu ctio n; RR

= 0.89

; P

= 0.66

Not repo rted

;

pri mar y outc ome anal yze d via RR

Not appl icab le

Mo dera te-

to- high risk elec tive abd omi nal surg ery

Pri mar y end poin t; no sign ifica nt redu ctio n

Sub gro up anal ysis sho wed tren d in non

-

hyp erte nsiv es

GA

with BIS

;

vent ilati on stan dard ized

Bal anc ed crys tallo id; eph edri ne, nore pine phri ne use d

HPI

alert at > 80

trig gere d algo rith mic inte rven tion

1.1

% (HP

I)

vs 0.9

%

(con trol)

; no sign ifica nt diff eren

ce

31.9

% (HP

I)

vs 29.7

%

(con trol)

; no sign ifica nt diff eren

ce

BIS

use d in both gro ups

Not analyz ed

Hu et al.

(20

24)

Use d as inpu t to CN N

and LST M

SBP

& DB P

use d for dos e pred ictio n

Extr acte d via CN N

feat ure laye rs

Tre nd mod eled via LST M

Not eval uate d

AN N + CN N + LST M

with atte ntio n

R² = 0.91

5

(intr a- op); MA PE

= 12.2

5%

Not appl icab le; regr essi on focu s

Not appl icab le

Vari ous; not strat ifie d

Not asse ssed

Not strat ifie d

GA

assu med

;

not deta iled

Eto mid ate, atra curi um, sufe ntan il pred icte d

AI

dosi ng mod el; atte ntio n high ligh ts key feat

ures

Not eval uate d

Not repo rted

BIS

inpu t coll ecte d; not outp ut vari able

Not analyz ed

Abi n et al.

(20

Inp ut to

dual

Cr use d as

inpu

Not dire ctly

mod

Use d to strat

ify

5–

10

% AKI

Dua l regr

essi

Acc urac y =

80.6

Not repo rted

;

Not appl icab

le

Car diac surg

ery

Pri mar y

end

HT N

incl

ude

GA

assu med

;

CP B

time

,

AKI

+

and AKI

5–

10

% AKI

AKI

seve rity

trac

Not mod eled

Not analyz ed

24)

regr essi on mod el

t; MA P

indi rectl y mod eled

eled

AKI

risk

-

relat ed mor talit y cite d

on mod els (AK

I+ / AKI –)

%; F1

= 0.82

1;

reca ll = 84.8

%

regr essi on- base d

(CA

BG,

valv e, tran spla nt)

poin t; AKI

+ vs AKI –

mod elin g

d in PM H

feat ures

not deta iled

flui ds, diur etic s, tran sfus ions pred icte

d

–

plan ners

+

visu al das hbo ard

-

relat ed mor talit y cite d

ked via Cr leve ls (KD IGO

)

Hay ase et al.

(20

20)

Cap ture d; use d as cont ext for EE G

patt erns

Rel ated to BIS

acc urac y via EM G

cont ami nati on

Not dire ctly use d; EE G-

deri ved via 20–

30H

z targ eted freq uen cy

Indi rectl y relat ed to EM

G/B IS

fide lity at low and high dept h

Not asse ssed

Poi ncar é- inde x fusi on via ML PN N

(dee p lear ning appr oac h)

R = 0.87

, RM SE

= 7.09

(Pre dBI S vs mBI S

corr elati on)

Not repo rted

;

line ar regr essi on use d for vali dati on

Not appl icab le

Mix ed (30

pati ents

,

incl udin g spin al and GA

case s)

Not anal yze d

EM G

artif acts

>8

0

BIS

sign al pro ble mati c in ligh t anes thes ia

GA

with BIS

;

som e spin al with pro pof ol

Sev oflu rane

,

rem ifen tanil

,

pro pof ol; mus cle rela xant use strat ifie

d

Poi ncar é- inde x– driv en ML PN N

alert s; laye red EE G

anal ysis

Not anal yze d

Not asse ssed

BIS

+ EE G

scat ter via Poi ncar é indi ces

Not evalua ted

Šri bar et al.

(20

Mo nito red; high

er

MA P

<65

mm Hg

Deri ved via wav

efor

Pri mar y end

poin

1

deat h (Ac ume

HPI

vs Flo Tra c

TW A- AU T:

0.01

Not repo rted

;

TW

Not appl icab le

Maj or thor acic

(lun

1

case (Flo trac gro

MA P

≥65 mm

Hg

GA

with one- lung vent

Pro pof ol, sufe

ntan

HPI

alert

≥90

+

deci

1

deat h (Ac ume

No MI, CVI

, or

AKI

BIS

use d; MA C

Epidur al or IV

sufent

anil

23)

base

line

in HPI

gro

up

thre

shol

d

m

anal

ysis

t

nIQ

gro

up)

GD T

vs

0.08

mm Hg

A

use

d

g/es

oph

agu s)

up)

targ et

ilati on

il,

sev

oflu rane

sion tree

nIQ

gro

up)

in

HPI

gro up

0.8–

1

Mo riss on et al.

(20

22)

Incl ude d in NO L

algo rith m

MA P

indi rectl y mod eled via NO L

Part of NO L

algo rith m

NO L-

base d thre shol ds (10

–

25)

Not asse ssed

ML

algo rith ms (ela stic net, RF, SV M

…)

CV- AU C 0.75

3

(0.7

18–

0.78

8);

Brie r scor e 0.19

4

CV- AU C 0.75

3

Not appl icab le

Gyn ecol ogic lapa rosc opy

Not asse ssed

Not strat ifie d

GA

with intu bati on

Fent anyl guid ed by NO L

NO L-

guid ed fent anyl vs MA

P/H R-

base d care

Not asse ssed

63.6

%

had mod erat e to seve re PA CU

pain

BIS

use d

Vel aga pud i et al.

(20

22)

Incl ude d in ML

inpu t feat ures and NO L

reac tion anal

Imp licit in case para met ers; MA P

not mod eled dire ctly

Not repo rted

; NO L-

base d surr ogat e feat ures likel

Use d indi rectl y via preo pera tive AS A

stat us and

Not asse ssed

XG

Boo st for gluc ose pred ictio n; Ran dom For est for

Glu cose

:

Acc urac y ↑ fro m 79

%

to 84.7

%;

Opi

Opi oid mod el cros s- vali date d AU RO C: 0.75

3

Not appl icab le

Mix ed amb ulat ory and inpa tient proc edur es

Not eval uate d

Imp lied in diab etes

/glu cose cont rol mod els

Intu bate d GA;

not strat ifie d

Fent anyl dosi ng mod ifie d base d on NO L

and ML

aid

ML

pred ictio ns pres ente d via RE DC

ap inte rfac e;

Not asse ssed

Foc use d on pain and gluc ose mis esti mati on; not trac ked

BIS

use d; not mod eled

Primar y outco me:

ML-

aided predict ion of opioid require ment

ysis

y con side red

surg ery type

opio id nee d

oids

:

Acc urac

y ↑

fro m 18

%

to 42

%

use d as refe

renc

e

for mall y

Luc ksc heit er et al., 202

2

Incl ude d as a key inpu t feat ure; rank ed thir d in imp orta nce in RF

mod el

Syst olic BP

use d; top- rank ed feat ure in RF

mod el

Not dire ctly mod eled

;

infe rred via sho ck inde x and ausc ultat ion

Not expl icitl y mod eled

;

indi rectl y refl ecte d via syst olic BP

and sho ck inde

x

Not asse ssed

Ran dom For est (RF

) vs Nai ve Bay es (NB

)

RF: AU RO C 0.96

; PR C

area 0.83

;

NB: AU RO C 0.93

; PR C

area 0.66

RF: 0.96

(95

% CI: 0.96 – 0.97

); NB: 0.93

(95

% CI: 0.92 – 0.93

)

Not appl icab le

Not surg ical; preh ospi tal trau ma regi stry (E

MS-

base d)

Not eval uate d

Indi rectl y mod eled via syst olic BP

and sho ck inde x

Pri mar y end poi nt;

5.7

%

of pati ents requ ired prec linic al airw ay man age men

t

Not mod eled

;

airw ay man age men t infe rred via inte rven tion s and med icati ons

ML

mod el trai ned on EM S

regi stry data

; no real

-

time alert s depl oye d

Not asse ssed

;

preh ospi tal setti ng only

Not appl icab le

Not assess ed

Tan et al., 202

1

Pre oper ativ e HR

mea

sure

d; slig htly high er in EP OH

gro up (not stati stic ally sign ifica nt)

SBP

>1

60

mm Hg defi

nes

EP OH;

intr aop erati ve and post oper ativ e pea k SBP

/DB P

sign ifica ntly high er in EP OH

gro

up

Not dire ctly mod eled

;

MA

P

vari abili ty and vas oact ive resp ons e indi rectl y invo lved

Not calc ulat ed dire ctly;

SBP

/DB P

tren ds use d as surr ogat es for hyp erte nsiv e patt erns

No deat hs occ urre d

duri

ng hos pital stay in eith er gro up

Gra dien t Boo sted Reg

ress

ion Tre es (GB

RT)

via XG

Boo st

Mea n AU RO C: 0.77

(95

% CI: 0.62 – 0.92

);

Sen sitiv ity

~90

%;

Spe cific ity 0.52

0.77

(cro ss- vali dati on

aver

age)

Not appl icab le

Car otid end arte rect omy

(CE

A)

und er gen eral anes thes ia

Not asse ssed

EP OH

risk spec ifica lly

mod

eled

;

intr aop erati ve BP

spik es high ly pred ictiv e

GA

with stan dard intu bati

on;

not a vari able in pred ictio n

Sev oflu rane

,

pro

pof ol,

fent anyl

,

phe nyle phri ne, eph edri ne trac ked; thei r dos ages use d as inpu t feat ures

ML

mod el vali date d

retr

osp ecti vely

; no live alert inte grat ion

Not asse ssed

Stro ke: 5

case s; CH

S: 4

case s (all in EP OH

gro up); Cer ebra l hem orrh age: 2

case s (bot h EP OH)

Not mod eled dire ctly; BIS

not

repo rted

Not assess ed

Discussion

This review emphasises how ML-based perioperative prediction models perform better than conventional techniques in terms of clinical utility and accuracy. Logistic regression is clearly outperformed by ensemble models such as XGBoost (AUROC up to 0.942) and RF (AUROC 0.96). By utilising time-series data, deep learning architectures provide additional incremental benefits.

Since several RCTs showed significant decreases in IOH burden, confirming the idea of predictive haemodynamic monitoring, the Hypotension Prediction Index (HPI) is noteworthy for its real- time clinical applicability. The idea that IOH is a modifiable intraoperative risk factor and that proactive ML-guided interventions can lessen its severity and duration is supported by this data.

ML-Driven vs. Biomarker-Based Models Biomarker- based methods (HPI)

Using arterial waveform analysis, provide timely, actionable alerts proven to lower IOH in clinical settings [19]. Reliance on proprietary algorithms and a comparatively limited prediction scope (hypotension only) are among thelimitations.

Models Powered by Machine Learning

Predict broader outcomes (mortality, AKI, composite complications) by integrating multi-modal data sources. Show improved discriminative abilities.

Model complexity, interpretability, and the requirement for sizable, superior datasets are among the difficulties.

Clinical Consequences

The findings of this synthesis point to a potential paradigm shift in perioperative care: predictive analytics, like capnography or pulse oximetry, may soon be used as a routine supplement to anaesthesia monitoring.

Proactive management techniques like prompt vasopressor titration, fluid optimisation, and renal protection protocols may be made possible by machine learning (ML) models integrated into anaesthesia information management systems (AIMS).

The Evidence's Limitations

Heterogeneity in outcome definitions, inconsistent performance metric reporting, and a small number of highly effective RCTs focussing on hard endpoints (mortality, AKI, myocardial injury) limit the body of evidence. Few studies evaluated cost-effectiveness or implementation feasibility in standard clinical practice, and proprietary algorithms like HPI further restrict reproducibility.

Prospects for the Future

Multicenter, sufficiently powered RCTs assessing clinically significant endpoints should be the top priority for future research. Explainable AI (XAI) to increase clinician trust and transparency combining machine learning models with biomarker-based indices to provide hybrid real-time decision support.

Health economics studies to assess implementation logistics and cost-effectiveness.

To guarantee generalisability, external validation is conducted across a variety of populations.

Conclusion

Perioperative risk management is about to be redefined by machine learning and biomarker-based monitoring. These models reliably forecast and reduce intraoperative complications more successfully than conventional techniques by utilising real-time waveform data, EHR variables, and sophisticated analytics. Large-scale retrospective studies validate the scalability of ML- based approaches, while HPI-guided trials demonstrate the clinical utility and viability of predictive monitoring.

However, strong proof of better patient outcomes—rather than merely surrogate metrics—will be necessary for wider adoption. In order to guarantee that predictive alerts result in significant clinical actions that lower morbidity and mortality, future research should focus on integrating ML algorithms into perioperative care pathways.

Acknowledgements

Declaration of Al-Assisted Technologies in the Writing Process: During the preparation of this work the authors used Chat-GPT version 3.5 to check grammar and spelling. After using this tool, the authors reviewed and edited the content and take full responsibility for the content of the publication.

Declaration of Interest

None of the authors has any conflicts of interest to disclose.

Funding

This work was conducted without any financial support from funding agencies in the public, commercial, or not-for-profit sectors.

Journal of Future Medicine and Healthcare Innovation(JFMHI)

ISSN: 3065-7628 | DOI: 10.33140/JFMHI

Journal of Future Medicine and Healthcare Innovation

Open Access Journals

Artificial Intelligence-Assisted Monitoring in Anaesthesia: A Systematic Review and Meta-analysis of Diagnostic Accuracy and Clinical Impact AI in Anaesthesia: Monitoring and Outcomes Review

Abstract

Keywords

Key Message

Introduction

Methods

Results

Discussion

Conclusion

Acknowledgements

Declaration of Interest

Funding

References

Important Links

Locate Us