*Result*: Exploring latent diffusion models for ECG generation on the minute scale.

Title:
Exploring latent diffusion models for ECG generation on the minute scale.
Authors:
Kranz DD; Section on Computational Neurology, Deparment of Neurology, Charité - Universitätsmedizin Berlin, Berlin, Germany; Berlin Institute of Health, Berlin, Germany; Department of Physics, Humboldt-Universität zu Berlin, Berlin, Germany. Electronic address: Dominik-dirk.kranz@charite.de., Krämer JF; Department of Physics, Humboldt-Universität zu Berlin, Berlin, Germany; Department of Library and Information Science, Humboldt-Universität zu Berlin, Berlin, Germany. Electronic address: Jan.kraemer@hu.berlin.de., Kahriman O; Department of Physics, Humboldt-Universität zu Berlin, Berlin, Germany; ID Information und Dokumentation im Gesundheitswesen GmbH & Co. KGaA, Berlin, Germany. Electronic address: Kahrimao@physik.hu-berlin.de., Nelde A; Section on Computational Neurology, Deparment of Neurology, Charité - Universitätsmedizin Berlin, Berlin, Germany; Berlin Institute of Health, Berlin, Germany. Electronic address: Alexander.nelde@charite.de., Wessel N; Department of Physics, Humboldt-Universität zu Berlin, Berlin, Germany; MSB Medical School Berlin, Berlin, Germany. Electronic address: Wessel@physik.hu-berlin.de., Meisel C; Section on Computational Neurology, Deparment of Neurology, Charité - Universitätsmedizin Berlin, Berlin, Germany; Berlin Institute of Health, Berlin, Germany. Electronic address: Christian.meisel@charite.de.
Source:
Computer methods and programs in biomedicine [Comput Methods Programs Biomed] 2026 Feb 01; Vol. 274, pp. 109138. Date of Electronic Publication: 2025 Nov 01.
Publication Type:
Journal Article
Language:
English
Journal Info:
Publisher: Elsevier Scientific Publishers Country of Publication: Ireland NLM ID: 8506513 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1872-7565 (Electronic) Linking ISSN: 01692607 NLM ISO Abbreviation: Comput Methods Programs Biomed Subsets: MEDLINE
Imprint Name(s):
Publication: Limerick : Elsevier Scientific Publishers
Original Publication: Amsterdam : Elsevier Science Publishers, c1984-
Contributed Indexing:
Keywords: Electrocardiogram (ECG); Generative artificial intelligence; Latent diffusion models
Entry Date(s):
Date Created: 20251108 Date Completed: 20251207 Latest Revision: 20251207
Update Code:
20260130
DOI:
10.1016/j.cmpb.2025.109138
PMID:
41205561
Database:
MEDLINE

*Further Information*

*Background and Objective: Artificial Intelligence (AI) models for electrocardiogram (ECG) interpretation rely on large, diverse datasets, but existing clinical datasets are often skewed, overrepresenting normal rhythms and lacking rare pathologies. This limits model performance and generalizability. While generative AI has shown promise in other domains, its application to biosignals has so far been restricted to short segments, reducing clinical utility. Additionally, potentially promising tools like inpainting for artefact removal have not yet been explored in the biosignal domain. To overcome these limitations, we propose ECGEN, a latent diffusion model (LDM) designed to synthesize long-duration, realistic ECGs for data augmentation, rhythm-specific generation, and signal restoration.
Methods: We developed ECGEN in three configurations: (1) ECGEN-Small, a 30-second conditional generator for sinus rhythm and atrial fibrillation (AFib); (2) ECGEN-Medium, a 90-second model for inpainting tasks; and (3) ECGEN-Large, an unconditional model for generating 320-second ECGs. Models were trained on real clinical ECGs from stroke patients using a vector quantized-variational autoencoder (VQ-VAE) for encoding and a denoising diffusion implicit model (DDIM) as the latent space backbone. We evaluated model outputs using heart rate (HR), heart rate variability (HRV), and morphological coherence.
Results: ECGEN-Small achieved an AUC of 0.98 for the AFib vs. sinus rhythm classification. ECGEN-Medium successfully inpainted missing segments with plausible HR dynamics. ECGEN-Large generated long ECGs with consistent morphology but showed distributional shifts in HRV (e.g., inflated standard deviation of normal-to-normal intervals, SdNN), indicating incomplete modeling of global temporal dependencies. Network depth was found to significantly influence output quality and training stability.
Conclusions: ECGEN demonstrates the feasibility of long-duration ECG generation using LDMs, offering applications in dataset augmentation and signal restoration. However, distributional mismatches and occasional artefacts highlight challenges in unsupervised biosignal synthesis. Future work should address long-range temporal modeling and refine realism through conditioning or adversarial training.
(Copyright © 2025. Published by Elsevier B.V.)*

*Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.*