Skip to main navigation Skip to search Skip to main content

Denoising autoencoder framework for reconstructing missing periodontal clinical records

  • Saveetha Institute of Medical and Technical Sciences (Deemed to be University)

Research output: Contribution to journalArticlepeer-review

Abstract

Introduction: Missing clinical data pose a significant challenge for retrospective analyses and predictive modelling in periodontal research. Traditional imputation methods often overlook the complex correlations among variables and produce implausible values. This study examines the application of generative models to reconstruct missing periodontal clinical records and outlines a comprehensive workflow that spans from data generation to evaluation. Methods: A synthetic periodontal dataset of 200 virtual patients was constructed, capturing realistic distributions of demographic factors (age, gender, smoking, and diabetes) and tooth-level measurements (probing depth, attachment loss, furcation involvement, and bleeding on probing) across eight sites. Missingness was introduced at random to 15% of the clinical variables. A denoising autoencoder with a single hidden layer of 18 neurons was trained to reconstruct the original data from corrupted inputs over 100 epochs. The model learned latent representations of the data and was then used to impute missing entries. Results: Performance was assessed by comparing reconstructed values to the original data using mean absolute error (MAE) and root mean squared error (RMSE) across individual variables and categories. Overall, MAE and RMSE were 0.61 and 0.74, respectively, with attachment loss and bleeding on probing exhibiting lower errors than probing depth and furcation involvement. Conclusion: Distribution comparisons showed that imputed values closely matched original distributions. The approach offers a practical framework for handling missing periodontal data and highlights the potential of generative models to improve data completeness without introducing unrealistic values. Limitations include the simplicity of the network architecture. Future work should explore advanced models, integrate multimodal data, and evaluate on real datasets. The workflow—from synthetic data creation and masking to training, imputation, and evaluation—serves as a template for researchers tackling missing clinical data.

Original languageEnglish
Article number1710316
JournalFrontiers in Dental Medicine
Volume7
DOIs
StatePublished - 2026

Keywords

  • artificial intelligence
  • clinical data
  • missing values
  • periodontitis
  • synthetic data

Fingerprint

Dive into the research topics of 'Denoising autoencoder framework for reconstructing missing periodontal clinical records'. Together they form a unique fingerprint.

Cite this