Developing digital twin and condition monitoring models for Floating Offshore Wind Turbines (FOWTs) mooring systems requires massive data across various health, operational, and metocean conditions. The scarcity of real damage-associated data may represent a significant challenge. Deep generative models (DGMs) have recently been introduced as powerful tools for oversampling scarce data. However, most oversampling methods focus on minority intra-class information. The inter-class dynamics between minority and majority classes are often ignored, increasing the risk of overfitting, especially in scenarios with high imbalance ratios. This study proposes a novel hierarchical variational autoencoder (HVAE) utilizing the diffusion probabilistic architecture, healthy (majority) data distribution, and the relation between healthy and damage-associated data in mooring systems of FOWTs to learn the damaged state distribution. We first evaluate HVAE’s ability to augment minority data based on majority distribution, using the MNIST benchmark image dataset for validation. This experiment compares the performance of HVAE with conventional and recent oversampling techniques. The second use case is the OC4-DeepCWind FOWT benchmark. The fine-tuned HVAE can generate damage-associated platform records for various sea states. Experimental results on MNIST indicate that HVAE achieves significant improvements over alternative oversampling techniques in downstream classification tasks, particularly in case of extreme imbalance. In the FOWT use case, the records generated for unseen sea states can incorporate the diversity and complexity of the majority ones, hence decreasing overfitting for the majority of sea states in downstream binary classification, highlighting the efficacy and generalization of HVAE.

Data augmentation for damaged scenarios in floating offshore wind turbines: an approach based on diffusion architecture, hierarchical variational approximation and healthy data distribution / Fathnejat, Hamed; Nava, Vincenzo. - In: ENGINEERING WITH COMPUTERS. - ISSN 0177-0667. - (2025). [10.1007/s00366-025-02148-6]

Data augmentation for damaged scenarios in floating offshore wind turbines: an approach based on diffusion architecture, hierarchical variational approximation and healthy data distribution

Vincenzo Nava
2025

Abstract

Developing digital twin and condition monitoring models for Floating Offshore Wind Turbines (FOWTs) mooring systems requires massive data across various health, operational, and metocean conditions. The scarcity of real damage-associated data may represent a significant challenge. Deep generative models (DGMs) have recently been introduced as powerful tools for oversampling scarce data. However, most oversampling methods focus on minority intra-class information. The inter-class dynamics between minority and majority classes are often ignored, increasing the risk of overfitting, especially in scenarios with high imbalance ratios. This study proposes a novel hierarchical variational autoencoder (HVAE) utilizing the diffusion probabilistic architecture, healthy (majority) data distribution, and the relation between healthy and damage-associated data in mooring systems of FOWTs to learn the damaged state distribution. We first evaluate HVAE’s ability to augment minority data based on majority distribution, using the MNIST benchmark image dataset for validation. This experiment compares the performance of HVAE with conventional and recent oversampling techniques. The second use case is the OC4-DeepCWind FOWT benchmark. The fine-tuned HVAE can generate damage-associated platform records for various sea states. Experimental results on MNIST indicate that HVAE achieves significant improvements over alternative oversampling techniques in downstream classification tasks, particularly in case of extreme imbalance. In the FOWT use case, the records generated for unseen sea states can incorporate the diversity and complexity of the majority ones, hence decreasing overfitting for the majority of sea states in downstream binary classification, highlighting the efficacy and generalization of HVAE.
File in questo prodotto:
File Dimensione Formato  
s00366-025-02148-6+(2) (1)_compressed.pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 1.22 MB
Formato Adobe PDF
1.22 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2999897