Modern data-driven applications increasingly demand flexible, scalable, and generalizable solutions to manage complex, heterogeneous workflows. Traditional Workflow Management Systems (WMS) often fall short in dynamic, high-throughput contexts due to their reliance on static configurations and limited adaptability. This paper introduces a modular and extensible framework for building dynamic data processing pipelines capable of integrating multiple data sources and evolving analytical tasks. The proposed system employs a three-layer architecture to facilitate pipeline construction, validation, and deployment through reusable components and standardized metadata. The framework simplifies the creation and connection of processing steps by using a standardized, modular approach that ensures data compatibility and logical consistency, making it accessible and useful for both expert and non-expert users. A real-world use case involving IoT data from the GAIA Metaplatform validates the system’s effectiveness, showcasing features such as runtime block isolation, robust input validation, and dynamic workflow execution. This framework aims to significantly reduce integration overhead while enhancing reusability and adaptability across diverse domains.
Toward Generalizable and Extensible Workflow Automation for Multi-Source Data Processing / Soltanali Khalili, Danial; Viticchié, Alessio; Cetrone, Felice; Patti, Edoardo; Aliberti, Alessandro. - ELETTRONICO. - (2025). (Intervento presentato al convegno 33rd International Conference on Software, Telecommunications, and Computer Networks (SoftCOM 2025) tenutosi a Slpit (HR) nel 18-20 September, 2025).
Toward Generalizable and Extensible Workflow Automation for Multi-Source Data Processing
Danial Soltanali Khalili;Edoardo Patti;Alessandro Aliberti
2025
Abstract
Modern data-driven applications increasingly demand flexible, scalable, and generalizable solutions to manage complex, heterogeneous workflows. Traditional Workflow Management Systems (WMS) often fall short in dynamic, high-throughput contexts due to their reliance on static configurations and limited adaptability. This paper introduces a modular and extensible framework for building dynamic data processing pipelines capable of integrating multiple data sources and evolving analytical tasks. The proposed system employs a three-layer architecture to facilitate pipeline construction, validation, and deployment through reusable components and standardized metadata. The framework simplifies the creation and connection of processing steps by using a standardized, modular approach that ensures data compatibility and logical consistency, making it accessible and useful for both expert and non-expert users. A real-world use case involving IoT data from the GAIA Metaplatform validates the system’s effectiveness, showcasing features such as runtime block isolation, robust input validation, and dynamic workflow execution. This framework aims to significantly reduce integration overhead while enhancing reusability and adaptability across diverse domains.| File | Dimensione | Formato | |
|---|---|---|---|
|
2025243117.pdf
accesso aperto
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
Pubblico - Tutti i diritti riservati
Dimensione
190.97 kB
Formato
Adobe PDF
|
190.97 kB | Adobe PDF | Visualizza/Apri |
|
Toward_Generalizable_and_Extensible_Workflow_Automation_for_Multi-Source_Data_Processing.pdf
accesso riservato
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
255.16 kB
Formato
Adobe PDF
|
255.16 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/3003293
