In many AI-enabled scenarios, such as video surveillance systems, besides requiring the data to be stored safely, human operators and AI models must also be able to access audio and video streams continuously while media files are still being collected. However, system throughput and latency are often limited by the use of transactionality to guarantee data persistence. This paper presents a solution providing both high ingestion rates with transactional data persistence and low-latency access to the stream during collection in near real-time. This enables the AI algorithms to be immediately applied as soon as the data is received. The binary data sources fit well with the audio and video capture of surveillance or similar systems, but the proposed solution can be extended through well-defined general interfaces. The scalability of the proposed approach is based on the microservice architecture. Using Apache Kafka and MongoDB replica sets, preliminary results show that the proposed solution provides up to 6 times larger throughput and 4.5 times lower latency than current standard multi-document transactions.

Combining fault-tolerant persistence and low-latency streaming access to binary data for AI models / Militone, Gabriele Scaffidi; Apiletti, Daniele; Malnati, Giovanni. - ELETTRONICO. - (2023), pp. 3149-3153. (Intervento presentato al convegno 2023 IEEE International Conference on Big Data (BigData) tenutosi a Sorrento (IT) nel 15/12/2023 - 18/12/2023) [10.1109/bigdata59044.2023.10386896].

Combining fault-tolerant persistence and low-latency streaming access to binary data for AI models

Militone, Gabriele Scaffidi;Apiletti, Daniele;Malnati, Giovanni
2023

Abstract

In many AI-enabled scenarios, such as video surveillance systems, besides requiring the data to be stored safely, human operators and AI models must also be able to access audio and video streams continuously while media files are still being collected. However, system throughput and latency are often limited by the use of transactionality to guarantee data persistence. This paper presents a solution providing both high ingestion rates with transactional data persistence and low-latency access to the stream during collection in near real-time. This enables the AI algorithms to be immediately applied as soon as the data is received. The binary data sources fit well with the audio and video capture of surveillance or similar systems, but the proposed solution can be extended through well-defined general interfaces. The scalability of the proposed approach is based on the microservice architecture. Using Apache Kafka and MongoDB replica sets, preliminary results show that the proposed solution provides up to 6 times larger throughput and 4.5 times lower latency than current standard multi-document transactions.
2023
979-8-3503-2445-7
File in questo prodotto:
File Dimensione Formato  
Combining fault-tolerant persistence and low-latency streaming access to binary data for AI models.pdf

non disponibili

Descrizione: Combining fault-tolerant persistence and low-latency streaming access to binary data for AI models - Gabriele Scaffidi Militone; Daniele Apiletti; Giovanni Malnati - Pubblicazione su proceedings IEEE BigData2023
Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 269.63 kB
Formato Adobe PDF
269.63 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2986798