In many AI-enabled scenarios, such as video surveillance systems, besides requiring the data to be stored safely, human operators and AI models must also be able to access audio and video streams continuously while media files are still being collected. However, system throughput and latency are often limited by the use of transactionality to guarantee data persistence. This paper presents a solution providing both high ingestion rates with transactional data persistence and low-latency access to the stream during collection in near real-time. This enables the AI algorithms to be immediately applied as soon as the data is received. The binary data sources fit well with the audio and video capture of surveillance or similar systems, but the proposed solution can be extended through well-defined general interfaces. The scalability of the proposed approach is based on the microservice architecture. Using Apache Kafka and MongoDB replica sets, preliminary results show that the proposed solution provides up to 6 times larger throughput and 4.5 times lower latency than current standard multi-document transactions.
Combining fault-tolerant persistence and low-latency streaming access to binary data for AI models / Militone, Gabriele Scaffidi; Apiletti, Daniele; Malnati, Giovanni. - ELETTRONICO. - (2023), pp. 3149-3153. (Intervento presentato al convegno 2023 IEEE International Conference on Big Data (BigData) tenutosi a Sorrento (IT) nel 15/12/2023 - 18/12/2023) [10.1109/bigdata59044.2023.10386896].
Combining fault-tolerant persistence and low-latency streaming access to binary data for AI models
Militone, Gabriele Scaffidi;Apiletti, Daniele;Malnati, Giovanni
2023
Abstract
In many AI-enabled scenarios, such as video surveillance systems, besides requiring the data to be stored safely, human operators and AI models must also be able to access audio and video streams continuously while media files are still being collected. However, system throughput and latency are often limited by the use of transactionality to guarantee data persistence. This paper presents a solution providing both high ingestion rates with transactional data persistence and low-latency access to the stream during collection in near real-time. This enables the AI algorithms to be immediately applied as soon as the data is received. The binary data sources fit well with the audio and video capture of surveillance or similar systems, but the proposed solution can be extended through well-defined general interfaces. The scalability of the proposed approach is based on the microservice architecture. Using Apache Kafka and MongoDB replica sets, preliminary results show that the proposed solution provides up to 6 times larger throughput and 4.5 times lower latency than current standard multi-document transactions.File | Dimensione | Formato | |
---|---|---|---|
Combining fault-tolerant persistence and low-latency streaming access to binary data for AI models.pdf
non disponibili
Descrizione: Combining fault-tolerant persistence and low-latency streaming access to binary data for AI models - Gabriele Scaffidi Militone; Daniele Apiletti; Giovanni Malnati - Pubblicazione su proceedings IEEE BigData2023
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
269.63 kB
Formato
Adobe PDF
|
269.63 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2986798