Real-time communication (RTC) platforms have seen a considerable surge in popularity in recent years, largely due to the COVID-19 pandemic which facilitated remote work. To ensure adequate Quality of Experience (QoE) for users, a good congestion control algorithm is needed. RTC applications use UDP, so congestion control is done on the application layer, leaving way for advanced algorithms. In this paper, we propose ReCoCo, a solution for congestion control in RTC applications based on Reinforcement learning (RL). ReCoCo gains information about the network conditions at the receiver-side, such as receiving rate, one-way delay and loss ratio and predicts the available bandwidth in the next time bin. We train ReCoCo on 9 bandwidth trace files that cover a vast array of network types. We try different algorithms, states and parameters, training both specific and general models. We find that ReCoCo outperforms the de-facto standard heuristic algorithm GCC in both specialized and general models. We also make observations on the difficulty of generalization when using RL.

ReCoCo: Reinforcement learning-based Congestion control for Real-time applications / Markudova, Dena; Meo, Michela. - ELETTRONICO. - (2023), pp. 68-74. (Intervento presentato al convegno 2023 IEEE 24th International Conference on High Performance Switching and Routing (HPSR) tenutosi a Albuquerque, NM, USA nel 05-07 June 2023) [10.1109/HPSR57248.2023.10147986].

ReCoCo: Reinforcement learning-based Congestion control for Real-time applications

Markudova, Dena;Meo, Michela
2023

Abstract

Real-time communication (RTC) platforms have seen a considerable surge in popularity in recent years, largely due to the COVID-19 pandemic which facilitated remote work. To ensure adequate Quality of Experience (QoE) for users, a good congestion control algorithm is needed. RTC applications use UDP, so congestion control is done on the application layer, leaving way for advanced algorithms. In this paper, we propose ReCoCo, a solution for congestion control in RTC applications based on Reinforcement learning (RL). ReCoCo gains information about the network conditions at the receiver-side, such as receiving rate, one-way delay and loss ratio and predicts the available bandwidth in the next time bin. We train ReCoCo on 9 bandwidth trace files that cover a vast array of network types. We try different algorithms, states and parameters, training both specific and general models. We find that ReCoCo outperforms the de-facto standard heuristic algorithm GCC in both specialized and general models. We also make observations on the difficulty of generalization when using RL.
2023
978-1-6654-7640-9
File in questo prodotto:
File Dimensione Formato  
ReCoCo_Reinforcement_learning-based_Congestion_control_for_Real-time_applications.pdf

accesso riservato

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 1.18 MB
Formato Adobe PDF
1.18 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
ReCoCo_HPSR_2023.pdf

accesso aperto

Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: Pubblico - Tutti i diritti riservati
Dimensione 448.98 kB
Formato Adobe PDF
448.98 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2979663