Bandwidth estimation (BWE) serves as a pivotal mechanism in real-time communications (RTC), supporting congestion control (CC) by governing traffic sending rate and thereby optimizing network performance. In recent years, reinforcement learning (RL) has surged to prominence, transcending conventional approaches as a superior bandwidth estimator. However, as the hype continues to escalate, the relentless pursuit of increasingly sophisticated algorithms eclipses the investigation into the core principles of CC, while the inherent flaws of RL appear to be overlooked. In this work, we rethink the necessity of RL in RTC, and contend that BWE itself, rather than reward-driven learning, is the deterministic factor, thus rendering a regressor sufficient. We first underscore the paramount importance of BWE accuracy and propose a simple feedforward neural network-based regressor with dual training stages: an offline stage imitating the perfect estimator and an online stage accommodating traffic dynamics. Utilizing an advanced RTC platform, we benchmark our solution against multiple state-of-the-art RL-based BWE algorithms. Conclusively, our regressor achieves superior performance, potentially charting a new course for BWE and CC in RTC.
Debunking Reinforcement Learning for Bandwidth Estimation in Real-Time Communications: When a Simple Regressor Suffices / Song, Tailai; Meo, Michela. - ELETTRONICO. - (2025). ( 2025 21st International Conference on Network and Service Management (CNSM) Bologna (Ita) 27-31 October 2025) [10.23919/cnsm67658.2025.11297463].
Debunking Reinforcement Learning for Bandwidth Estimation in Real-Time Communications: When a Simple Regressor Suffices
Song, Tailai;Meo, Michela
2025
Abstract
Bandwidth estimation (BWE) serves as a pivotal mechanism in real-time communications (RTC), supporting congestion control (CC) by governing traffic sending rate and thereby optimizing network performance. In recent years, reinforcement learning (RL) has surged to prominence, transcending conventional approaches as a superior bandwidth estimator. However, as the hype continues to escalate, the relentless pursuit of increasingly sophisticated algorithms eclipses the investigation into the core principles of CC, while the inherent flaws of RL appear to be overlooked. In this work, we rethink the necessity of RL in RTC, and contend that BWE itself, rather than reward-driven learning, is the deterministic factor, thus rendering a regressor sufficient. We first underscore the paramount importance of BWE accuracy and propose a simple feedforward neural network-based regressor with dual training stages: an offline stage imitating the perfect estimator and an online stage accommodating traffic dynamics. Utilizing an advanced RTC platform, we benchmark our solution against multiple state-of-the-art RL-based BWE algorithms. Conclusively, our regressor achieves superior performance, potentially charting a new course for BWE and CC in RTC.| File | Dimensione | Formato | |
|---|---|---|---|
|
Debunking_RL_for_BWE_in_RTC__When_a_Simple_Regressor_Suffices__CNSM___camera_ready_.pdf
accesso aperto
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
Pubblico - Tutti i diritti riservati
Dimensione
964.09 kB
Formato
Adobe PDF
|
964.09 kB | Adobe PDF | Visualizza/Apri |
|
Debunking_Reinforcement_Learning_for_Bandwidth_Estimation_in_Real-Time_Communications_When_a_Simple_Regressor_Suffices.pdf
accesso riservato
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
975.09 kB
Formato
Adobe PDF
|
975.09 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/3007597
