Edge computing enables resource-constrained de- vices to execute machine learning applications via task offloading. To this aim, radio access network (RAN) slicing is instrumental to provide the necessary network resources. However, current RAN slicing approaches rely on static computing models, thereby constraining their ability to leverage the dynamic semantic data representation capabilities enabled by recent neural architec- tures. In this paper, we propose OMNIS, a semantic RAN slicing framework for edge computing built on dynamic split neural models. OMNIS embeds a dynamic form of neural compression paired with adaptive data encoding for task offloading, enabling flexible communication payload and computing options. We explicitly study the interplay between neural compression and information quantization in computer vision tasks and design a novel “Box” quantization scheme that improves resiliency to bit errors as a function of compression rate. Considering partial observability and differing objectives of mobile devices and the edge server, we formulate neural gate control and resource slicing optimization problems and solve them via multi- agent contextual multi-armed bandits and convex optimization algorithms. Experimental results show that OMNIS improves inference accuracy by up to 22.85% and reduces quality of service violations by up to 10x. The evaluation code is available at https://github.com/qlt315/OMNIS.
OMNIS: Semantic RAN Slicing via Dynamic Split Neural Networks / Qin, Langtian; Andrew Harshbarger, Ian; Nasraoui, Leila; Chiasserini, Carla Fabiana; Levorato, Marco. - (2026). ( IEEE INFOCOM 2026 Tokyo (Jap) 18 - 21 May 2026).
OMNIS: Semantic RAN Slicing via Dynamic Split Neural Networks
Carla Fabiana Chiasserini;
2026
Abstract
Edge computing enables resource-constrained de- vices to execute machine learning applications via task offloading. To this aim, radio access network (RAN) slicing is instrumental to provide the necessary network resources. However, current RAN slicing approaches rely on static computing models, thereby constraining their ability to leverage the dynamic semantic data representation capabilities enabled by recent neural architec- tures. In this paper, we propose OMNIS, a semantic RAN slicing framework for edge computing built on dynamic split neural models. OMNIS embeds a dynamic form of neural compression paired with adaptive data encoding for task offloading, enabling flexible communication payload and computing options. We explicitly study the interplay between neural compression and information quantization in computer vision tasks and design a novel “Box” quantization scheme that improves resiliency to bit errors as a function of compression rate. Considering partial observability and differing objectives of mobile devices and the edge server, we formulate neural gate control and resource slicing optimization problems and solve them via multi- agent contextual multi-armed bandits and convex optimization algorithms. Experimental results show that OMNIS improves inference accuracy by up to 22.85% and reduces quality of service violations by up to 10x. The evaluation code is available at https://github.com/qlt315/OMNIS.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/3005737
