Runtime Feature Compression for Adaptive Keyword Spotting on Embedded Systems

Peluso, Valentino; Calimera, Andrea; Macii, Enrico; Montuschi, Paolo

doi:10.1109/jiot.2026.3676917

Voice user interfaces rely on keyword spotting (KWS) to detect wake-word commands, enabling low-power devices to switch from drowsy to active states and initiate more complex tasks. In embedded systems, KWS combines handcrafted acoustic features extraction with lightweight neural network classifiers to achieve accurate detection within strict resource constraints. Adapting KWS to time-varying energy budgets requires optimization strategies that operate at runtime. Most existing approaches adjust the complexity of the neural model but overlook that a substantial amount of latency, and thus energy consumption, is due to feature extraction, which remains unaffected by model scaling. This work introduces Runtime Feature Compression (RFC), a dynamic rescaling strategy that modulates the workload of the entire KWS pipeline. RFC promotes the hop-length parameter of the Short-Time Fourier Transform as a runtime control knob to adjust the number of time frames in speech features, allowing a single model to operate across multiple latency modes. To support this flexibility, we introduce two training-time techniques: HopAugment, a data augmentation scheme that exposes the model to variable hop lengths during training, and Masked Layers, which preserve consistent activation statistics during training and inference under compressed feature settings. Evaluations on four KWS datasets using the TC-ResNet model family show that RFC outperforms model scaling techniques, offering a wider range of latency-accuracy trade-offs. RFC achieves up to 31.8% lower latency without accuracy degradation, or up to 0.30% higher accuracy within equivalent latency bounds. That proves RFC improves adaptability in energy-constrained IoT speech interfaces. A set of ablation studies further demonstrates the robustness of RFC by evaluating the role of its training components, batching strategies, ability to preserve accuracy with a shared weight set, scalability across operating modes, and applicability to different model architectures.

Runtime Feature Compression for Adaptive Keyword Spotting on Embedded Systems / Peluso, Valentino; Calimera, Andrea; Macii, Enrico; Montuschi, Paolo. - In: IEEE INTERNET OF THINGS JOURNAL. - ISSN 2327-4662. - (2026), pp. 1-1. [10.1109/jiot.2026.3676917]

Runtime Feature Compression for Adaptive Keyword Spotting on Embedded Systems

Peluso, Valentino;Calimera, Andrea;Macii, Enrico;Montuschi, Paolo

2026

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2026
			
	Codice DOI
	
				https://dx.doi.org/10.1109/jiot.2026.3676917
			
	Titolo della Rivista
	
				IEEE INTERNET OF THINGS JOURNAL
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
IOTJ_2025___KWS___Final_Files (2).pdf accesso riservato Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Non Pubblico - Accesso privato/ristretto Dimensione 1.4 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.4 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3010606

PORTO @ Archivio Istituzionale della Ricerca

Runtime Feature Compression for Adaptive Keyword Spotting on Embedded Systems

Peluso, Valentino;Calimera, Andrea;Macii, Enrico;Montuschi, Paolo

2026

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)