Enhancing Smart Home Interaction through Multimodal Command Disambiguation

Calò, Tommaso; De Russis, Luigi

doi:10.1007/s00779-024-01827-3

Smart speakers are entering our homes and enriching the connected ecosystem already present in them. Home inhabitants can use those to execute relatively simple commands, e.g., turning a lamp on. Their capabilities to interpret more complex and ambiguous commands (e.g., make this room warmer) are limited, if not absent. Large Language Models (LLMs) can offer creative and viable solutions to enable a practical and user-acceptable interpretation of such ambiguous commands. This paper introduces an interactive disambiguation approach that integrates visual and textual cues with natural language commands. After contextualizing the approach with a use case, we test it in an experiment where users are prompted to select the appropriate cue (an image or a textual description) to clarify ambiguous commands, thereby refining the accuracy of the system's interpretations. Outcomes from the study indicate that the disambiguation system produces responses well-aligned with user intentions, and that participants found the textual descriptions slightly more effective. Finally, interviews reveal heightened satisfaction with the smart-home system when engaging with the proposed disambiguation approach.

Enhancing Smart Home Interaction through Multimodal Command Disambiguation / Calò, Tommaso; DE RUSSIS, Luigi. - In: PERSONAL AND UBIQUITOUS COMPUTING. - ISSN 1617-4909. - 28:6(2024), pp. 985-1000. [10.1007/s00779-024-01827-3]

Enhancing Smart Home Interaction through Multimodal Command Disambiguation

Tommaso Calò;Luigi de Russis

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2024
			
	Codice DOI
	
				https://dx.doi.org/10.1007/s00779-024-01827-3
			
	Titolo della Rivista
	
				PERSONAL AND UBIQUITOUS COMPUTING
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
s00779-024-01827-3.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 1.09 MB Formato Adobe PDF Visualizza/Apri	1.09 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2990415

PORTO @ Archivio Istituzionale della Ricerca

Enhancing Smart Home Interaction through Multimodal Command Disambiguation

Tommaso Calò;Luigi de Russis

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)