With the recent development of deep learning and Natural Language Generation (NLG) techniques in particular, machines have been given the ability to write and speak, overcoming the limitations of traditional template-based approaches. However, implementing sophisticated models for practical usage is definitely demanding, especially in business contexts in which the final output quality has a direct impact on the economic performance of the service. Our work deals with proposing a specific framework for data-to-text and text-to-speech generation, mainly based on a pre-trained language model and popular NLP tasks, as Named Entity Recognition. The ultimate objective is to generate detailed product descriptions autonomously, starting from the product sheet, in order to promote automation and reduce human effort. Rather than simply reporting product features, the system independently creates a coherent structure by interpreting the information it receives. In this way, it is able to formulate reasonable considerations and motivate possible benefits while trying to remain consistent with the source information. The obtained results present the tendency to properly reproduce the semantic, lexical and general linguistic style of the given context.

Generation of textual/video descriptions for technological products based on structured data / Avignone, Andrea; Fiori, Alessandro; Chiusano, Silvia; Rizzo, Giuseppe. - (2023). (Intervento presentato al convegno IEEE International Conference Application of Information and Communication Technologies tenutosi a Baku (AZ) nel 18-20 October 2023) [10.1109/AICT59525.2023.10313177].

Generation of textual/video descriptions for technological products based on structured data

Avignone, Andrea;Fiori, Alessandro;Chiusano, Silvia;Rizzo, Giuseppe
2023

Abstract

With the recent development of deep learning and Natural Language Generation (NLG) techniques in particular, machines have been given the ability to write and speak, overcoming the limitations of traditional template-based approaches. However, implementing sophisticated models for practical usage is definitely demanding, especially in business contexts in which the final output quality has a direct impact on the economic performance of the service. Our work deals with proposing a specific framework for data-to-text and text-to-speech generation, mainly based on a pre-trained language model and popular NLP tasks, as Named Entity Recognition. The ultimate objective is to generate detailed product descriptions autonomously, starting from the product sheet, in order to promote automation and reduce human effort. Rather than simply reporting product features, the system independently creates a coherent structure by interpreting the information it receives. In this way, it is able to formulate reasonable considerations and motivate possible benefits while trying to remain consistent with the source information. The obtained results present the tendency to properly reproduce the semantic, lexical and general linguistic style of the given context.
2023
979-8-3503-0356-8
File in questo prodotto:
File Dimensione Formato  
Generation_of_Textual_Video_Descriptions_for_Technological_Products_Based_on_Structured_Data.pdf

non disponibili

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 2.15 MB
Formato Adobe PDF
2.15 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2982742