With the recent development of deep learning and Natural Language Generation (NLG) techniques in particular, machines have been given the ability to write and speak, overcoming the limitations of traditional template-based approaches. However, implementing sophisticated models for practical usage is definitely demanding, especially in business contexts in which the final output quality has a direct impact on the economic performance of the service. Our work deals with proposing a specific framework for data-to-text and text-to-speech generation, mainly based on a pre-trained language model and popular NLP tasks, as Named Entity Recognition. The ultimate objective is to generate detailed product descriptions autonomously, starting from the product sheet, in order to promote automation and reduce human effort. Rather than simply reporting product features, the system independently creates a coherent structure by interpreting the information it receives. In this way, it is able to formulate reasonable considerations and motivate possible benefits while trying to remain consistent with the source information. The obtained results present the tendency to properly reproduce the semantic, lexical and general linguistic style of the given context.
Generation of textual/video descriptions for technological products based on structured data / Avignone, Andrea; Fiori, Alessandro; Chiusano, Silvia; Rizzo, Giuseppe. - (2023). (Intervento presentato al convegno IEEE International Conference Application of Information and Communication Technologies tenutosi a Baku (AZ) nel 18-20 October 2023) [10.1109/AICT59525.2023.10313177].
Generation of textual/video descriptions for technological products based on structured data
Avignone, Andrea;Fiori, Alessandro;Chiusano, Silvia;Rizzo, Giuseppe
2023
Abstract
With the recent development of deep learning and Natural Language Generation (NLG) techniques in particular, machines have been given the ability to write and speak, overcoming the limitations of traditional template-based approaches. However, implementing sophisticated models for practical usage is definitely demanding, especially in business contexts in which the final output quality has a direct impact on the economic performance of the service. Our work deals with proposing a specific framework for data-to-text and text-to-speech generation, mainly based on a pre-trained language model and popular NLP tasks, as Named Entity Recognition. The ultimate objective is to generate detailed product descriptions autonomously, starting from the product sheet, in order to promote automation and reduce human effort. Rather than simply reporting product features, the system independently creates a coherent structure by interpreting the information it receives. In this way, it is able to formulate reasonable considerations and motivate possible benefits while trying to remain consistent with the source information. The obtained results present the tendency to properly reproduce the semantic, lexical and general linguistic style of the given context.File | Dimensione | Formato | |
---|---|---|---|
Generation_of_Textual_Video_Descriptions_for_Technological_Products_Based_on_Structured_Data.pdf
non disponibili
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
2.15 MB
Formato
Adobe PDF
|
2.15 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2982742