Modeling the evolution of topics and forecast future trends is a crucial task when analyzing scientific papers. In this work we propose tASKE (temporal Automated System for Knowledge Extraction), a dynamic topic modeling approach which exploits zero-shot classification and contextual embeddings in order to track topic evolution through time. The approach is evaluated against a corpus of data science papers, assessing the ability of tASKE to correctly classify documents and retrieving relevant derivation relationships between older and new topics in time.

Exploiting Contextual Embeddings to Extract Topic Genealogy from Scientific Literature / Ferrara, A.; Montanelli, S.; Picascia, S.; Riva, D.. - 3656:(2023). (Intervento presentato al convegno 3rd International Workshop on Scientific Document Understanding (SDU 2023) tenutosi a Remote nel February 14, 2023).

Exploiting Contextual Embeddings to Extract Topic Genealogy from Scientific Literature

Riva D.
2023

Abstract

Modeling the evolution of topics and forecast future trends is a crucial task when analyzing scientific papers. In this work we propose tASKE (temporal Automated System for Knowledge Extraction), a dynamic topic modeling approach which exploits zero-shot classification and contextual embeddings in order to track topic evolution through time. The approach is evaluated against a corpus of data science papers, assessing the ability of tASKE to correctly classify documents and retrieving relevant derivation relationships between older and new topics in time.
2023
File in questo prodotto:
File Dimensione Formato  
paper5.pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 884.83 kB
Formato Adobe PDF
884.83 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2992901