Modeling the evolution of topics and forecast future trends is a crucial task when analyzing scientific papers. In this work we propose tASKE (temporal Automated System for Knowledge Extraction), a dynamic topic modeling approach which exploits zero-shot classification and contextual embeddings in order to track topic evolution through time. The approach is evaluated against a corpus of data science papers, assessing the ability of tASKE to correctly classify documents and retrieving relevant derivation relationships between older and new topics in time.
Exploiting Contextual Embeddings to Extract Topic Genealogy from Scientific Literature / Ferrara, A.; Montanelli, S.; Picascia, S.; Riva, D.. - 3656:(2023). (Intervento presentato al convegno 3rd International Workshop on Scientific Document Understanding (SDU 2023) tenutosi a Remote nel February 14, 2023).
Exploiting Contextual Embeddings to Extract Topic Genealogy from Scientific Literature
Riva D.
2023
Abstract
Modeling the evolution of topics and forecast future trends is a crucial task when analyzing scientific papers. In this work we propose tASKE (temporal Automated System for Knowledge Extraction), a dynamic topic modeling approach which exploits zero-shot classification and contextual embeddings in order to track topic evolution through time. The approach is evaluated against a corpus of data science papers, assessing the ability of tASKE to correctly classify documents and retrieving relevant derivation relationships between older and new topics in time.File | Dimensione | Formato | |
---|---|---|---|
paper5.pdf
accesso aperto
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Creative commons
Dimensione
884.83 kB
Formato
Adobe PDF
|
884.83 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2992901