Extracting structured knowledge from scientific literature is crucial for helping professionals make well-informed decisions. This paper presents an approach to distilling knowledge from biomedical documents within the context of Named Entity Disambiguation (NED). The proposed method leverages a joint representation of biomedical entities, combining pre-trained language models with graph machine learning techniques. A Siamese Neural Network (SNN) is trained to optimize this joint representation by integrating the contextual text embeddings of entity mentions with the graph embeddings of corresponding canonical entities in a biomedical Knowledge Graph (KG). During the inference phase, the SNN model assigns a score to this joint representation to disambiguate the target entity among a set of candidates. To the best of our knowledge, this is the first NED method in the biomedical domain that incorporates graph embeddings using a neural model. We empirically evaluated the effectiveness of our approach against well-known biomedical datasets, such as MedMentions and BC5CDR. The results demonstrate a promising direction in utilizing the relational knowledge captured by graph embeddings for the NED task.
Towards Named Entity Disambiguation with Graph Embeddings / Colliani, Felice Paolo; Futia, Giuseppe; Garifo, Giovanni; Vetro, Antonio; De Martin, Juan Carlos. - (2024), pp. 1-6. (Intervento presentato al convegno 2024 IEEE 18th International Conference on Application of Information and Communication Technologies (AICT) tenutosi a Turin (IT) nel 25-27 September 2024) [10.1109/aict61888.2024.10740424].
Towards Named Entity Disambiguation with Graph Embeddings
Colliani, Felice Paolo;Futia, Giuseppe;Garifo, Giovanni;Vetro, Antonio;De Martin, Juan Carlos
2024
Abstract
Extracting structured knowledge from scientific literature is crucial for helping professionals make well-informed decisions. This paper presents an approach to distilling knowledge from biomedical documents within the context of Named Entity Disambiguation (NED). The proposed method leverages a joint representation of biomedical entities, combining pre-trained language models with graph machine learning techniques. A Siamese Neural Network (SNN) is trained to optimize this joint representation by integrating the contextual text embeddings of entity mentions with the graph embeddings of corresponding canonical entities in a biomedical Knowledge Graph (KG). During the inference phase, the SNN model assigns a score to this joint representation to disambiguate the target entity among a set of candidates. To the best of our knowledge, this is the first NED method in the biomedical domain that incorporates graph embeddings using a neural model. We empirically evaluated the effectiveness of our approach against well-known biomedical datasets, such as MedMentions and BC5CDR. The results demonstrate a promising direction in utilizing the relational knowledge captured by graph embeddings for the NED task.File | Dimensione | Formato | |
---|---|---|---|
Towards_Named_Entity_Disambiguation_with_Graph_Embeddings.pdf
non disponibili
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
370.44 kB
Formato
Adobe PDF
|
370.44 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2994321