Extracting structured knowledge from scientific literature is crucial for helping professionals make well-informed decisions. This paper presents an approach to distilling knowledge from biomedical documents within the context of Named Entity Disambiguation (NED). The proposed method leverages a joint representation of biomedical entities, combining pre-trained language models with graph machine learning techniques. A Siamese Neural Network (SNN) is trained to optimize this joint representation by integrating the contextual text embeddings of entity mentions with the graph embeddings of corresponding canonical entities in a biomedical Knowledge Graph (KG). During the inference phase, the SNN model assigns a score to this joint representation to disambiguate the target entity among a set of candidates. To the best of our knowledge, this is the first NED method in the biomedical domain that incorporates graph embeddings using a neural model. We empirically evaluated the effectiveness of our approach against well-known biomedical datasets, such as MedMentions and BC5CDR. The results demonstrate a promising direction in utilizing the relational knowledge captured by graph embeddings for the NED task.

Towards Named Entity Disambiguation with Graph Embeddings / Colliani, Felice Paolo; Futia, Giuseppe; Garifo, Giovanni; Vetro, Antonio; De Martin, Juan Carlos. - (2024), pp. 1-6. (Intervento presentato al convegno 2024 IEEE 18th International Conference on Application of Information and Communication Technologies (AICT) tenutosi a Turin (IT) nel 25-27 September 2024) [10.1109/aict61888.2024.10740424].

Towards Named Entity Disambiguation with Graph Embeddings

Colliani, Felice Paolo;Futia, Giuseppe;Garifo, Giovanni;Vetro, Antonio;De Martin, Juan Carlos
2024

Abstract

Extracting structured knowledge from scientific literature is crucial for helping professionals make well-informed decisions. This paper presents an approach to distilling knowledge from biomedical documents within the context of Named Entity Disambiguation (NED). The proposed method leverages a joint representation of biomedical entities, combining pre-trained language models with graph machine learning techniques. A Siamese Neural Network (SNN) is trained to optimize this joint representation by integrating the contextual text embeddings of entity mentions with the graph embeddings of corresponding canonical entities in a biomedical Knowledge Graph (KG). During the inference phase, the SNN model assigns a score to this joint representation to disambiguate the target entity among a set of candidates. To the best of our knowledge, this is the first NED method in the biomedical domain that incorporates graph embeddings using a neural model. We empirically evaluated the effectiveness of our approach against well-known biomedical datasets, such as MedMentions and BC5CDR. The results demonstrate a promising direction in utilizing the relational knowledge captured by graph embeddings for the NED task.
2024
979-8-3503-8753-7
File in questo prodotto:
File Dimensione Formato  
Towards_Named_Entity_Disambiguation_with_Graph_Embeddings.pdf

non disponibili

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 370.44 kB
Formato Adobe PDF
370.44 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2994321