This paper shows the effectiveness of a DBpedia-based approach for text categorization in the e-government field. Our use case is the analysis of all the speech transcripts of current White House members. This task is performed by means of TellMeFirst, an open-source software that leverages the DBpedia knowledge base and the English Wikipedia linguistic corpus for topic extraction. Analysis results allow to identify the main political trends addressed by the White House, increasing the citizens' awareness to issues discussed by politicians. Unlike methods based on string recognition, TellMeFirst semantically classifies documents through DBpedia URIs, gathering all the words that belong to a similar area of meaning (such as synonyms, hypernyms and hyponyms of a lemma) under the same unambiguous concept.
Exploiting Linked Open Data and Natural Language Processing for Classification of Political Speech / Futia, Giuseppe; Cairo, Federico; Morando, Federico; Leschiutta, Luca. - (2014). (Intervento presentato al convegno International Conference for E-Democracy and Open Government 2014 tenutosi a Krems (Austria) nel 21.05.2014 - 23.05.2014).
Exploiting Linked Open Data and Natural Language Processing for Classification of Political Speech
FUTIA, GIUSEPPE;CAIRO, FEDERICO;MORANDO, FEDERICO;LESCHIUTTA, LUCA
2014
Abstract
This paper shows the effectiveness of a DBpedia-based approach for text categorization in the e-government field. Our use case is the analysis of all the speech transcripts of current White House members. This task is performed by means of TellMeFirst, an open-source software that leverages the DBpedia knowledge base and the English Wikipedia linguistic corpus for topic extraction. Analysis results allow to identify the main political trends addressed by the White House, increasing the citizens' awareness to issues discussed by politicians. Unlike methods based on string recognition, TellMeFirst semantically classifies documents through DBpedia URIs, gathering all the words that belong to a similar area of meaning (such as synonyms, hypernyms and hyponyms of a lemma) under the same unambiguous concept.File | Dimensione | Formato | |
---|---|---|---|
futia2014exploiting.pdf
accesso aperto
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
Creative commons
Dimensione
422.71 kB
Formato
Adobe PDF
|
422.71 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2540694
Attenzione
Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo