Publication

Unsupervised Graph-Based Topic Labelling using DBpedia

Hulpus, Ioana
Hayes, Conor
Karnstedt, Marcel
Greene, Derek
Loading...
Thumbnail Image
Identifiers
http://dl.acm.org/citation.cfm?id=2433454
http://hdl.handle.net/10379/4528
https://doi.org/10.13025/21045
Publication Date
2013
Keywords
Type
Conference Paper
Downloads
Citation
Ioana Hulpus and Conor Hayes and Marcel Karnstedt and Derek Greene (2013) Unsupervised Graph-Based Topic Labelling using DBpedia . In: Stefano Leonardi, Alessandro Panconesi eds. Web Search and Data Mining - WSDM 2013
Abstract
Automated topic labelling brings benefits for users aiming at analysing and understanding document collections, as well as for search engines targetting at the linkage between groups of words and their inherent topics. Current approaches to achieve this suffer in quality, but we argue their performances might be improved by setting the focus on the structure in the data. Building upon research for concept disambiguation and linking to DBpedia, we are taking a novel approach to topic labelling by making use of structured data exposed by DBpedia. We start from the hypothesis that words co-occuring in text likely refer to concepts that belong closely together in the DBpedia graph. Using graph centrality measures, we show that we are able to identify the concepts that best represent the topics. We comparatively evaluate our graph-based approach and the standard text-based approach, on topics extracted from three corpora, based on results gathered in a crowd-sourcing experiment. Our research shows that graph-based analysis of DBpedia can achieve better results for topic labelling in terms of both precision and topic coverage.
Funder
|~|SFI|~|
Publisher
Publisher DOI
Rights
Attribution-NonCommercial-NoDerivs 3.0 Ireland