The colloquial WordNet: Extending Princeton WordNet with neologisms
McCrae, John P. ; Wood, Ian D. ; HIcks, Amanda
McCrae, John P.
Wood, Ian D.
HIcks, Amanda
Loading...
Repository DOI
Publication Date
2017-05-27
Keywords
Type
Article
Downloads
Citation
McCrae J.P., Wood I., Hicks A. (2017) The Colloquial WordNet: Extending Princeton WordNet with Neologisms. In: Gracia J., Bond F., McCrae J., Buitelaar P., Chiarcos C., Hellmann S. (eds) Language, Data, and Knowledge. LDK 2017. Lecture Notes in Computer Science, vol 10318. Springer, Cham
Abstract
Princeton WordNet is one of the most important resources for natural language processing, but has not been updated for over ten years and is not suitable for analyzing the fast moving language as used on social media. We propose an extension to WordNet, with new terms that have been found from Twitter and Reddit, and cover language usage that is emergent or vulgar. In addition to our methodology for extraction, we analyze new terms to provide information about how new words are entering the English language. Finally, we discuss publishing this resource both as linguistic linked open data and as part of the Global WordNet Association’s Interlingual Index.
Publisher
Springer International Publishing
Publisher DOI
10.1007/978-3-319-59888-8_17
Rights
Attribution-NonCommercial-NoDerivs 3.0 Ireland