Title | EmojiNet: A Machine Readable Emoji Sense Inventory |
Publication Type | Report |
Year of Publication | 2016 |
Authors | Sanjaya Wijeratne, Lakshika Balasuriya, Amit Sheth, Derek Doran |
Series Title | Wright Brother's Day, Wright State University |
Date Published | 10/2016 |
City | Dayton, Ohio, USA |
Keywords | Emoji Analysis, Emoji Sense Disambiguation, EmojiNet |
Abstract | With the rise of social media, ‘emoji’ have become extremely popular in online communications. People have started using emoji as a new language in social media to add color and whimsiness to their messages. Without rigid semantics attached to them, emoji symbols take on different meanings based on the context of a message. This has resulted in ambiguity in emoji use. Similar to word sense disambiguation, machine readable sense inventories that list emoji meanings are essential for machines to understand emoji without ambiguity. As the first step towards building machines that can understand emoji, this paper presents EmojiNet, the first machine readable sense inventory for emoji. It links Unicode emoji representations to their English meanings extracted from the Web, enabling systems to link emoji with their context-specific meaning. EmojiNet is automatically constructed by integrating multiple emoji resources with BabelNet, which is the most comprehensive multilingual sense inventory available to date. The paper discusses its construction, evaluates the automatic resource creation process, and presents a use case where EmojiNet disambiguates emoji usage in tweets. EmojiNet is available online for use at http://emojinet.knoesis.org. |