Twitter sentiment analysis in the era of emojis

Li, Mengdi (2018) Twitter sentiment analysis in the era of emojis. PhD thesis, University of Nottingham.

[img] PDF (Thesis - as examined) - Repository staff only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (3MB)


Twitter has become an important site for national discussions where we can get a new and timely update of the public opinion towards any event. Twitter Sentiment Analysis (TSA) can be an effective method for unpacking the deep insights embodied within the opinions of the public. Recently, various TSA techniques have been developed, but little consideration has gone into emojis, which is a new invention and has been popularly shared by Twitter users from different countries, with various demographic characteristics, and diverse cultural backgrounds. The ubiquitous adoption of emojis on Twitter provides new opportunities to analyse sentiment expressions in a textual context. Emojis should be included when conducting TSA as the meaning of a Twitter post and its sentiment can be identified with greater clarity and accuracy with emojis. This research aims to develop novel approaches that handle emojis properly and tackle current open issues in TSA. Consisting of four phases, this thesis presents a comprehensive and in-depth research work in the field of Emoji Analytics and TSA. Several studies have been conducted to investigate emoji usage on Twitter and evaluate their effects on TSA. The experimental results demonstrate that emojis has become an essential component of Twitter communication and it is an important area of study complementary to TSA, implying promising future research opportunities for TSA. A novel TSA methodological framework that collects, pre-processes, analyses and maps citizen sentiments from Twitter in helping learn citizens’ moods has been implemented and proved to be effective. The novel framework identifies the best setting for TSA when involving emojis, and proposes an effective emoji training heuristic, which is feasible for both ternary and multi-class classification of tweets. Besides, it innovatively includes the visualisation of user-generated contents in a location-based manner on geographical maps, which provides a much easier-to-understand visual representation of the sentiment. The methodological framework has been proved applicable in real-world scenarios and can be used to support research in other fields. Being the first to consider popularity of emojis on Twitter and include them in performing TSA, this research is considered to be a pioneering work in the field, suggesting a new direction for TSA in the era of emojis.

Item Type: Thesis (University of Nottingham only) (PhD)
Supervisors: Ch'ng, Eugene
Chong, Alain
Keywords: twitter, sentiment analysis, emojis
Subjects: Q Science > QA Mathematics
Faculties/Schools: UNNC Ningbo, China Campus > Faculty of Humanities and Social Sciences > School of International Communications
Item ID: 52019
Depositing User: LI, Mengdi
Date Deposited: 14 Aug 2018 02:04
Last Modified: 07 Feb 2019 18:46

Actions (Archive Staff Only)

Edit View Edit View