SENTIMENT ANALYSIS OF PEOPLE’S ACCEPTANCE TOWARDS THE NEW MALAYSIAN GOVERNMENT USING NAÏVE BAYES METHOD
DOI:
https://doi.org/10.46754/umtjur.v2i3.170Keywords:
Sentiment analysis, Naive Bayes method, natural language processingAbstract
Sentiment analysis is a field of research that has a significant impact on today’s nations, politics and businesses. It is an algorithmic process to comprehend the opinions of a given subject based on the Natural Language Processing (NLP) methodologies. It has received much attention in recent years and is proven vital in various fields, e.g., online product reviews and social media analysis (Twitter, Facebook, etc.). This paper reports the outcome of sentiment analysis to investigate people’s acceptance of Pakatan Harapan, as the new Malaysian government, spearheaded by Tun Dr. Mahathir Mohamad and Dr. Wan Azizah, with an influence of Dato Seri Anwar Ibrahim. The objective is to classify tweets into three types of sentiments; positive, neutral and negative using Naïve Bayes method which is readily available in Python. The first step is tweets extraction for a month (March to April 2019) using search queries: {Pakatan Harapan, Mahathir, Anwar Ibrahim, Wan Azizah}. It is followed by tweets wrangling using NLP library and lastly output visualization in the form of a word cloud. A word cloud is a visual representation of text data with various font sizes depending on its probabilities. Final results showed that the tweets related to new government consist of neutral sentiment (41%) followed by positive sentiment (30%) and negative sentiment (29%). Malaysians do prefer the new government. However careful mitigation steps must be crafted to overcome controversial issues such as the ‘Rome Statute’ to avoid negative digital footprint, hence winning the Malaysians’ heart.
References
Alec Go, R. B. (2009). Twitter Sentiment Classification using Distant Supervision. Natural Language Processing, 50-62.
Anderson T. W., & Golden L.L. (1984), ”Lifestyle and Psychographics: a Critical Review and Recommendation”, in NA - Advances in Consumer Research Volume 11, eds. Thomas C. Kinnear, Provo, UT : Association for Consumer Research, Pages: 405-411.
Andrew McCallum, K. N. (1998). A Comparison of Event Models for Naive Bayes Text Classification. Washinghton: AAAI Technical Report.
Antoine Boutet, H. K. (2012). What’s in Your Tweets? I Know Who You Supported in the UK 2010 General Election. ASONAM ‘12 Proceedings of 2012 International Conference on Advances in Social Networks Analysis and Mining (pp. 132-139). Washington,DC: IEEE Computer Society.
Bird, Steven, Edward Loper & Ewan Klein (2009). Natural Language Processing with Python. O’Reilly Media Inc.
Bo Pang, L. L. (2002). Thumbs up? Sentiment Classification using Machine Learning Techniques. Proceedings of the conference on Empirical Methods in Natural Language Processing (EMNLP), (pp. 79-86).
Castillo, O. C. (2006). Text Analysis Using Different Graph-Based Representations. SIGIR ‘06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval (pp. 485-492). Seattle: ACM New York.
Christopher D. Manning, P. R. (2008). Text classification and Naive Bayes. In Introduction to Information Retrieval (pp. 253-287). Cambridge University Press.
Friedman, J. H. (1997). On bias, variance, 0/1 - loss, and the curse-of-dimensionality. Data Mining and Knowledge Discovery, 55-77.
Hern, Alex (April 10, 2018). “How to check whether Facebook shared your data with Cambridge Analytica”. The Guardian. https://www.theguardian.com/technology/2018/apr/10/facebook-notify-users-data-harvested-cambridge-analytica
Hays, J. (June, 2015). POLITICS AND POLITICAL PARTIES IN MALAYSIA. Facts And Details: http://factsanddetails.com/southeast-asia/Malaysia/sub5_4d/entry-3668.html
Hutto, C. a. (2014). VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text. Eighth International Conference on Weblogs and Social Media (ICWSM-14) (pp. 123-134). Ann Arbor, MI: Association for the Advancement of Artificial Intelligence.
John Violos, K. T. (2009). Sentiment Analysis using Word-Graphs. WIMS ‘16 Proceedings of the 6th International Conference on Web Intelligence,Mining and Semantics. Nimes,France: ACM New York.
Kushal Dave, S. L. (2003). Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews. WWW’03 Proceedings of the 12th International conferenc on World Wide Web (pp. 519-528). Budapest,Hungary: ACM New York.
Nir Friedman, D. G. (1997). Bayesian network classifiers. Machine Learning, 131-163.
Pat Langley, W. I. (1992). An analysis of Bayesian classifiers. AAAI ‘92 Proceedings of the tenth national conference on Artificial Intellegence (pp. 223-228). Son Jose,California: AAAI Press.
Perkins, J. (2014). Python 3 Text Processing with NLTK 3 Cookbook. Birmingham,UK: Packt Publishing Ltd.
Russell, R. (2018). Machine Learning:Step-by-Step Guide To Implement Machine Learning Algorithms with Python. California: CreateSpace Independent Publishing Platform.
Sahami, M. (1996). Learning limited dependence Bayesian classifiers. Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (pp. 335-338). AAAI Press.
Sue, A. (6 May, 2013). Analisa Keputusan Pilihanraya Umum ke 13 Malaysia : BN menang 133 kerusi, PR 89 kerusi. Wanista. https://www.wanista.com/2013/12524/analisa-keputusan-pilihanraya-umum-ke-13-malaysia-bn-menang-133-kerusi-pr-89-kerusi/
Teacher, L. (Friday February, 2018). The Political Parties in Malaysia. Free Law Essay. https://www.lawteacher.net/free-law-essays/administrative-law/the-political-parties-in-malaysia-administrative-law-essay.php
Tetsuya Nasukawa, J. Y. (2003). Sentiment Analysis :Capturing favorability using Natural Language Processing. K-CAP ‘03 Proceedings of the 2nd International Conference on Knowledge capture (pp. 70-77). Sanibel Island,FL: ACM New York.
Turney, P. D. (2002). Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL) (pp. 417-424). Philadelphia: Association for Computational Linguistic.