Natural Language Processing (NLP) for Sentiment Analysis in Social Media

Authors

  • Thomas Joseph Strathmore University

DOI:

https://doi.org/10.47941/ijce.2135

Keywords:

Sentimental Analysis, Natural Language Processing (NLP), Machine Learning, Deep Learning, Social Media

Abstract

Purpose: This study sought to analyze Natural Language Processing (NLP) for sentiment analysis in social media.

Methodology: The study adopted a desktop research methodology. Desk research refers to secondary data or that which can be collected without fieldwork. Desk research is basically involved in collecting data from existing resources hence it is often considered a low cost technique as compared to field research, as the main cost is involved in executive’s time, telephone charges and directories. Thus, the study relied on already published studies, reports and statistics. This secondary data was easily accessed through the online journals and library.

Findings: The findings reveal that there exists a contextual and methodological gap relating to Natural Language Processing (NLP) for sentiment analysis in social media. Preliminary empirical review revealed that advanced computational techniques significantly advanced the understanding of sentiments across diverse social media platforms. Methodologies such as machine learning algorithms and deep learning models like CNNs and RNNs demonstrated robust capabilities in categorizing sentiments accurately and capturing contextual nuances such as sarcasm and slang. The research highlighted the interdisciplinary nature of NLP applications, integrating linguistics with computer science and social sciences to develop effective frameworks for analyzing large-scale social media data. These findings contributed to enhancing decision-making in marketing, politics, and public opinion research, pointing towards future directions in hybrid NLP models for improved sentiment analysis across different languages and cultural contexts.

Unique Contribution to Theory, Practice and Policy: The Social Constructionism, Cognitive Linguistics and Discourse Analysis Theory may be used to anchor future studies on Natural Language Processing (NLP). The recommendations aimed to advance theoretical foundations by exploring deep learning models and nuanced sentiment lexicons. Practical applications were enhanced through the development of scalable NLP tools for real-time data processing and integration into social media platforms. Policy implications focused on establishing ethical guidelines for data privacy and bias mitigation in sentiment analysis algorithms. Cross-disciplinary collaboration fostered innovation by integrating insights from computer science, linguistics, psychology, and social sciences. Education initiatives and international collaborations were prioritized to build capacity and standardize methodologies globally, ensuring advancements in both research and practical deployment of sentiment analysis technologies.

Downloads

Download data is not yet available.

References

Ahmad, N., Aftab, S., & Ali, I. (2017). Sentiment analysis of tweets using SVM. International Journal of Computer Applications, 177(5), 1-10. https://doi.org/10.5120/ijca2017915498

Ashktorab, Z., Brown, C., Nandi, M., & Culotta, A. (2014). Tweedr: Mining twitter to inform disaster response. Proceedings of the 11th International ISCRAM Conference. DOI not available.

Berger, P. L., & Luckmann, T. (1966). The Social Construction of Reality: A Treatise in the Sociology of Knowledge. Garden City, NY: Anchor Books.

Bollen, J., Mao, H., & Zeng, X. (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1), 1-8.

Chui, M., Manyika, J., Bughin, J., Dobbs, R., Roxburgh, C., Sarrazin, H., & Westergren, M. (2012). The social economy: Unlocking value and productivity through social technologies. McKinsey Global Institute. Retrieved from https://www.mckinsey.com/industries/high-tech/our-insights/the-social-economy

Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 4171-4186. https://doi.org/10.18653/v1/N19-1423

Esuli, A., & Sebastiani, F. (2006). SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining. Proceedings of LREC, 417-422.

Fairclough, N. (1995). Critical Discourse Analysis: The Critical Study of Language. London: Longman.

Gimpel, K., Schneider, N., O'Connor, B., Das, D., Mills, D., Eisenstein, J., ... & Smith, N. A. (2011). Part-of-speech tagging for Twitter: Annotation, features, and experiments. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 42-47.

Go, A., Bhayani, R., & Huang, L. (2009). Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford.

Grimmer, J., Roberts, M. E., & Stewart, B. M. (2017). Text as data: The promise and pitfalls of automatic content analysis methods for political texts. Political Analysis, 21(3), 267-297. https://doi.org/10.1093/pan/mps028

Hu, M., & Liu, B. (2004). Mining and summarizing customer reviews. Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD).

Jansen, B. J., Zhang, M., Sobel, K., & Chowdury, A. (2009). Twitter power: Tweets as electronic word of mouth. Journal of the American Society for Information Science and Technology, 60(11), 2169-2188. https://doi.org/10.1002/asi.21149

Kwarteng, M. A., Asare, M., & Essuman, S. (2019). Sentiment analysis of twitter data for improving public health monitoring: A case study of Ghana. Journal of Big Data, 6(1), 1-19. https://doi.org/10.1186/s40537-019-0191-0

Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., & Dyer, C. (2016). Neural Architectures for Named Entity Recognition. Proceedings of NAACL-HLT, 260-270. https://doi.org/10.18653/v1/N16-1030

Langacker, R. W. (1987). Foundations of Cognitive Grammar: Theoretical Prerequisites. Stanford, CA: Stanford University Press.

Liu, B. (2012). Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies, 5(1), 1-167. https://doi.org/10.2200/S00416ED1V01Y201204HLT016

Lwin, M. O., Lu, J., Sheldenkar, A., & Schulz, P. J. (2020). Strategic uses of Facebook in Zika outbreak communication: Implications for the crisis and emergency risk communication model. International Journal of Environmental Research and Public Health, 17(12), 4237. https://doi.org/10.3390/ijerph17124237

Manning, C. D., Schütze, H., & Raghavan, P. (2012). Introduction to Information Retrieval. Cambridge University Press.

Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. arXiv preprint arXiv:1301.3781.

Mohammad, S., & Turney, P. (2013). Crowdsourcing a word-emotion association lexicon. Computational Intelligence, 29(3), 436-465.

Okazaki, S., Andreu, L., & Campo, S. (2017). Knowledge sharing among tourists via social media: A comparison between Facebook and TripAdvisor. International Journal of Tourism Research, 19(1), 107-118. https://doi.org/10.1002/jtr.2081

Pak, A., & Paroubek, P. (2010). Twitter as a corpus for sentiment analysis and opinion mining. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC).

Riloff, E., Qadir, A., Surdeanu, M., Gilbert, N., & Huang, R. (2013). Sarcasm as Contrast between a Positive Sentiment and Negative Situation. Proceedings of the 2013 Conference on Empirical Methods in Natural Language

Souza, J., Gonçalves, M. A., & Pappa, G. L. (2018). Twitter monitoring through dynamic expansion of online social networks using sentiment analysis. Journal of Internet Services and Applications, 9(1), 1-15. https://doi.org/10.1186/s13174-018-0086-4

Statista. (2023). Number of social media users worldwide from 2017 to 2027 (in billions). Retrieved from https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/

Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T., & Qin, B. (2014). Learning sentiment-specific word embedding for Twitter sentiment classification. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL).

Toutanova, K., Klein, D., Manning, C. D., & Singer, Y. (2003). Feature-rich part-of-speech tagging with a cyclic dependency network. Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, 173-180.

Wang, G., Sun, A., Zhou, J., Zhao, B., & Zhang, Y. (2011). Microblogging, online expression, and political efficacy among young Chinese citizens: The growth of Microblogging services and the 2010 election cycle. Chinese Journal of Communication, 4(4), 381-399.

Wang, H., Can, D., Kazemzadeh, A., Bar, F., & Narayanan, S. (2012). A system for real-time twitter sentiment analysis of 2012 U.S. presidential election cycle. Proceedings of the ACL 2012 System Demonstrations, 115-120.

Zhang, L., Wang, S., & Liu, B. (2018). Deep Learning for Sentiment Analysis: A Survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 8(4), e1253. https://doi.org/10.1002/widm.1253

Downloads

Published

2024-07-31

How to Cite

Joseph, T. (2024). Natural Language Processing (NLP) for Sentiment Analysis in Social Media. International Journal of Computing and Engineering, 6(2), 35–48. https://doi.org/10.47941/ijce.2135

Issue

Section

Articles