Identification of Fake News Using Emotional Profiling as an Approach to Text Analysis

Document Type : Research Article

Authors

1 Department of Economy and Finance, V.I. Vernadsky Crimean Federal University, Simferopol, Russian Federation

2 Department of Management, University of Isfahan, Isfahan, Iran

Abstract
This article presents new methods and tools used in the field of text analysis to identify fake news in the media. The problem with the research is that, as a rule, to identify fakes, a training dataset is required, on which thematic fakes were tested. This is not always feasible and requires additional resources. To solve this problem, a comprehensive research methodology has been developed that covers most detection tools, even in the absence of an established database containing reliable and fake news. The study includes a combination of various algorithms combined into a single analytical structure, presented in the work in the form of pseudocode. The authors introduce the concept of an "emotional fake model" similar to individual emotions included in a broader emotional spectrum. The essence of the model is to evaluate fakes based on the structure of definitions of emotions formed in fakes, which differ from the original signals due to different weight coefficients. The innovation involves a two—stage identification of fakes - initially clusters of messages from the text corpus are identified, and then, based on text analysis tools, their linguistic features and emotional differences are revealed (based on a set of emotions POMS). In the context of creating fake news using neural networks, emotional coloring plays a crucial role, providing a permanent foundation that can serve as a cornerstone for identification.

Keywords


[1] Malyshenko, K. A., Malyshenko, V. A., and Anashkina, M. V. Methodical approaches to forecasting dynamics of the stock market based on Text-Mining. In Proceedings of the 32nd International Business Information Management Association Conference, IBIMA 2018 - Vision 2020: Sustainable Economic Development and Application of Innovation Management from Regional expansion to Global Growth (IBIMA 2018 - Vision 2020, pages 1030–1044, 2018.
[2] Malyshenko, K. A., Malyshenko, V. A., and Anashkina, M. V. Using the orange software package to assess the accuracy of ARIMA and VAR models for predicting stock prices on the securities market. In CEUR Workshop Proceedings, papes 171–185, 2021.
[3] Nan, Qiong, Juan Cao, Yongchun Zhu, Yanyan Wang, and Jintao Li. MDFEND: Multi-domain Fake News Detection. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, pages 3343–3347, 2021. doi: https://doi.org/10.1145/3459637.3482139.
[4] Hlaing, May Me Me, and Nang, Saing Moon Kham. Comparative study of fake news detection using machine learning and neural network approaches. In 11th International Workshop on Computer Science and Engineering (WCSE 2021), pages 59–64, 2021. doi: https://doi.org/10.18178/wcse.2021.02.010
[5] Morales-Vargas, Alejandro, Rafael PedrazaJim´enez, and Llu´ıs Codina. Website quality in digital media: literature review on general evaluation methods and indicators and reliability attributes. In Revista Latina de Comunicacion Social, vol. 80, pages 39–63, 2022. doi: https://doi.org/10.4185/RLCS-2022-1515.
[6] Mahmud, Fahim Belal, Mahi Md Sadek Rayhan, Mahdi Hasan Shuvo, Islam Sadia, and Md Kishor Morol. A comparative analysis of Graph Neural Networks and commonly used machine learning algorithms on fake news detection. In Proceedings - 2022 7th International Conference on Data Science and Machine Learning Applications (CDMA-2022) , pages 97–102, 2022. doi: https://doi.org/10.1109/CDMA54072.2022.00021.
[7] Lampridis, Orestis, Dimitra, Karanatsiou, and Athena, Vakali. Manifesto: a Human-centric Explainable Approach for Fake News Spreaders Detection. In Computing, vol. 104, pages 717–739, 2022. doi: https://doi.org/10.1007/s00607-021-01013-w.
[8] Stitini, Oumaima, Soulaimane, Kaloun, and Omar, Bencharef. Towards the Detection of Fake News on Social Networks Contributing to the Improvement of Trust and Transparency in Recommendation Systems: Trends and Challenges. In Information (Switzerland), vol. 13, no. 3, pages 128, 2022. doi: https://doi.org/10.3390/info13030128.
[9] Varlamis, Iraklis, Dimitrios Michail, Foteini Glykou, and Panagiotis Tsantilas. A Survey on the Use of Graph Convolutional Networks for Combating Fake News. In Future Internet, vol. 14, no. 3, pages 70, 2022. doi: https://doi.org/10.3390/fi14030070.
[10] Sharma, Shakshi, and Rajesh Sharma. Identifying Possible Rumor Spreaders on Twitter: A Weak Supervised Learning Approach. In Proceedings of the International Joint Conference on Neural Networks, pages 1–8, 2021. doi: https://doi.org/10.1109/IJCNN52387.2021.9534185.
[11] Jason C. Young, Brandyn Boyd, Katya Yefimova, Stacey Wedlake, Chris Coward and Rolf Hapel. The role of libraries in misinformation programming: A research agenda. In Journal of Librarianship and Information Science, vol. 53, no. 4, pages 539–550, 2021. doi: https://doi.org/10.1177/0961000620966650.
[12] Guo, Bin, Yasan Ding, Yueheng Sun, Shuai Ma, Ke Li, and Zhiwen Yu. The mass, fake news, and cognition security. In Frontiers of Computer Science, vol. 15, no. 3, pages 1–13, 2021. doi: https://doi.org/10.1007/s11704-020-9256-0.
[13] de Souza, Mariana Caravanti, Bruno Magalh˜aes Nogueira, Rafael Geraldeli Rossi, Ricardo Marcondes Marcacini, Brucce Neves Dos Santos, and Solange Oliveira Rezende. A networkbased positive and unlabeled learning approach for fake news detection. In Machine Learning, vol. 111, no. 10, pages 3549–3592, 2022. doi: https://doi.org/10.1007/s10994-021-06111-6.
[14] Shu, Kai, Amy Sliva, Suhang Wang, Jiliang Tang, and Huan Liu. Fake news detection on social media: A data mining perspective. In ACM SIGKDD Explorations Newsletter, vol. 19, no. 1, pages 22–36, 2017.
[15] Vosoughi, Soroush, Deb Roy, and Sinan Aral. The spread of true and false news online. In Science, vol. 359, no. 6380, pages 1146–1151, 2018.
[16] Kim, M., Kang, J., and Yoo, C. D. Semantically regularized LSTM for imbalanced sentiment classification on Twitter. In Information Sciences, vol. 468, pages 66–80, 2018.
[17] Thorne, James, and Andreas Vlachos. Automated fact-checking: Task formulations, methods and future directions. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2 (Short Papers), pages 21–27, 2018.
[18] Wang, William Yang. ”Liar, liar pants on fire”: A new benchmark dataset for fake news detection. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 2: (Short Papers), pages 422–426, 2017.
[19] Jin, Y., Luo, D., Zhang, F., and Zhang, Y. Multimodal fake news detection via stacked heterogeneous attention network. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 1230–1240, 2020.
[20] Natali Ruchansky, Sungyong Seo, Yan Liu. Csi: A hybrid deep model for fake news detection. In arXiv, 2017. doi: arXiv preprint arXiv:1703.09857.
[21] Ciampaglia, Giovanni Luca, Prashant Shiralkar, Luis M. Rocha, Johan Bollen, Filippo Menczer, and Alessandro Flammini. Computational fact checking from knowledge networks. In PLoS ONE, vol. 13, no. 1, 2018.
[22] Potthast, Martin, Sebastian K¨opsel, Benno Stein, and Matthias Hagen. Clickbait detection. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 827–830, 2017.
[23] Ma, X., Wu, J., Feng, S., and Zhou, J. Detecting rumors from microblogs with recurrent neural networks. In Knowledge-Based Systems, pages 194, 105623, 2020.
[24] Mohammadrezaei, Mohammadreza, Ebrahim Shiri Mohammad, and Amir Masoud Rahmani. Detection of Fake Accounts in Social Networks Based on One Class Classification. In ISC Int’l Journal of Information Security, vol. 11, no. 2, pages 173–183, 2019. doi: https://doi.org/10.22042/isecure.2019.165312.450.
[25] Mohammadhassanzadeh, Hossein, and Hamid Reza Shahriari. Prediction of User’s Trustworthiness in Web-based Social Networks via Text Mining. In ISC Int’l Journal of Information Security, vol. 5, no. 2, pages 171–187, 2013. doi: https://doi.org/10.22042/isecure.2014.5.2.5.
[26] Clarke, Jonathan, Hailiang Chen, Ding Du, and Yu Jeffrey Hu. Fake news, investor attention, and market reaction. In Information Systems Research, vol. 32, no. 1, pages 35–52, 2020.
[27] Paschen, Jeannette. Investigating the emotional appeal of fake news using artificial intelligence and human contributions. In Journal of Product & Brand Management, vol. 29, no. 2, pages 223–233, 2020. doi: https://doi.org/10.1108/JPBM-12-2018-2179.
[28] Denault, Vincent, Pierrich Plusquellec, Louise M.Jupe, Michel St-Yves, Norah E. Dunbar, Maria Hartwig, Siegfried L. Sporer et al. The analysis of nonverbal communication: The dangers of pseudoscience in security and justice contexts. In Anuario de Psicolog´ıa Jur´ıdica, vol. 30, no. 1, pages 1–12, 2020. doi: https://doi.org/10.5093/apj2019a9.
[29] Elstrodt, J¨urgen. The Life and Work of Gustav Lejeune Dirichlet (1805–1859). In Analytic Number Theory, vol. 7, pages 1–37, 2007.
[30] Shu, Kai, Deepak Mahudeswaran, Suhang Wang, Dongwon Lee, and Huan Liu. Fake News Net: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media. In Big data, vol. 8, no. 3, pages 171–188, 2020. doi: https://doi.org/10.1089/big.2020.0062.
[31] Ekman, Paul. An Argument for Basic Emotions. In Cognition & emotion, vol. 6, no. 3-4, pages 169–200, 1992.
[32] Plutchik, Robert. A general psychoevolutionary theory of emotion. In Theories of emotion, vol. 1, pages 3–33, Academic press, 1980.
[33] Altszyler, Edgar, Sidarta Ribeiro, Mariano Sigman, and Diego Fern´andez Slezak. The interpretation of dream meaning: Resolving ambiguity using Latent Semantic Analysis in a small corpus of text. In Consciousness and cognition, vol. 56, pages 178–187, 2017. doi: https://doi.org/10.1016/j.concog.2017.09.004.
[34] M. Saif Mohammad. Chapter 11-Sentiment analysis: Automatically detecting valence, emotions, and other affectual states from text, Editor(s): Herbert L. Meiselman. In Emotion Measurement(Second Edition), pages 323–379, Woodhead Publishing, 2021. doi: https://doi.org/10.1016/B978-0-12-821124-3.00011-9.
[35] Fennell, Peter G., Zhiya Zuo, and Kristina Lerman. Predicting and explaining behavioral data with structured feature space decomposition. In EPJ Data Science, vol. 8, no. 1, pages 1–27, 2019. doi: https://doi.org/10.1140/epjds/s13688-019-0201-0.
[36] Sievert, Carson, and Kenneth Shirley. LDAvis: A method for visualizing and interpreting topics. In Proceedings of the workshop on interactive language learning, visualization, and interfaces, pages 63–70, 2014.
[37] G´alvez, Ramiro H., and Agust´ın Gravano. Assessing the usefulness of online message board mining in automatic stock prediction systems. In Journal of Computational Science, vol. 19, pages 43–56, 2017. doi: https://doi.org/10.1016/j.jocs.2017.01.001.
[38] Stuart, Keith, and Ana Botella. Corpus Linguistics, Network Analysis and Co-occurrence Matrices. In International Journal of English Studies, vol. 9, no. 3, pages 1–20, 2009.
[39] Hutto, Clayton, and Eric Gilbert. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Proceedings of the international AAAI conference on web and social media. Ann Arbor, MI, June 2014, (ICWSM-14), vol. 8, no. 1, pages 216–225, 2014.
[40] Curran, Shelly L., Michael A. Andrykowski, and Jamie L. Studts. Short Form of the Profile of Mood States (POMS-SF): Psychometric information. In Psychological Assessment, vol. 7, no. 1, pages 80–83, 1995. doi: https://doi.org/10.1037/1040-3590.7.1.80.
[41] Muller, Thomas, Ryan Cotterell, Alexander Fraser, and Hinrich Sch¨utze. Joint Lemmatization and Morphological Tagging with LEMMING. In 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon: Association for Computational Linguistics, pages 2268–2274, 2015. doi: https://doi.org/10.18653/v1/D15-1272.
[42] Nan, Qiong, Juan Cao, Yongchun Zhu, Yanyan Wang, and Jintao Li. MDFEND: Multi-domain Fake News Detection. In International Conference on Information and Knowledge Managements. Proceedings of the 30th ACM International Conference on Information & Knowledge Management, pages 3343–3347, 2021. doi: https://doi.org/10.1145/3459637.3482139.
[43] Tabaeeian, Reihaneh Alsadat, and Majid Mohammad Shafiee. Factors Affecting the Motivation of Games Users in Social Networks and their Impact on the User Attitude and Shopping Intention. In New Marketing Research Journal, vol. 12, no. 4, pages 51–68, 2023.
[44] Van Den Burg, Gerrit J. J., and Patrick J. F., Groenen. GenSVM: A generalized multiclass support vector machine. In Journal of Machine Learning Research, vol. 17, no. 224, pages 1–42, 2016.
[45] Denault, Vincent, Pierrich Plusquellec, Louise M.Jupe, Michel St-Yves, Norah E. Dunbar, Maria Hartwig, Siegfried L. Sporer et al. The Analysis of Nonverbal Communication: The Dangers of Pseudoscience in Security and Justice Contexts. In Anuario de Psicolog´ıa Jur´ıdica, vol. 30, no. 1, pages 1–12, 2020. doi: https://doi.org/10.5093/apj2019a9.
[46] Ma, Jing, Wei Gao, Prasenjit Mitra, Sejeong Kwon, Bernard J. Jansen, Kam-Fai Wong, and Meeyoung Cha. Detecting rumors from microblogs with recurrent neural networks. In International Joint Conference on Artificial Intelligence, 2016-
January (IJCAI), pages 3818–3824, 2016.
[47] Ghanem, Bilal, Paolo Rosso, and Francisco Rangel. An emotional analysis of false information in social media and news articles. In ACM Transactions on Internet Technology (TOIT), vol.20, no. 1, pages 1–18, 2020.