Vestnik of Northern (Arctic) Federal University.
Series "Humanitarian and Social Sciences"
ISSN 2227-6564 e-ISSN 2687-1505 DOI:10.37482/2687-1505
Legal and postal addresses of the publisher: office 1336, 17 Naberezhnaya Severnoy Dviny, Arkhangelsk, 163002, Russian Federation, Northern (Arctic) Federal University named after M.V. Lomonosov
Phone: (818-2) 21-61-21, ext. 18-20 ABOUT JOURNAL |
Section: Linguistics Download (pdf, 5.2MB )UDC81’33:811.161.1DOI10.37482/2227-6564-V038AuthorsAlina V. MalikovaSiberian Federal University; prosp. Svobodnyy 82A, Krasnoyarsk, 660041, Russian Federation; ORCID: https://orcid.org/0000-0002-3438-1839 e-mail: malikovaav1304@gmail.com AbstractThis article describes the initial stages of the project aiming to design a classifier of Internet texts in Russian by emotional tonality. To create a sentiment analysis algorithm that attributes texts to one of the 8 basic emotions according to Lövheim’s cube model, it is necessary to do the following: carefully select the language material for the training sample; label its tonality with the assistance of an independent expert; carry out an expert linguistic analysis of the data in order to determine the emotion markers; validate the markers using corpus analysis tools; and, subject to their quantitative significance in the emotion corpora, validate them in the work of the prototype classifier. The author examined the possibility of using non-verbal emotion markers as classification parameters. The linguistic analysis revealed two potential parameters: lexemes written in capital letters and numbers written in figures. Double validation of the markers identified allows us to determine which of them improves the accuracy of classification. The marker of writing numbers in figures leads to a 2 % increase in the overall accuracy of the sentiment analysis algorithm, as well as to a 7 % increase in the classification accuracy for the basic emotion of interest/excitement, and a 3 % increase for the basic emotions of surprise/startle and enjoyment/joy. It is noted that non-verbal markers are slightly less effective for the sentiment analysis of texts than lexical, semantic or punctuation markers, but are as much effective as syntactic markers. The results indicate the need to consider this type of markers along with verbal markers of emotions and explore in more detail concrete non-verbal markers as potential classifier parameters.KeywordsRussian-language Internet texts, text classifier, machine learning, non-verbal emotion markers, sentiment analysisReferences1. Hogenboom A., Frasincar F., de Jong F., Kaymak U. Polarity Classification Using Structure-Based Vector Representations of Text. Decis. Support Syst., 2015, no. 74, pp. 46–56.2. Loukachevitch N.V., Blinov P.D., Kotelnikov E.V., Rubtsova Y.V., Ivanov V.V., Tutubalina E. SentiRuEval: Testing Object-Oriented Sentiment Analysis Systems in Russian. Selegey V.P. (ed.). Computational Linguistics and Intellectual Technologies. Moscow, 2015. Iss. 14, pp. 3–15. 3. Vasilyev V.G., Denisenko A.A., Soloviev D.A. Aspect Extraction and Twitter Sentiment Classification by Fragment Rules. Selegey V.P. (ed.). Computational Linguistics and Intellectual Technologies. Moscow, 2015. Iss. 14, pp. 76–88. 4. Karpov I.A., Kozhevnikov M.V., Kazorin V.I., Nemov N.R. Entity Based Sentiment Analysis Using Syntax Patterns and Convolutional Neural Network. Selegey V.P. (ed.). Computational Linguistics and Intellectual Technologies. Moscow, 2016. Iss. 15, pp. 225–236. 5. Lucas G.M., Gratch J., Malandrakis N., Szablowski E., Fessler E., Nichols J. GOAALLL!: Using Sentiment in the World Cup to Explore Theories of Emotion. Image Vis. Comput., 2017, no. 65, pp. 58–65. 6. Staiano J., Guerini M. DepecheMood: A Lexicon for Emotion Analysis from Crowd-Annotated News. Toutanova K., Wu H. (eds.). Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL). New York, 2014, pp. 427–433. 7. Alm C.O., Roth D., Sproat R. Emotions from Text: Machine Learning for Text-Based Emotion Prediction. Mooney R.J. (ed.). Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing. Stroudsburg, 2005, pp. 579–586. 8. Lövheim H. A New Three-Dimensional Model for Emotions and Monoamine Neurotransmitters. Med. Hypotheses, 2012, vol. 78, no. 2, pp. 341–348. 9. Tomkins S.S. Affect Theory. Ekman P. (ed.). Emotion in the Human Face. Cambridge, 1982, pp. 353–395. 10. Potapova R., Lykova O. Verbal Representation of Lies in Russian and Anglo-American Cultures. Procedia – Soc. Behav. Sci., 2016, vol. 236, pp. 114–118. 11. Pisarevskaya D. Rhetorical Structure Theory as a Feature for Deception Detection in News Reports in the Russian Language. Selegey V.P. (ed.). Computational Linguistics and Intellectual Technologies. Moscow, 2017. Iss. 16. Vol. 1, pp. 191–200. 12. Potapova R., Komalova L. Multimodal Perception of Aggressive Behavior. Ronzhin A., Potapova R., Németh G. (eds.). Speech and Computer. SPECOM 2016. Cham, 2016. Vol. 9811, pp. 499–506. 13. Koltsova O.Y., Alexeeva S.V., Kolcov S.N. An Opinion Word Lexicon and a Training Dataset for Russian Sentiment Analysis of Social Media. Selegey V.P. (ed.). Computational Linguistics and Intellectual Technologies. Moscow, 2016. Iss. 15, pp. 259–268. 14. Kolosov Ya.V. Lingvisticheskie korrelyaty emotsional’nogo sostoyaniya “strakh” v russkoy i angliyskoy rechi: formirovanie bazy dannykh [Linguistic Correlates of the Emotional State of Fear in Russian and English Speech: Database Formation: Diss.]. Moscow, 2004. 214 p. 15. Kolmogorova A.V. Verbal’nye markery emotsiy v kontekste resheniya zadach sentiment-analiza [Verbal Markers of Emotions in Sentiment Analysis Researches]. Voprosy kognitivnoy lingvistiki, 2018, no. 1, pp. 83–93. DOI: 10.20916/1812-3228-2018-1-83-93 16. Kolmogorova A., Kalinin A., Malikova A. Emojis as Predictors in Lövheim Cube Backed Multi-Class Sentiment Analysis: Can We Really Trust Them? 6th SWS International Scientific Conference on Arts and Humanities ISCAH 2019. Sofia, 2019. Vol. 6, pp. 645–652. |
Make a Submission
INDEXED IN:
|