推文情感分析通过单词嵌入和机器学习技术

论文标题

推文情感分析通过单词嵌入和机器学习技术

Tweets Sentiment Analysis via Word Embeddings and Machine Learning Techniques

论文作者

Sharma, Aditya, Daniels, Alex

论文摘要

社交媒体数据的情感分析包括可以认为是人类思维方式的态度，评估和情感。将大量文档集中到正面和负面方面是一项非常困难的任务。 Twitter，Facebook和Instagram等社交网络提供了一个平台，以收集有关人们情感和观点的信息。考虑到人们每天在社交媒体上花费数小时并分享各种不同主题的看法有助于我们更好地分析情感这一事实。越来越多的公司正在使用社交媒体工具提供各种服务并与客户互动。情感分析（SA）将给定推文的极性分类为正面和负面推文，以了解公众的情感。本文旨在使用功能选择模型Word2VEC和机器学习算法随机森林进行情感分类的“ 2019年选举Twitter数据”进行情感分析。与传统方法（如BOW和TF-IDF）相比，具有随机森林的Word2Vec可显着提高情感分析的准确性。 Word2Vec通过在文本中考虑文字的上下文语义来提高功能的质量，从而提高了机器学习和情感分析的准确性。

Sentiment analysis of social media data consists of attitudes, assessments, and emotions which can be considered a way human think. Understanding and classifying the large collection of documents into positive and negative aspects are a very difficult task. Social networks such as Twitter, Facebook, and Instagram provide a platform in order to gather information about peoples sentiments and opinions. Considering the fact that people spend hours daily on social media and share their opinion on various different topics helps us analyze sentiments better. More and more companies are using social media tools to provide various services and interact with customers. Sentiment Analysis (SA) classifies the polarity of given tweets to positive and negative tweets in order to understand the sentiments of the public. This paper aims to perform sentiment analysis of real-time 2019 election twitter data using the feature selection model word2vec and the machine learning algorithm random forest for sentiment classification. Word2vec with Random Forest improves the accuracy of sentiment analysis significantly compared to traditional methods such as BOW and TF-IDF. Word2vec improves the quality of features by considering contextual semantics of words in a text hence improving the accuracy of machine learning and sentiment analysis.

下载PDF全文

下载文献需遵守相关版权规定

论文标题