A novel hybrid approach for sentiment classification of TURKISH tweets for GSM operators

The increase in the amount of content shared on social media makes it difficult to extract meaningful information from scientific studies. Accordingly in recent years researchers have been working extensively on sentiment analysis studies for the automatic evaluation of social media data. One of the...

Full description

Bibliographic Details
Main Authors: Yelmen, Ilkay, Zontul, Metin, Kaynar, Oǧuz, Sönmez, Ferdi
Format: Article in Journal/Newspaper
Language:English
Published: North Atlantic University Union 2018
Subjects:
Online Access:https://hdl.handle.net/20.500.12469/1784
Description
Summary:The increase in the amount of content shared on social media makes it difficult to extract meaningful information from scientific studies. Accordingly in recent years researchers have been working extensively on sentiment analysis studies for the automatic evaluation of social media data. One of the focuses of these studies is sentiment analysis on tweets. The more tweets are available the more features in terms of words exist. This leads to the curse of dimensionality and sparsity resulting in a decrease in the success of the classification. In this study Gini Index Information Gain and Genetic Algorithm (GA) are used for feature selection and Support Vector Machines (SVMs) Artificial Neural Networks (ANN) and Centroid Based classification algorithms are used for the classification of Turkish tweets obtained from 3 different GSM operators. The feature selection methods are combined with the classification methods to investigate the effect on the success rate of analysis. Especially when the SVMs are used with the GA as a hybrid 96.8% success has been achieved for the classification of the tweets as positive or negative. © 2018 North Atlantic University Union. All rights reserved.