A comparative study of the class imbalance problem in Twitter spam detection
Citations Over TimeTop 10% of 2017 papers
Abstract
Summary Recently, online social network (OSN) such as Twitter has become an important and popular source for real‐time information and news dissemination, and Twitter is inevitably a prime target of spammers. It has been showed that the security threats caused by Twitter spam can reach far beyond the social media platform itself. To mitigate the damage caused by Twitter spam, machine learning classification algorithms have been employed by researchers and communities to detect the Twitter spam. However, most of these studies have overlooked the class imbalance problem in Twitter spam detection. In this paper, we have studied the class imbalance problem in Twitter spam detection. Firstly, we have conducted a comparative study regarding some popular methods in handling the class imbalance problem in order to identify the most effective approach for addressing the class imbalance problem. Then, we have conducted another comparative study from Twitter spam detection based on several classic techniques. Experimental results demonstrate that a fuzy‐based ensemble learning can significantly improve the classification performance on imbalance ground truth Twitter data.
Related Papers
- → Definition of spam 2.0: New spamming boom(2010)44 cited
- → The harvester, the botmaster, and the spammer(2014)31 cited
- NEIGHBORWATCHER: A Content-Agnostic Comment Spam Inference System.(2013)
- → Detection of Web Spambot in the Presence of Decoy Actions(2014)3 cited
- → Handling Web Spamming Using Logic Approach(2018)1 cited