A Comparative Study of Different State-of-the-Art Hate Speech Detection Methods in Hindi-English Code-Mixed Data
Arrow@dit (Dublin Institute of Technology)2020pp. 42–48
Citations Over TimeTop 10% of 2020 papers
Priya Rani, Shardul Suryawanshi, Koustava Goswami, Bharathi Raja Chakravarthi, Theodorus Fransen, John P. McCrae
Abstract
Hate speech detection in social media communication has become one of the primary concerns to avoid conflicts and curb undesired activities. In an environment where multilingual speakers switch among multiple languages, hate speech detection becomes a challenging task using methods that are designed for monolingual corpora. In our work, we attempt to analyze, detect and provide a comparative study of hate speech in a code-mixed social media text. We also provide a Hindi-English code-mixed data set consisting of Facebook and Twitter posts and comments. Our experiments show that deep learning models trained on this code-mixed corpus perform better.
Related Papers
- → Named Entity Recognition for Code-Mixed Indian Corpus using Meta Embedding(2020)94 cited
- → A Survey of Current Datasets for Code-Switching Research(2020)93 cited
- → Comparison of Different Orthographies for Machine Translation of Under-Resourced Dravidian Languages(2019)42 cited
- → Corpus Creation for Sentiment Analysis in Code-Mixed Tamil-English Text(2020)109 cited
- → A Sentiment Analysis Dataset for Code-Mixed Malayalam-English(2020)38 cited