A Multi-task Model for Sentiment Aided Cyberbullying Detection in Code-Mixed Indian Languages

Maity, Krishanu; Saha, Sriparna

doi:10.1007/978-3-030-92273-3_36

Krishanu Maity¹³ &
Sriparna Saha¹³

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 13111))

Included in the following conference series:

International Conference on Neural Information Processing

2178 Accesses
8 Citations

Abstract

With the expansion of digital sphere and advancement of technology, cyberbullying has become increasingly common, especially among teenagers. In this work, we have created a benchmark Hindi-English code-mixed corpus called BullySent, annotated with bully and sentiment labels for investigating how sentiment label information helps to identify cyberbully in a better way. For a vast portion of India, both of these languages constitute the primary means of communication, and language mixing is common in everyday speech. A multi-task framework called MT-BERT+VecMap based on two different embedding schemes for the efficient representations of code-mixed data, has been developed. Our proposed multi-task framework outperforms all the single-task baselines with the highest accuracy values of 81.12(+/−1.65)% and 77.46(+/−0.99)% for the cyberbully detection task and sentiment analysis task, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

COVID-19 and cyberbullying: deep ensemble model to identify cyberbullying from code-switched languages during the pandemic

Article 08 January 2022

Deep learning based sentiment analysis and offensive language identification on multilingual code-mixed data

Article Open access 13 December 2022

BERT-Capsule Model for Cyberbullying Detection in Code-Mixed Indian Languages

Notes

References

Artetxe, M., Labaka, G., Agirre, E.: Learning bilingual word embeddings with (almost) no bilingual data. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 451–462 (2017)
Google Scholar
Badjatiya, P., Gupta, S., Gupta, M., Varma, V.: Deep learning for hate speech detection in tweets. In: Proceedings of the 26th International Conference on World Wide Web Companion, pp. 759–760 (2017)
Google Scholar
Bohra, A., Vijay, D., Singh, V., Akhtar, S.S., Shrivastava, M.: A dataset of hindi-english code-mixed social media text for hate speech detection. In: Proceedings of the Second Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media, pp. 36–41 (2018)
Google Scholar
Caruana, R.: Multitask learning. Mach. Learn. 28(1), 41–75 (1997)
Google Scholar
Chauhan, D.S., Dhanush, S., Ekbal, A., Bhattacharyya, P.: Sentiment and emotion help sarcasm? a multi-task learning framework for multi-modal sarcasm, sentiment and emotion analysis. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 4351–4360 (2020)
Google Scholar
Cho, K., Van Merriënboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: Encoder-decoder approaches. arXiv preprint arXiv:1409.1259 (2014)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Dinakar, K., Reichart, R., Lieberman, H.: Modeling the detection of textual cyberbullying. In: Proceedings of the International Conference on Weblog and Social Media 2011. Citeseer (2011)
Google Scholar
Grave, E., Bojanowski, P., Gupta, P., Joulin, A., Mikolov, T.: Learning word vectors for 157 languages. arXiv preprint arXiv:1802.06893 (2018)
Gupta, D., Ekbal, A., Bhattacharyya, P.: A deep neural network based approach for entity extraction in code-mixed indian social media text. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (2018)
Google Scholar
Khapra, M.M., Ramanathan, A., Kunchukuttan, A., Visweswariah, K., Bhattacharyya, P.: When transliteration met crowdsourcing: an empirical study of transliteration via crowdsourcing using efficient, non-redundant and fair quality control. In: LREC, pp. 196–202. Citeseer (2014)
Google Scholar
Myers-Scotton, C.: Duelling Languages: Grammatical Structure in Codeswitching. Oxford University Press, Oxford (1997)
Google Scholar
Pang, B., Lee, L.: Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. arXiv preprint cs/0506075 (2005)
Google Scholar
Reynolds, K., Kontostathis, A., Edwards, L.: Using machine learning to detect cyberbullying. In: 2011 10th International Conference on Machine Learning and Applications and Workshops, vol. 2, pp. 241–244. IEEE (2011)
Google Scholar
Singh, A., Saha, S., Hasanuzzaman, M., Dey, K.: Multitask learning for complaint identification and sentiment analysis. Cognitive Computation, pp. 1–16 (2021)
Google Scholar
Smith, P.K., Mahdavi, J., Carvalho, M., Fisher, S., Russell, S., Tippett, N.: Cyberbullying: its nature and impact in secondary school pupils. J. Child Psychol. Psychiatry 49(4), 376–385 (2008)
Article Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Google Scholar
Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1480–1489 (2016)
Google Scholar

Download references

Acknowledgement

The Authors would like to acknowledge the support of Ministry of Home Affairs (MHA), India for conducting this research.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Indian Institute of Technology, Patna, India
Krishanu Maity & Sriparna Saha

Authors

Krishanu Maity
View author publications
You can also search for this author in PubMed Google Scholar
Sriparna Saha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Krishanu Maity .

Editor information

Editors and Affiliations

Sampoerna University, Jakarta, Indonesia
Teddy Mantoro
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee
Sampoerna University, Jakarta, Indonesia
Media Anugerah Ayu
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Universitas Indonesia, Depok, Indonesia
Achmad Nizar Hidayanto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Maity, K., Saha, S. (2021). A Multi-task Model for Sentiment Aided Cyberbullying Detection in Code-Mixed Indian Languages. In: Mantoro, T., Lee, M., Ayu, M.A., Wong, K.W., Hidayanto, A.N. (eds) Neural Information Processing. ICONIP 2021. Lecture Notes in Computer Science(), vol 13111. Springer, Cham. https://doi.org/10.1007/978-3-030-92273-3_36

Download citation

DOI: https://doi.org/10.1007/978-3-030-92273-3_36
Published: 05 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92272-6
Online ISBN: 978-3-030-92273-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Multi-task Model for Sentiment Aided Cyberbullying Detection in Code-Mixed Indian Languages

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

COVID-19 and cyberbullying: deep ensemble model to identify cyberbullying from code-switched languages during the pandemic

Deep learning based sentiment analysis and offensive language identification on multilingual code-mixed data

BERT-Capsule Model for Cyberbullying Detection in Code-Mixed Indian Languages

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Multi-task Model for Sentiment Aided Cyberbullying Detection in Code-Mixed Indian Languages

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

COVID-19 and cyberbullying: deep ensemble model to identify cyberbullying from code-switched languages during the pandemic

Deep learning based sentiment analysis and offensive language identification on multilingual code-mixed data

BERT-Capsule Model for Cyberbullying Detection in Code-Mixed Indian Languages

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation