Optimizing Deep Learning-Based Hate Speech Classification on Social Media Through Frame Semantic Theory and Hyperparameter Tuning
Main Article Content
Abstract
This study uses deep learning, frame semantic theory, and precise hyperparameter tuning to classify social media hate speech. Three carefully selected hate speech datasets, arranged according to a frame semantic framework, are used to categorize opposing content, helping to better grasp context than simply looking for words. The Synthetic Minority Over-sampling Technique (SMOTE) enhanced minority-class detection to mitigate class imbalance. To find optimal configurations, deep learning architectures, activation functions, and data split algorithms were tested. The best approach used 70% of the data for training and 30% for testing, a model with hidden layers, a Rectifier activation function, and 10 epochs, achieving 96.30% accuracy, 96.95% recall, 99.15% precision, and 0.98 F1-score. These findings show that frame semantic theory, deep learning, and rigorous hyperparameter optimization can significantly improve social media hate speech detection systems. The method lays the foundation for context-aware and socially responsible content control.
Article Details
References
- F. Poletto, V. Basile, M. Sanguinetti, C. Bosco, V. Patti, Resources and Benchmark Corpora for Hate Speech Detection: A Systematic Review, Lang. Resour. Eval. 55 (2020), 477-523. https://doi.org/10.1007/s10579-020-09502-8.
- M.S. Jahan, M. Oussalah, A Systematic Review of Hate Speech Automatic Detection Using Natural Language Processing, Neurocomputing 546 (2023), 126232. https://doi.org/10.1016/j.neucom.2023.126232.
- C.J. Fillmore, Frame semantics, in: Linguistics in the Morning Calm, Hanshin Publishing Co., Seoul, pp. 111–137, 1982.
- M.R. Petruck, Frame Semantics, in: Handbook of Pragmatics, John Benjamins Publishing Company, Amsterdam, pp. 592–601, 2022.
- A. Schmidt, M. Wiegand, A Survey on Hate Speech Detection Using Natural Language Processing, in: Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media, Association for Computational Linguistics, Stroudsburg, PA, 2017. https://doi.org/10.18653/v1/w17-1101.
- K. Florio, V. Basile, M. Polignano, P. Basile, V. Patti, Time of Your Hate: The Challenge of Time in Hate Speech Detection on Social Media, Appl. Sci. 10 (2020), 4180. https://doi.org/10.3390/app10124180.
- G. Kovács, P. Alonso, R. Saini, Challenges of Hate Speech Detection in Social Media, SN Comput. Sci. 2 (2021), 95. https://doi.org/10.1007/s42979-021-00457-3.
- B. Chen, Z. Zhang, N. Langrené, S. Zhu, Unleashing the Potential of Prompt Engineering for Large Language Models, arXiv:2310.14735 (2023). https://doi.org/10.1016/j.patter.2025.101260.
- T. Ige, C. Kiekintveld, A. Piplai, An Investigation into the Performances of the State-Of-The-Art Machine Learning Approaches for Various Cyber-Attack Detection: A Survey, in: 2024 IEEE International Conference on Electro Information Technology (eIT), IEEE, 2024, pp. 135-144. https://doi.org/10.1109/eit60633.2024.10609847.
- B. Chen, Z. Zhang, N. Langrené, S. Zhu, Unleashing the Potential of Prompt Engineering for Large Language Models, Patterns 6 (2025), 101260. https://doi.org/10.1016/j.patter.2025.101260.
- X. Zhang, J. Zhao, Y. LeCun, Character-Level Convolutional Networks for Text Classification, arXiv:1509.01626 (2015). https://doi.org/10.48550/arXiv.1509.01626.
- J. Devlin, M.W. Chang, K. Lee, K. Toutanova, in: Proceedings of the 2019 Conference of the North, Association for Computational Linguistics, Stroudsburg, PA, USA, 2019, pp. 4171-4186. https://doi.org/10.18653/v1/N19-1423.
- S. Minaee, N. Kalchbrenner, E. Cambria, N. Nikzad, M. Chenaghlu, et al., Deep Learning--Based Text Classification, ACM Comput. Surv. 54 (2021), 1-40. https://doi.org/10.1145/3439726.
- Q. Li, H. Peng, J. Li, C. Xia, R. Yang, et al., A Survey on Text Classification: From Traditional to Deep Learning, ACM Trans. Intell. Syst. Technol. 13 (2022), 1-41. https://doi.org/10.1145/3495162.
- C.N. Kamath, S.S. Bukhari, A. Dengel, Comparative Study Between Traditional Machine Learning and Deep Learning Approaches for Text Classification, in: Proceedings of the ACM Symposium on Document Engineering 2018, ACM, New York, NY, USA, 2018, pp. 1-11. https://doi.org/10.1145/3209280.3209526.
- J. Liu, W.C. Chang, Y. Wu, Y. Yang, Deep Learning for Extreme Multi-Label Text Classification, in: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, New York, NY, USA, 2017, pp. 115-124. https://doi.org/10.1145/3077136.3080834.
- X. Wang, Y. Zhao, F. Pourpanah, Recent Advances in Deep Learning, Int. J. Mach. Learn. Cybern. 11 (2020), 747-750. https://doi.org/10.1007/s13042-020-01096-5.
- H. Wu, Y. Liu, J. Wang, Review of Text Classification Methods on Deep Learning, Comput. Mater. Contin. 63 (2020), 1309-1321. https://doi.org/10.32604/cmc.2020.010172.
- K. Kowsari, D.E. Brown, M. Heidarysafa, K. Jafari Meimandi, M.S. Gerber, et al., HDLTex: Hierarchical Deep Learning for Text Classification, in: 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), IEEE, 2017, pp. 364-371. https://doi.org/10.1109/icmla.2017.0-134.
- D.S.N. Ulum, A.S. Girsang, Hyperparameter Optimization of Long-Short Term Memory Using Symbiotic Organism Search for Stock Prediction, Int. J. Innov. Res. Sci. Stud. 5 (2022), 121-133. https://doi.org/10.53894/ijirss.v5i2.415.
- O. Ussatova, A. Zhumabekova, V. Karyukin, E.T. Matson, N. Ussatov, The Development of a Model for the Threat Detection System with the Use of Machine Learning and Neural Network Methods, Int. J. Innov. Res. Sci. Stud. 7 (2024), 863-877. https://doi.org/10.53894/ijirss.v7i3.2957.
- C. Baker, FrameNet: A Knowledge Base for Natural Language Processing, in: Proceedings of Frame Semantics in NLP: A Workshop in Honor of Chuck Fillmore (1929-2014), Association for Computational Linguistics, Stroudsburg, PA, USA, 2014. https://doi.org/10.3115/v1/w14-3001.
- G. Zymovets, Frame Semantics as an Explanatory Model for Commercial Names Formation, Acta Linguist. Lith. 85 (2021), 245-262. https://doi.org/10.35321/all85-13.
- N.V. Chawla, K.W. Bowyer, L.O. Hall, W.P. Kegelmeyer, SMOTE: Synthetic Minority Over-Sampling Technique, J. Artif. Intell. Res. 16 (2002), 321-357. https://doi.org/10.1613/jair.953.
- B. Dou, Z. Zhu, E. Merkurjev, L. Ke, L. Chen, et al., Machine Learning Methods for Small Data Challenges in Molecular Science, Chem. Rev. 123 (2023), 8736-8780. https://doi.org/10.1021/acs.chemrev.3c00189.
- J. Li, M. Zhou, H. Wu, L. Wang, J. Zhang, et al., Machine Learning‐Assisted Property Prediction of Solid‐State Electrolyte, Adv. Energy Mater. 14 (2024), 2304480. https://doi.org/10.1002/aenm.202304480.
- M. Kulichenko, B. Nebgen, N. Lubbers, J.S. Smith, K. Barros, et al., Data Generation for Machine Learning Interatomic Potentials and Beyond, Chem. Rev. 124 (2024), 13681-13714. https://doi.org/10.1021/acs.chemrev.4c00572.
- A. Zhumabekova, O. Ussatova, V. Karyukin, E.T. Matson, N. Ussatov, et al., The Application of Machine and Deep Learning Models for Detecting Malware Threats in Healthcare Organizations, in: 2025 5th International Conference on Advanced Research in Computing (ICARC), IEEE, 2025, pp. 1-6. https://doi.org/10.1109/icarc64760.2025.10962965.
- T. Davidson, D. Warmsley, M. Macy, I. Weber, Automated Hate Speech Detection and the Problem of Offensive Language, Proc. Int. AAAI Conf. Web Soc. Media 11 (2017), 512-515. https://doi.org/10.1609/icwsm.v11i1.14955.