Volume 3 , Issue 1, January 2023

A Method for SMS Spam Message Detection Using Machine Learning

Abstract

 In recent years, it has become increasingly common for individuals to connect with their relatives and friends, read the most recent news, and discuss various social topics by using online social platforms such as Twitter and Facebook. As a consequence of this, anything that is considered spam can quickly spread among them. The identification of spam is widely regarded as one of the most significant problems involved in text analysis. Previous studies on the detection of spam concentrated primarily on English-language content and paid little attention to other languages. The information gathered by the University of California; Irvine served as the basis for the development of our spam detection technology (UCI). In this research study, the effectiveness of various supervised machine learning algorithms, such as the J48, KNN, and DT, in identifying spam and ham communications is investigated. SMS spam is becoming more widespread as the number of individuals who use the internet continues to rise and an increasing number of businesses disclose their customers' personal information. E-mail spam filtering is the progenitor of SMS spam filtering, which inherits a significant number of its features. We evaluate the proposed method based on accuracy, recall, and precision. Experiments showed that DT has obtained higher accuracy compared to other machine learning classifiers.

References

  1. Bakshy, E., Rosenn, I., Marlow, C., & Adamic, L. (2012, April). The role of social networks in information diffusion. In Proceedings of the 21st international conference on World Wide Web (pp. 519-528).
  2. Luo, Y., & Xu, X. (2021). Comparative study of deep learning models for analyzing online restaurant reviews in the era of the COVID-19 pandemic. International Journal of Hospitality Management, 94, 102849.
  3. Rao, S., Verma, A. K., & Bhatia, T. (2021). A review on social spam detection: challenges, open issues, and future directions. Expert Systems with Applications, 186, 115742.
  4. Gao, J., Lanchantin, J., Soffa, M. L., & Qi, Y. (2018, May). Black-box generation of adversarial text sequences to evade deep learning classifiers. In 2018 IEEE Security and Privacy Workshops (SPW) (pp. 50-56). IEEE.
  5. Jain, G., Sharma, M., & Agarwal, B. (2019). Optimizing semantic LSTM for spam detection. International Journal of Information Technology, 11(2), 239-250.
  6. Jain, A. K., Yadav, S. K., & Choudhary, N. (2020). A novel approach to detect spam and smishing SMS using machine learning techniques. International Journal of E-Services and Mobile Applications (IJESMA), 12(1), 21-38.
  7. Almeida, T. A., Hidalgo, J. M. G., & Yamakami, A. (2011, September). Contributions to the study of SMS spam filtering: new collection and results. In Proceedings of the 11th ACM symposium on Document engineering (pp. 259-262).
  8. Cormack, G. V. (2008). Email spam filtering: A systematic review. Foundations and Trends® in Information Retrieval, 1(4), 335-455.
  9. Ji, H., & Zhang, H. (2015). Analysis on the content features and their correlation of web pages for spam detection. China Communications, 12(3), 84-94.
Published February 14, 2023
Keywords
  • Short Message Service,
  • Data Cleaning,
  • Spam Detection,
  • Machine Learning
How to Cite
Saeed, vaman A. (2023). A Method for SMS Spam Message Detection Using Machine Learning. Artificial Intelligence & Robotics Development Journal, 3(1), 214-228. https://doi.org/10.52098/airdj.202366