Development of Proposed Ensemble Model for Spam e-mail Classification

Akhilesh Kumar Shrivas; Amit Kumar Dewangan; S M Ghosh; Devendra Singh

doi:10.5755/j01.itc.50.3.27349

Authors

Akhilesh Kumar Shrivas Guru Ghasidas Vishwavidyalaya, Bilaspur, Chhattisgarh
Amit Kumar Dewangan Dr. C. V. Raman University Bilaspur, Chhattisgarh
S M Ghosh Dr. C. V. Raman University Bilaspur, Chhattisgarh
Devendra Singh

DOI:

https://doi.org/10.5755/j01.itc.50.3.27349

Keywords:

Ensemble Model (E-Model), Classification, Cross Validation, Spam E-mail, Text Mining

Abstract

Spam e-mail documents classification is a very challenging task for e-mail users, especially non IT users. Billions of people using the internet and face the problem of spam e-mails. The automatic identification and classification of spam e-mails help to reduce the problem of e-mail users in managing a large amount of e-mails. This work aims to do a significant contribution by building a robust model for classification of spam e-mail documents using data mining techniques. In this paper, we use Enorn1 data set which consists of spam and ham documents collected from Kaggle repository. We propose an Ensemble Model-1 that is an ensemble of Multilayer Perceptron (MLP), Naïve Bayes and Random Forest (RF) to obtain better accuracy for the classification of spam and hame-mail documents. Experimental results reveal that the proposed Ensemble Model-1 outperforms other existing classifiers as well as other proposed ensemble models in terms of classification accuracy. The suggested and proposed Ensemble Model-1 produces a high accuracy of 97.25% for classification of spam e-mail documents.

Development of Proposed Ensemble Model for Spam e-mail Classification

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

crossref2

crossref

Information