Abstract—The continuing explosive growth of textual content within the World Wide Web has given rise to the need for sophisticated Text Classification (TC) techniques that combine efficiency with high quality of results. E-mail filtering and email organization is an application rife with the potential to streamline the management of the vast amount of information that accumulates in the inbox. Even though a large body of research has delved into this area, there is a paucity of survey that indicates trends and directions. This paper attempts to categorize the prevalent popular techniques for classifying email as spam or legitimate and suggests possible techniques to fill in the lacunae in the arena of automatic management of emails. Our findings suggest that context-based email organization has the most potential in improving quality by learning various contexts such as n-gram phrases, linguistic constructs or users’ profile based context to tailor his/her filtering scheme.
Index Terms—Context Based TC, Context Interpretation, Email Management, Statistical TC.
Upasana Pandey, (e-mail:firstname.lastname@example.org) S. Chakraverty, (e-mail:email@example.com) Division of Computer Engineering, Netaji Subhas Inst. of Technology, New Delhi-110078
Cite: Upasana Pandey and S. Chakraverty, "A Review of Text Classification Approaches for E-mail Management," International Journal of Engineering and Technology vol. 3, no. 2, pp. 137-144, 2011.