Neuerscheinungen 2010Stand: 2020-01-07 |
Schnellsuche
ISBN/Stichwort/Autor
|
Herderstraße 10 10625 Berlin Tel.: 030 315 714 16 Fax 030 315 714 14 info@buchspektrum.de |
Abdelwadood Mesleh
SUPPORT VECTOR MACHINE TEXT CLASSIFIER FOR ARABIC ARTICLES
USING ANT COLONY OPTIMIZATION-BASED FEATURE SUBSET SELECTION
2010. 132 S.
Verlag/Jahr: VDM VERLAG DR. MÜLLER 2010
ISBN: 3-639-27141-6 (3639271416)
Neue ISBN: 978-3-639-27141-6 (9783639271416)
Preis und Lieferzeit: Bitte klicken
In this book, we have implemented a support vector machine (SVM) text classifier for Arabic articles. Experimental results show that the SVM classifier outperformed Na‹ve Bayesian (NB) and k-nearest neighbor (kNN) classifiers. We investigated the effectiveness of six state-of-the-art feature subset selection (FSS) methods, which are commonly used in text classification (TC) tasks, for our Arabic SVM text classification system. We implemented an Ant Colony Optimization Based-Feature Subset Selection (ACO Based-FSS) method for our Arabic SVM text classifier. The proposed FSS method adapted Chi-square statistic as heuristic information and the effectiveness of the SVM classifier as a guide to improving the selection of features for each category. Compared to the six state-of-the-art FSS methods, our ACO Based-FSS algorithm achieved better TC effectiveness. Evaluation used an in-house Arabic TC corpus that consists of 1445 documents independently classified into nine categories. The experimental results were presented in terms of macro-averaging precision, macro-averaging recall and macro-averaging F1 measures.
Dr. Mesleh received his Ph.D. in CIS in 2008, and his BSc and MSc in Computer Engineering in 1995 & 1998 respectively. His research interests are Arabic NLP, IR, Feature Selection and Speech Recognition. He participated in peer paper revision for many Journals. He is a fluent speaker of Arabic, English and Chinese.