TY - GEN
T1 - Unsupervised text feature selection technique based on particle swarm optimization algorithm for improving the text clustering
AU - Abualigah, Laith Mohammad
AU - Khader, Ahamad Tajudin
AU - Al-Betar, Mohammed Azmi
AU - Hanandeh, Essam Said
PY - 2017/2/27
Y1 - 2017/2/27
N2 - After incensing the amount of text information on internet web pages, the dealing with this information is very complex due to the volume of information. Text clustering technique is an appropriate task to deal with a huge amount of text documents by grouping set of documents into groups. Text documents contain uninformative features, which decrease the performance of the text clustering technique. Feature selection is an unsupervised technique used to select informative features by creating a new subset of informative features. This technique used to improve the performance of the underlying algorithm. Latterly, several complex optimization problems are success solved by meta- heuristic algorithms. In this paper, we proposed the Particle swarm optimization algorithm to solve the feature selection problem, namely, (FSPSOTC). The feature selection technique encourages the k-mean text clustering technique to obtain more accurate clusters. Experiments were conducted using four standard benchmark text datasets with different characteris-tics. Experimental results showed that the proposed method (FSPSOTC) is enhanced the performance of the text clustering technique by dealing with a new subset of informative features.
AB - After incensing the amount of text information on internet web pages, the dealing with this information is very complex due to the volume of information. Text clustering technique is an appropriate task to deal with a huge amount of text documents by grouping set of documents into groups. Text documents contain uninformative features, which decrease the performance of the text clustering technique. Feature selection is an unsupervised technique used to select informative features by creating a new subset of informative features. This technique used to improve the performance of the underlying algorithm. Latterly, several complex optimization problems are success solved by meta- heuristic algorithms. In this paper, we proposed the Particle swarm optimization algorithm to solve the feature selection problem, namely, (FSPSOTC). The feature selection technique encourages the k-mean text clustering technique to obtain more accurate clusters. Experiments were conducted using four standard benchmark text datasets with different characteris-tics. Experimental results showed that the proposed method (FSPSOTC) is enhanced the performance of the text clustering technique by dealing with a new subset of informative features.
KW - Informative features
KW - K-mean text clustering technique
KW - Particle swarm optimization algorithm
KW - Unsupervised feature selection
UR - https://www.scopus.com/pages/publications/85088756468
U2 - 10.4108/eai.27-2-2017.152282
DO - 10.4108/eai.27-2-2017.152282
M3 - Conference contribution
AN - SCOPUS:85088756468
T3 - COMPSE 2016 - 1st EAI International Conference on Computer Science and Engineering
BT - COMPSE 2016 - 1st EAI International Conference on Computer Science and Engineering
A2 - Vasant, Pandian
A2 - Duy, Vo Hoang
PB - EAI
T2 - 1st EAI International Conference on Computer Science and Engineering, COMPSE 2016
Y2 - 11 November 2016 through 12 November 2016
ER -