TY - GEN
T1 - Multi-objectives-based text clustering technique using K-mean algorithm
AU - Abualigah, Laith Mohammad
AU - Khader, Ahamad Tajudin
AU - Al-Betar, Mohammed Azmi
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/8/23
Y1 - 2016/8/23
N2 - Text documents clustering is a popular unsupervised text mining tool. It is used for partitioning a collection of text documents into similar clusters based on the distance or similarity measure as decided by an objective function. Text clustering algorithm often makes prior assumptions to satisfy objective function, which is optimized either through traditional techniques or meta-heuristic techniques. In text clustering techniques, the right decision for any document distribution is done using an objective function. Normally, clustering algorithms perform poorly when the configuration of the well-formulated objective function is not sound and complete. Therefore, we proposed multi-objectives-based method namely, combine distance and similarity measure for improving the text clustering technique. Multi-objectives text clustering method is combined with two evaluating criteria which emerge as a robust alternative in several situations. In particular, the multi-objective function in the text clustering domain is not a popular, and it is a core issue that affects the performance of the text clustering technique. The performance of multi-objectives function is investigated using the k-mean text clustering technique. The experiments were conducted using seven standard text datasets. The results showed that the proposed multi-objectives based method outperforms the other measures in term of the performance of the text clustering, evaluated by using two common clustering measures, namely, Accuracy and F-measure.
AB - Text documents clustering is a popular unsupervised text mining tool. It is used for partitioning a collection of text documents into similar clusters based on the distance or similarity measure as decided by an objective function. Text clustering algorithm often makes prior assumptions to satisfy objective function, which is optimized either through traditional techniques or meta-heuristic techniques. In text clustering techniques, the right decision for any document distribution is done using an objective function. Normally, clustering algorithms perform poorly when the configuration of the well-formulated objective function is not sound and complete. Therefore, we proposed multi-objectives-based method namely, combine distance and similarity measure for improving the text clustering technique. Multi-objectives text clustering method is combined with two evaluating criteria which emerge as a robust alternative in several situations. In particular, the multi-objective function in the text clustering domain is not a popular, and it is a core issue that affects the performance of the text clustering technique. The performance of multi-objectives function is investigated using the k-mean text clustering technique. The experiments were conducted using seven standard text datasets. The results showed that the proposed multi-objectives based method outperforms the other measures in term of the performance of the text clustering, evaluated by using two common clustering measures, namely, Accuracy and F-measure.
KW - Clustering performance
KW - Distance measure
KW - K-mean technique
KW - Multi-objectives function
KW - Similarity measure
KW - Text clustering
UR - https://www.scopus.com/pages/publications/84987680119
U2 - 10.1109/CSIT.2016.7549464
DO - 10.1109/CSIT.2016.7549464
M3 - Conference contribution
AN - SCOPUS:84987680119
T3 - Proceedings - CSIT 2016: 2016 7th International Conference on Computer Science and Information Technology
BT - Proceedings - CSIT 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Computer Science and Information Technology, CSIT 2016
Y2 - 13 July 2016 through 14 July 2016
ER -