Skip to main navigation Skip to search Skip to main content

Multi-objectives-based text clustering technique using K-mean algorithm

  • Universiti Sains Malaysia
  • Al-Balqa Applied University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

48 Scopus citations

Abstract

Text documents clustering is a popular unsupervised text mining tool. It is used for partitioning a collection of text documents into similar clusters based on the distance or similarity measure as decided by an objective function. Text clustering algorithm often makes prior assumptions to satisfy objective function, which is optimized either through traditional techniques or meta-heuristic techniques. In text clustering techniques, the right decision for any document distribution is done using an objective function. Normally, clustering algorithms perform poorly when the configuration of the well-formulated objective function is not sound and complete. Therefore, we proposed multi-objectives-based method namely, combine distance and similarity measure for improving the text clustering technique. Multi-objectives text clustering method is combined with two evaluating criteria which emerge as a robust alternative in several situations. In particular, the multi-objective function in the text clustering domain is not a popular, and it is a core issue that affects the performance of the text clustering technique. The performance of multi-objectives function is investigated using the k-mean text clustering technique. The experiments were conducted using seven standard text datasets. The results showed that the proposed multi-objectives based method outperforms the other measures in term of the performance of the text clustering, evaluated by using two common clustering measures, namely, Accuracy and F-measure.

Original languageEnglish
Title of host publicationProceedings - CSIT 2016
Subtitle of host publication2016 7th International Conference on Computer Science and Information Technology
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781467389136
DOIs
StatePublished - 23 Aug 2016
Externally publishedYes
Event7th International Conference on Computer Science and Information Technology, CSIT 2016 - Amman, Jordan
Duration: 13 Jul 201614 Jul 2016

Publication series

NameProceedings - CSIT 2016: 2016 7th International Conference on Computer Science and Information Technology

Conference

Conference7th International Conference on Computer Science and Information Technology, CSIT 2016
Country/TerritoryJordan
CityAmman
Period13/07/1614/07/16

Keywords

  • Clustering performance
  • Distance measure
  • K-mean technique
  • Multi-objectives function
  • Similarity measure
  • Text clustering

Fingerprint

Dive into the research topics of 'Multi-objectives-based text clustering technique using K-mean algorithm'. Together they form a unique fingerprint.

Cite this