Skip to main navigation Skip to search Skip to main content

Feature Selection with β-Hill Climbing Search for Text Clustering Application

  • Laith Mohammad Abualigah
  • , Ahamad Tajudin Khader
  • , Mohammed Azmi Al-Betar
  • , Zaid Abdi Alkareem Alyasseri
  • , Osama Ahmad Alomari
  • , Essam Said Hanandeh
  • Universiti Sains Malaysia
  • Al-Balqa Applied University
  • University of Kufa
  • Zarqa University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

40 Scopus citations

Abstract

In the bases of increasing the volume of text information, the dealing with text information has become incredibly complicated. The text clustering is a suitable technique used in dealing with a tremendous amount of text documents by classifying these set of text documents into clusters. Ultimately, text documents hold sparse, non-uniform distribution and uninformative features are difficult to cluster. The text feature selection is a primary unsupervised learning method that is utilized to choose a new subset of informational text features. In this paper, a new algorithm is proposed based on β-hill climbing technique for text feature selection problem to improve the text clustering (B-FSTC). The results of the proposed method for β-hill climbing and original Hill climbing (i.e., H-FSTC) are examined using the k-mean text clustering and compared with each other. Experiments were conducted on four standard text datasets with varying characteristics. Interestingly, the proposed β-hill climbing algorithm obtains superior results in comparison with the other well-regard techniques by producing a new subset of informational text features. Lastly, the β-hill climbing-based feature selection method supports the k-mean clustering algorithm to achieve more precise clusters.

Original languageEnglish
Title of host publicationProceedings - 2017 Palestinian International Conference on Information and Communication Technology, PICICT 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages22-27
Number of pages6
ISBN (Electronic)9781509065387
DOIs
StatePublished - 14 Sep 2017
Externally publishedYes
Event2nd Palestinian International Conference on Information and Communication Technology, PICICT 2017 - Gaza, Gaza Strip, Palestine, State of
Duration: 8 May 20179 May 2017

Publication series

NameProceedings - 2017 Palestinian International Conference on Information and Communication Technology, PICICT 2017

Conference

Conference2nd Palestinian International Conference on Information and Communication Technology, PICICT 2017
Country/TerritoryPalestine, State of
CityGaza, Gaza Strip
Period8/05/179/05/17

Keywords

  • Informative features
  • K-mean Text document Clustering
  • Unsupervised Feature Selection
  • β-Hill Climbing

Fingerprint

Dive into the research topics of 'Feature Selection with β-Hill Climbing Search for Text Clustering Application'. Together they form a unique fingerprint.

Cite this