Skip to main navigation Skip to search Skip to main content

Multi-Document News Web Page Summarization Using Content Extraction and Lexical Chain Based Key Phrase Extraction

  • Chandrakala Arya
  • , Manoj Diwakar
  • , Prabhishek Singh
  • , Vijendra Singh
  • , Seifedine Kadry
  • , Jungeun Kim
  • Graphic Era Hill University
  • Graphic Era
  • Bennett University
  • University of Petroleum and Energy Studies
  • Noroff University College
  • Lebanese American University
  • Kongju National University

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

In the area of text summarization, there have been significant advances recently. In the meantime, the current trend in text summarization is focused more on news summarization. Therefore, developing a synthesis approach capable of extracting, comparing, and ranking sentences is vital to create a summary of various news articles in the context of erroneous online data. It is necessary, however, for the news summarization system to be able to deal with multi-document summaries due to content redundancy. This paper presents a method for summarizing multi-document news web pages based on similarity models and sentence ranking, where relevant sentences are extracted from the original article. English-language articles are collected from five news websites that cover the same topic and event. According to our experimental results, our approach provides better results than other recent methods for summarizing news.

Original languageEnglish
Article number1762
JournalMathematics
Volume11
Issue number8
DOIs
StatePublished - Apr 2023

Keywords

  • ROUGE
  • extractive summarization
  • keyphrase extraction
  • multi-document summarization
  • news web page summarization
  • sentence length
  • sentence ranking
  • similarity measure

Fingerprint

Dive into the research topics of 'Multi-Document News Web Page Summarization Using Content Extraction and Lexical Chain Based Key Phrase Extraction'. Together they form a unique fingerprint.

Cite this