Abstract
The substantial competition among the news industries puts editors under the pressure of posting news articles which are likely to gain more user attention. Anticipating the popularity of news articles can help the editorial teams in making decisions about posting a news article. Article similarity extracted from the articles posted within a small period of time is found to be a useful feature in existing popularity prediction approaches. This work proposes a new approach to estimate the popularity of news articles by adding semantics in the article similarity based approach of popularity estimation. A semantically enriched model is proposed which estimates news popularity by measuring cosine similarity between document embeddings of the news articles. Word2vec model has been used to generate distributed representations of the news content. In this work, we define popularity as the number of times a news article is posted on different websites. We collect data from different websites that post news concerning the domain of cybersecurity and estimate the popularity of cybersecurity news. The proposed approach is compared with different models and it is shown that it outperforms the other models.
| Original language | English |
|---|---|
| Pages (from-to) | 533-547 |
| Number of pages | 15 |
| Journal | CMES - Computer Modeling in Engineering and Sciences |
| Volume | 127 |
| Issue number | 2 |
| DOIs | |
| State | Published - 2021 |
| Externally published | Yes |
Keywords
- Cosine similarity
- Embeddings
- Popularity
- Semantics
- Word2vec
Fingerprint
Dive into the research topics of 'An automated system to predict popular cybersecurity news using document embeddings'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver