Skip to main navigation Skip to search Skip to main content

Speeding DBLP querying using hadoop and spark

  • Jordan University of Science and Technology

Research output: Contribution to journalConference articlepeer-review

2 Scopus citations

Abstract

Big data is becoming bigger every day. Even for simple applications such as the Digital Bibliography & Library Project (DBLP) database, the data is becoming unmanageable using the conventional databases because of its size. Applying big data processing methods such as Hadoop and Spark is becoming more popular because of that. In this work, we investigate the use of Hadoop and Spark in the querying process of big data and we compare the performance of them in terms of their execution time. We use the DBLP database as a case study. Results show that Hadoop and Spark enhances the query execution time significantly when compared with conventional database management systems. We also found that Spark enhances the execution time over Hadoop.

Original languageEnglish
Article number012003
JournalIOP Conference Series: Materials Science and Engineering
Volume459
Issue number1
DOIs
StatePublished - 2018
Externally publishedYes
EventAegean International Textile and Advanced Engineering Conference, AITAE 2018 - Lesvos, Greece
Duration: 5 Sep 20187 Sep 2018

Fingerprint

Dive into the research topics of 'Speeding DBLP querying using hadoop and spark'. Together they form a unique fingerprint.

Cite this