Abstract
The Map Reduce paradigm is now considered a standard platform that is used for large-scale data processing and management. A major operation that the Map Reduce platform relies on greatly is tasks scheduling. Although many schedulers have been presented, task scheduling is still one of the major problems that face Map Reduce frameworks. Schedulers need to maintain data locality to achieve an acceptable performance by avoiding several data transmissions. Hence, in this paper, we propose a new scheduling algorithm named 'MTL' that utilises multi-threading principles. The MTL scheduler assigns a dedicated thread for each data block. Indeed, the multi-threading approach shows great results that make our MTL scheduler a scalable one that performs well. At the same time, it maintains the locality property. During the evaluation of the MTL scheduler performance, two main factors were taken into consideration; the simulation time and the energy consumption. The MTL scheduler is then compared with other existing schedulers such as FIFO, matchmaking, and delay schedulers. The MTL scheduler showed favourable results and proved its advantages over other existing schedulers.
| Original language | English |
|---|---|
| Pages (from-to) | 44-54 |
| Number of pages | 11 |
| Journal | International Journal of Computational Science and Engineering |
| Volume | 14 |
| Issue number | 1 |
| DOIs | |
| State | Published - 2017 |
| Externally published | Yes |
Keywords
- Clustering
- Computational science
- Hadoop
- Map reduce
- Multi-threading
- Scalability
- Schedulers
Fingerprint
Dive into the research topics of 'A scalable Map Reduce tasks scheduling: A threading-based approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver