TY - GEN
T1 - Using GPUs to speed-up Levenshtein edit distance computation
AU - Balhaf, Khaled
AU - Shehab, Mohammed A.
AU - Al-Sarayrah, Wala'a T.
AU - Al-Ayyoub, Mahmoud
AU - Al-Saleh, Mohammed
AU - Jararweh, Yaser
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/5/20
Y1 - 2016/5/20
N2 - Sequence comparison problems such as sequence alignment and approximate string matching are part of the fundamental problems in many fields such as natural language processing, data mining and bioinformatics. However, the algorithms proposed to address these problems suffer from high computational complexities prohibiting them from being widely used in practical large-scale settings. Many researchers used parallel programming to reduce the execution time of these algorithms. In this paper, we follow this approach and use the parallelism capabilities of the Graphics Processing Unit (GPU) to accelerate one of the most common algorithms to compute the edit distance between two strings, which is known as the Levenshtein distance. To take full advantage of the large number of cores in a GPU, we employ a diagonal-based tracing technique which results in even greater improvements in terms of the running time. In fact, our CUDA implementation of the Levenshtein algorithm is about 11X faster than the sequential implementation. This is achieved without affecting the accuracy.
AB - Sequence comparison problems such as sequence alignment and approximate string matching are part of the fundamental problems in many fields such as natural language processing, data mining and bioinformatics. However, the algorithms proposed to address these problems suffer from high computational complexities prohibiting them from being widely used in practical large-scale settings. Many researchers used parallel programming to reduce the execution time of these algorithms. In this paper, we follow this approach and use the parallelism capabilities of the Graphics Processing Unit (GPU) to accelerate one of the most common algorithms to compute the edit distance between two strings, which is known as the Levenshtein distance. To take full advantage of the large number of cores in a GPU, we employ a diagonal-based tracing technique which results in even greater improvements in terms of the running time. In fact, our CUDA implementation of the Levenshtein algorithm is about 11X faster than the sequential implementation. This is achieved without affecting the accuracy.
UR - https://www.scopus.com/pages/publications/84973868382
U2 - 10.1109/IACS.2016.7476090
DO - 10.1109/IACS.2016.7476090
M3 - Conference contribution
AN - SCOPUS:84973868382
T3 - 2016 7th International Conference on Information and Communication Systems, ICICS 2016
SP - 80
EP - 84
BT - 2016 7th International Conference on Information and Communication Systems, ICICS 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Information and Communication Systems, ICICS 2016
Y2 - 5 April 2016 through 7 April 2016
ER -