Header menu link for other important links
X
Developing an efficient algorithm for representation and compression of large Bengali text
, M.P. Uddin, M.I. Afjal, M.D. Haque
Published in Institute of Electrical and Electronics Engineers Inc.
2014
Pages: 22 - 25
Abstract
Efficient coding is one of the challenging aspects of information and communication theory. On the other hand, the natural languages such as Bengali is coded using Unicode technology which requires more space and thus takes more time to transfer the data of that language. In this paper, we have proposed a novel algorithm to represent Bengali text efficiently and then to compress the text offering a better compression ratio. Each Bengali character is represented by a unique 2-digit intermediate decimal value. Indexing and sorting all the word values successive subtraction is performed on the values in hope to reduce the weight of the numbers. The new values of each word can now be encoded with a very few bits. In comparison to other compressors, the compression ratio of the proposed algorithm decreases in a big amount for the large text which may contain more duplicate or redundant words, more words with the same length and more words of the same length with the same prefix called Uposorgo in Bengali. © 2014 IEEE.
About the journal
JournalData powered by Typeset2014 9th International Forum on Strategic Technology, IFOST 2014
PublisherData powered by TypesetInstitute of Electrical and Electronics Engineers Inc.