International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 90 - Number 11 |
Year of Publication: 2014 |
Authors: Kashfia Sailunaz, Mohammed Rokibul Alam Kotwal, Mohammad Nurul Huda |
10.5120/15765-4456 |
Kashfia Sailunaz, Mohammed Rokibul Alam Kotwal, Mohammad Nurul Huda . Data Compression Considering Text Files. International Journal of Computer Applications. 90, 11 ( March 2014), 27-32. DOI=10.5120/15765-4456
Lossless text data compression is an important field as it significantly reduces storage requirement and communication cost. In this work, the focus is directed mainly to different file compression coding techniques and comparisons between them. Some memory efficient encoding schemes are analyzed and implemented in this work. They are: Shannon Fano Coding, Huffman Coding, Repeated Huffman Coding and Run-Length coding. A new algorithm "Modified Run-Length Coding" is also proposed and compared with the other algorithms. These analyses show how these coding techniques work, how much compression is possible for these coding techniques, the amount of memory needed for each technique, comparison between these techniques to find out which technique is better in what conditions. It is observed from the experiments that the repeated Huffman Coding shows higher compression ratio. Besides, the proposed Modified run length coding shows a higher performance than the conventional one.