Fast text compression using multiple static dictionaries

Carus A.; Mesut A.

Fast text compression using multiple static dictionaries

Tarih

2010

Yazarlar

Carus A.

Mesut A.

Yayıncı

Asian Network for Scientific Information

Erişim Hakkı

info:eu-repo/semantics/openAccess

Özet

We developed a fast text compression method based on multiple static dictionaries and named this algorithm as STECA (Static Text Compression Algorithm). This algorithm is language dependent because of its static structure; however, it owes its speed to that structure. To evaluate encoding and decoding performance of STECA with different languages, we select English and Turkish that have different grammatical structures. Compression and decompression times and compression ratio results are compared with the results of LZW, LZRW1, LZP1, LZOP, WRT, DEFLATE (Gap), BWCA (Bzip2) and PPMd algorithms. Our evaluation experiments show that: If speed is the primary consideration, STECA is an efficient algorithm for compressing natural language texts. © 2010 Asian Network for Scientific Information.

Anahtar Kelimeler

Diagram Coding; Dictionary Based Compression; Multiple Dictionary; Static Dictionary; Text Compression, Compression Ratio (Machinery); Diagram Coding; Dictionary-Based Compressions; Evaluation Experiments; Grammatical Structure; Multiple Dictionaries; Natural Language Text; Text Compression Methods; Text Compressions; Natural Language Processing Systems

Kaynak

Information Technology Journal

Scopus Q Değeri

N/A

Cilt

9

Sayı

5

Bağlantı

https://doi.org/10.3923/itj.2010.1013.1021
https://hdl.handle.net/20.500.14551/16703

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Fast text compression using multiple static dictionaries

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon