Multi-Stream Word-Based Compression Algorithm
dc.authorid | Öztürk, Emir/0000-0002-3734-5171 | |
dc.authorid | Mesut, Altan/0000-0002-1477-3093; | |
dc.authorwosid | Öztürk, Emir/Z-1726-2018 | |
dc.authorwosid | Mesut, Altan/AAE-8734-2019 | |
dc.authorwosid | Diri, Banu/AAA-1020-2021 | |
dc.contributor.author | Ozturk, Emir | |
dc.contributor.author | Mesut, Altan | |
dc.contributor.author | Diri, Banu | |
dc.date.accessioned | 2024-06-12T10:59:09Z | |
dc.date.available | 2024-06-12T10:59:09Z | |
dc.date.issued | 2017 | |
dc.department | Trakya Üniversitesi | en_US |
dc.description | 2017 International Conference on Computer Science and Engineering (UBMK) -- OCT 05-08, 2017 -- Antalya, TURKEY | en_US |
dc.description.abstract | In this article, we present a novel word-based lossless compression algorithm for text files which uses a semi-static model. We named our algorithm as Multi-stream Word-based Compression Algorithm (MWCA), because it stores the compressed forms of the words in three individual streams depending on their frequencies in the text. It also stores two dictionaries and a bit vector as a side information. In our experiments MWCA obtains compression ratio over 3,23 bpc on average and 2,88 bpc on files larger than 50 MB. If a variable length encoder like Huffman Coding is used after MWCA, given ratios will reduce to 2,63 and 2,44 bpc respectively. With the advantage of its multi-stream structure MWCA could become a good solution especially for storing and searching big text data. | en_US |
dc.description.sponsorship | IEEE Adv Technol Human,Istanbul Teknik Univ,Gazi Univ,Atilim Univ,TBV,Akdeniz Univ,Tmmob Bilgisayar Muhendisleri Odasi | en_US |
dc.identifier.endpage | 37 | en_US |
dc.identifier.isbn | 978-1-5386-0930-9 | |
dc.identifier.scopus | 2-s2.0-85040605764 | en_US |
dc.identifier.scopusquality | N/A | en_US |
dc.identifier.startpage | 34 | en_US |
dc.identifier.uri | https://hdl.handle.net/20.500.14551/20337 | |
dc.identifier.wos | WOS:000426856900007 | en_US |
dc.identifier.wosquality | N/A | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | tr | en_US |
dc.publisher | IEEE | en_US |
dc.relation.ispartof | 2017 International Conference On Computer Science And Engineering (Ubmk) | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Data Compression | en_US |
dc.subject | Text Compression | en_US |
dc.subject | Natural-Language Text | en_US |
dc.title | Multi-Stream Word-Based Compression Algorithm | en_US |
dc.type | Conference Object | en_US |