Lightweight natural language text compression

NR Brisaboa, A Farina, G Navarro, JR Paramá - Information retrieval, 2007 - Springer
Variants of Huffman codes where words are taken as the source symbols are currently the
most attractive choices to compress natural language text databases. In particular, Tagged …

New adaptive compressors for natural language text

NR Brisaboa, A Farina, G Navarro… - Software: Practice and …, 2008 - Wiley Online Library
Semistatic byte‐oriented word‐based compression codes have been shown to be an
attractive alternative to compress natural language text databases, because of the …

Efficiently decodable and searchable natural language adaptive compression

NR Brisaboa, A Farina, G Navarro… - Proceedings of the 28th …, 2005 - dl.acm.org
We address the problem of adaptive compression of natural language text, focusing on the
case where low bandwidth is available and the receiver has little processing power, as in …

Improving semistatic compression via pair-based coding

NR Brisaboa, A Farina, G Navarro… - International Andrei Ershov …, 2006 - Springer
In the last years, new semistatic word-based byte-oriented compressors, such as Plain and
Tagged Huffman and the Dense Codes, have been used to improve the efficiency of text …

Adapted LZW Protocol for‎ ECG Data Compression

SMK Al-alak, IH Alwan… - Journal Of University Of …, 2017 - journalofbabylon.com
Lempel–Ziv–Welch (LZW) is a data compression method, which is adopted by many
applications likes Electrocardiography (ECG) data to reduce the size of transferred data …

[HTML][HTML] A fast dynamic compression scheme for natural language texts

A Gupta, S Agarwal - Computers & Mathematics with Applications, 2010 - Elsevier
We adapted Word-based Tagged Code (WBTC) to obtain its dynamic version. The aim of
designing a dynamic version of WBTC is to adapt it for real-time transmission. The problem …

Making dense codes even denser

S Grabowski - … Górniczo-Hutnicza im. Stanisława Staszica w …, 2008 - yadda.icm.edu.pl
The task of compressed pattern matching is to report all the occurrences of a given pattern P
in a text T available in compressed form. Certain compression algorithms allow for searching …

Compressing dynamic text collections via phrase-based coding

NR Brisaboa, A Farina, G Navarro… - … Conference on Theory and …, 2005 - Springer
We present a new statistical compression method, which we call Phrase Based Dense Code
(PBDC), aimed at compressing large digital libraries. PBDC compresses the text collection …

[CARTE][B] New algorithms for exact and approximate text matching

S Grabowski - 2009 - szgrabowski.kis.p.lodz.pl
Searching is arguably the most important of all problems that computer science deals with.
Very broadly speaking, this problem consists in reporting occurrences of an object (key) in a …

[PDF][PDF] Term-frequency surrogates in text similarity computations

S Pohl, A Moffat - Proc. 13th Australasian Document …, 2008 - adcs-conference.org
Inverted indexes on external storage perform best when accesses are ordered and data is
read sequentially, so that seek times are minimized. As a consequence, the various items …