Lightweight natural language text compression
Variants of Huffman codes where words are taken as the source symbols are currently the
most attractive choices to compress natural language text databases. In particular, Tagged …
most attractive choices to compress natural language text databases. In particular, Tagged …
New adaptive compressors for natural language text
Semistatic byte‐oriented word‐based compression codes have been shown to be an
attractive alternative to compress natural language text databases, because of the …
attractive alternative to compress natural language text databases, because of the …
Efficiently decodable and searchable natural language adaptive compression
We address the problem of adaptive compression of natural language text, focusing on the
case where low bandwidth is available and the receiver has little processing power, as in …
case where low bandwidth is available and the receiver has little processing power, as in …
Improving semistatic compression via pair-based coding
In the last years, new semistatic word-based byte-oriented compressors, such as Plain and
Tagged Huffman and the Dense Codes, have been used to improve the efficiency of text …
Tagged Huffman and the Dense Codes, have been used to improve the efficiency of text …
Adapted LZW Protocol for ECG Data Compression
Lempel–Ziv–Welch (LZW) is a data compression method, which is adopted by many
applications likes Electrocardiography (ECG) data to reduce the size of transferred data …
applications likes Electrocardiography (ECG) data to reduce the size of transferred data …
[HTML][HTML] A fast dynamic compression scheme for natural language texts
A Gupta, S Agarwal - Computers & Mathematics with Applications, 2010 - Elsevier
We adapted Word-based Tagged Code (WBTC) to obtain its dynamic version. The aim of
designing a dynamic version of WBTC is to adapt it for real-time transmission. The problem …
designing a dynamic version of WBTC is to adapt it for real-time transmission. The problem …
Making dense codes even denser
S Grabowski - … Górniczo-Hutnicza im. Stanisława Staszica w …, 2008 - yadda.icm.edu.pl
The task of compressed pattern matching is to report all the occurrences of a given pattern P
in a text T available in compressed form. Certain compression algorithms allow for searching …
in a text T available in compressed form. Certain compression algorithms allow for searching …
Compressing dynamic text collections via phrase-based coding
We present a new statistical compression method, which we call Phrase Based Dense Code
(PBDC), aimed at compressing large digital libraries. PBDC compresses the text collection …
(PBDC), aimed at compressing large digital libraries. PBDC compresses the text collection …
[CARTE][B] New algorithms for exact and approximate text matching
S Grabowski - 2009 - szgrabowski.kis.p.lodz.pl
Searching is arguably the most important of all problems that computer science deals with.
Very broadly speaking, this problem consists in reporting occurrences of an object (key) in a …
Very broadly speaking, this problem consists in reporting occurrences of an object (key) in a …
[PDF][PDF] Term-frequency surrogates in text similarity computations
S Pohl, A Moffat - Proc. 13th Australasian Document …, 2008 - adcs-conference.org
Inverted indexes on external storage perform best when accesses are ordered and data is
read sequentially, so that seek times are minimized. As a consequence, the various items …
read sequentially, so that seek times are minimized. As a consequence, the various items …