


Distance Coding (DC) is an algorithm proposed by Edgar Binder in 2000. There is no official paper published about DC yet, but some messages from Edgar Binder in the comp.compression newsgroup are available. DC is a replacement of the Move To Front stage (MTF) within the BurrowsWheeler Compression Algorithm. Similar to the IF stage, the output of the DC stage consits of distances of indices, which can be greater than 255. The DC stage scans the input sequence sequentially for each symbol C and outputs the distance between the current index and the next occurence of C.
Logo

Title

Description


Distance Coding newsgroup message

The famous newsgroup posting from 2000 by Edgar Binder, where he describes the DC algorithm by an example together with 3 important properties of the DC output sequence.


Second step algorithms in the BurrowsWheeler compression algorithm

This publication of Sebastian Deorowicz in "SoftwarePractice and Experience" from 2002 give a quite complete overview of the post BWT stages used within the BurrowsWheeler Compression Algorithm. Besides his own Weighted Frequency Count Algorithm and other post BWT stages, Sebastian describes briefly but clearly variantions of the Move To Front scheme, the Inversion Frequencies algorithm and the Distance Coding algorithm with the 3 properties of the newsgroup posting. This is one of my favourite BWCA papers. This BWCA approach achieves a compression rate of 2.25 bps for the Calgary Corpus.


An analysis of the second step algorithms in the BurrowsWheeler compression algorithm

A quite similar publication from 2000 to the article "Second step algorithms in the BurrowsWheeler compression algorithm" from Sebastian Deorowicz, describing his Weighted Frequency Count Algorithm. This BWCA approach achieves a compression rate of 2.25 bps for the Calgary Corpus.

Logo

Name

Description


Edgar Binder

Edgar Binder is the author of the Distance Coding algorithm (DC) and a member of the Snarlup software team.


Sebastian Deorowicz

Sebastian is the author of the Weigthed Frequency Count algorithm (WFC) and is a doctoral student of the Silesian University of Technology, Poland. He just finished his dissertation.




