Author Page
Seminal arithmetic coding source code
The source code to the famous Witten, Neal, and Cleary 1987 CACM article on arithmetic coding. The paper is probably not legally on line anywhere, but can be found in the book Text Compression, as well as the journal. This FTP site has three different variations on the source.
An FTP site for the Calgary Corpus
The Calgary Corpus is a set of files that were put together by compression mavens Bell, Cleary, and Witten in 1989 for benchmarking lossless compression algorithms. Files included in this set include English text, source code, executable code, and some data files.
ftp://ftp.cpsc.ucalgary.ca/pub/projects/text.compression.corpus/
“Block Sorting Text Compression” by Peter Fenwick
A paper discussing BWT text compression in Proceedings of the 19th Australasian Computer Science Conference, Melbourne, Australia. Jan 31 - Feb 2, 1996.
“Experiments with a Block-Sorting Text Compression Algorithm” by Peter Fenwick
The University of Auckland, Department of Computer Science, Technical Report 111, March 1995.
“Improvements to the Block-Sorting Text Compression Algorithm” by Peter Fenwick
The University of Auckland, Department of Computer Science, Technical Report 120, July 1995.
“Block Sorting Text Compression — Final Report” by Peter Fenwick,
The University of Auckland, Department of Computer Science, Technical Report 130, April 1996.
Zip file format specification
This file is based on the PKWare APPNOTE.TXT file dated 15 February 1996. This is what is normally used as the original documentation specifying the Zip file format. This file is from the documentation site for the InfoZip project. InfoZip is the source for the popular cross-platform Zip and UnZip programs.
ftp://ftp.uu.net/pub/archiving/zip/doc/appnote-970311-iz.zip
ari_b - by Michael Schindler
This ftp site contains arithmetic coding source from Michael Schindler. The library code consists of a a byte oriented arithmetic coder. The arithmetic coder is based on Alstair Moffat’s paper “Arithmetic Coding Revisited,” presented at the Data Compression Conference 95 (Snowbird, UT). The source is stored in both tar.z and zip format. The 1.101 release […]
A Mathematical Theory of Communication by Claude E. Shannon
A reprint of an important paper. This site has links to the paper in PDF and Postscript formats. Claude E. Shannon is widely acknowledged to be the father of Information Theory. The publication of this paper established that reputation and gave birth to this area of scientific endeavor.
D.J. Wheeler’s FTP site
Contains his implementation of the BWT algorithm, in the program bred. Along with this are some notes and papers on his implementation of BWT
Current list of files:
bexp.c
bexp3.c
bred.c
bred.ps
bred.ps.Z
bred2
bred3.c
bred3.ps
exp.c
huff.ps
mintext.ps
mintext.tex
red.c
tea.ps
tub.ps
wake.ps
xtea.ps
xxtea.ps
Tak’Asic
Tak’Asic provides Lossless/Lossy still image compression/decompression ICs solution. .
Our ICs support JBIG, MH, MR, MMR and JPEG standards.
Our latest product, Tak’B3, is the world fastest JBIG codec available. It is able to process images at 133Mpixel/sec minimum and up to 2128Mpixel/sec. It also provides code conversion (MR/MR/MMR JBIG) and scaling functionality.
