Data-Compression.org

data compression link collection

The British National Corpus

Published in Files, Benchmarks

The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of current British English, both spoken and written.

http://info.ox.ac.uk/bnc/

         

Posted in September 21st, 1999

The Calgary Corpus

This is the home page for the Calgary Corpus. This set of files has long been the standard used for comparison of various lossless compression techniques.

http://links.uwaterloo.ca/calgary.corpus.html

         

Posted in September 21st, 1999

Ross Williams

Ross Williams did some seminal work in the area of dictionary based encoders in the late 1980s and early 1990s. His LZRW algorithms were not only innovative and interesting, but they managed to place Ross right in the middle of some early software patent issues.

http://www.ross.net/compression

* * * *  

Posted in September 21st, 1999

Shorten

Shorten is an audio compression program by SoftSound. Shorten is advertised as a low complexity audio coder that can compress in lossless or lossy mode. This is the home page with links to the download page. Includes source and binaries for DOS, Windows, and Linux. Evaluation version is available.

http://www.softsound.com/Shorten.html

* * *    

Posted in September 21st, 1999

Greg Roelofs

Greg Roelofs’ home page. Greg has invested many years of his life towards good works such as PNG, zlib, and Info-ZIP. You might know him as Cave Newt.

http://www.sonic.net/~roelofs/

* * * * *

Posted in September 21st, 1999

Bibliography on Source Coding/Data Compression

Mitsuharu Arimura’s page of links and references to a wide variety of papers and books on lossless compression. Some of the links are listed in English, others in Japanese.

http://www.hn.is.uec.ac.jp/~arimura/compression_papers.html

         

Posted in September 21st, 1999

Sending TV Down the Phone Line

ADAM CLARK could be sitting on an invention with the potential to turn the computer world on its head, not to mention the worlds of telecommunications and broadcasting. The 22-year-old Knoxfield developer claims to have cracked a conundrum that has stumped researchers for years - how to deliver broadcast quality sound and video down plain old telephone line.
Note: there is a $1AUS charge for viewing this article. :-(

http://newsstore.f2.com.au/rlprod/members_rlsearcher?ac=viewDocument&docID=news980519_0332
5622&docType=N&rs=1&sy=age&kw=adam+clark&pb=all_ffx&dt=selectRange&dr=5years&so=relevance&
&la=search&ss=AGE&sf=article&rc=10&rm=200&clsPage=1

* * *    

Posted in September 21st, 1999

GIF89A Specification

This is a copy of the second GIF specification from CompuServe. It added quite a few features to the GIF format. Probably the best well known of these would be the ability to add animation to GIF files.

Reader Andrew T. says: The definitive document, what more can you ask for?

http://www.w3.org/Graphics/GIF/spec-gif89a.txt

         

Posted in September 21st, 1999

PNG (Portable Network Graphics) Specification Version 1.0

Published in Standards, PNG

W3C Recommendation 01-October-1996. This document describes PNG (Portable Network Graphics), an extensible file format for the lossless, portable, well-compressed storage of raster images. PNG provides a patent-free replacement for GIF and can also replace many common uses of TIFF. Indexed-color, grayscale, and truecolor images are supported, plus an optional alpha channel. Sample depths range from 1 to 16 bits.

http://www.w3.org/TR/png.html

* * * * *

Posted in September 21st, 1999

Wotsit’s Archive Formats

Published in Standards, Archiving

Wotsit’s Format, the complete programmer’s resource on the net. This site contains file format information on hundreds of different file types and all sorts of other useful programming information; algorithms, source code, specifications, etc. This page has information on scads of archive formats.

http://www.wotsit.org/search.asp?s=archive

* * * * *

Posted in September 21st, 1999

LHA Source code site

FTP site that contains LHA source code. Most of this page is in Japanese.

ftp://ftp.vector.co.jp/pack/dos/util/arc/lha/

         

Posted in February 18th, 1999

LIMIT version 1.2 — Data Compression/rchive utility

LIMIT is a fast and compact data compressor/archiver. The compression and decompression routines have been written entirely in 80286 assembly, so it runs faster than many other archivers. LIMIT also compresses better than other archivers; it uses a compression method based on 32K sliding window dictionary plus huffman encoding, similar to LHA/PKZIP/ARJ, but with some improvments.

ftp://garbo.uwasa.fi/pc/arcers/limit12.zip

* * * *  

Posted in February 18th, 1999

SCRNCH

SCRNCH is a data compressor designed for people, such as software developers, who need to send programs or files to a large number of people cheaply. It provides a high degree of compression and the ability to customize the self-extracting compressed file. SCRNCH takes its time to produce optimal compression, but decompression is automatic and very fast.

ftp://garbo.uwasa.fi/pc/arcers/scrnch02.zip

         

Posted in February 18th, 1999

Unzip 5.32 - 16 bit Intel executables

UnZip is an extraction utility for archives compressed in .zip format (also
called “zipfiles”). Although highly compatible both with PKWARE’s PKZIP
and PKUNZIP utilities for MS-DOS and with Info-ZIP’s own Zip program, our
primary objectives have been portability and non-MSDOS functionality.

ftp://garbo.uwasa.fi/pc/arcers/unz532x.exe

         

Posted in February 18th, 1999

The JPEG Playground

Published in Mirror Site

This site looks like it got started but never completed

http://mu.org/~jimmy/jpeg_tests/testgrnd.htm

         

Posted in February 18th, 1999

Announcing XWPL, the X Wavelet Packet Laboratory

Published in Mirror Site

XWPL is an X based tool to examine one-dimensional real-valued signals using
wavelets and wavelet packets. It has been designed to be as easy to use as
possible for beginners. It is intended more as an educational and exploratory
tool than as a numerical analysis program, even though it uses fast, optimized
wavelet and wavelet packet transforms. NOT USING BECAUSE IT ISN’T SPECIFICALLY RELATED TO DATA COMPRESSION.

http://pascal.math.yale.edu/pub/wavelets/software/xwpl/

         

Posted in February 18th, 1999

Wavelet Resources - Yale

Links from the Yale Computational Mathematics Group. Contains links to software and papers. Most of the links don’t show up in this database as they are not necessarily related to using wavelets for data compression.The papers that appear off this page are all compressed postscript and have not been added to the database.

http://www.math.yale.edu/wavelets/

*        

Posted in February 18th, 1999

Yleisimpiä kuvanpakkausmenetelmiä ja niiden vertailua

This appears to be an overview of various data compression methods applied to image compression, both lossy and lossless. The page is entirely in what appears to be Finnish.

http://www.hut.fi/~pemakela/courses/maa-57.351/compressing.html

* * *    

Posted in February 17th, 1999

PICTools Imaging Library by Pegasus Imaging

Pegasus Imaging Corporation, Tampa FL.announces the release today of new version of the PICTools imaging libraries for software developers.

http://www.pegasusimaging.com/new_pictools_pr.htm

*        

Posted in February 17th, 1999

EE372 Home Page QUANTIZATION and Data Compression and

A basic syllabus and links giving a wide survey of the field.

http://www.stanford.edu/class/ee372/

         

Posted in February 17th, 1999

Gordon V. Comack

The email address for Gordon V. Comack, the author of DMC.

mailto:cormack@uwaterloo.ca

         

Posted in February 17th, 1999

Tom Lane

Published in People, JPEG

email address for Tom Lane, organizer of the Independent JPEG group, omniscient poster to comp.compression on JPEG issues.

mailto:tgl@netcom.com

         

Posted in February 17th, 1999

I need specs for graphics formats

Links found in the comp.compression FAQ that help with this question

http://www.faqs.org/faqs/compression-faq/part1/section-29.html

         

Posted in February 15th, 1999

JPEGDUMP from Colosseum Builders, Inc.

This application dumps the contents of JPEG blocks in a JFIF file.

http://www.colosseumbuilders.com/Source/jpegdump.zip

* * * * *

Posted in February 15th, 1999

Compression Basics by Pasi ‘Albert’ Ojala

An introductory paper. Includes information on Huffman coding, Info Theory, Coding, LZ77, LZ78, and more. This page also has a good set of links.

http://www.cs.tut.fi/~albert/Dev/pucrunch/packing.html

* * * * *

Posted in February 15th, 1999

Ian H. Witten

Published in People, Data Compression

Ian H. Witten’s Home Page. I’m Professor of Computer Science here in sunny New Zealand — Department of Computer ScienceUniversity of Waikato, New Zealand

http://www.cs.waikato.ac.nz/~ihw/

         

Posted in February 15th, 1999

History of the Portable Network Graphics (PNG) Format

This article originally appeared in the electronic Linux Gazette in January 1997 and was subsequently (re)printed in the April 1997 issue of Linux Journal. The main text is current as of early January 1997, with updates appearing at the very end as Author’s Notes.

http://www.libpng.org/pub/png/pnghist.html

         

Posted in February 15th, 1999

Markov Coders

A set of Markov compressors by Charles Bloom, including source code. This includes links to a wide variety of his programs, including Context Coders, List LRU, and DefSum, along with a link to an early paper of his.

http://www.cbloom.com/src/indexppm.html

* * * * *

Posted in February 14th, 1999

Fractal Image Compression : Theory and Application

Published in Books, Fractals

by Yuval Fisher (Editor). Featuring a collection of articles by twelve experts in the field of fractal image compression, this book contains the complete details of how to encode and decode images, offering working codes that are usable in applications. Includes some of the latest results in this field..
If you are interested in buying this book, please use the link on this page. Your purchase will help to support this site.

http://www.amazon.com/exec/obidos/ASIN/0387942114/theinternetdatac

* * * * *

Posted in February 13th, 1999

Lossless Audio Compression with LTAC

LTAC allows for lossless audio compression of PCM coded audio signals (16 bit), i.e. decoding results bit for bit in the original bit stream of the PCM coded signal. In contrast to LTAC there exist a lot of lossy coding algorithms like MPEG, ATRAC and AC-3, which yield higher compression ratios but in any case lead to coding distortions - nothing for HiFi purists and professionals.

User comment: Please note that LTAC is not developed further; it has a successor called LPAC; the rating refers to LPAC which is really an excellent small program.

http://www.nue.tu-berlin.de/wer/liebchen/ltac.html

* * * * *

Posted in February 4th, 1999