The British National Corpus
The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of current British English, both spoken and written.
The Calgary Corpus
This is the home page for the Calgary Corpus. This set of files has long been the standard used for comparison of various lossless compression techniques.
http://links.uwaterloo.ca/calgary.corpus.html
Ross Williams
Ross Williams did some seminal work in the area of dictionary based encoders in the late 1980s and early 1990s. His LZRW algorithms were not only innovative and interesting, but they managed to place Ross right in the middle of some early software patent issues.
http://www.ross.net/compression
Shorten
Shorten is an audio compression program by SoftSound. Shorten is advertised as a low complexity audio coder that can compress in lossless or lossy mode. This is the home page with links to the download page. Includes source and binaries for DOS, Windows, and Linux. Evaluation version is available.
http://www.softsound.com/Shorten.html
Greg Roelofs
Greg Roelofs’ home page. Greg has invested many years of his life towards good works such as PNG, zlib, and Info-ZIP. You might know him as Cave Newt.
http://www.sonic.net/~roelofs/
Bibliography on Source Coding/Data Compression
Mitsuharu Arimura’s page of links and references to a wide variety of papers and books on lossless compression. Some of the links are listed in English, others in Japanese.
http://www.hn.is.uec.ac.jp/~arimura/compression_papers.html
Sending TV Down the Phone Line
ADAM CLARK could be sitting on an invention with the potential to turn the computer world on its head, not to mention the worlds of telecommunications and broadcasting. The 22-year-old Knoxfield developer claims to have cracked a conundrum that has stumped researchers for years - how to deliver broadcast quality sound and video down plain old telephone line.
Note: there is a $1AUS charge for viewing this article.
GIF89A Specification
This is a copy of the second GIF specification from CompuServe. It added quite a few features to the GIF format. Probably the best well known of these would be the ability to add animation to GIF files.
Reader Andrew T. says: The definitive document, what more can you ask for?
http://www.w3.org/Graphics/GIF/spec-gif89a.txt
PNG (Portable Network Graphics) Specification Version 1.0
W3C Recommendation 01-October-1996. This document describes PNG (Portable Network Graphics), an extensible file format for the lossless, portable, well-compressed storage of raster images. PNG provides a patent-free replacement for GIF and can also replace many common uses of TIFF. Indexed-color, grayscale, and truecolor images are supported, plus an optional alpha channel. Sample depths range from 1 to 16 bits.
Wotsit’s Archive Formats
Wotsit’s Format, the complete programmer’s resource on the net. This site contains file format information on hundreds of different file types and all sorts of other useful programming information; algorithms, source code, specifications, etc. This page has information on scads of archive formats.
http://www.wotsit.org/search.asp?s=archive
LHA Source code site
FTP site that contains LHA source code. Most of this page is in Japanese.
ftp://ftp.vector.co.jp/pack/dos/util/arc/lha/
LIMIT version 1.2 — Data Compression/rchive utility
LIMIT is a fast and compact data compressor/archiver. The compression and decompression routines have been written entirely in 80286 assembly, so it runs faster than many other archivers. LIMIT also compresses better than other archivers; it uses a compression method based on 32K sliding window dictionary plus huffman encoding, similar to LHA/PKZIP/ARJ, but with some improvments.
ftp://garbo.uwasa.fi/pc/arcers/limit12.zip
SCRNCH
SCRNCH is a data compressor designed for people, such as software developers, who need to send programs or files to a large number of people cheaply. It provides a high degree of compression and the ability to customize the self-extracting compressed file. SCRNCH takes its time to produce optimal compression, but decompression is automatic and very fast.
ftp://garbo.uwasa.fi/pc/arcers/scrnch02.zip
Unzip 5.32 - 16 bit Intel executables
UnZip is an extraction utility for archives compressed in .zip format (also
called “zipfiles”). Although highly compatible both with PKWARE’s PKZIP
and PKUNZIP utilities for MS-DOS and with Info-ZIP’s own Zip program, our
primary objectives have been portability and non-MSDOS functionality.
ftp://garbo.uwasa.fi/pc/arcers/unz532x.exe
The JPEG Playground
This site looks like it got started but never completed
http://mu.org/~jimmy/jpeg_tests/testgrnd.htm
Announcing XWPL, the X Wavelet Packet Laboratory
XWPL is an X based tool to examine one-dimensional real-valued signals using
wavelets and wavelet packets. It has been designed to be as easy to use as
possible for beginners. It is intended more as an educational and exploratory
tool than as a numerical analysis program, even though it uses fast, optimized
wavelet and wavelet packet transforms. NOT USING BECAUSE IT ISN’T SPECIFICALLY RELATED TO DATA COMPRESSION.
http://pascal.math.yale.edu/pub/wavelets/software/xwpl/
Wavelet Resources - Yale
Links from the Yale Computational Mathematics Group. Contains links to software and papers. Most of the links don’t show up in this database as they are not necessarily related to using wavelets for data compression.The papers that appear off this page are all compressed postscript and have not been added to the database.
http://www.math.yale.edu/wavelets/
Yleisimpiä kuvanpakkausmenetelmiä ja niiden vertailua
This appears to be an overview of various data compression methods applied to image compression, both lossy and lossless. The page is entirely in what appears to be Finnish.
http://www.hut.fi/~pemakela/courses/maa-57.351/compressing.html
PICTools Imaging Library by Pegasus Imaging
Pegasus Imaging Corporation, Tampa FL.announces the release today of new version of the PICTools imaging libraries for software developers.
http://www.pegasusimaging.com/new_pictools_pr.htm
EE372 Home Page QUANTIZATION and Data Compression and
Gordon V. Comack
Tom Lane
email address for Tom Lane, organizer of the Independent JPEG group, omniscient poster to comp.compression on JPEG issues.
I need specs for graphics formats
Links found in the comp.compression FAQ that help with this question
http://www.faqs.org/faqs/compression-faq/part1/section-29.html
JPEGDUMP from Colosseum Builders, Inc.
This application dumps the contents of JPEG blocks in a JFIF file.
http://www.colosseumbuilders.com/Source/jpegdump.zip
Compression Basics by Pasi ‘Albert’ Ojala
An introductory paper. Includes information on Huffman coding, Info Theory, Coding, LZ77, LZ78, and more. This page also has a good set of links.
http://www.cs.tut.fi/~albert/Dev/pucrunch/packing.html
Ian H. Witten
Ian H. Witten’s Home Page. I’m Professor of Computer Science here in sunny New Zealand — Department of Computer ScienceUniversity of Waikato, New Zealand
http://www.cs.waikato.ac.nz/~ihw/
History of the Portable Network Graphics (PNG) Format
This article originally appeared in the electronic Linux Gazette in January 1997 and was subsequently (re)printed in the April 1997 issue of Linux Journal. The main text is current as of early January 1997, with updates appearing at the very end as Author’s Notes.
http://www.libpng.org/pub/png/pnghist.html
Markov Coders
A set of Markov compressors by Charles Bloom, including source code. This includes links to a wide variety of his programs, including Context Coders, List LRU, and DefSum, along with a link to an early paper of his.
http://www.cbloom.com/src/indexppm.html
Fractal Image Compression : Theory and Application
by Yuval Fisher (Editor). Featuring a collection of articles by twelve experts in the field of fractal image compression, this book contains the complete details of how to encode and decode images, offering working codes that are usable in applications. Includes some of the latest results in this field..
If you are interested in buying this book, please use the link on this page. Your purchase will help to support this site.
http://www.amazon.com/exec/obidos/ASIN/0387942114/theinternetdatac
Lossless Audio Compression with LTAC
LTAC allows for lossless audio compression of PCM coded audio signals (16 bit), i.e. decoding results bit for bit in the original bit stream of the PCM coded signal. In contrast to LTAC there exist a lot of lossy coding algorithms like MPEG, ATRAC and AC-3, which yield higher compression ratios but in any case lead to coding distortions - nothing for HiFi purists and professionals.
User comment: Please note that LTAC is not developed further; it has a successor called LPAC; the rating refers to LPAC which is really an excellent small program.
http://www.nue.tu-berlin.de/wer/liebchen/ltac.html