Data-Compression.org

data compression link collection

Author Page

Dynamic Huffman Coder

This dynamic Huffman coder from Karl Malbrain is written in C and includes weight scaling. It is modeled on the Vitter algorithm.
A DataCompression.info user notes that this site has been undergoing continual changes, and perhaps would benefit from some sort of “last modified on” field.

http://www.geocities.com/malbrain/vitter_c.html

read more from admin's post

Posted in November 20th, 2002

Introduction to Video Coding

A nice set of lecture notes by Iain Richardson. The notes have a good level of detail, so there is value here even without the lecture.

http://www.vcodex.fsnet.co.uk/videocoding2b.pdf

read more from admin's post

Posted in November 17th, 2002

From Ukkonen to McCreight and Weiner: A Unifying View of Linear-Time Suffix Tree Construction

1997, Robert Giegerich, Stefan Kurtz. We review the linear time suffix tree constructions by Weiner, McCreight, and Ukkonen. We use the terminology of the most recent algorithm, Ukkonen’s online construction, to explain its historic predecessors. The submitter of this paper indicates that it has user-friendly terminology, always welcome in Journal papers.

http://citeseer.nj.nec.com/giegerich97from.html

read more from admin's post

Posted in November 16th, 2002

ITU Recommendation V.44

This is the data compression standard that implements the LZJH algorithm, and is used in V.90 and V.92 modems. The ITU wants to charge you a few bucks for this standard, but if you believe the post from Pete Fraser (listed elsewhere on DataCompression.info) you can get three free standards per year. Maybe this ought […]

http://www.itu.int/rec/recommendation.asp?type=items&lang=e&parent=T-REC-V.44-200011-I

read more from admin's post

Posted in November 9th, 2002

Sequitir - inferring hierarchies from sequences

SEQUITUR is a method for inferring compositional hierarchies from strings. It detects repetition and factors it out of the string by forming rules in a grammar. The rules can be composed of non-terminals, giving rise to a hierarchy. It is useful for recognizing lexical structure in strings, and excels at very long sequences.

http://sequence.rutgers.edu/sequitur/

read more from admin's post

Posted in November 9th, 2002

PlusV

PlusV is a brand new audio compression enhancement technology that allows audio files to be compressed in as little as 64 or even 48 kbits/s. PlusV is not a compression scheme of its own, it is an extension that can be applied to existing audio formats. When combined with the MP3 technology, MP3+V files are […]

http://www.plusv.org/

read more from admin's post

Posted in November 8th, 2002

IP*Works! Zip Component Suite

IP*Works! Zip allows developers to easily integrate compression and decompression functionality into web or desktop applications using popular file and data compression standards including Zip, Tar, Gzip and Jar standards for compression. All IP*Works! Zip components have been built from the ground up entirely in native .NET and Java code, combining exceptional speed, excellent […]

http://www.nsoftware.com/products/ipworkszip.aspx

read more from admin's post

Posted in November 7th, 2002

BICOM - BIjective COMpressor

BICOM is a freely available open source compressor. It uses a souped-up PPM algorithm, and is completely bijective.
Reader comment:
Wow this is hot! …a bijective compressor
using full size Rijndael encryption…

http://www3.sympatico.ca/mt0000/bicom/bicom.html

read more from admin's post

Posted in November 1st, 2002

szip homepage

Szip is a freeware portable general purpose lossless compression program. It has a high speed and compression, but high memory demands (up to 20MB) too. The compression is done using a variant of blocksorting, which explains its rather high memory requirements.

Update: Michael Schindler has at long last posted the source code for szip.

http://www.compressconsult.com/szip/

read more from admin's post

Posted in October 31st, 2002

In-Place Calculation of Minimum-Redundancy Codes

The abstract for a paper on calculation of Huffman codes. The paper isn’t here, but the source code is. Alistair says that if you sort your array of counts, you can create the Canonical Huffman tree in memory.

http://www.cs.mu.oz.au/~alistair/abstracts/mk95%3Awads.html

read more from admin's post

Posted in October 31st, 2002

Embedded BWT Compression

An implementation of BWT designed with the goal of minimizing memory usage. Source code and a documentation page.

http://www.iahu.ca:8080/bwt.html

read more from admin's post

Posted in October 31st, 2002

Helix DNA Client

Real is making their client software available in an Open Source program. Download this software and you can develop your own MP3 player! Free of royalties if you are distributing it for free - commercial products pay a royalty.

https://www.helixcommunity.org/2002/intro/client

read more from admin's post

Posted in October 30th, 2002

SuperZip

Published in Commercial Programs, Zip

A full-featured release of a Win32 Zip program. Version 3.0 was released in October, 2002. Trial version available.

http://www.integsoft.com/sz.htm

read more from admin's post

Posted in October 30th, 2002

ArchiveXpert

ArchiveExpert supports the standard Zip format, as well as CAB, LHA, LZH, plus extraction support for ARJ, ARC, PAK, RAR, as well as other formats. The 2.02 release shipped in August of 2002.

http://www.c4fsoft.com/products/archivexpert/

read more from admin's post

Posted in October 30th, 2002

Hacking Data Compression

Andy McFadden was nice enough to post a series of a dozen lessons on Data Compression. It looks as though these were posts to the now-forgotten GEnie information service, once a pretender to the CompuServe popularity crown.

http://www.fadden.com/techmisc/hdc/

read more from admin's post

Posted in October 30th, 2002

Sax.net Streaming Compression

Sax.net Streaming Compression helps you keep your data small and fast. Use high-performance compression and data compression code, using a class library that was designed from the ground up for integration with the Microsoft .NET framework.

In addition to being able to specify whether to prefer speed over size, Sax.net Compression offers you a choice of […]

http://www.sax.net/dotnet/compression/

read more from admin's post

Posted in October 30th, 2002

cwenc - MP3 & Ogg Vorbis Encoder for Cakewalk

The cwenc utility is primarily a tool for musicians working with Cakewalk music production software, such as Home Studio and Sonar. cwenc enables you to export your original music to a compressed audio file format, such as MP3 or Ogg Vorbis, from within the mentioned Cakewalk products. cwenc is a direct replacement for the […]

http://cwenc.sourceforge.net/

read more from admin's post

Posted in October 30th, 2002

MP3elf - Network MP3 Player

The MP3elf is a low-cost Ethernet connected MP3 player that receives a MP3 stream from a local area network server and delivers it to amplified speakers or a Hi-Fi system. You can have a single source of MP3 files in your home or office, and browse through playlists from any MP3elf. The Universal IR […]

http://mp3elf.net/

read more from admin's post

Posted in October 30th, 2002

Java and Windows Pdf Extraction Decoding Access Library

A Java library released under the LGPL license for extracting text and images from pdf files, with full source code and examples. It also provides a rasterizer.

A commercial product called WinPedal is also available. WinPedal is an EXE that has grouping functionality that converts PDF text into XML.

http://www.jpedal.org/

read more from admin's post

Posted in October 30th, 2002

Feature Comparison of Popular Zip Compression Components

There are many Zip compression components in the marketplace. This web page is a summary of the most popular Zip components available and provides a side-by-side comparison of the features and pricing of each. This review focuses on components used by Microsoft developers in Visual Basic, ASP, Visual C++, and Visual FoxPro applications.

http://componentreview.tripod.com/

read more from admin's post

Posted in October 30th, 2002

Trie-based dictionary program

Falk Hueffner created a dictionary program that was to be used for a Scrabble-type word game. The source can be found at this link under the name dawg.tar.gz.

http://people.debian.org/~falk/

read more from admin's post

Posted in October 30th, 2002

Efficient Creation of Finite State Automota

This paper describes a method which creates minimal acyclic deterministic FSAs. These can be used to create an efficient representation of a dictionary of words by merging identical subtrees.

http://www-zo.iinf.polsl.gliwice.pl/~sdeor/pub/cd00abs.htm

read more from admin's post

Posted in October 30th, 2002

MPEG-4 Industry Forum Resources Link

Links to scads of papers, web sites, software, etc., all related to the MPEG-4 standard.

http://www.m4if.org/resources.php

read more from admin's post

Posted in October 30th, 2002

Jonathan Bennet’s C++ implementation of LZSS

A C++ implementation of the LZSS / LZ77 algorithm. Also contains a description of the LZSS algorithm and my implementations of it as I learned more about it (hashing, lazy evaluation, etc.) All the code from my first attempt to the current version is included.

An anonymous visitor to Jonathan’s page said it was […]

http://www.hiddensoft.com/cgi-bin/countdown.pl?code/LZSS.zip

read more from admin's post

Posted in October 30th, 2002

FileStream, Inc

FileStream makes quite a few compression-related programs, including TurboZip and TurboSFX. Release 5.,1 appears to have shipped sometime in the latter part of 2002.

http://www.turbozip.com

read more from admin's post

Posted in October 30th, 2002

Karl Malbrain - BWT Source

Karl has created a complete BWT package, and has posted the source on this site. He also has an adaptation of N. Jesper Larsson’s
Burrows-Wheeler Suffix Sorting for your perusal.

http://www.geocities.com/malbrain

read more from admin's post

Posted in October 30th, 2002

Burrows-Wheeler Transformation

Helger Lipmaa created a BWT transformer in PHP, and can demo it to you right on the net at this page.

http://www.tcs.hut.fi/~helger/bwt/index.php

read more from admin's post

Posted in October 23rd, 2002

WebWrap

WebWrap is a small 3-in-1 utility program to compress and/or code your HTML web files. It uses either a modified LZSS algorithm or the URL encryption procedure, and can reduce the size of your files by more than 50%. It also renders the source of any HTML pages you have created, your design features and […]

http://www.tac-soft.com/WebWrap/webwrap.html

read more from admin's post

Posted in October 21st, 2002

TaskZip

Freeware program written in VB5 that automates system backup, using Zip format.

http://jupiter.drw.net/matpie/PBSystems/products/taskzip/TZip.html

read more from admin's post

Posted in October 12th, 2002

Avi-Info

This free tool can be used to extract information about an AVI file, including resolution, bit-rate, video codec, etc. Full source available for download.

http://isengrim.rotegruetze.net/index.php?main=4&id=1&german=

read more from admin's post

Posted in October 12th, 2002