Author Page
Dynamic Huffman Coder
This dynamic Huffman coder from Karl Malbrain is written in C and includes weight scaling. It is modeled on the Vitter algorithm.
A DataCompression.info user notes that this site has been undergoing continual changes, and perhaps would benefit from some sort of “last modified on” field.
Introduction to Video Coding
A nice set of lecture notes by Iain Richardson. The notes have a good level of detail, so there is value here even without the lecture.
From Ukkonen to McCreight and Weiner: A Unifying View of Linear-Time Suffix Tree Construction
1997, Robert Giegerich, Stefan Kurtz. We review the linear time suffix tree constructions by Weiner, McCreight, and Ukkonen. We use the terminology of the most recent algorithm, Ukkonen’s online construction, to explain its historic predecessors. The submitter of this paper indicates that it has user-friendly terminology, always welcome in Journal papers.
ITU Recommendation V.44
This is the data compression standard that implements the LZJH algorithm, and is used in V.90 and V.92 modems. The ITU wants to charge you a few bucks for this standard, but if you believe the post from Pete Fraser (listed elsewhere on DataCompression.info) you can get three free standards per year. Maybe this ought […]
http://www.itu.int/rec/recommendation.asp?type=items&lang=e&parent=T-REC-V.44-200011-I
Sequitir - inferring hierarchies from sequences
SEQUITUR is a method for inferring compositional hierarchies from strings. It detects repetition and factors it out of the string by forming rules in a grammar. The rules can be composed of non-terminals, giving rise to a hierarchy. It is useful for recognizing lexical structure in strings, and excels at very long sequences.
PlusV
PlusV is a brand new audio compression enhancement technology that allows audio files to be compressed in as little as 64 or even 48 kbits/s. PlusV is not a compression scheme of its own, it is an extension that can be applied to existing audio formats. When combined with the MP3 technology, MP3+V files are […]
IP*Works! Zip Component Suite
IP*Works! Zip allows developers to easily integrate compression and decompression functionality into web or desktop applications using popular file and data compression standards including Zip, Tar, Gzip and Jar standards for compression. All IP*Works! Zip components have been built from the ground up entirely in native .NET and Java code, combining exceptional speed, excellent […]
BICOM - BIjective COMpressor
BICOM is a freely available open source compressor. It uses a souped-up PPM algorithm, and is completely bijective.
Reader comment:
Wow this is hot! …a bijective compressor
using full size Rijndael encryption…
szip homepage
Szip is a freeware portable general purpose lossless compression program. It has a high speed and compression, but high memory demands (up to 20MB) too. The compression is done using a variant of blocksorting, which explains its rather high memory requirements.
Update: Michael Schindler has at long last posted the source code for szip.
In-Place Calculation of Minimum-Redundancy Codes
The abstract for a paper on calculation of Huffman codes. The paper isn’t here, but the source code is. Alistair says that if you sort your array of counts, you can create the Canonical Huffman tree in memory.
Embedded BWT Compression
An implementation of BWT designed with the goal of minimizing memory usage. Source code and a documentation page.
Helix DNA Client
Real is making their client software available in an Open Source program. Download this software and you can develop your own MP3 player! Free of royalties if you are distributing it for free - commercial products pay a royalty.
SuperZip
A full-featured release of a Win32 Zip program. Version 3.0 was released in October, 2002. Trial version available.
ArchiveXpert
ArchiveExpert supports the standard Zip format, as well as CAB, LHA, LZH, plus extraction support for ARJ, ARC, PAK, RAR, as well as other formats. The 2.02 release shipped in August of 2002.
Hacking Data Compression
Andy McFadden was nice enough to post a series of a dozen lessons on Data Compression. It looks as though these were posts to the now-forgotten GEnie information service, once a pretender to the CompuServe popularity crown.
Sax.net Streaming Compression
Sax.net Streaming Compression helps you keep your data small and fast. Use high-performance compression and data compression code, using a class library that was designed from the ground up for integration with the Microsoft .NET framework.
In addition to being able to specify whether to prefer speed over size, Sax.net Compression offers you a choice of […]
cwenc - MP3 & Ogg Vorbis Encoder for Cakewalk
The cwenc utility is primarily a tool for musicians working with Cakewalk music production software, such as Home Studio and Sonar. cwenc enables you to export your original music to a compressed audio file format, such as MP3 or Ogg Vorbis, from within the mentioned Cakewalk products. cwenc is a direct replacement for the […]
MP3elf - Network MP3 Player
The MP3elf is a low-cost Ethernet connected MP3 player that receives a MP3 stream from a local area network server and delivers it to amplified speakers or a Hi-Fi system. You can have a single source of MP3 files in your home or office, and browse through playlists from any MP3elf. The Universal IR […]
Java and Windows Pdf Extraction Decoding Access Library
A Java library released under the LGPL license for extracting text and images from pdf files, with full source code and examples. It also provides a rasterizer.
A commercial product called WinPedal is also available. WinPedal is an EXE that has grouping functionality that converts PDF text into XML.
Feature Comparison of Popular Zip Compression Components
There are many Zip compression components in the marketplace. This web page is a summary of the most popular Zip components available and provides a side-by-side comparison of the features and pricing of each. This review focuses on components used by Microsoft developers in Visual Basic, ASP, Visual C++, and Visual FoxPro applications.
Trie-based dictionary program
Falk Hueffner created a dictionary program that was to be used for a Scrabble-type word game. The source can be found at this link under the name dawg.tar.gz.
Efficient Creation of Finite State Automota
This paper describes a method which creates minimal acyclic deterministic FSAs. These can be used to create an efficient representation of a dictionary of words by merging identical subtrees.
MPEG-4 Industry Forum Resources Link
Links to scads of papers, web sites, software, etc., all related to the MPEG-4 standard.
Jonathan Bennet’s C++ implementation of LZSS
A C++ implementation of the LZSS / LZ77 algorithm. Also contains a description of the LZSS algorithm and my implementations of it as I learned more about it (hashing, lazy evaluation, etc.) All the code from my first attempt to the current version is included.
An anonymous visitor to Jonathan’s page said it was […]
http://www.hiddensoft.com/cgi-bin/countdown.pl?code/LZSS.zip
FileStream, Inc
FileStream makes quite a few compression-related programs, including TurboZip and TurboSFX. Release 5.,1 appears to have shipped sometime in the latter part of 2002.
Karl Malbrain - BWT Source
Karl has created a complete BWT package, and has posted the source on this site. He also has an adaptation of N. Jesper Larsson’s
Burrows-Wheeler Suffix Sorting for your perusal.
Burrows-Wheeler Transformation
Helger Lipmaa created a BWT transformer in PHP, and can demo it to you right on the net at this page.
WebWrap
WebWrap is a small 3-in-1 utility program to compress and/or code your HTML web files. It uses either a modified LZSS algorithm or the URL encryption procedure, and can reduce the size of your files by more than 50%. It also renders the source of any HTML pages you have created, your design features and […]
TaskZip
Freeware program written in VB5 that automates system backup, using Zip format.
http://jupiter.drw.net/matpie/PBSystems/products/taskzip/TZip.html
Avi-Info
This free tool can be used to extract information about an AVI file, including resolution, bit-rate, video codec, etc. Full source available for download.
http://isengrim.rotegruetze.net/index.php?main=4&id=1&german=