Fractal compression is a lossy compression method for digital images, based on fractals. The method is best suited for textures and natural images, relying on the fact that parts of an image often resemble other parts of the same image.[1] Fractal algorithms convert these parts into mathematical data called "fractal codes" which are used to recreate the encoded image.
We begin with the representation of a binary image, where the image may be thought of as a subset of . An IFS is a set of contraction mappingsƒ1,...,ƒN,
According to these mapping functions, the IFS describes a two-dimensional set S as the fixed point of the Hutchinson operator
That is, H is an operator mapping sets to sets, and S is the unique set satisfying H(S) = S. The idea is to construct the IFS such that this set S is the input binary image. The set S can be recovered from the IFS by fixed point iteration: for any nonempty compact initial set A0, the iteration Ak+1 = H(Ak) converges to S.
The set S is self-similar because H(S) = S implies that S is a union of mapped copies of itself:
So we see the IFS is a fractal representation of S.
Extension to grayscale
IFS representation can be extended to a grayscale image by considering the image's graph as a subset of . For a grayscale image u(x,y), consider the set
S = {(x,y,u(x,y))}. Then similar to the binary case, S is described by an IFS using a set of contraction mappings ƒ1,...,ƒN, but in ,
Encoding
A challenging problem of ongoing research in fractal image representation is how to choose the ƒ1,...,ƒN such that its fixed point approximates the input image, and how to do this efficiently.
A simple approach[2] for doing so is the following partitioned iterated function system (PIFS):
Partition the image domain into range blocks Ri of size s×s.
For each Ri, search the image to find a block Di of size 2s×2s that is very similar to Ri.
Select the mapping functions such that H(Di) = Ri for each i.
In the second step, it is important to find a similar block so that the IFS accurately represents the input image, so a sufficient number of candidate blocks for Di need to be considered. On the other hand, a large search considering many blocks is computationally costly.
This bottleneck of searching for similar blocks is why PIFS fractal encoding is much slower than for example DCT and wavelet based image representation.
The initial square partitioning and brute-force search algorithm presented by Jacquin provides a starting point for further research and extensions in many possible directions—different ways of partitioning the image into range blocks of various sizes and shapes; fast techniques for quickly finding a close-enough matching domain block for each range block rather than brute-force searching, such as fast motion estimation algorithms; different ways of encoding the mapping from the domain block to the range block; etc.[3]
Other researchers attempt to find algorithms to automatically encode an arbitrary image as RIFS (recurrent iterated function systems) or global IFS, rather than PIFS; and algorithms for fractal video compression including motion compensation and three dimensional iterated function systems.[4][5]
Fractal image compression has many similarities to vector quantization image compression.[6]
Features
With fractal compression, encoding is extremely computationally expensive because of the search used to find the self-similarities. Decoding, however, is quite fast. While this asymmetry has so far made it impractical for real time applications, when video is archived for distribution from disk storage or file downloads fractal compression becomes more competitive.[7][8]
At common compression ratios, up to about 50:1, fractal compression provides similar results to DCT-based algorithms such as JPEG.[9] At high compression ratios fractal compression may offer superior quality. For satellite imagery, ratios of over 170:1[10] have been achieved with acceptable results. Fractal video compression ratios of 25:1–244:1 have been achieved in reasonable compression times (2.4 to 66 sec/frame).[11]
Compression efficiency increases with higher image complexity and color depth, compared to simple grayscale images.
Resolution independence and fractal scaling
An inherent feature of fractal compression is that images become resolution independent[12] after being converted to fractal code. This is because the iterated function systems in the compressed file scale indefinitely. This indefinite scaling property of a fractal is known as "fractal scaling".
Fractal interpolation
The resolution independence of a fractal-encoded image can be used to increase the display resolution of an image. This process is also known as "fractal interpolation". In fractal interpolation, an image is encoded into fractal codes via fractal compression, and subsequently decompressed at a higher resolution. The result is an up-sampled image in which iterated function systems have been used as the interpolant.[13]
Fractal interpolation maintains geometric detail very well compared to traditional interpolation methods like bilinear interpolation and bicubic interpolation.[14][15][16] Since the interpolation cannot reverse Shannon entropy however, it ends up sharpening the image by adding random instead of meaningful detail. One cannot, for example, enlarge an image of a crowd where each person's face is one or two pixels and hope to identify them.
History
Michael Barnsley led the development of fractal compression from 1985 at the Georgia Institute of Technology (where both Barnsley and Sloan were professors in the mathematics department).[17] The work was sponsored by DARPA and the Georgia Tech Research Corporation. The project resulted in several patents from 1987.[18] Barnsley's graduate student Arnaud Jacquin implemented the first automatic algorithm in software in 1992.[19][20] All methods are based on the fractal transform using iterated function systems. Michael Barnsley and Alan Sloan formed Iterated Systems Inc.[21] in 1987 which was granted over 20 additional patents related to fractal compression.
A major breakthrough for Iterated Systems Inc. was the automatic fractal transform process which eliminated the need for human intervention during compression as was the case in early experimentation with fractal compression technology. In 1992, Iterated Systems Inc. received a US$2.1 million government grant[22] to develop a prototype digital image storage and decompression chip using fractal transform image compression technology.
Fractal image compression has been used in a number of commercial applications: onOne Software, developed under license from Iterated Systems Inc., Genuine Fractals 5[23] which is a Photoshop plugin capable of saving files in compressed FIF (Fractal Image Format). To date the most successful use of still fractal image compression is by Microsoft in its Encarta multimedia encyclopedia,[24] also under license.
Iterated Systems Inc. supplied a shareware encoder (Fractal Imager), a stand-alone decoder, a Netscape plug-in decoder and a development package for use under Windows. The redistribution of the "decompressor DLL" provided by the ColorBox III SDK was governed by restrictive per-disk or year-by-year licensing regimes for proprietary software vendors and by a discretionary scheme that entailed the promotion of the Iterated Systems products for certain classes of other users.[25]
ClearVideo – also known as RealVideo (Fractal) – and SoftVideo were early fractal video compression products. ClearFusion was Iterated's freely distributed streaming video plugin for web browsers. In 1994 SoftVideo was licensed to Spectrum Holobyte for use in its CD-ROM games including Falcon Gold and Star Trek: The Next Generation A Final Unity.[26]
In 1996, Iterated Systems Inc. announced[27] an alliance with the Mitsubishi Corporation to market ClearVideo to their Japanese customers. The original ClearVideo 1.2 decoder driver is still supported[28] by Microsoft in Windows Media Player although the encoder is no longer supported.
Two firms, Total Multimedia Inc. and Dimension, both claim to own or have the exclusive licence to Iterated's video technology, but neither has yet released a working product. The technology basis appears to be Dimension's U.S. patents 8639053 and 8351509, which have been considerably analyzed.[29] In summary, it is a simple quadtree block-copying system with neither the bandwidth efficiency nor PSNR quality of traditional DCT-based codecs. In January 2016, TMMI announced that it was abandoning fractal-based technology altogether.
Research papers between 1997 and 2007 discussed possible solutions to improve fractal algorithms and encoding hardware.[30][31][32][33][34][35][36][37][38]
Implementations
A library called Fiasco was created by Ullrich Hafner. In 2001, Fiasco was covered in the Linux Journal.
[39]
According to the 2000-04 Fiasco manual, Fiasco can be used for video compression.
[40]
The Netpbm library includes the Fiasco library.
[41][42]
Femtosoft developed an implementation of fractal image compression in Object Pascal and Java.
[43]
^Uemura, Satoshi; Haseyama, Miki; Kitajima, Hideo (28 January 2003). "EFIFを用いた自己アフィンフラクタル図形の拡大処理に関する考察" [A Note on Expansion Technique for Self-Affine Fractal Objects Using Extended Fractal Interpolation Functions]. IEICE Technical Report (in Japanese). 102 (630): 95–100. doi:10.11485/itetr.27.9.0_95. NAID110003171506.
^Kuroda, Hideo; Hu, Xiaotong; Fujimura, Makoto (1 February 2003). "フラクタル画像符号化におけるスケーリングファクタに関する考察" [Studies on Scaling Factor for Fractal Image Coding]. The Transactions of the Institute of Electronics, Information and Communication Engineers (in Japanese). 86 (2): 359–363. NAID110003170896.
^Barnsley, Michael; Sloan, Alan (January 1988). "A Better Way to Compress Images". Byte. pp. 215–223.
^Wu, Xianwei; Jackson, David Jeff; Chen, Hui-Chuan (September 2005). "A fast fractal image encoding method based on intelligent search of standard deviation". Computers & Electrical Engineering. 31 (6): 402–421. doi:10.1016/j.compeleceng.2005.02.003.
^Wu, Xianwei; Jackson, David Jeff; Chen, Hui-Chuan (2005). "Novel fractal image-encoding algorithm based on a full-binary-tree searchless iterated function system". Optical Engineering. 44 (10): 107002. Bibcode:2005OptEn..44j7002W. doi:10.1117/1.2076828.
^Truong, Trieu-Kien; Jeng, Jyh H. (2000). "Fast classification method for fractal image compression". In Schmalz, Mark S (ed.). Mathematics and Applications of Data/Image Coding, Compression, and Encryption III. Vol. 4122. pp. 190–193. Bibcode:2000SPIE.4122..190T. doi:10.1117/12.409247. S2CID120032052.
^Erra, Ugo (2005). "Toward Real Time Fractal Image Compression Using Graphics Hardware". Advances in Visual Computing. Lecture Notes in Computer Science. Vol. 3804. pp. 723–728. doi:10.1007/11595755_92. hdl:11563/14075. ISBN978-3-540-30750-1.