Fixed-Rate Compressed Floating-Point Arrays

论文链接: Fixed-Rate Compressed Floating-Point Arrays

Abstract

Current compression schemes for ﬂoating-point data commonly take ﬁxed-precision values and compress them to a variable-length bit stream, complicating memory management and random access. We present a ﬁxed-rate, near-lossless compression scheme that maps small blocks of 4d values in d dimensions to a ﬁxed, user-speciﬁed number of bits per block, thereby allowing read and write random access to compressed ﬂoating-point data at block granularity. Our approach is inspired by ﬁxed-rate texture compression methods widely adopted in graphics hardware, but has been tailored to the high dynamic range and precision demands of scientiﬁc applications. Our compressor is based on a new, lifted, orthogonal block transform and embedded coding, allowing each per-block bit stream to be truncated at any point if desired, thus facilitating bit rate selection using a single compression scheme. To avoid compression or decompression upon every data access, we employ a software write-back cache of uncompressed blocks. Our compressor has been designed with computational simplicity and speed in mind to allow for the possibility of a hardware implementation, and uses only a small number of ﬁxed-point arithmetic operations per compressed value. We demonstrate the viability and beneﬁts of lossy compression in several applications, including visualization, quantitative data analysis, and numerical simulation.

Notes

Main

Contribution

first solution that serves as a generalpurpose compressed representation of floating-point arrays
可随机访问（random access）

Details

主要解决问题场景
- visualization: with insufficient I/O bandwidth to store the simulation results
- compression speed sacrificed in favor of as-fast-as-possible decompression.
- 不支持已压缩数据的 random access
main ingredients: orthogonal block transform, embedded coding

主要流程

（指数对齐）align the values in a block to a common exponent
（浮点——>定点）convert the floating-point values to a fixed-point representation
（正交块变换，减少相关性）apply an orthogonal block transform to decorrelate the values
（相关系数排序）order the transform coefficients by expected magnitude
（“位平面编码”）encode the resulting coefficients one “bit plane” at a time

Conversion to Fixed-Point
- 核心思想为 block 整个 array 后分别对每个 block 正交块变换
- converting the double-precision values in $4 \times 4 \times 4$ block
- Q3.60 fixed-point two’s complement format

正交变换
- 对固定点值应用一个正交变换，目的是将空间上相关的值尽可能地去相关化，从而产生许多接近零的系数，这些系数可以被高效压缩。
- 系数排序：变换后的系数按预期大小排序，这有助于后续的编码过程，因为大部分信息都集中在少数几个系数中
嵌入式编码
- 本文采用“位平面”编码。这种编码方式允许每个块的比特流在任何点被截断，从而实现不同的压缩比率，将数据的每个位平面独立编码，从而允许在不同的精度级别上访问数据
- 优势
  - 灵活性：嵌入式编码允许在不同的比特率和质量级别下访问数据，提供了极大的灵活性。
  - 效率：通过嵌入式编码，可以在保持较高压缩效率的同时，实现数据的快速解码和访问。
  - 适应性

对应的 zfp 库 (C 实现)

zfp uses two different algorithms to support lossy and lossless compression. These algorithms are described in detail below.
Lossy Compression
The zfp lossy compression scheme is based on the idea of breaking a d-dimensional array into independent blocks of 4d values each, e.g., 4 × 4 × 4 values in three dimensions. Each block is compressed/decompressed entirely independently from all other blocks. In this sense, zfp is similar to current hardware texture compression schemes for image coding implemented on graphics cards and mobile devices.
The lossy compression scheme implemented in this version of zfp has evolved from the method described in the original paper, and can conceptually be thought of as consisting of eight sequential steps (in practice some steps are consolidated or exist only for illustrative purposes):

The d-dimensional array is partitioned into blocks of dimensions 4d. If the array dimensions are not multiples of four, then blocks near the boundary are padded to the next multiple of four. This padding is invisible to the application.
The independent floating-point values in a block are converted to what is known as a block-floating-point representation, which uses a single, common floating-point exponent for all 4d values. The effect of this conversion is to turn each floating-point value into a 31- or 63-bit signed integer. If the values in the block are all zero or are smaller in magnitude than the fixed-accuracy tolerance (see below), then only a single bit is stored with the block to indicate that it is “empty” and expands to all zeros. Note that the block-floating-point conversion and empty-block encoding are not performed if the input data is represented as integers rather than floating-point numbers.
The integers are decorrelated using a custom, high-speed, near orthogonal transform similar to the discrete cosine transform used in JPEG image coding. The transform exploits separability and is implemented efficiently in-place using the lifting scheme, requiring only 2.5 d integer additions and 1.5 d bit shifts by one per integer in d dimensions. If the data is “smooth,” then this transform will turn most integers into small signed values clustered around zero.
The signed integer coefficients are reordered in a manner similar to JPEG zig-zag ordering so that statistically they appear in a roughly monotonically decreasing order. Coefficients corresponding to low frequencies tend to have larger magnitude and are listed first. In 3D, coefficients corresponding to frequencies i, j, k in the three dimensions are ordered by i + j + k first and then by $i_{2} + j_{2} + k_{2}$ .
The two’s complement signed integers are converted to their negabinary (base negative two) representation using one addition and one bit-wise exclusive or per integer. Because negabinary has no single dedicated sign bit, these integers are subsequently treated as unsigned. Unlike sign-magnitude representations, the leftmost one-bit in negabinary simultaneously encodes the sign and approximate magnitude of a number. Moreover, unlike two’s complement, numbers small in magnitude have many leading zeros in negabinary regardless of sign, which facilitates encoding.
The bits that represent the list of 4d integers are transposed so that instead of being ordered by coefficient they are ordered by bit plane, from most to least significant bit. Viewing each bit plane as an unsigned integer, with the lowest bit corresponding to the lowest frequency coefficient, the anticipation is that the first several of these transposed integers are small, because the coefficients are assumed to be ordered by magnitude.
The transform coefficients are compressed losslessly using embedded coding by exploiting the property that the coefficients tend to have many leading zeros that need not be encoded explicitly. Each bit plane is encoded in two parts, from lowest to highest bit. First, the n lowest bits are emitted verbatim, where n is the smallest number such that the 4d − n highest bits in all previous bit planes are all zero. Initially, n = 0. Then, a variable-length representation of the remaining 4d − n bits, x, is encoded. For such an integer x, a single bit is emitted to indicate if x = 0, in which case we are done with the current bit plane. If not, then bits of x are emitted, starting from the lowest bit, until a one-bit is emitted. This triggers another test whether this is the highest set bit of x, and the result of this test is output as a single bit. If not, then the procedure repeats until all m of x’s value bits have been output, where 2m-1 ≤ x < 2m. This can be thought of as a run-length encoding of the zeros of x, where the run lengths are expressed in unary. The total number of value bits, n, in this bit plane is then incremented by m before being passed to the next bit plane, which is encoded by first emitting its n lowest bits. The assumption is that these bits correspond to n coefficients whose most significant bits have already been output, i.e., these n bits are essentially random and not compressible. Following this, the remaining 4d − n bits of the bit plane are run-length encoded as described above, which potentially results in n being increased.
As an example, x = 000001001101000 with m = 10 is encoded as 010011110110001, where the bits in boldface indicate “group tests” that determine if the remainder of x (to the left) contains any one-bits. Again, this variable-length code is generated and parsed from right to left.
The embedded coder emits one bit at a time, with each successive bit potentially improving the accuracy of the approximation. The early bits are most important and have the greatest impact on accuracy, with the last few bits providing very small changes. The resulting compressed bit stream can be truncated at any point and still allow for a valid approximate reconstruction of the original block of values. The final step truncates the bit stream in one of three ways: to a fixed number of bits (the fixed-rate mode); after some fixed number of bit planes have been encoded (the fixed-precision mode); or until a lowest bit plane number has been encoded, as expressed in relation to the common floating-point exponent within the block (the fixed-accuracy mode).
Various parameters are exposed for controlling the quality and compressed size of a block, and can be specified by the user at a very fine granularity. These parameters are discussed here.
Lossless Compression
The reversible (lossless) compression algorithm shares most steps with the lossy algorithm. The main differences are steps 2, 3, and 8, which are the only sources of error. Since step 2 may introduce loss in the conversion to zfp’s block-floating-point representation, the reversible algorithm adds a test to see if this conversion is lossless. It does so by converting the values back to the source format and testing the result for bitwise equality with the uncompressed data. If this test passes, then a modified decorrelating transform is performed in step 3 that uses reversible integer subtraction operations only. Finally, step 8 is modified so that no one-bits are truncated in the variable-length bit stream. However, all least significant bit planes with all-zero bits are truncated, and the number of encoded bit planes is recorded in step 7. As with lossy compression, a floating-point block consisting of all (“positive”) zeros is represented as a single bit, making it possible to efficiently encode sparse data.
If the block-floating-point transform is not lossless, then the reversible compression algorithm falls back on a simpler scheme that reinterprets floating-point values as integers via type punning. This lossless conversion from floating-point to integer data replaces step 2, and the algorithm proceeds from there with the modified step 3. Moreover, this conversion ensures that special values like infinities, NaNs, and negative zero are preserved.
The lossless algorithm handles integer data also, for which step 2 is omitted.