Title :
Quantization errors in floating-point arithmetic
Author :
Sripad, A. ; Snyder, Donald L.
Author_Institution :
Intermetrics, Inc., Cambridge, MA
fDate :
10/1/1978 12:00:00 AM
Abstract :
In this paper, the quantization of the mantissa in a normalized floating-point number is investigated. A necessary and sufficient condition is given for the mantissa to have a reciprocal probability density. A model to represent a floating-point quantizer with the mantissa having a reciprocal density is developed. The first- and second-order statistical properties of this model are studied.
Keywords :
Biomedical computing; Digital signal processing; Dynamic range; Floating-point arithmetic; Numerical analysis; Probability; Quantization; Random variables; Statistics; Sufficient conditions;
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on
DOI :
10.1109/TASSP.1978.1163135