consider a (binary) floating point system of the form where . calculate the relative error, with respect to -norm if we convert the vector to the given floating point system. when converting to floating point, first convert to a binary number then truncate any additional bits. note: we are interested in the representation accuracy of floating point. so assume that all operations performed (addition, subtraction, etc.) do not increase the error.

Respuesta :