External Issue: Floating Point Constants

chapelu · October 20, 2021, 7:34pm

18599, "damianmoz", "Floating Point Constants", "2021-10-20T19:32:43Z"

Floating Point Constants

opened 07:32PM - 20 Oct 21 UTC

Various `generic programming issues` (which is largely unobtrusive currently) wi…ll arise as Chapel has to adapt to handling either real(16) or real(128) found in some of the latest CPUs. Indeed, one only has to look at what you have to do in C/C++ when trying to generically program to handle floating point constants for float, double and long double. The only way around this is to assume that Chapel effectively defines a floating constant as having infinite precision until it is used opr at least the maximum precision handled by the hardware, or some _compiler-flag-restricted_ maximum precision . The latter still allows us to mandate that a floating point constant is 64-bits wide so existing programs are not broken. The programmer can still specify the size occupied by that constant from the start of execution by a cast as in ```chapel const SQRT_2_0 = 1.41421356237309504880168872420969807856967187537694807317667973799:real(w); ``` The question then arises of how you specify the precision of a symbolic constant. These are things like **INFINITY**, **e** and **pi**(both of which drive me crazy because they conflict with names my programs use), and others. This also will mean that the compiler will need a flag which allows a programmer to specify the rounding mode used at compile time to an expression like the above or the one below: ```chapel param t = 1 / 3.0:real(32); ``` Similarly, is the expression on the right-hand side of this, ```chapel const t = 1 / 3.0:real(32); ``` evaluated at compile time and determined by that previously mentioned flag or by the rounding flag in effect at run-time,. or is it determined by a separate compiler flag? Let us see where this discussion goes. I am sure this discussion has implications for the optimiser. This discssion may lead to the need to define a parameterised type with say ```chapel type zuse(w) = uint(w); ``` in Chapel. But it may not.

Various generic programming issues (which is largely unobtrusive currently) will arise as Chapel has to adapt to handling either real(16) or real(128) found in some of the latest CPUs. Indeed, one only has to look at what you have to do in C/C++ when trying to generically program to handle floating point constants for float, double and long double.

The only way around this is to assume that Chapel effectively defines a floating constant as having infinite precision until it is used opr at least the maximum precision handled by the hardware, or some compiler-flag-restricted maximum precision . The latter still allows us to mandate that a floating point constant is 64-bits wide so existing programs are not broken. The programmer can still specify the size occupied by that constant from the start of execution by a cast as in

const SQRT_2_0 = 1.41421356237309504880168872420969807856967187537694807317667973799:real(w);

The question then arises of how you specify the precision of a symbolic constant. These are things like INFINITY, e and pi(both of which drive me crazy because they conflict with names my programs use), and others.

This also will mean that the compiler will need a flag which allows a programmer to specify the rounding mode used at compile time to
an expression like the above or the one below:

param t = 1 / 3.0:real(32);

Similarly, is the expression on the right-hand side of this,

const t = 1 / 3.0:real(32);

evaluated at compile time and determined by that previously mentioned flag or by the rounding flag in effect at run-time,. or is it determined by a separate compiler flag?

Let us see where this discussion goes. I am sure this discussion has implications for the optimiser.

This discssion may lead to the need to define a parameterised type with say

type zuse(w) = uint(w);

in Chapel. But it may not.

Topic		Replies	Views
[design] proposed change to small int -> real/complex conversions Users	0	177	May 4, 2022
Qualified (Restricted Range) Floating Point Numbers - Long Term Users	2	142	November 16, 2022
Getting the imaginary part of an imaginary number Users	6	74	March 11, 2024
Overloading elementary function Users	15	184	December 5, 2023
Complex/Imaginary Multiplication Users	3	85	January 16, 2024

External Issue: Floating Point Constants

Related Topics