glibc/sysdeps/loongarch
Adhemerval Zanella 5fa89852fa math: Use exp10m1f from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows better performance compared to the generic exp10m1f.

The code was adapted to glibc style and to use the definition of
math_config.h (to handle errno, overflow, and underflow).  I mostly
fixed some small issues in corner cases (sNaN handling, -INFINITY,
a specific overflow check).

Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1,
gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1):

Latency                      master        patched   improvement
x86_64                      45.4690        49.5845        -9.05%
x86_64v2                    46.1604        36.2665        21.43%
x86_64v3                    37.8442        31.0359        17.99%
i686                        121.367        93.0079        23.37%
aarch64                     21.1126        15.0165        28.87%
power10                     12.7426        8.4929         33.35%

reciprocal-throughput        master        patched   improvement
x86_64                      19.6005        17.4005        11.22%
x86_64v2                    19.6008        11.1977        42.87%
x86_64v3                    17.5427        10.2898        41.34%
i686                        59.4215        60.9675        -2.60%
aarch64                     13.9814        7.9173         43.37%
power10                      6.7814        6.4258          5.24%

The generic implementation calls __ieee754_exp10f which has an
optimized version, although it is not correctly rounded, which is
the main culprit of the the latency difference for x86_64 and
throughp for i686.

Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
2024-11-01 11:27:26 -03:00
..
bits Refer to C23 in place of C2X in glibc 2024-02-01 11:02:01 +00:00
fpu LoongArch: Correct {__ieee754, _}_scalb -> {__ieee754, _}_scalbf 2024-03-12 14:07:27 +08:00
lp64 math: Use exp10m1f from CORE-MATH 2024-11-01 11:27:26 -03:00
nofpu Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
nptl nptl: Add <thread_pointer.h> for LoongArch 2024-11-01 10:41:20 +08:00
sys LoongArch: Ensure sp 16-byte aligned for tlsdesc 2024-06-14 10:14:54 +08:00
Implies LoongArch: Build Infrastructure 2022-07-26 12:35:12 -03:00
Makefile LoongArch: Fix tst-gnu2-tls2 compiler error 2024-05-21 11:23:03 +08:00
__longjmp.S Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
abort-instr.h LoongArch: Linux Syscall Interface 2022-07-26 12:35:12 -03:00
bsd-_setjmp.c
bsd-setjmp.c
configure Convert to autoconf 2.72 (vanilla release, no distribution patches) 2024-06-17 21:15:28 +02:00
configure.ac LoongArch: Fix tst-gnu2-tls2 compiler error 2024-05-21 11:23:03 +08:00
cpu-tunables.c LoongArch: Add glibc.cpu.hwcap support. 2024-04-24 18:22:38 +08:00
dl-audit-check.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
dl-get-cpu-features.c LoongArch: Add glibc.cpu.hwcap support. 2024-04-24 18:22:38 +08:00
dl-irel.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
dl-link.sym LoongArch: Add lasx/lsx support for _dl_runtime_profile. 2023-09-15 09:07:42 +08:00
dl-machine.h LoongArch: Add cfi instructions for _dl_tlsdesc_dynamic 2024-08-09 09:06:17 +08:00
dl-tls.h LoongArch: Add support for TLS Descriptors 2024-05-15 10:31:53 +08:00
dl-tlsdesc-dynamic.h LoongArch: Fix macro redefined warning in tls-desc.S 2024-09-06 15:46:13 +08:00
dl-tlsdesc.S LoongArch: Fix macro redefined warning in tls-desc.S 2024-09-06 15:46:13 +08:00
dl-tlsdesc.h LoongArch: Add cfi instructions for _dl_tlsdesc_dynamic 2024-08-09 09:06:17 +08:00
dl-trampoline.S Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
dl-trampoline.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
dl-tunables.list LoongArch: Add glibc.cpu.hwcap support. 2024-04-24 18:22:38 +08:00
e_sqrtl.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
fpu_control.h LoongArch: Use "$fcsr0" instead of "$r0" in _FPU_{GET,SET}CW 2024-05-28 09:17:05 +08:00
hp-timing.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
jmpbuf-offsets.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
jmpbuf-unwind.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
ldsodefs.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
libc-tls.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
linkmap.h LoongArch: Add support for TLS Descriptors 2024-05-15 10:31:53 +08:00
machine-gmon.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
math-use-builtins-ffs.h LoongArch: Use builtins for ffs and ffsll 2024-02-05 15:19:41 -03:00
math_private.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
preconfigure LoongArch: Add support for TLS Descriptors 2024-05-15 10:31:53 +08:00
preconfigure.ac LoongArch: Add soft float support. 2022-09-01 09:10:08 +08:00
setjmp.S Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
sfp-machine.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
sotruss-lib.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
stackinfo.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
start.S Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
tininess.h LoongArch: Generic <math.h> and soft-fp Routines 2022-07-26 12:35:12 -03:00
tlsdesc.c LoongArch: Add support for TLS Descriptors 2024-05-15 10:31:53 +08:00
tlsdesc.sym LoongArch: Add cfi instructions for _dl_tlsdesc_dynamic 2024-08-09 09:06:17 +08:00
tst-audit.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
tst-gnu2-tls2.h LoongArch: Fix tst-gnu2-tls2 test case 2024-06-26 12:02:07 +08:00
tst-hwcap-tunables.c LoongArch: Add glibc.cpu.hwcap support. 2024-04-24 18:22:38 +08:00