glibc/sysdeps/or1k
Adhemerval Zanella d846f4c12d math: Use lgammaf from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows better performance to the generic lgammaf.

The code was adapted to glibc style, to use the definition of
math_config.h, to remove errno handling, to use math_narrow_eval
on overflow usage, and to adapt to make it reentrant.

Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (M1,
gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1):

latency                       master       patched  improvement
x86_64                       86.5609       70.3278       18.75%
x86_64v2                     78.3030       69.9709       10.64%
x86_64v3                     74.7470       59.8457       19.94%
i686                         387.355       229.761       40.68%
aarch64                      40.8341       33.7563       17.33%
power10                      26.5520       16.1672       39.11%
powerpc                      28.3145       17.0625       39.74%

reciprocal-throughput         master       patched  improvement
x86_64                       68.0461       48.3098       29.00%
x86_64v2                     55.3256       47.2476       14.60%
x86_64v3                     52.3015       38.9028       25.62%
i686                         340.848       195.707       42.58%
aarch64                      36.8000       30.5234       17.06%
power10                      20.4043       12.6268       38.12%
powerpc                      22.6588       13.8866       38.71%

Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
2024-11-22 10:52:27 -03:00
..
bits Fix femode_t conditionals for arc and or1k 2024-11-19 22:25:39 +00:00
fpu math: Use lgammaf from CORE-MATH 2024-11-22 10:52:27 -03:00
nofpu math: Use lgammaf from CORE-MATH 2024-11-22 10:52:27 -03:00
nptl
Implies
Makefile
__longjmp.S
atomic-machine.h
bsd-_setjmp.S
bsd-setjmp.S
configure
configure.ac
dl-machine.h
dl-start.S
dl-tls.h
dl-trampoline.S
fpu_control.h or1k: Add hard float support 2024-05-03 18:28:18 +01:00
jmpbuf-offsets.h
jmpbuf-unwind.h
ldsodefs.h
libc-tls.c
machine-gmon.h
math-tests-snan-payload.h or1k: Add hard float support 2024-05-03 18:28:18 +01:00
math-tests-trap.h or1k: Add hard float support 2024-05-03 18:28:18 +01:00
memusage.h
preconfigure
setjmp.S
sfp-machine.h or1k: Add hard float support 2024-05-03 18:28:18 +01:00
sotruss-lib.c
stackinfo.h
start.S
sysdep.h
tininess.h
tst-audit.h
utmp-size.h login: Check default sizes of structs utmp, utmpx, lastlog 2024-04-19 14:38:17 +02:00