glibc/sysdeps/i386
Adhemerval Zanella 6f9bacf36b math: Use atan2f from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows slight better performance to the generic atan2f.

The code was adapted to glibc style and to use the definition of
math_config.h (to handle errno, overflow, and underflow).

Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1,
gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1):

Latency                      master        patched   improvement
x86_64                      68.1175        69.2014        -1.59%
x86_64v2                    66.9884        66.0081         1.46%
x86_64v3                    57.7034        61.6407        -6.82%
i686                       189.8690        152.7560       19.55%
aarch64 (Neoverse)          32.6151        24.5382        24.76%
power10                     21.7282        17.1896        20.89%

reciprocal-throughput        master        patched   improvement
x86_64                      34.5202        31.6155         8.41%
x86_64v2                    32.6379        30.3372         7.05%
x86_64v3                    34.3677        23.6455        31.20%
i686                       157.7290        75.8308        51.92%
aarch64 (Neoverse)          27.7788        16.2671        41.44%
power10                     15.5715         8.1588        47.60%

Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
2024-12-18 17:24:43 -03:00
..
fpu math: Use atan2f from CORE-MATH 2024-12-18 17:24:43 -03:00
htl
i586 i586: Fix multiple definitions of __memcpy_chk and __mempcpy_chk 2024-05-02 11:50:21 +01:00
i686 math: Use atan2f from CORE-MATH 2024-12-18 17:24:43 -03:00
i786
nptl
sys
Implies
Makefile
Versions
____longjmp_chk.S
__longjmp.S
abort-instr.h
add_n.S
addmul_1.S
asm-syntax.h
backtrace.c
bsd-_setjmp.S
bsd-setjmp.S
configure Convert to autoconf 2.72 (vanilla release, no distribution patches) 2024-06-17 21:15:28 +02:00
configure.ac
crti.S
crtn.S
dl-fixup-attribute.h
dl-irel.h
dl-machine-rel.h
dl-machine.h
dl-procinfo.c
dl-tls.h
dl-tlsdesc-dynamic.h
dl-tlsdesc.S
dl-tlsdesc.h
dl-trampoline.S
gccframe.h
i386-mcount.S
isa.h
jmpbuf-offsets.h
jmpbuf-unwind.h
link-defines.sym
lshift.S
machine-gmon.h
malloc-alignment.h
math-use-builtins-ffs.h
memchr.S
memcmp.S
memcopy.h
memcpy.S
memcpy_chk.S
memmove.S
memmove_chk.S
mempcpy.S
mempcpy_chk.S
memset.S
memset_chk.S
mp_clz_tab.c
mul_1.S
preconfigure
pthread_spin_trylock.S
rawmemchr.S
rshift.S
setfpucw.c
setjmp.S
stackguard-macros.h
stackinfo.h
start.S
stpcpy.S
stpncpy.S i386: Don't define stpncpy alias when used in IFUNC [BZ #31768] 2024-05-20 19:35:00 -07:00
strcat.S
strchr.S
strchrnul.S
strcspn.S
string-inlines.c
string-opthr.h
strlen.S
strlen.c
strpbrk.S
strrchr.S
strspn.S
sub_n.S
submul_1.S
symbol-hacks.h
sysdep.h
tlsdesc.c
tlsdesc.sym
tst-audit.h
tst-audit3.c
tst-audit3.h
tst-auditmod3a.c
tst-auditmod3b.c
tst-ld-sse-use.sh
unwind-arch.h