glibc

Commit Graph

Author	SHA1	Message	Date
Andreas K. Hüttel	59b9c2b0ef	math: update sparc ulps Linux catbus 6.1.112 #1 SMP Sun Oct 13 10:52:08 PDT 2024 sparc64 sun4v UltraSparc T5 (Niagara5) GNU/Linux Signed-off-by: Andreas K. Hüttel <dilfridge@gentoo.org>	2025-01-03 15:40:06 +01:00
Andreas K. Hüttel	e71b548fac	math: update s390 ulps Linux lgentoo4 6.8.9-gentoo #1 SMP Tue May 7 09:52:48 EDT 2024 s390x 8561 IBM GNU/Linux Signed-off-by: Andreas K. Hüttel <dilfridge@gentoo.org>	2025-01-03 15:36:56 +01:00
H.J. Lu	ed97ef7a4b	not-cancel.h: Support testing fortify build with Clang When Clang is used to test fortify glibc build configured with --enable-fortify-source=N clang issues errors like In file included from tst-rfc3484.c:60: In file included from ./getaddrinfo.c:81: ../sysdeps/unix/sysv/linux/not-cancel.h:36:10: error: reference to overloaded function could not be resolved; did you mean to call it? 36 \| __typeof (open64) __open64_nocancel; \| ^~~~~~~~ ../include/bits/../../io/bits/fcntl2.h:127:1: note: possible target for call 127 \| open64 (__fortify_clang_overload_arg (const char , ,__path), int __oflag, \| ^ ../include/bits/../../io/bits/fcntl2.h:118:1: note: possible target for call 118 \| open64 (__fortify_clang_overload_arg (const char , ,__path), int __oflag) \| ^ ../include/bits/../../io/bits/fcntl2.h:114:1: note: possible target for call 114 \| open64 (const char __path, int __oflag, mode_t __mode, ...) \| ^ ../io/fcntl.h:219:12: note: possible target for call 219 \| extern int open64 (const char __file, int __oflag, ...) __nonnull ((1)); \| ^ because clang fortify support for functions with variable arguments relies on function overload. Update not-cancel.h to avoid __typeof on functions with variable arguments. Co-Authored-By: Adhemerval Zanella <adhemerval.zanella@linaro.org> Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2025-01-03 04:56:48 +08:00
Aurelien Jarno	d4b16e22e7	RISC-V: Regenerate ULPs Generated on a VisionFive 2 board running Linux version 6.12.6 and GCC 14.2.0. Needed due to: - commit `bbd578b38d` ("math: Use expm1f from CORE-MATH") - commit `8ae9e51376` ("math: Use log1pf from CORE-MATH") - commit `0ae0af68d8` ("Implement C23 cospi") - commit `776938e8b8` ("Implement C23 sinpi") - commit `f9e90e4b4c` ("Implement C23 tanpi") - commit `28d102d15c` ("Implement C23 acospi") - commit `f962932206` ("Implement C23 asinpi") - commit `ffe79c446c` ("Implement C23 atanpi") - commit `3374de9038` ("Implement C23 atan2pi") - commit `a357d6273f` ("math: Use atanf from CORE-MATH") - commit `6f9bacf36b` ("math: Use atan2f from CORE-MATH") - commit `e5ca265a9c` ("new inputs with large errors for [a]cospi, [a]sinpi, [a]tanpi, atan2pi") Signed-off-by: Aurelien Jarno <aurelien@aurel32.net>	2025-01-02 20:46:24 +01:00
Sam James	e9be7701e6	mlock, mlock2, munlock: Use __attr_access_none macro This fixes build failures using GCC 7.5.0 against glibc headers, see https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118194#c5. Followup to `013106ae67`. Reported-by: vvinayag@arm.com	2025-01-02 17:58:06 +00:00
Wilco Dijkstra	0ab62fa4f6	AArch64: Update libm-test-ulps Update ulps for (a)cospi, (a)sinpi, (a)tanpi, atan2pi.	2025-01-02 17:53:07 +00:00
Paul Zimmermann	e5ca265a9c	new inputs with large errors for [a]cospi, [a]sinpi, [a]tanpi, atan2pi These inputs were generated with the programs from https://gitlab.inria.fr/zimmerma/math_accuracy, with rounding to nearest: * for univariate binary32 functions by exhaustive search * for other functions with the "threshold" parameter up to 10^6	2025-01-02 18:26:36 +01:00
Florian Weimer	cc74583f23	elf: Remove the remaining uses of GET_ADDR_OFFSET Expand the macro where it is used in static definitions of __tls_get_addr. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-01-02 13:45:27 +01:00
Florian Weimer	91ee75abcf	s390: Define TLS_DTV_OFFSET instead of GET_ADDR_OFFSET This will be used in __tls_get_addr to adjust the returned pointer value. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-01-02 13:45:27 +01:00
Florian Weimer	ceae7e2770	elf: Introduce generic <dl-tls.h> On arc, the definition of TLS_DTV_UNALLOCATED now comes from <dl-dtv.h>. For x86-64 x32, a separate version is needed because unsigned long int is 32 bits on this target. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-01-02 13:45:27 +01:00
Florian Weimer	64d07e117d	powerpc: Update acosf ulps As seen on powerpc64le-linux-gnu with GCC 11 defaulting to POWER9 instructions.	2025-01-02 11:57:39 +01:00
gfleury	396048fa5a	htl: move pthread_mutexattr_setprioceiling, pthread_mutexattr_getprioceiling into libc. Message-ID: <20241231134909.1166440-9-gfleury@disroot.org>	2025-01-02 01:20:21 +01:00
gfleury	4371b11c86	htl: move pthread_mutexattr_{setrobust, setrobust_np}, pthread_mutexattr_{getrobust, getrobust_np} into libc. Message-ID: <20241231134909.1166440-8-gfleury@disroot.org>	2025-01-02 01:20:20 +01:00
gfleury	1e5b39a5e0	htl: move pthread_mutexattr_setpshared, pthread_mutexattr_getpshared into libc. Message-ID: <20241231134909.1166440-7-gfleury@disroot.org>	2025-01-02 01:19:29 +01:00
gfleury	b386295727	htl: move pthread_mutexattr_settype, pthread_mutexattr_gettype into libc. Message-ID: <20241231134909.1166440-6-gfleury@disroot.org>	2025-01-02 00:51:35 +01:00
Samuel Thibault	3cd1cf5fe0	htl: move pthread_mutexattr_setprotocol into libc. Message-ID: <20241231134909.1166440-5-gfleury@disroot.org>	2025-01-02 00:51:17 +01:00
gfleury	15686aa188	htl: move pthread_mutexattr_getprotocol into libc. Message-ID: <20241231134909.1166440-4-gfleury@disroot.org>	2025-01-02 00:51:05 +01:00
gfleury	beabc5dff5	htl: move pthread_mutexattr_destroy into libc. Message-ID: <20241231134909.1166440-3-gfleury@disroot.org>	2025-01-01 23:46:19 +01:00
gfleury	826b1bbcca	htl: move pthread_mutexattr_init into libc. Message-ID: <20241231134909.1166440-2-gfleury@disroot.org>	2025-01-01 23:44:32 +01:00
Samuel Thibault	cf13f740a9	bits/socket.h: Update to recent BSD definition The old BSD 4.4 definition (not used by Linux) was not 64b-proof: the cmsg_data field is supposed to CMSG_ALIGN'ed (as can be also seen in the CMSG_LEN macro). Suggested-by: Diego Nieto Cid <dnietoc@gmail.com>	2025-01-01 22:11:13 +01:00
Paul Eggert	ad16577ae1	Update copyright in generated files by running "make"	2025-01-01 11:22:09 -08:00
Paul Eggert	2642002380	Update copyright dates with scripts/update-copyrights	2025-01-01 11:22:09 -08:00
Xi Ruoyao	013106ae67	mlock, mlock2, munlock: Tell the compiler we don't dereference the pointer Since https://gcc.gnu.org/r11-959, the compiler emits -Wmaybe-uninitialized if a const pointer to an uninitialized buffer is passed. Tell the compiler we don't dereference the pointer to remove the false alarm. Link: https://gcc.gnu.org/PR118194 Signed-off-by: Xi Ruoyao <xry111@xry111.site> Reviewed-by: Sam James <sam@gentoo.org>	2025-01-01 16:08:36 +01:00
Adhemerval Zanella	0ca8785a28	elf: Do not change stack permission on dlopen/dlmopen If some shared library loaded with dlopen/dlmopen requires an executable stack, either implicitly because of a missing GNU_STACK ELF header (where the ABI default flags implies in the executable bit) or explicitly because of the executable bit from GNU_STACK; the loader will try to set the both the main thread and all thread stacks (from the pthread cache) as executable. Besides the issue where any __nptl_change_stack_perm failure does not undo the previous executable transition (meaning that if the library fails to load, there can be thread stacks with executable stacks), this behavior was used on a CVE [1] as a vector for RCE. This patch changes that if a shared library requires an executable stack, and the current stack is not executable, dlopen fails. The change is done only for dynamically loaded modules, if the program or any dependency requires an executable stack, the loader will still change the main thread before program execution and any thread created with default stack configuration. [1] https://www.qualys.com/2023/07/19/cve-2023-38408/rce-openssh-forwarded-ssh-agent.txt Checked on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2024-12-31 09:04:20 -03:00
Florian Weimer	0ee6e13f7f	x86-64: Reorder dynamic linker list in ldd script (bug 32508) Move the x86-64 loader first, before the i386 and x32 loaders. In most cases, it's the loader the script needs. This avoids an error message if the i386 loader does not work. The effect of this change to the generated ldd script looks like this: -RTLDLIST="/lib/ld-linux.so.2 /lib64/ld-linux-x86-64.so.2 /libx32/ld-linux-x32.so.2" +RTLDLIST="/lib64/ld-linux-x86-64.so.2 /lib/ld-linux.so.2 /libx32/ld-linux-x32.so.2" Reviewed-by: Sam James <sam@gentoo.org>	2024-12-30 13:24:36 +01:00
Michael Jeanson	0852c4aab7	nptl: hppa: replace __get_cr27 with __thread_pointer The addition of the new thread_pointer.h header on HPPA resulted in duplicated inline asm to get the current thread pointer from the cr27 register. Include thread_pointer.h in tls.h and replace __get/set_cr27() with __set_/thread_pointer() with the appropriate casts. Signed-off-by: Michael Jeanson <mjeanson@efficios.com>	2024-12-27 17:41:02 +01:00
Michael Jeanson	6fdb6abeb2	nptl: Add <thread_pointer.h> for hppa This will be required by the rseq extensible ABI implementation on all Linux architectures exposing the '__rseq_size' and '__rseq_offset' symbols to set the initial value of the 'cpu_id' field which can be used by applications to test if rseq is available and registered. As long as the symbols are exposed it is valid for an application to perform this test even if rseq is not yet implemented in libc for this architecture. Compile tested with build-many-glibcs.py but I don't have access to any hardware to run the tests. Signed-off-by: Michael Jeanson <mjeanson@efficios.com>	2024-12-27 17:41:02 +01:00
Florian Weimer	5e249192ca	elf: Remove the GET_ADDR_ARGS and related macros from the TLS code This was used to manage an IA-64 ABI divergence is no longere needed after the IA-64 removal. (It should be possible to encode all the required information in one machine word, so the pointer indirection is really unnecessary. Technically, none of this is part of the ABI, so perhaps it's possible to do this retroactively. See bug 27404.) Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-12-27 07:29:56 +01:00
Samuel Thibault	7fa9e786b6	hurd: Avoid asm statements which return They are not supposed to change flow control. This fixes miscompilation with gcc 14.2.0 which then drops code, see https://lists.gnu.org/archive/html/bug-hurd/2024-11/msg00145.html	2024-12-27 01:10:58 +01:00
gfleury	f646be6ff6	htl: move pthread_cond_timedwait, pthread_cond_clockwait, pthread_cond_wait into libc. Message-ID: <20241219203727.669825-9-gfleury@disroot.org>	2024-12-22 23:37:30 +01:00
gfleury	ba8522542f	htl: move __pthread_mutex_checklocked into libc. move out __getpid from pt-mutex.h and in pt-mutex-* include <unistd.h> where __getpid was called Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241219203727.669825-8-gfleury@disroot.org>	2024-12-22 23:34:28 +01:00
gfleury	a369d567d2	htl: move __pthread_timedblock, __pthread_timedblock_intr, __pthread_block, __pthread_block_intr into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241219203727.669825-7-gfleury@disroot.org>	2024-12-22 23:34:28 +01:00
gfleury	f57a277c16	htl: move pthread_cond_signal into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241219203727.669825-6-gfleury@disroot.org>	2024-12-22 23:34:28 +01:00
gfleury	3089d23517	htl: move pthread_cond_broadcast into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241219203727.669825-5-gfleury@disroot.org>	2024-12-22 23:34:27 +01:00
gfleury	917a131ab9	htl: move pthread_cond_destroy into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241219203727.669825-4-gfleury@disroot.org>	2024-12-22 23:34:27 +01:00
gfleury	4ab765c6ba	htl: move __pthread_wakeup into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241219203727.669825-3-gfleury@disroot.org>	2024-12-22 23:34:27 +01:00
gfleury	8735ea79ab	htl: move pthread_cond_init into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241219203727.669825-2-gfleury@disroot.org>	2024-12-22 23:34:27 +01:00
Adhemerval Zanella	a2b0ff98a0	include/sys/cdefs.h: Add __attribute_optimization_barrier__ Add __attribute_optimization_barrier__ to disable inlining and cloning on a function. For Clang, expand it to __attribute__ ((optnone)) Otherwise, expand it to __attribute__ ((noinline, clone)) Co-Authored-By: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-23 06:28:55 +08:00
John David Anglin	325db5ab7f	hppa: Simplify handling of sanity check errors in clone.S. This simplifies the handling of sanity check errors in clone.S. Adjusted a couple of comments to reflect current code. Signed-off-by: John David Anglin <dave.anglin@bell.net>	2024-12-22 09:58:02 -05:00
John David Anglin	9bdb1487c5	hppa: add cacheflush() syscall wrapper The hppa Linux kernel supports the cacheflush() syscall since version 6.5. This adds the glibc syscall wrapper. Signed-off-by: Helge Deller <deller@gmx.de> --- v2: This patch was too late in release cycle for GLIBC_2.40, so update now to GLIBC_2.41 instead.	2024-12-22 09:51:54 -05:00
John David Anglin	4b37fb71e0	hppa: Update libm-test-ulps Signed-off-by: John David Anglin <dave.anglin@bell.net>	2024-12-22 09:45:34 -05:00
Samuel Thibault	faa0c883f6	hurd: make mprotect translate KERN_PROTECTION_FAILURE to EACCESS Suggested-by: Sergey Bugaev <bugaevc@gmail.com>	2024-12-22 11:40:24 +01:00
Fangrui Song	d773aff467	x86: Define __HAVE_FLOAT128 for Clang and use __builtin_*f128 code path Clang supports __builtin_fabsf128 (despite not supporting _Float128) but it does not support __builtin_fabsq. Fallback to back to `typedef __float128 _Float128;` it clang is used. Originally developed by Fangrui Song <maskray@google.com>. Co-Authored-By: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-22 16:07:11 +08:00
Adhemerval Zanella	6412d8cc46	x86: Use inhibit_stack_protector on tst-ifunc-isa.h Co-Authored-By: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-22 13:19:12 +08:00
H.J. Lu	03feea74dc	elf: Compile test modules with -fsemantic-interposition Compiler may default to -fno-semantic-interposition. But some elf test modules must be compiled with -fsemantic-interposition to function properly. Add a TEST_CC check for -fsemantic-interposition and use it on elf test modules. This fixed FAIL: elf/tst-dlclose-lazy FAIL: elf/tst-pie1 FAIL: elf/tst-plt-rewrite1 FAIL: elf/unload4 when Clang 19 is used to test glibc. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-22 13:15:43 +08:00
Adhemerval Zanella	799e686c88	dirent: Remove variable length array structure for tst-getdents64.c Clang emits the following warnings: ../sysdeps/unix/sysv/linux/tst-getdents64.c:111:18: error: fields must have a constant size: 'variable length array in structure' extension will never be supported char buffer[buffer_size]; ^ Co-Authored-By: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-22 13:06:02 +08:00
H.J. Lu	f5fb9fa011	x86: Include test-flt-eval-method-387 if -mfpmath=387 works Since Clang doesn't support -mfpmath=387 on x86-64, on x86, include test-flt-eval-method-387 only if -mfpmath=387 works. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-22 12:54:44 +08:00
H.J. Lu	9151ecbb5e	x86-64: Disable libmvec ABI test for Clang Unlike GCC, libmvec support in Clang is hard-coded. Clang doesn't use macros defined in <bits/libm-simd-decl-stubs.h> to support new libmvec functions added to glibc and can't vectorize all test loops to test libmvec ABI: https://github.com/llvm/llvm-project/issues/120868 disable libmvec ABI test for Clang. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-22 12:51:56 +08:00
H.J. Lu	88499d61bd	Check if -mamx-tile works for testing Since -mamx-tile is used only for testing, use LIBC_TRY_TEST_CC_COMMAND, instead of LIBC_TRY_CC_AND_TEST_CC_COMMAND to check it and don't check __builtin_ia32_ldtilecfg for Clang. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-22 06:07:17 +08:00
Adhemerval Zanella	b3a7a15d99	cet: Drop '#pragma GCC target' in tst-cet-legacy-10a[-static].c After commit `215447f5cb` Author: H.J. Lu <hjl.tools@gmail.com> Date: Tue Dec 17 06:18:55 2024 +0800 cet: Pass -mshstk to compiler for tst-cet-legacy-10a[-static].c we can remove '#pragma GCC target' in tst-cet-legacy-10a[-static].c. Co-Authored-By: H.J. Lu <hjl.tools@gmail.com>	2024-12-21 06:16:58 +08:00
Aurelien Jarno	6fd215d6ae	posix: fix system when a child cannot be created [BZ #32450 ] POSIX states that "if a child process cannot be created, or if the termination status for the command language interpreter cannot be obtained, system() shall return -1 and set errno to indicate the error." In the glibc implementation it could happen when posix_spawn fails, which happens when the underlying fork, vfork, or clone call fails. They could fail with EAGAIN and ENOMEM. Resolves: BZ #32450 Signed-off-by: Aurelien Jarno <aurelien@aurel32.net> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-12-20 22:57:06 +01:00
H.J. Lu	40bf25b754	Fix elf: Introduce is_rtld_link_map [BZ #32488 ] Also use is_rtld_link_map in dl-cet.c. This fixes BZ #32488. Signed-off-by: H.J. Lu <hjl.tools@gmail.com>	2024-12-21 04:36:18 +08:00
Florian Weimer	ef5823d955	elf: Move _dl_rtld_map, _dl_rtld_audit_state out of GL This avoids immediate GLIBC_PRIVATE ABI issues if the size of struct link_map or struct auditstate changes. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-12-20 15:52:57 +01:00
Florian Weimer	2b1dba3eb3	elf: Introduce is_rtld_link_map Unconditionally define it to false for static builds. This avoids the awkward use of weak_extern for _dl_rtld_map in checks that cannot be possibly true on static builds. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-12-20 15:52:57 +01:00
Joseph Myers	322e9d4e44	Add F_CREATED_QUERY from Linux 6.12 to bits/fcntl-linux.h Linux 6.12 adds a new constant F_CREATED_QUERY. Add it to glibc's bits/fcntl-linux.h. Tested for x86_64.	2024-12-20 11:47:33 +00:00
Joseph Myers	37d9618492	Add HWCAP_LOONGARCH_LSPW from Linux 6.12 to bits/hwcap.h Add the new Linux 6.12 HWCAP_LOONGARCH_LSPW to the corresponding bits/hwcap.h. Tested with build-many-glibcs.py for loongarch64-linux-gnu-lp64d.	2024-12-20 11:47:03 +00:00
Joseph Myers	fbdd8b3fa8	Add MSG_SOCK_DEVMEM from Linux 6.12 to bits/socket.h Linux 6.12 adds a constant MSG_SOCK_DEVMEM (recall that various constants such as this one are defined in the non-uapi linux/socket.h but still form part of the kernel/userspace interface, so that non-uapi header is one that needs checking each release for new such constants). Add it to glibc's bits/socket.h. Tested for x86_64.	2024-12-20 11:46:06 +00:00
Florian Weimer	9a6533429e	i386: Regenerate ulps As seen on an Intel i9-9900K CPU, with glibc built with GCC 11.5, configured with and without --disable-multi-arch.	2024-12-20 12:40:17 +01:00
Florian Weimer	6fba7d6578	x86_64: Regenerate ulps As seen with an AMD 7950X CPU, on a glibc built with GCC 11.5.	2024-12-20 07:22:02 +01:00
Florian Weimer	6a99b4172a	aarch64: Regenerate ulps Results from running on Neoverse-V2, built with GCC 11.5.	2024-12-20 07:12:30 +01:00
Florian Weimer	e79b9e962d	elf: Remove code dependent on __rtld_lock_default_lock_recursive macro Neither NPTL nor Hurd define this macro anymore. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-12-19 21:29:58 +01:00
Florian Weimer	70d0836305	Linux: Accept null arguments for utimensat pathname This matches kernel behavior. With this change, it is possible to use utimensat as a replacement for the futimens interface, similar to what glibc does internally. Reviewed-by: Paul Eggert <eggert@cs.ucla.edu>	2024-12-19 21:21:30 +01:00
Florian Weimer	30d3fd7f4f	x86_64: Remove unused padding from tcbhead_t This padding is difficult to use for preserving the internal GLIBC_PRIVATE ABI. The comment is misleading. Current Address Sanitizer uses heuristics to determine struct pthread size. It does not depend on its precise layout. It merely scans for pointers allocated using malloc. Due to the removal of the padding, the assert for its start is no longer required. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2024-12-19 21:21:30 +01:00
Joseph Myers	29ae632e76	Add SCHED_EXT from Linux 6.12 to bits/sched.h Linux 6.12 adds the SCHED_EXT constant. Add it to glibc's bits/sched.h and update the kernel version in tst-sched-consts.py. Tested for x86_64.	2024-12-19 17:08:38 +00:00
John David Anglin	57256971b0	hppa: Fix strace detach-vfork test This change implements vfork.S for direct support of the vfork syscall. clone.S is revised to correct child support for the vfork case. The main bug was creating a frame prior to the clone syscall. This was done to allow the rp and r4 registers to be saved and restored from the stack frame. r4 was used to save and restore the PIC register, r19, across the system call and the call to set errno. But in the vfork case, it is undefined behavior for the child to return from the function in which vfork was called. It is surprising that this usually worked. Syscalls on hppa save and restore rp and r19, so we don't need to create a frame prior to the clone syscall. We only need a frame when __syscall_error is called. We also don't need to save and restore r19 around the call to $$dyncall as r19 is not used in the code after $$dyncall. This considerably simplifies clone.S. Signed-off-by: John David Anglin <dave.anglin@bell.net>	2024-12-19 11:30:09 -05:00
Joseph Myers	5fcee06dc7	Update kernel version to 6.12 in header constant tests There are no new constants covered by tst-mman-consts.py, tst-mount-consts.py or tst-pidfd-consts.py in Linux 6.12 that need any header changes, so update the kernel version in those tests. (tst-sched-consts.py will need updating separately along with adding SCHED_EXT.) Tested with build-many-glibcs.py.	2024-12-19 15:38:59 +00:00
Adhemerval Zanella	0e0be3ed80	math: Use tanhf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic tanhf. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 51.5273 41.0951 20.25% x86_64v2 47.7021 39.1526 17.92% x86_64v3 45.0373 34.2737 23.90% i686 133.9970 83.8596 37.42% aarch64 (Neoverse) 21.5439 14.7961 31.32% power10 13.3301 8.4406 36.68% reciprocal-throughput master patched improvement x86_64 24.9493 12.8547 48.48% x86_64v2 20.7051 12.7761 38.29% x86_64v3 19.2492 11.0851 42.41% i686 78.6498 29.8211 62.08% aarch64 (Neoverse) 11.6026 7.11487 38.68% power10 6.3328 2.8746 54.61% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Adhemerval Zanella	1751c0519a	math: Use sinhf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic sinhf. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 52.6819 49.1489 6.71% x86_64v2 49.1162 42.9447 12.57% x86_64v3 46.9732 39.9157 15.02% i686 141.1470 129.6410 8.15% aarch64 (Neoverse) 20.8539 17.1288 17.86% power10 14.5258 9.1906 36.73% reciprocal-throughput master patched improvement x86_64 27.5553 23.9395 13.12% x86_64v2 21.6423 20.3219 6.10% x86_64v3 21.4842 16.0224 25.42% i686 87.9709 86.1626 2.06% aarch64 (Neoverse) 15.1919 12.2744 19.20% power10 7.2188 5.2611 27.12% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Adhemerval Zanella	9583836785	math: Use coshf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode), although it should worse performance than current one. The current implementation performance comes mainly from the internal usage of the optimize expf implementation, and shows a maximum ULPs of 2 for FE_TONEAREST and 3 for other rounding modes. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 40.6995 49.0737 -20.58% x86_64v2 40.5841 44.3604 -9.30% x86_64v3 39.3879 39.7502 -0.92% i686 112.3380 129.8570 -15.59% aarch64 (Neoverse) 18.6914 17.0946 8.54% power10 11.1343 9.3245 16.25% reciprocal-throughput master patched improvement x86_64 18.6471 24.1077 -29.28% x86_64v2 17.7501 20.2946 -14.34% x86_64v3 17.8262 17.1877 3.58% i686 64.1454 86.5645 -34.95% aarch64 (Neoverse) 9.77226 12.2314 -25.16% power10 4.0200 5.3316 -32.63% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Adhemerval Zanella	7cfd8b5698	math: Use atanhf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic atanhf. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 59.4930 45.8568 22.92% x86_64v2 59.5705 45.5804 23.48% x86_64v3 53.1838 37.7155 29.08% i686 169.354 133.5940 21.12% aarch64 (Neoverse) 26.0781 16.9829 34.88% power10 15.6591 10.7623 31.27% reciprocal-throughput master patched improvement x86_64 23.5903 18.5766 21.25% x86_64v2 22.6489 18.2683 19.34% x86_64v3 19.0401 13.9474 26.75% i686 97.6034 107.3260 -9.96% aarch64 (Neoverse) 15.3664 9.57846 37.67% power10 6.8877 4.6242 32.86% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Adhemerval Zanella	6f9bacf36b	math: Use atan2f from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic atan2f. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 68.1175 69.2014 -1.59% x86_64v2 66.9884 66.0081 1.46% x86_64v3 57.7034 61.6407 -6.82% i686 189.8690 152.7560 19.55% aarch64 (Neoverse) 32.6151 24.5382 24.76% power10 21.7282 17.1896 20.89% reciprocal-throughput master patched improvement x86_64 34.5202 31.6155 8.41% x86_64v2 32.6379 30.3372 7.05% x86_64v3 34.3677 23.6455 31.20% i686 157.7290 75.8308 51.92% aarch64 (Neoverse) 27.7788 16.2671 41.44% power10 15.5715 8.1588 47.60% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Adhemerval Zanella	a357d6273f	math: Use atanf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic atanf. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 56.8265 53.6842 5.53% x86_64v2 54.8177 53.6842 2.07% x86_64v3 46.2915 48.7034 -5.21% i686 158.3760 108.9560 31.20% aarch64 (Neoverse) 21.687 20.5893 5.06% power10 13.1903 13.5012 -2.36% reciprocal-throughput master patched improvement x86_64 16.6787 16.7601 -0.49% x86_64v2 16.6983 16.7601 -0.37% x86_64v3 16.2268 12.1391 25.19% i686 138.6840 36.0640 74.00% aarch64 (Neoverse) 11.8012 10.3565 12.24% power10 5.3212 4.2894 19.39% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Adhemerval Zanella	ed608a40e2	math: Use asinhf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic asinhf. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 64.5128 56.9717 11.69% x86_64v2 63.3065 57.2666 9.54% x86_64v3 62.8719 51.4170 18.22% i686 189.1630 137.635 27.24% aarch64 (Neoverse) 25.3551 20.5757 18.85% power10 17.9712 13.3302 25.82% reciprocal-throughput master patched improvement x86_64 20.0844 15.4731 22.96% x86_64v2 19.2919 15.4000 20.17% x86_64v3 18.7226 11.9009 36.44% i686 103.7670 80.2681 22.65% aarch64 (Neoverse) 12.5005 8.68969 30.49% power10 7.2220 5.03617 30.27% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>: Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Adhemerval Zanella	5fb4b566ef	math: Use asinf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic asinf. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 42.8237 35.2460 17.70% x86_64v2 43.3711 35.9406 17.13% x86_64v3 35.0335 30.5744 12.73% i686 213.8780 104.4710 51.15% aarch64 (Neoverse) 17.2937 13.6025 21.34% power10 12.0227 7.4241 38.25% reciprocal-throughput master patched improvement x86_64 13.6770 15.5231 -13.50% x86_64v2 13.8722 16.0446 -15.66% x86_64v3 13.6211 13.2753 2.54% i686 186.7670 45.4388 75.67% aarch64 (Neoverse) 9.96089 9.39285 5.70% power10 4.9862 3.7819 24.15% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Adhemerval Zanella	673e6fe110	math: Use acoshf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic acoshf. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 61.2471 58.7742 4.04% x86_64-v2 62.6519 59.0523 5.75% x86_64-v3 58.7408 50.1393 14.64% aarch64 24.8580 21.3317 14.19% power10 17.0469 13.1345 22.95% reciprocal-throughput master patched improvement x86_64 16.1618 15.1864 6.04% x86_64-v2 15.7729 14.7563 6.45% x86_64-v3 14.1669 11.9568 15.60% aarch64 10.911 9.5486 12.49% power10 6.38196 5.06734 20.60% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Adhemerval Zanella	66fa7ad437	math: Use acosf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows slight better performance to the generic acosf. The code was adapted to glibc style and to use the definition of math_config.h (to handle errno, overflow, and underflow). Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1, gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1): Latency master patched improvement x86_64 52.5098 36.6312 30.24% x86_64v2 53.0217 37.3091 29.63% x86_64v3 42.8501 32.3977 24.39% i686 207.3960 109.4000 47.25% aarch64 21.3694 13.7871 35.48% power10 14.5542 7.2891 49.92% reciprocal-throughput master patched improvement x86_64 14.1487 15.9508 -12.74% x86_64v2 14.3293 16.1899 -12.98% x86_64v3 13.6563 12.6161 7.62% i686 158.4060 45.7354 71.13% aarch64 12.5515 9.19233 26.76% power10 5.7868 3.3487 42.13% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-12-18 17:24:43 -03:00
Michael Jeanson	eb8fa66d4e	nptl: Add <thread_pointer.h> for sparc This will be required by the rseq extensible ABI implementation on all Linux architectures exposing the '__rseq_size' and '__rseq_offset' symbols to set the initial value of the 'cpu_id' field which can be used by applications to test if rseq is available and registered. As long as the symbols are exposed it is valid for an application to perform this test even if rseq is not yet implemented in libc for this architecture. Compile tested with build-many-glibcs.py but I don't have access to any hardware to run the tests. Signed-off-by: Michael Jeanson <mjeanson@efficios.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-12-18 19:38:58 +00:00
Adhemerval Zanella	849c73fe2b	powerpc: Update libm-test-ulps Regen to add new functions acospi, asinpi, atan2pi, atanpi, and tanpi.	2024-12-18 15:43:09 -03:00
Adhemerval Zanella	2872876d43	arm: Update libm-test-ulps Regen to add new functions acospi, asinpi, atan2pi, atanpi, cospi, sinpi, and tanpi.	2024-12-18 14:20:41 -03:00
Adhemerval Zanella	5a4c99163c	i386: Update libm-test-ulps Regen to add new functions acospi, asinpi, atan2pi, atanpi, cospi, sinpi, and tanpi.	2024-12-18 14:20:41 -03:00
Joseph Myers	e0a0fd64b5	Update syscall lists for Linux 6.12 Linux 6.12 has no new syscalls. Update the version number in syscall-names.list to reflect that it is still current for 6.12. Tested with build-many-glibcs.py.	2024-12-18 15:12:36 +00:00
H.J. Lu	a194871b13	sys/platform/x86.h: Do not depend on _Bool definition in C++ mode Clang does not define _Bool for -std=c++98: /usr/include/bits/platform/features.h:31:19: error: unknown type name '_Bool' 31 \| static __inline__ _Bool \| ^ Change _Bool to bool to silence clang++ error. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Florian Weimer <fweimer@redhat.com>	2024-12-18 02:32:27 +08:00
H.J. Lu	54fe008ba6	ldbl-96: Set -1 to "int sign_exponent:16" ieee_long_double_shape_type has typedef union { long double value; struct { ... int sign_exponent:16; ... } parts; } ieee_long_double_shape_type; Clang issues an error: ../sysdeps/ieee754/ldbl-96/test-totalorderl-ldbl-96.c:49:2: error: implicit truncation from 'int' to bit-field changes value from 65535 to -1 [-Werror,-Wbitfield-constant-conversion] 49 \| SET_LDOUBLE_WORDS (ldnx, 0xffff, \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 50 \| tests[i] >> 32, tests[i] & 0xffffffffULL); \| Use -1, instead of 0xffff, to silence Clang. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-18 01:54:26 +08:00
H.J. Lu	d4ee46b0cd	tst-clone3[-internal].c: Add _Atomic to silence Clang Add _Atomic to futex_wait argument and ctid in tst-clone3[-internal].c to silence Clang error: ../sysdeps/unix/sysv/linux/tst-clone3-internal.c:93:3: error: address argument to atomic operation must be a pointer to _Atomic type ('pid_t ' (aka 'int ') invalid) 93 \| wait_tid (&ctid, CTID_INIT_VAL); \| ^ ~~~~~ ../sysdeps/unix/sysv/linux/tst-clone3-internal.c:51:21: note: expanded from macro 'wait_tid' 51 \| while ((__tid = atomic_load_explicit (ctid_ptr, \ \| ^ ~~~~~~~~ /usr/bin/../lib/clang/19/include/stdatomic.h:145:30: note: expanded from macro 'atomic_load_explicit' 145 \| #define atomic_load_explicit __c11_atomic_load \| ^ Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-18 01:54:26 +08:00
Florian Weimer	61c3450db9	x86: Avoid integer truncation with large cache sizes (bug 32470) Some hypervisors report 1 TiB L3 cache size. This results in some variables incorrectly getting zeroed, causing crashes in memcpy/memmove because invariants are violated.	2024-12-17 18:49:50 +01:00
H.J. Lu	0cc88d2327	Silence Clang #include_next error Use "#include <...>" to silence Clang #include_next error: In file included from ../sysdeps/x86_64/fpu/test-double-vlen4-wrappers.c:19: ../sysdeps/x86_64/fpu/test-double-vlen4.h:19:2: error: #include_next in file found relative to primary source file or found by absolute path; will search from start of include path [-Werror,-Winclude-next-absolute-path] 19 \| #include_next <test-double-vlen4.h> \| ^ 1 error generated. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-18 01:22:48 +08:00
H.J. Lu	215447f5cb	cet: Pass -mshstk to compiler for tst-cet-legacy-10a[-static].c Pass -mshstk to compiler to silence Clang: In file included from ../sysdeps/x86_64/tst-cet-legacy-10a.c:2: ../sysdeps/x86_64/tst-cet-legacy-10.c:29:7: error: always_inline function '_get_ssp' requires target feature 'shstk', but would be inlined into function 'do_test' that is compiled without support for 'shstk' 29 \| if (_get_ssp () != 0) \| ^ Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-18 01:20:16 +08:00
Joana Cruz	cff9648d0b	AArch64: Improve codegen of AdvSIMD expf family Load the polynomial evaluation coefficients into 2 vectors and use lanewise MLAs. Also use intrinsics instead of native operations. expf: 3% improvement in throughput microbenchmark on Neoverse V1, exp2f: 5%, exp10f: 13%, coshf: 14%. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2024-12-17 15:28:22 +00:00
Joana Cruz	6914774b9d	AArch64: Improve codegen of AdvSIMD atan(2)(f) Load the polynomial evaluation coefficients into 2 vectors and use lanewise MLAs. 8% improvement in throughput microbenchmark on Neoverse V1. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2024-12-17 15:28:22 +00:00
Joana Cruz	d6e034f5b2	AArch64: Improve codegen of AdvSIMD logf function family Load the polynomial evaluation coefficients into 2 vectors and use lanewise MLAs. 8% improvement in throughput microbenchmark on Neoverse V1 for log2 and log, and 2% for log10. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2024-12-17 15:25:58 +00:00
H.J. Lu	dd413a4d2f	Fix sysdeps/x86/fpu/Makefile: Split and sort tests Signed-off-by: H.J. Lu <hjl.tools@gmail.com>	2024-12-16 05:57:28 +08:00
H.J. Lu	57a44f27c4	sysdeps/x86/fpu/Makefile: Split and sort tests Split and sort tests in sysdeps/x86/fpu/Makefile. Signed-off-by: H.J. Lu <hjl.tools@gmail.com>	2024-12-16 05:51:02 +08:00
H.J. Lu	07e3eb1774	Use empty initializer to silence GCC 4.9 or older Use empty initializer to silence GCC 4.9 or older: getaddrinfo.c: In function ‘gaih_inet’: getaddrinfo.c:1135:24: error: missing braces around initializer [-Werror=missing-braces] / sizeof (struct gaih_typeproto)] = {0}; ^ Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-16 04:06:30 +08:00
Florian Weimer	b933e5cef6	Linux: Check for 0 return value from vDSO getrandom probe As of Linux 6.13, there is no code in the vDSO that declines this initialization request with the special ~0UL state size. If the vDSO has the function, the call succeeds and returns 0. It's expected that the code would follow the “a negative value indicating an error” convention, as indicated in the __cvdso_getrandom_data function comment, so that INTERNAL_SYSCALL_ERROR_P on glibc's side would return true. This commit changes the commit to check for zero to indicate success instead, which covers potential future non-zero success return values and error returns. Fixes commit `4f5704ea34` ("powerpc: Use correct procedure call standard for getrandom vDSO call (bug 32440)").	2024-12-15 17:05:25 +01:00
John David Anglin	6f5e1e4e98	hppa: Update libm-test-ulps Signed-off-by: John David Anglin <dave.anglin@bell.net>	2024-12-15 09:24:53 -05:00
H.J. Lu	20f8c5df56	Revert "Add braces in initializers for GCC 4.9 or older" This reverts commit `8aa2a9e033`. as not all targets need braces.	2024-12-15 18:49:52 +08:00
Stafford Horne	afac8b1311	or1k: Update libm-test-ulps Regen to add new functions acospi, asinpi, atan2pi and atanpi.	2024-12-15 00:42:27 +00:00
gfleury	2716bd6b12	htl: move pthread_sigmask into libc. Message-ID: <20241212220612.782313-3-gfleury@disroot.org>	2024-12-14 23:13:14 +01:00
gfleury	79cb83c7f9	htl: move __pthread_sigstate into libc. Message-ID: <20241212220612.782313-2-gfleury@disroot.org>	2024-12-14 23:12:01 +01:00
gfleury	dca0807a4d	htl: move __pthread_sigstate_destroy into libc. Message-ID: <20241212220612.782313-1-gfleury@disroot.org>	2024-12-14 23:11:45 +01:00
H.J. Lu	335ba9b6c1	Return EXIT_UNSUPPORTED if __builtin_add_overflow unavailable Since GCC 4.9 doesn't have __builtin_add_overflow: In file included from tst-stringtable.c:180:0: stringtable.c: In function ‘stringtable_finalize’: stringtable.c:185:7: error: implicit declaration of function ‘__builtin_add_overflow’ [-Werror=implicit-function-declaration] else if (__builtin_add_overflow (previous->offset, ^ return EXIT_UNSUPPORTED for GCC 4.9 or older. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-15 05:24:19 +08:00
H.J. Lu	8aa2a9e033	Add braces in initializers for GCC 4.9 or older Add braces to silence GCC 4.9 or older: getaddrinfo.c: In function ‘gaih_inet’: getaddrinfo.c:1135:24: error: missing braces around initializer [-Werror=missing-braces] / sizeof (struct gaih_typeproto)] = {0}; ^ Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-14 19:26:45 +08:00
Wilco Dijkstra	ca7d48a80f	AArch64: Update libm-test-ulps Update ulps for acospi, asinpi, atanpi, atan2pi.	2024-12-13 17:14:58 +00:00
Stefan Liebler	97b74cbbb0	s390: Simplify elf_machine_{load_address, dynamic} [BZ #31799 ] If an executable is static PIE and has a non-zero load address (compare to elf/tst-pie-address-static), it segfaults as elf_machine_load_address() returns 0x0 and elf_machine_dynamic() returns the run-time instead of link-time address of _DYNAMIC. Now rely on __ehdr_start and _DYNAMIC as also done on other architectures. Checked back to old arch-levels that this approach works fine: - 31bit: -march=g5 - 64bit: -march=z900 Note, that there is no static-PIE support on 31bit, but this approach cleans it also up. Furthermore this cleanup in glibc does not change anything regarding the first GOT-element as the s390 ABI (https://github.com/IBM/s390x-abi) explicitely defines: The doubleword at _GLOBAL_OFFSET_TABLE_[0] is set by the linkage editor to hold the address of the dynamic structure, referenced with the symbol _DYNAMIC. This allows a program, such as the dynamic linker, to find its own dynamic structure without having yet processed its relocation entries. This is especially important for the dynamic linker, because it must initialize itself without relying on other programs to relocate its memory image.	2024-12-13 09:44:38 +01:00
Stafford Horne	e4e49583d9	or1k: Update libm-test-ulps Pick up new functions cospi, "Imaginary part of csin", exp10m1, exp2m1, log10p1, log2p1, sinpi and tanpi.	2024-12-13 07:20:32 +00:00
Michael Jeanson	f2acd75b0e	nptl: Add <thread_pointer.h> for or1k This will be required by the rseq extensible ABI implementation on all Linux architectures exposing the '__rseq_size' and '__rseq_offset' symbols to set the initial value of the 'cpu_id' field which can be used by applications to test if rseq is available and registered. As long as the symbols are exposed it is valid for an application to perform this test even if rseq is not yet implemented in libc for this architecture. Compile tested with build-many-glibcs.py but I don't have access to any hardware to run the tests. Signed-off-by: Michael Jeanson <mjeanson@efficios.com> Signed-off-by: Stafford Horne <shorne@gmail.com>	2024-12-13 07:20:32 +00:00
Joseph Myers	3374de9038	Implement C23 atan2pi C23 adds various <math.h> function families originally defined in TS 18661-4. Add the atan2pi functions (atan2(y,x)/pi). Tested for x86_64 and x86, and with build-many-glibcs.py.	2024-12-12 20:57:44 +00:00
Joseph Myers	ffe79c446c	Implement C23 atanpi C23 adds various <math.h> function families originally defined in TS 18661-4. Add the atanpi functions (atan(x)/pi). Tested for x86_64 and x86, and with build-many-glibcs.py.	2024-12-11 21:51:49 +00:00
Peter Bergner	aec85b2557	powerpc64: Fix dl-trampoline.S big-endian / non-ROP build failure Fix a big-endian / non-ROP build failure caused by commit `4d9a4c02` when building dl-trampoline.S. Reported-by: Joseph Myers <josmyers@redhat.com>	2024-12-11 23:15:13 +03:00
Florian Weimer	4f5704ea34	powerpc: Use correct procedure call standard for getrandom vDSO call (bug 32440) A plain indirect function call does not work on POWER because success and failure are signaled through a flag register, and not via the usual Linux negative return value convention. This has potential security impact, in two ways: the return value could be out of bounds (EAGAIN is 11 on powerpc6le), and no random bytes have been written despite the non-error return value. Fixes commit `461cab1de7` ("linux: Add support for getrandom vDSO"). Reported-by: Ján Stanček <jstancek@redhat.com> Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2024-12-11 17:49:04 +01:00
H.J. Lu	b79f257533	Add TEST_CC and TEST_CXX support Support testing glibc build with a different C compiler or a different C++ compiler with $ ../glibc-VERSION/configure TEST_CC="gcc-6.4.1" TEST_CXX="g++-6.4.1" 1. Add LIBC_TRY_CC_AND_TEST_CC_OPTION, LIBC_TRY_CC_AND_TEST_CC_COMMAND and LIBC_TRY_CC_AND_TEST_LINK to test both CC and TEST_CC. 2. Add check and xcheck targets to Makefile.in and override build compiler options with ones from TEST_CC and TEST_CXX. Tested on Fedora 41/x86-64: 1. Building with GCC 14.2.1 and testing with GCC 6.4.1 and GCC 11.2.1. 2. Building with GCC 15 and testing with GCC 6.4.1. Support for GCC versions older than GCC 6.2 may need to change the test sources. Other targets may need to update configure.ac under sysdeps and modify Makefile.in to override target build compiler options. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Sam James <sam@gentoo.org>	2024-12-11 18:31:00 +08:00
Peter Bergner	4d9a4c02f9	powerpc64le: ROP changes for the dl-trampoline functions Add ROP protection for the _dl_runtime_resolve and _dl_profile_resolve functions.	2024-12-10 23:25:56 -05:00
Joseph Myers	f962932206	Implement C23 asinpi C23 adds various <math.h> function families originally defined in TS 18661-4. Add the asinpi functions (asin(x)/pi). Tested for x86_64 and x86, and with build-many-glibcs.py.	2024-12-10 20:42:20 +00:00
Joseph Myers	28d102d15c	Implement C23 acospi C23 adds various <math.h> function families originally defined in TS 18661-4. Add the acospi functions (acos(x)/pi). Tested for x86_64 and x86, and with build-many-glibcs.py.	2024-12-09 23:01:29 +00:00
Sachin Monga	be13e46764	powerpc64le: ROP changes for the *context and setjmp functions Add ROP protection for the getcontext, setcontext, makecontext, swapcontext and __sigsetjmp_symbol functions. Reviewed-by: Peter Bergner <bergner@linux.ibm.com>	2024-12-09 16:49:54 -05:00
Michael Jeanson	9e08698e4c	nptl: Add <thread_pointer.h> for m68k This will be required by the rseq extensible ABI implementation on all Linux architectures exposing the '__rseq_size' and '__rseq_offset' symbols to set the initial value of the 'cpu_id' field which can be used by applications to test if rseq is available and registered. As long as the symbols are exposed it is valid for an application to perform this test even if rseq is not yet implemented in libc for this architecture. Compile tested with build-many-glibcs.py but I don't have access to any hardware to run the tests. Signed-off-by: Michael Jeanson <mjeanson@efficios.com> Reviewed-by: Arjun Shankar <arjun@redhat.com>	2024-12-09 20:24:26 +00:00
Michael Jeanson	8dd1588794	nptl: Add <thread_pointer.h> for RISC-V This will be required by the rseq extensible ABI implementation on all Linux architectures exposing the '__rseq_size' and '__rseq_offset' symbols to set the initial value of the 'cpu_id' field which can be used by applications to test if rseq is available and registered. As long as the symbols are exposed it is valid for an application to perform this test even if rseq is not yet implemented in libc for this architecture. Both code paths tested on a Visionfive 2 with Debian sid. Signed-off-by: Michael Jeanson <mjeanson@efficios.com> Reviewed-by: Palmer Dabbelt <palmer@rivosinc.com> Acked-by: Palmer Dabbelt <palmer@rivosinc.com>	2024-12-09 13:26:55 -05:00
Michael Jeanson	d3b3a12258	nptl: add RSEQ_SIG for RISC-V Enable RSEQ for RISC-V, support was added in Linux 5.18. Signed-off-by: Michael Jeanson <mjeanson@efficios.com> Reviewed-by: Palmer Dabbelt <palmer@rivosinc.com> Acked-by: Palmer Dabbelt <palmer@rivosinc.com>	2024-12-09 13:26:55 -05:00
Pierre Blanchard	13a7ef5999	AArch64: Improve codegen in users of ADVSIMD expm1 helper Add inline helper for expm1 and rearrange operations so MOV is not necessary in reduction or around the special-case handler. Reduce memory access by using more indexed MLAs in polynomial. Speedup on Neoverse V1 for expm1 (19%), sinh (8.5%), and tanh (7.5%).	2024-12-09 16:20:34 +00:00
Pierre Blanchard	ca0c0d0f26	AArch64: Improve codegen in users of ADVSIMD log1p helper Add inline helper for log1p and rearrange operations so MOV is not necessary in reduction or around the special-case handler. Reduce memory access by using more indexed MLAs in polynomial. Speedup on Neoverse V1 for log1p (3.5%), acosh (7.5%) and atanh (10%).	2024-12-09 16:20:34 +00:00
Pierre Blanchard	8eb5ad2ebc	AArch64: Improve codegen in AdvSIMD logs Remove spurious ADRP and a few MOVs. Reduce memory access by using more indexed MLAs in polynomial. Align notation so that algorithms are easier to compare. Speedup on Neoverse V1 for log10 (8%), log (8.5%), and log2 (10%). Update error threshold in AdvSIMD log (now matches SVE log).	2024-12-09 16:20:34 +00:00
Pierre Blanchard	569cfaaf49	AArch64: Improve codegen in AdvSIMD pow Remove spurious ADRP. Improve memory access by shuffling constants and using more indexed MLAs. A few more optimisation with no impact on accuracy - force fmas contraction - switch from shift-aided rint to rint instruction Between 1 and 5% throughput improvement on Neoverse V1 depending on benchmark.	2024-12-09 16:20:34 +00:00
Stefan Liebler	b602f60f5e	s390x: Regenerated ULPs. Needed after: "Implement C23 cospi" commit `0ae0af68d8` and "Implement C23 sinpi" commit `776938e8b8` and "Implement C23 tanpi"	2024-12-09 10:25:24 +01:00
gfleury	a4b4b9a96b	htl: move pthread_condattr_setpshared into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241126205329.2215295-8-gfleury@disroot.org>	2024-12-09 02:03:18 +01:00
gfleury	5ccb28e65d	htl: move pthread_condattr_setclock into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241126205329.2215295-7-gfleury@disroot.org>	2024-12-09 02:03:18 +01:00
gfleury	ebd85cdc4a	htl: move pthread_condattr_init into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241126205329.2215295-6-gfleury@disroot.org>	2024-12-09 02:03:18 +01:00
gfleury	25699c4c3a	htl: move pthread_condattr_getpshared into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241126205329.2215295-5-gfleury@disroot.org>	2024-12-09 02:03:18 +01:00
gfleury	f1b5041354	htl: move pthread_condattr_getclock into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241126205329.2215295-4-gfleury@disroot.org>	2024-12-09 02:03:17 +01:00
gfleury	7ded100d36	htl: move __pthread_default_condattr into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241126205329.2215295-3-gfleury@disroot.org>	2024-12-09 01:49:49 +01:00
gfleury	c982918e3e	htl: move pthread_condattr_destroy into libc. Signed-off-by: gfleury <gfleury@disroot.org> Message-ID: <20241126205329.2215295-2-gfleury@disroot.org>	2024-12-09 01:49:40 +01:00
Andreas K. Hüttel	3a9b4b4aeb	math: Add sinpi,cospi,tanpi sparc64 ulps Linux catbus 6.1.112 #1 SMP Sun Oct 13 10:52:08 PDT 2024 sparc64 sun4v UltraSparc T5 (Niagara5) GNU/Linux gcc (Gentoo 13.3.1_p20240614 p17) 13.3.1 20240614 Signed-off-by: Andreas K. Hüttel <dilfridge@gentoo.org>	2024-12-08 22:01:51 +01:00
Andreas K. Hüttel	80d1e63e90	math: Add tanpi aarch64 ulps Linux dola 5.15.169-gentoo-dist #1 SMP Wed Oct 23 06:25:30 -00 2024 aarch64 GNU/Linux Vendor ID: ARM Model name: Neoverse-N1 gcc (Gentoo Hardened 13.3.1_p20241025 p1) 13.3.1 20241024 Signed-off-by: Andreas K. Hüttel <dilfridge@gentoo.org>	2024-12-08 18:25:05 +01:00
Joseph Myers	f9e90e4b4c	Implement C23 tanpi C23 adds various <math.h> function families originally defined in TS 18661-4. Add the tanpi functions (tan(pi*x)). Tested for x86_64 and x86, and with build-many-glibcs.py.	2024-12-05 21:42:10 +00:00
Adhemerval Zanella	c8d3220e64	powerpc: Update ulps From 'Implement C23 cospi' (`0ae0af68d8`) and 'Implement C23 sinpi' (`776938e8b8`).	2024-12-05 13:35:24 -03:00
Wilco Dijkstra	fa16523c48	AArch64: Update libm-test-ulps Add sinpi/cospi.	2024-12-05 16:19:37 +00:00
H.J. Lu	09d07f16a7	i686: Update libm-test-ulps Update i686 libm-test-ulps to fix FAIL: math/test-float64x-cospi FAIL: math/test-float64x-sinpi FAIL: math/test-ldouble-cospi FAIL: math/test-ldouble-sinpi when building glibc with GCC 7.4. Signed-off-by: H.J. Lu <hjl.tools@gmail.com>	2024-12-05 20:10:58 +08:00
H.J. Lu	0003605a54	x86-64: Update libm-test-ulps Update x86-64 libm-test-ulps to fix FAIL: math/test-float64x-cospi FAIL: math/test-float64x-exp2m1 FAIL: math/test-float64x-sinpi FAIL: math/test-ldouble-cospi FAIL: math/test-ldouble-exp2m1 FAIL: math/test-ldouble-sinpi when building glibc with GCC 7.4. Signed-off-by: H.J. Lu <hjl.tools@gmail.com>	2024-12-05 20:08:36 +08:00
Joseph Myers	776938e8b8	Implement C23 sinpi C23 adds various <math.h> function families originally defined in TS 18661-4. Add the sinpi functions (sin(pi*x)). Tested for x86_64 and x86, and with build-many-glibcs.py.	2024-12-04 20:04:04 +00:00
Joseph Myers	0ae0af68d8	Implement C23 cospi C23 adds various <math.h> function families originally defined in TS 18661-4. Add the cospi functions (cos(pi*x)). Tested for x86_64 and x86, and with build-many-glibcs.py.	2024-12-04 10:20:44 +00:00
H.J. Lu	1c4cebb84b	malloc: Optimize small memory clearing for calloc Add calloc-clear-memory.h to clear memory size up to 36 bytes (72 bytes on 64-bit targets) for calloc. Use repeated stores with 1 branch, instead of up to 3 branches. On x86-64, it is faster than memset since calling memset needs 1 indirect branch, 1 broadcast, and up to 4 branches. Signed-off-by: H.J. Lu <hjl.tools@gmail.com> Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2024-12-04 04:28:15 +08:00
Adhemerval Zanella	17a43505b3	elf: Consolidate stackinfo.h And use sane default the generic implementation. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2024-12-02 17:14:58 +00:00
Joseph Myers	3c2b9dc41c	Add threaded test of sem_trywait All the existing glibc tests of sem_trywait are single-threaded. Add one that calls sem_trywait and sem_post in separate threads. Tested for x86_64.	2024-11-29 20:25:04 +00:00
Sergey Kolosov	bde47662b7	nptl: Add new test for pthread_spin_trylock Add a threaded test for pthread_spin_trylock attempting to lock already acquired spin lock and checking for correct return code. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2024-11-29 15:55:20 +01:00
Wilco Dijkstra	a08d9a52f9	AArch64: Remove zva_128 from memset Remove ZVA 128 support from memset - the new memset no longer guarantees count >= 256, which can result in underflow and a crash if ZVA size is 128 ([1]). Since only one CPU uses a ZVA size of 128 and its memcpy implementation was removed in commit `e162ab2bf1`, remove this special case too. [1] https://sourceware.org/pipermail/libc-alpha/2024-November/161626.html Reviewed-by: Andrew Pinski <quic_apinski@quicinc.com>	2024-11-29 13:27:13 +00:00
Adhemerval Zanella	82a3991a84	Remove nios2-linux-gnu GCC 15 (e876acab6cdd84bb2b32c98fc69fb0ba29c81153) and binutils (e7a16d9fd65098045ef5959bf98d990f12314111) both removed all Nios II support, and the architecture has been EOL'ed by the vendor. The kernel still has support, but without a proper compiler there is no much sense in keep it on glibc. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2024-11-28 14:03:25 -03:00
Adhemerval Zanella	3b1c5a539b	math: Add internal roundeven_finite Some CORE-MATH routines uses roundeven and most of ISA do not have an specific instruction for the operation. In this case, the call will be routed to generic implementation. However, if the ISA does support round() and ctz() there is a better alternative (as used by CORE-MATH). This patch adds such optimization and also enables it on powerpc. On a power10 it shows the following improvement: expm1f master patched improvement latency 9.8574 7.0139 28.85% reciprocal-throughput 4.3742 2.6592 39.21% Checked on powerpc64le-linux-gnu and aarch64-linux-gnu. Reviewed-by: DJ Delorie <dj@redhat.com>	2024-11-26 15:07:57 -03:00
Julian Zhu	32445b6dd2	RISC-V: Use builtin for fma and fmaf The built-in functions `builtin_{fma, fmaf}` are sufficient to generate correct `fmadd.d`/`fmadd.s` instructions on RISC-V. Signed-off-by: Julian Zhu <jz531210@gmail.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-25 16:45:59 -03:00
Julian Zhu	d2264de5db	RISC-V: Use builtin for copysign and copysignf The built-in functions `builtin_{copysign, copysignf}` are sufficient to generate correct `fsgnj.d/fsgnj.s` instructions on RISC-V. Signed-off-by: Julian Zhu <jz531210@gmail.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-25 16:45:59 -03:00
Alejandro Colomar	53fcdf5f74	Silence most -Wzero-as-null-pointer-constant diagnostics Replace 0 by NULL and {0} by {}. Omit a few cases that aren't so trivial to fix. Link: <https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117059> Link: <https://software.codidact.com/posts/292718/292759#answer-292759> Signed-off-by: Alejandro Colomar <alx@kernel.org>	2024-11-25 16:45:59 -03:00
Yannick Le Pennec	83d4b42ded	sysdeps: linux: Fix output of LD_SHOW_AUXV=1 for AT_RSEQ_* The constants themselves were added to elf.h back in `8754a4133e` but the array in _dl_show_auxv wasn't modified accordingly, resulting in the following output when running LD_SHOW_AUXV=1 /bin/true on recent Linux: AT_??? (0x1b): 0x1c AT_??? (0x1c): 0x20 With this patch: AT_RSEQ_FEATURE_SIZE: 28 AT_RSEQ_ALIGN: 32 Tested on Linux 6.11 x86_64 Signed-off-by: Yannick Le Pennec <yannick.lepennec@live.fr> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-25 16:45:59 -03:00
Michael Jeanson	d9f40387d3	nptl: initialize cpu_id_start prior to rseq registration When adding explicit initialization of rseq fields prior to registration, I glossed over the fact that 'cpu_id_start' is also documented as initialized by user-space. While current kernels don't validate the content of this field on registration, future ones could. Signed-off-by: Michael Jeanson <mjeanson@efficios.com> Reviewed-by: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>	2024-11-25 19:42:14 +01:00
Adhemerval Zanella	6976cd3124	math: Fix branch hint for `68d7128942`	2024-11-25 13:37:50 -03:00
Sachin Monga	2062e02772	powerpc64le: ROP Changes for strncpy/ppc-mount Add ROP protect instructions to strncpy and ppc-mount functions. Modify FRAME_MIN_SIZE to 48 bytes for ELFv2 to reserve additional 16 bytes for ROP save slot and padding. Signed-off-by: Sachin Monga <smonga@linux.ibm.com> Reviewed-by: Peter Bergner <bergner@linux.ibm.com>	2024-11-25 10:44:20 -05:00
Vincent Lefevre	68d7128942	math: Fix non-portability in the computation of signgam in lgammaf The k>>31 in signgam = 1 - (((k&(k>>31))&1)<<1); is not portable: * The ISO C standard says "If E1 has a signed type and a negative value, the resulting value is implementation-defined." (this is still in C23). * If the int type is larger than 32 bits (e.g. a 64-bit type), then k = INT_MAX; line 144 will make k>>31 put 1 in bit 0 (thus signgam will be -1) while 0 is expected. Moreover, instead of the fx >= 0x1p31f condition, testing fx >= 0 is probably better for 2 reasons: The signgam expression has more or less a condition on the sign of fx (the goal of k>>31, which can be dropped with this new condition). Since fx ≥ 0 should be the most common case, one can get signgam directly in this case (value 1). And this simplifies the expression for the other case (fx < 0). This new condition may be easier/faster to test on the processor (e.g. by avoiding a load of a constant from the memory). This is commit d41459c731865516318f813cf4c966dafa0eecbf from CORE-MATH. Checked on x86_64-linux-gnu.	2024-11-25 09:20:47 -03:00
Samuel Thibault	d92a5e1dad	hurd: Add MAP_NORESERVE mmap flag This is already the current default behavior, which we will change with overcommit support addition.	2024-11-25 00:55:33 +01:00
Joseph Myers	99671e72bb	Add multithreaded test of sem_getvalue Test coverage of sem_getvalue is fairly limited. Add a test that runs it on threads on each CPU. For this purpose I adapted tst-skeleton-thread-affinity.c; it didn't seem very suitable to use as-is or include directly in a different test doing things per-CPU, but did seem a suitable starting point (thus sharing tst-skeleton-affinity.c) for such testing. Tested for x86_64.	2024-11-22 16:58:51 +00:00
Adhemerval Zanella	bccb0648ea	math: Use tanf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows better performance to the generic tanf. The code was adapted to glibc style, to use the definition of math_config.h, to remove errno handling, and to use a generic 128 bit routine for ABIs that do not support it natively. Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (neoverse1, gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1): latency master patched improvement x86_64 82.3961 54.8052 33.49% x86_64v2 82.3415 54.8052 33.44% x86_64v3 69.3661 50.4864 27.22% i686 219.271 45.5396 79.23% aarch64 29.2127 19.1951 34.29% power10 19.5060 16.2760 16.56% reciprocal-throughput master patched improvement x86_64 28.3976 19.7334 30.51% x86_64v2 28.4568 19.7334 30.65% x86_64v3 21.1815 16.1811 23.61% i686 105.016 15.1426 85.58% aarch64 18.1573 10.7681 40.70% power10 8.7207 8.7097 0.13% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-11-22 10:52:27 -03:00
Adhemerval Zanella	d846f4c12d	math: Use lgammaf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows better performance to the generic lgammaf. The code was adapted to glibc style, to use the definition of math_config.h, to remove errno handling, to use math_narrow_eval on overflow usage, and to adapt to make it reentrant. Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (M1, gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1): latency master patched improvement x86_64 86.5609 70.3278 18.75% x86_64v2 78.3030 69.9709 10.64% x86_64v3 74.7470 59.8457 19.94% i686 387.355 229.761 40.68% aarch64 40.8341 33.7563 17.33% power10 26.5520 16.1672 39.11% powerpc 28.3145 17.0625 39.74% reciprocal-throughput master patched improvement x86_64 68.0461 48.3098 29.00% x86_64v2 55.3256 47.2476 14.60% x86_64v3 52.3015 38.9028 25.62% i686 340.848 195.707 42.58% aarch64 36.8000 30.5234 17.06% power10 20.4043 12.6268 38.12% powerpc 22.6588 13.8866 38.71% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-11-22 10:52:27 -03:00
Adhemerval Zanella	baa495f231	math: Use erfcf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows better performance to the generic erfcf. The code was adapted to glibc style and to use the definition of math_config.h. Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (M1, gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1): latency master patched improvement x86_64 98.8796 66.2142 33.04% x86_64v2 98.9617 67.4221 31.87% x86_64v3 87.4161 53.1754 39.17% aarch64 33.8336 22.0781 34.75% power10 21.1750 13.5864 35.84% powerpc 21.4694 13.8149 35.65% reciprocal-throughput master patched improvement x86_64 48.5620 27.6731 43.01% x86_64v2 47.9497 28.3804 40.81% x86_64v3 42.0255 18.1355 56.85% aarch64 24.3938 13.4041 45.05% power10 10.4919 6.1881 41.02% powerpc 11.763 6.76468 42.49% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-11-22 10:52:27 -03:00
Adhemerval Zanella	994fec2397	math: Use erff from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows better performance to the generic erff. The code was adapted to glibc style and to use the definition of math_config.h. Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (M1, gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1): latency master patched improvement x86_64 85.7363 45.1372 47.35% x86_64v2 86.6337 38.5816 55.47% x86_64v3 71.3810 34.0843 52.25% i686 190.143 97.5014 48.72% aarch64 34.9091 14.9320 57.23% power10 38.6160 8.5188 77.94% powerpc 39.7446 8.45781 78.72% reciprocal-throughput master patched improvement x86_64 35.1739 14.7603 58.04% x86_64v2 34.5976 11.2283 67.55% x86_64v3 27.3260 9.8550 63.94% i686 91.0282 30.8840 66.07% aarch64 22.5831 6.9615 69.17% power10 18.0386 3.0918 82.86% powerpc 20.7277 3.63396 82.47% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: DJ Delorie <dj@redhat.com>	2024-11-22 10:52:27 -03:00
Adhemerval Zanella	c4c64ba5d1	math: Split s_erfF in erff and erfc So we can eventually replace each implementation. Reviewed-by: DJ Delorie <dj@redhat.com>	2024-11-22 10:52:26 -03:00
Adhemerval Zanella	c5d241f06b	math: Use cbrtf from CORE-MATH The CORE-MATH implementation is correctly rounded (for any rounding mode) and shows better performance to the generic cbrtf. The code was adapted to glibc style and to use the definition of math_config.h. Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (M1, gcc 13.2.1), and powerpc (POWER10, gcc 13.2.1): latency master patched improvement x86_64 68.6348 36.8908 46.25% x86_64v2 67.3418 36.6968 45.51% x86_64v3 63.4981 32.7859 48.37% aarch64 29.3172 12.1496 58.56% power10 18.0845 8.8893 50.85% powerpc 18.0859 8.79527 51.37% reciprocal-throughput master patched improvement x86_64 36.4369 13.3565 63.34% x86_64v2 37.3611 13.1149 64.90% x86_64v3 31.6024 11.2102 64.53% aarch64 18.6866 7.3474 60.68% power10 9.4758 3.6329 61.66% powerpc 9.58896 3.90439 59.28% Signed-off-by: Alexei Sibidanov <sibid@uvic.ca> Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-22 10:01:03 -03:00
Siddhesh Poyarekar	713d6d7e78	x86/string: Use `movsl` instead of `movsd` in strncat [BZ #32344 ] The previous patch missed strncat, so fixed that. Resolves: BZ #32344 Signed-off-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2024-11-21 17:11:01 -05:00
Andrew Pinski	e6590f0c86	aarch64: Remove non-temporal load/stores from oryon-1's memset The hardware architects have a new recommendation not to use non-temporal load/stores for memset. This patch removes this path. I found there was no difference in the memset speed with/without non-temporal load/stores either. Signed-off-by: Andrew Pinski <quic_apinski@quicinc.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-21 11:32:23 -03:00
Andrew Pinski	eb5eeb4740	aarch64: Remove non-temporal load/stores from oryon-1's memcpy The hardware architects have a new recommendation not to use non-temporal load/stores for memcpy. This patch removes this path. I found there was no difference in the memcpy speed with/without non-temporal load/stores either. Signed-off-by: Andrew Pinski <quic_apinski@quicinc.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-21 11:32:17 -03:00
Sachin Monga	3051f3495c	powerpc64le: _init/_fini file changes for ROP The ROP instructions were added in ISA 3.1 (ie, Power10), however they were defined so that if executed on older cpus, they would behave as nops. This allows us to emit them on older cpus and they'd just be ignored, but if run on a Power10, then the binary would be ROP protected. Hash instructions use negative offsets so the default position of ROP pointer is FRAME_ROP_SAVE from caller's SP. Modified FRAME_MIN_SIZE_PARM to 112 for ELFv2 to reserve additional 16 bytes for ROP save slot and padding. Signed-off-by: Sachin Monga <smonga@linux.ibm.com> Reviewed-by: Peter Bergner <bergner@linux.ibm.com>	2024-11-20 16:50:34 -05:00
Yury Khrustalev	f4d00dd60d	AArch64: Add support for memory protection keys This patch adds support for memory protection keys on AArch64 systems with enabled Stage 1 permission overlays feature introduced in Armv8.9 / 9.4 (FEAT_S1POE) [1]. 1. Internal functions "pkey_read" and "pkey_write" to access data associated with memory protection keys. 2. Implementation of API functions "pkey_get" and "pkey_set" for the AArch64 target. 3. AArch64-specific PKEY flags for READ and EXECUTE (see below). 4. New target-specific test that checks behaviour of pkeys on AArch64 targets. 5. This patch also extends existing generic test for pkeys. 6. HWCAP constant for Permission Overlay Extension feature. To support more accurate mapping of underlying permissions to the PKEY flags, we introduce additional AArch64-specific flags. The full list of flags is: - PKEY_UNRESTRICTED: 0x0 (for completeness) - PKEY_DISABLE_ACCESS: 0x1 (existing flag) - PKEY_DISABLE_WRITE: 0x2 (existing flag) - PKEY_DISABLE_EXECUTE: 0x4 (new flag, AArch64 specific) - PKEY_DISABLE_READ: 0x8 (new flag, AArch64 specific) The problem here is that PKEY_DISABLE_ACCESS has unusual semantics as it overlaps with existing PKEY_DISABLE_WRITE and new PKEY_DISABLE_READ. For this reason mapping between permission bits RWX and "restrictions" bits awxr (a for disable access, etc) becomes complicated: - PKEY_DISABLE_ACCESS disables both R and W - PKEY_DISABLE_{WRITE,READ} disables W and R respectively - PKEY_DISABLE_EXECUTE disables X Combinations like the one below are accepted although they are redundant: - PKEY_DISABLE_ACCESS \| PKEY_DISABLE_READ \| PKEY_DISABLE_WRITE Reverse mapping tries to retain backward compatibility and ORs PKEY_DISABLE_ACCESS whenever both flags PKEY_DISABLE_READ and PKEY_DISABLE_WRITE would be present. This will break code that compares pkey_get output with == instead of using bitwise operations. The latter is more correct since PKEY_* constants are essentially bit flags. It should be noted that PKEY_DISABLE_ACCESS does not prevent execution. [1] https://developer.arm.com/documentation/ddi0487/ka/ section D8.4.1.4 Co-authored-by: Szabolcs Nagy <szabolcs.nagy@arm.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-20 11:30:58 +00:00
Andrew Pinski	e162ab2bf1	AArch64: Remove thunderx{,2} memcpy ThunderX1 and ThunderX2 have been retired for a few years now. So let's remove the thunderx{,2} specific versions of memcpy. The performance gain or them was for medium and large sizes while the generic (aarch64) memcpy will handle just slightly worse. Signed-off-by: Andrew Pinski <quic_apinski@quicinc.com> Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2024-11-20 11:23:53 +00:00
Joseph Myers	d899b48a30	Fix femode_t conditionals for arc and or1k Two of the architecture bits/fenv.h headers define femode_t if __GLIBC_USE (IEC_60559_BFP_EXT), instead of the correct condition __GLIBC_USE (IEC_60559_BFP_EXT_C23) (both were added after commit `0175c9e9be`, but were probably first developed before it and then not updated to take account of its changes). This results in failures of the installed headers check for fenv.h when building with GCC 15 (defaults to -std=gnu23 - we don't yet have an installed-headers test specifically for C23 mode and don't yet require a compiler with such a mode for building glibc) together with a combination of options leaving C23 features enabled, since the declarations of functions using femode_t use the correct conditions; see <https://sourceware.org/pipermail/libc-testresults/2024q4/013163.html>. Fix the conditionals to get <fenv.h> to work correctly in C23 mode again. Tested with build-many-glibcs.py (arc-linux-gnu, arch-linux-gnuhf, or1k-linux-gnu-hard, or1k-linux-gnu-soft).	2024-11-19 22:25:39 +00:00
Mahesh Bodapati	3ef7e42861	powerpc64le: Optimized strcat for POWER10 This patch adds an optimized strcat which makes use of the default strcat function which calls the Power10 strcpy and strlen routines.	2024-11-19 15:59:15 -05:00
Peter Bergner	229265cc2c	powerpc: Improve the inline asm for syscall wrappers Update the inline asm syscall wrappers to match the newer register constraint usage in INTERNAL_VSYSCALL_CALL_TYPE. Use the faster mfocrf instruction when available, rather than the slower mfcr microcoded instruction.	2024-11-19 12:43:57 -05:00
gfleury	7f045c0b48	htl: move pthread_attr_init into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:37:35 +01:00
gfleury	1a1cedd635	htl: move pthread_attr_setguardsize into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:37:35 +01:00
gfleury	f26b272a75	htl: move pthread_attr_setschedparam into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:37:35 +01:00
gfleury	32aa498ceb	htl: move pthread_attr_setscope into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:37:35 +01:00
gfleury	4a8b7d7e62	htl: move pthread_attr_setstackaddr into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:37:35 +01:00
gfleury	d69a010e7b	htl: move pthread_attr_setstacksize into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:37:35 +01:00
gfleury	330c1fad5b	htl: move pthread_attr_getstack into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:37:35 +01:00
gfleury	1428ae39e8	htl: move pthread_attr_getstackaddr into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:37:35 +01:00
gfleury	993440a260	htl move pthread_attr_getstacksize into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:34:34 +01:00
gfleury	4bcda927fe	htl move pthread_attr_getscope into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:19:00 +01:00
gfleury	6caf24c972	htl move pthread_attr_getguardsize into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:18:59 +01:00
gfleury	f55cf584ff	htl: move __pthread_default_attr into libc Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:08:27 +01:00
gfleury	736befab6c	htl: move pthread_attr_destroy into libc. Signed-off-by: gfleury <gfleury@disroot.org>	2024-11-19 01:08:14 +01:00
Noah Goldstein	c510681a69	x86/string: Use `movsl` instead of `movsd` in strncpy/strncat [BZ #32344 ] `ld`, starting at 2.40, emits a warning when using `movsd`. There is no change to the actual code produced. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2024-11-13 10:09:30 -06:00
John David Anglin	b919fe1f6d	hppa: Update libm-test-ulps Update imaginary part of csin. Signed-off-by: John David Anglin <dave.anglin@bell.net>	2024-11-12 21:32:54 -05:00
Samuel Thibault	e5c2738f17	Revert "hurd: Stop depending on the default_pager stubs provided by gnumach" This reverts commit `f7f7dd8009`. default_pager is actually also used in e.g. xosview.	2024-11-13 01:34:09 +01:00
Adhemerval Zanella	461cab1de7	linux: Add support for getrandom vDSO Linux 6.11 has getrandom() in vDSO. It operates on a thread-local opaque state allocated with mmap using flags specified by the vDSO. Multiple states are allocated at once, as many as fit into a page, and these are held in an array of available states to be doled out to each thread upon first use, and recycled when a thread terminates. As these states run low, more are allocated. To make this procedure async-signal-safe, a simple guard is used in the LSB of the opaque state address, falling back to the syscall if there's reentrancy contention. Also, _Fork() is handled by blocking signals on opaque state allocation (so _Fork() always sees a consistent state even if it interrupts a getrandom() call) and by iterating over the thread stack cache on reclaim_stack. Each opaque state will be in the free states list (grnd_alloc.states) or allocated to a running thread. The cancellation is handled by always using GRND_NONBLOCK flags while calling the vDSO, and falling back to the cancellable syscall if the kernel returns EAGAIN (would block). Since getrandom is not defined by POSIX and cancellation is supported as an extension, the cancellation is handled as 'may occur' instead of 'shall occur' [1], meaning that if vDSO does not block (the expected behavior) getrandom will not act as a cancellation entrypoint. It avoids a pthread_testcancel call on the fast path (different than 'shall occur' functions, like sem_wait()). It is currently enabled for x86_64, which is available in Linux 6.11, and aarch64, powerpc32, powerpc64, loongarch64, and s390x, which are available in Linux 6.12. Link: https://pubs.opengroup.org/onlinepubs/9799919799/nframe.html [1] Co-developed-by: Jason A. Donenfeld <Jason@zx2c4.com> Tested-by: Jason A. Donenfeld <Jason@zx2c4.com> # x86_64 Tested-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> # x86_64, aarch64 Tested-by: Xi Ruoyao <xry111@xry111.site> # x86_64, aarch64, loongarch64 Tested-by: Stefan Liebler <stli@linux.ibm.com> # s390x	2024-11-12 14:42:12 -03:00
caiyinyu	ab4388f91c	LoongArch: Update ulps Needed for test-float-cacosh, test-float-csin, test-float32-cacosh and test-float32-csin. Signed-off-by: caiyinyu <caiyinyu@loongson.cn> Reviewed-by: Florian Weimer <fweimer@redhat.com>	2024-11-12 09:19:23 +08:00
Samuel Thibault	d2e65aa7d6	mach: Fix __xpg_strerror_r on in-range but undefined errors [BZ #32350 ] For instance, 1073741906 leads to system 16, subsystem 0 and code 82, which is in range (max_code is 122), but not defined. Return EINVAL in that case, like	2024-11-09 20:00:40 +01:00
Noah Goldstein	6754b5becf	x86/string: Use `movsl` instead of `movsd` [BZ #32344 ] `ld`, starting at 2.40, emits a warning when using `movsd`. There is no change to the actual code produced. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2024-11-08 17:23:05 -06:00
Joseph Myers	c7dcf594f4	Rename new tst-sem17 test to tst-sem18 As noted by Adhemerval, we already have a tst-sem17 in nptl. Tested for x86_64.	2024-11-08 17:08:09 +00:00
Joseph Myers	f745d78e26	Avoid uninitialized result in sem_open when file does not exist A static analyzer apparently reported an uninitialized use of the variable result in sem_open in the case where the file is required to exist but does not exist. The report appears to be correct; set result to SEM_FAILED in that case, and add a test for it. Note: the test passes for me even without the sem_open fix, I guess because result happens to get value SEM_FAILED (i.e. 0) when uninitialized. Tested for x86_64.	2024-11-08 01:53:48 +00:00
Michael Jeanson	97f60abd25	nptl: initialize rseq area prior to registration Per the rseq syscall documentation, 3 fields are required to be initialized by userspace prior to registration, they are 'cpu_id', 'rseq_cs' and 'flags'. Since we have no guarantee that 'struct pthread' is cleared on all architectures, explicitly set those 3 fields prior to registration. Signed-off-by: Michael Jeanson <mjeanson@efficios.com> Reviewed-by: Florian Weimer <fweimer@redhat.com>	2024-11-07 22:23:49 +01:00
Mark Wielaard	c18de3b76a	s390x: Update ulps Needed for test-float-cacosh, test-float-csin, test-float32-cacosh and test-float32-csin. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2024-11-07 20:58:05 +01:00
Adhemerval Zanella	12b8dd7718	math: Fix log10f on some ABIs The commit `9247f53219` triggered some regressions on loongarch and riscv: math/test-float-log10 math/test-float32-log10 And it is due a wrong sync with CORE-MATH for special 0.0/-0.0 inputs. Checked on aarch64-linux-gnu and loongarch64-linux-gnu-lp64d.	2024-11-07 07:59:43 -03:00
caiyinyu	1b70a0a024	nptl: fix __builtin_thread_pointer detection on LoongArch Signed-off-by: caiyinyu <caiyinyu@loongson.cn>	2024-11-07 14:08:30 +08:00
Florian Weimer	ba60be8735	math: Fix incorrect results of exp10m1f with some GCC versions On GCC 11 (x86-64), the previous code produced test failures like this one: Failure: Test: exp10m1_towardzero (-0x1.1p+4) Result: is: -1.00000000e+00 -0x1.000000p+0 should be: -9.99999940e-01 -0x1.fffffep-1 difference: 5.96046447e-08 0x1.000000p-24 ulp : 1.0000 max.ulp : 0.0000 Apply a similar fix to exp2m1f. Co-authored-by: Paul Zimmermann <Paul.Zimmermann@inria.fr> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-06 16:09:05 +01:00
Yury Khrustalev	ff254cabd6	misc: Align argument name for pkey_*() functions with the manual Change name of the access_rights argument to access_restrictions of the following functions: - pkey_alloc() - pkey_set() as this argument refers to access restrictions rather than access rights and previous name might have been misleading. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2024-11-06 13:11:33 +00:00
Florian Weimer	f2326c2ec0	elf: Introduce _dl_relocate_object_no_relro And make _dl_protect_relro apply RELRO conditionally. Reviewed-by: DJ Delorie <dj@redhat.com>	2024-11-06 10:33:44 +01:00

... 2 3 4 5 6 ...

16761 Commits