glibc

Commit Graph

Author	SHA1	Message	Date
Adhemerval Zanella	b5a09ebb3f	libio: Fix UB __libio_codecvt_length To avoid a 0 size VLA.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	43ba3b5c3a	iconv: Fix UB on iconv/tst-translit-mchar Building with ubsan, the test triggers: UBSAN: Undefined behaviour in programs/locfile.c:598:3 null pointer passed as argument 2, nonnull attribute declared at unknown:0:0 The obstack_grow is only define for size > 0.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	63c5414e81	iconv: Fix UB on find_derivation The cost addition might overflow since the default value is LONG_INT. Use wrap addition instead.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	78ddfecdd5	math: Fix UB in setayloadf The code can shift the 1U for value larger than 31 depending of the exponent value. Add a check prior the shift.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	beb6d2de65	math: Fix UB in setayload The code can shift the 1ULL for value larger than 63 depending of the exponent value. Add a check prior the shift.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	b26c1af4bd	math: Remove UB from float128 ilogbf The subnormal exponent calculation invokes UB by left shifting the high or lower work. Use unsigned values and stdc_leading_zeros instead.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	31bc3437c9	powerpc: Use generic ilogb/ilogbf and refactor ilogbf128 The powerpc64 leverages the use of xsxexpdp and xsxexpqp for for both ilogb/ilogbf for float, double, and float128 types. However with the new generic ilogb/ilogbf, this is not really a gain anymore. On POWER9 with gcc-13, the xsxexpdp/xsxexpqp shows: $ ./benchtests/bench-ilogb "ilogb": { "subnormal": { "duration": 5.08829e+08, "iterations": 4.4588e+07, "max": 18.761, "min": 6.7005, "mean": 11.4118 }, "normal": { "duration": 5.04674e+08, "iterations": 9.9596e+07, "max": 7.386, "min": 5.0505, "mean": 5.06722 } $ ./benchtests/bench-ilogbf "ilogbf": { "subnormal": { "duration": 5.04918e+08, "iterations": 9.8732e+07, "max": 7.1595, "min": 5.0475, "mean": 5.11402 }, "normal": { "duration": 5.04971e+08, "iterations": 9.8744e+07, "max": 7.771, "min": 5.048, "mean": 5.11394 } } While the new generic implementation shows: $ ./benchtests/bench-ilogb "ilogb": { "subnormal": { "duration": 5.05389e+08, "iterations": 9.2644e+07, "max": 11.0355, "min": 5.4255, "mean": 5.45517 }, "normal": { "duration": 5.04667e+08, "iterations": 1.02388e+08, "max": 9.758, "min": 4.8945, "mean": 4.92897 } }[azanella@cfarm135 powerpc64le-linux-gnu-power9-gcc13]$ ./benchtests/bench-ilogbf "ilogbf": { "subnormal": { "duration": 5.05409e+08, "iterations": 9.238e+07, "max": 7.69, "min": 5.442, "mean": 5.47098 }, "normal": { "duration": 5.0456e+08, "iterations": 1.02012e+08, "max": 6.84, "min": 4.922, "mean": 4.94609 } } The xsxexpdp/xsxexpqp also adds some extra code size overhead since it uses the generic ilogb/ilogbf for 0/inf/NaN handling. It is still kept for float128, and this patch also optimizes it to avoid need to call extra generic symbol to handle not number inputs. On same hardware (POWER9/gcc-13) it shows the improvement: * master "ilogbf128": { "subnormal": { "duration": 5.09608e+08, "iterations": 3.3092e+07, "max": 28.845, "min": 6.824, "mean": 15.3997 }, "normal": { "duration": 5.05148e+08, "iterations": 9.1692e+07, "max": 7.744, "min": 5.377, "mean": 5.50918 } } * patch: "ilogbf128": { "subnormal": { "duration": 5.0586e+08, "iterations": 8.388e+07, "max": 7.3295, "min": 5.952, "mean": 6.03076 }, "normal": { "duration": 5.04783e+08, "iterations": 9.6608e+07, "max": 8.9255, "min": 5.185, "mean": 5.22507 } } Checked on powerpc64le-linux-gnu and powerpc64le-linux-gnu targetting POWER8 and with --disable-multi-arch on POWER9.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	d8e83281aa	math: Remove i386 ilogb/ilogbf/llogb/llogbf The new float and double implementation does not required an extra function call and error handling uses math_err function, which results in better performance on i386 as well. With gcc-14 on AMD AMD Ryzen 9 5900X, master shows: $ ./benchtests/bench-ilogb "ilogb": { "subnormal": { "duration": 3.68863e+09, "iterations": 1.72228e+08, "max": 89.2995, "min": 21.016, "mean": 21.4171 }, "normal": { "duration": 3.68878e+09, "iterations": 1.72948e+08, "max": 78.6065, "min": 21.127, "mean": 21.3288 } } $ ./benchtests/bench-ilogbf "ilogbf": { "subnormal": { "duration": 3.68835e+09, "iterations": 1.66716e+08, "max": 46.953, "min": 21.793, "mean": 22.1236 }, "normal": { "duration": 3.68784e+09, "iterations": 1.66168e+08, "max": 46.9715, "min": 21.904, "mean": 22.1935 } } While with this patch: $ ./benchtests/bench-ilogb "ilogb": { "subnormal": { "duration": 3.68134e+09, "iterations": 4.17516e+08, "max": 32.5045, "min": 8.3245, "mean": 8.81723 }, "normal": { "duration": 3.6677e+09, "iterations": 6.79468e+08, "max": 50.9305, "min": 5.3465, "mean": 5.3979 } } $ ./benchtests/bench-ilogbf "ilogbf": { "subnormal": { "duration": 3.67553e+09, "iterations": 5.11032e+08, "max": 35.927, "min": 7.0485, "mean": 7.19237 }, "normal": { "duration": 3.66877e+09, "iterations": 6.556e+08, "max": 26.3625, "min": 5.5315, "mean": 5.59605 } } Checked on i686-linux-gnu.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	2713ad3e90	math: Optimize float ilogb/llogb It removes the wrapper by moving the error/EDOM handling to an out-of-line implementation (__math_invalidf_i/__math_invalidf_li). Also, __glibc_unlikely is used on errors case since it helps code generation on recent gcc. The code now builds to with gcc-14 on aarch64: 0000000000000000 <__ilogbf>: 0: 1e260000 fmov w0, s0 4: d3577801 ubfx x1, x0, #23, #8 8: 340000e1 cbz w1, 24 <__ilogbf+0x24> c: 5101fc20 sub w0, w1, #0x7f 10: 7103fc3f cmp w1, #0xff 14: 54000040 b.eq 1c <__ilogbf+0x1c> // b.none 18: d65f03c0 ret 1c: 12b00000 mov w0, #0x7fffffff // #2147483647 20: 14000000 b 0 <__math_invalidf_i> 24: 53175800 lsl w0, w0, #9 28: 340000a0 cbz w0, 3c <__ilogbf+0x3c> 2c: 5ac01000 clz w0, w0 30: 12800fc1 mov w1, #0xffffff81 // #-127 34: 4b000020 sub w0, w1, w0 38: d65f03c0 ret 3c: 320107e0 mov w0, #0x80000001 // #-2147483647 40: 14000000 b 0 <__math_invalidf_i> Some ABI requires additional adjustments: * i386 and m68k requires to use the template version, since both provide __ieee754_ilogb implementatations. * loongarch uses a custom implementation as well. * powerpc64le also has a custom implementation for POWER9, which is also used for float and float128 version. The generic e_ilogb.c implementation is moved on powerpc to keep the current code as-is. Checked on aarch64-linux-gnu and x86_64-linux-gnu.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	4977d28d6b	math: Remove UB and optimize double ilogbf The subnormal exponent calculation invokes UB by left shifting the signed expoenent to find the first leading bit. The patch reimplements ilogb using the math_config.h macros and uses the new stdbit.h function to simplify the subnormal handling. On aarch64 it generates better code: * master: 0000000000000000 <__ieee754_ilogbf>: 0: 1e260000 fmov w0, s0 4: 12007801 and w1, w0, #0x7fffffff 8: 72091c1f tst w0, #0x7f800000 c: 54000141 b.ne 34 <__ieee754_ilogbf+0x34> // b.any 10: 34000201 cbz w1, 50 <__ieee754_ilogbf+0x50> 14: 53185c21 lsl w1, w1, #8 18: 12800fa0 mov w0, #0xffffff82 // #-126 1c: d503201f nop 20: 531f7821 lsl w1, w1, #1 24: 51000400 sub w0, w0, #0x1 28: 7100003f cmp w1, #0x0 2c: 54ffffac b.gt 20 <__ieee754_ilogbf+0x20> 30: d65f03c0 ret 34: 13177c20 asr w0, w1, #23 38: 12b01002 mov w2, #0x7f7fffff // #2139095039 3c: 5101fc00 sub w0, w0, #0x7f 40: 6b02003f cmp w1, w2 44: 12b00001 mov w1, #0x7fffffff // #2147483647 48: 1a819000 csel w0, w0, w1, ls // ls = plast 4c: d65f03c0 ret 50: 320107e0 mov w0, #0x80000001 // #-2147483647 54: d65f03c0 ret * patch: 0000000000000000 <__ieee754_ilogbf>: 0: 1e260001 fmov w1, s0 4: d3577820 ubfx x0, x1, #23, #8 8: 350000e0 cbnz w0, 24 <__ieee754_ilogbf+0x24> c: 53175821 lsl w1, w1, #9 10: 34000141 cbz w1, 38 <__ieee754_ilogbf+0x38> 14: 5ac01021 clz w1, w1 18: 12800fc0 mov w0, #0xffffff81 // #-127 1c: 4b010000 sub w0, w0, w1 20: d65f03c0 ret 24: 7103fc1f cmp w0, #0xff 28: 5101fc00 sub w0, w0, #0x7f 2c: 12b00001 mov w1, #0x7fffffff // #2147483647 30: 1a811000 csel w0, w0, w1, ne // ne = any 34: d65f03c0 ret 38: 320107e0 mov w0, #0x80000001 // #-2147483647 3c: d65f03c0 ret Other architecture with support for stdc_leading_zeros and/or __builtin_clzll should have similar improvements. Checked on aarch64-linux-gnu and x86_64-linux-gnu.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	46e4070d56	math: Optimize double ilogb/llogb It removes the wrapper by moving the error/EDOM handling to an out-of-line implementation (__math_invalid_i/__math_invalid_li). Also, __glibc_unlikely is used on errors case since it helps code generation on recent gcc. The code now builds to with gcc-14 on aarch64: 0000000000000000 <__ilogb>: 0: 9e660000 fmov x0, d0 4: d374f801 ubfx x1, x0, #52, #11 8: 340000e1 cbz w1, 24 <__ilogb+0x24> c: 510ffc20 sub w0, w1, #0x3ff 10: 711ffc3f cmp w1, #0x7ff 14: 54000040 b.eq 1c <__ilogb+0x1c> // b.none 18: d65f03c0 ret 1c: 12b00000 mov w0, #0x7fffffff // #2147483647 20: 14000000 b 0 <__math_invalid_i> 24: d374cc00 lsl x0, x0, #12 28: b40000a0 cbz x0, 3c <__ilogb+0x3c> 2c: dac01000 clz x0, x0 30: 12807fc1 mov w1, #0xfffffc01 // #-1023 34: 4b000020 sub w0, w1, w0 38: d65f03c0 ret 3c: 320107e0 mov w0, #0x80000001 // #-2147483647 40: 14000000 b 0 <__math_invalid_i> Some ABI requires additional adjustments: * i386 and m68k requires to use the template version, since both provide __ieee754_ilogb implementatations. * loongarch uses a custom implementation as well. * powerpc64le also has a custom implementation for POWER9, which is also used for float and float128 version. The generic e_ilogb.c implementation is moved on powerpc to keep the current code as-is. Checked on aarch64-linux-gnu and x86_64-linux-gnu.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	ac05570fcd	math: Remove UB and optimize double ilogb The subnormal exponent calculation invokes UB by left shifting the signed expoenent to find the first leading bit. The implementation also uses 32 bits operations, which generates suboptimal code in 64 bits architectures. The patch reimplements ilogb using the math_config.h macros and uses the new stdbit function to simplify the subnormal handling. On aarch64 it generates better code: * master: 0000000000000000 <__ieee754_ilogb>: 0: 9e660000 fmov x0, d0 4: d360fc02 lsr x2, x0, #32 8: d360f801 ubfx x1, x0, #32, #31 c: f26c285f tst x2, #0x7ff00000 10: 540001a1 b.ne 44 <__ieee754_ilogb+0x44> // b.any 14: 2a000022 orr w2, w1, w0 18: 34000322 cbz w2, 7c <__ieee754_ilogb+0x7c> 1c: 35000221 cbnz w1, 60 <__ieee754_ilogb+0x60> 20: 2a0003e1 mov w1, w0 24: 7100001f cmp w0, #0x0 28: 12808240 mov w0, #0xfffffbed // #-1043 2c: 540000ad b.le 40 <__ieee754_ilogb+0x40> 30: 531f7821 lsl w1, w1, #1 34: 51000400 sub w0, w0, #0x1 38: 7100003f cmp w1, #0x0 3c: 54ffffac b.gt 30 <__ieee754_ilogb+0x30> 40: d65f03c0 ret 44: 13147c20 asr w0, w1, #20 48: 12b00202 mov w2, #0x7fefffff // #2146435071 4c: 510ffc00 sub w0, w0, #0x3ff 50: 6b02003f cmp w1, w2 54: 12b00001 mov w1, #0x7fffffff // #2147483647 58: 1a819000 csel w0, w0, w1, ls // ls = plast 5c: d65f03c0 ret 60: 53155021 lsl w1, w1, #11 64: 12807fa0 mov w0, #0xfffffc02 // #-1022 68: 531f7821 lsl w1, w1, #1 6c: 51000400 sub w0, w0, #0x1 70: 7100003f cmp w1, #0x0 74: 54ffffac b.gt 68 <__ieee754_ilogb+0x68> 78: d65f03c0 ret 7c: 320107e0 mov w0, #0x80000001 // #-2147483647 80: d65f03c0 ret * patch: 0000000000000000 <__ieee754_ilogb>: 0: 9e660001 fmov x1, d0 4: d374f820 ubfx x0, x1, #52, #11 8: 350000e0 cbnz w0, 24 <__ieee754_ilogb+0x24> c: d374cc21 lsl x1, x1, #12 10: b4000141 cbz x1, 38 <__ieee754_ilogb+0x38> 14: dac01021 clz x1, x1 18: 12807fc0 mov w0, #0xfffffc01 // #-1023 1c: 4b010000 sub w0, w0, w1 20: d65f03c0 ret 24: 711ffc1f cmp w0, #0x7ff 28: 510ffc00 sub w0, w0, #0x3ff 2c: 12b00001 mov w1, #0x7fffffff // #2147483647 30: 1a811000 csel w0, w0, w1, ne // ne = any 34: d65f03c0 ret 38: 320107e0 mov w0, #0x80000001 // #-2147483647 3c: d65f03c0 ret Other architecture with support for stdc_leading_zeros and/or __builtin_clzll should have similar improvements. Checked on aarch64-linux-gnu and x86_64-linux-gnu.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	b59c34e8f9	math: Fix UB in expm1 Building with ubsan triggers: UBSAN: Undefined behaviour in ../sysdeps/ieee754/dbl-64/s_expm1.c:242:4 left shift of 4294967271 by 20 cannot be represented in type 'int' Since at the time k is always positive, just cast to uint32_t.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	541a425162	aarch64: Fix UB in ifunc resolvers When building with ubsan the ifunc resolvers triggers: UBSAN: Undefined behaviour in ../sysdeps/aarch64/multiarch/memchr.c:34:1 left shift of 255 by 24 cannot be represented in type 'int' The midr is defined as uint64_t, so use UINT64_C to define the masks as well.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	0fd666fb4b	malloc: Fix UB in malloc-debug Multiple tests fail when malloc-debug is built with ubsan: UBSAN: Undefined behaviour in malloc-debug.c:231:24 applying non-zero offset to a NULL pointer The main issue is it tries to apply DUMPED_MAIN_ARENA_CHUNK or for mem2chunk for NULL pointers.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	5197a3a2f7	stdio: Fix UB on snprintf The elf/tst-dl-printf-static test when built with ubsan triggers: UBSAN: Undefined behaviour in vfprintf-process-arg.c:58:36 negation of 9223372036854775808 cannot be represented in type 'long int'	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	7417fcf25a	x86: Fix UB in x86_cpu_present/x86_cpu_active The elf/tst-cpu-features-supports (and other tests that check for CPU features) triggers the following issue with ubsan: UBSAN: Undefined behaviour in ../sysdeps/x86/sys/platform/x86.h:59:42 left shift of 1 by 31 cannot be represented in type 'int' The active_array is unsigned, so use an unsigned constant as well.	2025-05-07 14:21:21 -03:00
Adhemerval Zanella	8cc687cbac	x86_64: Fix UB on plt rewrite The elf/tst-plt-rewrite2 and elf/tst-plt-rewrite2 triggers the failures with ubsan: UBSAN: Undefined behaviour in ../sysdeps/x86_64/dl-machine.h:637:39 store to misaligned address 0x00007adb50230021 for type 'uint32_t'	2025-05-07 14:21:20 -03:00
Adhemerval Zanella	84c6cfb9b6	elf: Fix UB on _dl_early_allocate The ubsan triggers on elf/tst-tls-allocation-failure-static-patched: UBSAN: Undefined behaviour in ../sysdeps/unix/sysv/linux/dl-early_allocate.c:58:16 pointer index expression with base 0x0000555578792000 overflowed to 0x8000555578792cc0 The function is called with a size larger than PTRDIFF_MAX, and the addition than overflow. Fix it by limiting the size up to PTRDIFF_MAX, like all other malloc functions.	2025-05-07 14:21:20 -03:00
Adhemerval Zanella	5c94ce98f7	elf: Fix UB on _dl_map_object_from_fd On 32-bit architecture ubsan triggers: UBSAN: Undefined behaviour in dl-load.c:1345:54 pointer index expression with base 0x00612508 overflowed to 0xf7c3a508 Use explicit uintptr_t operation instead.	2025-05-07 14:20:40 -03:00
Adhemerval Zanella	dc48df09cd	argp: Fix shift bug From gnulib commits 06094e390b0 and 88033d3779362a.	2025-05-07 14:20:40 -03:00
Adhemerval Zanella	d5c20a5be6	locale: Fix UB on add_locale_uint32_array The ubsan triggers: UBSAN: Undefined behaviour in programs/locfile.c:644:3 null pointer passed as argument 2, nonnull attribute declared at unknown:0:0 The obstack_grow is only required if there is extra elements to be inserted (n_elems > 0).	2025-05-07 14:20:40 -03:00
Adhemerval Zanella	8e44f1e0b6	locale: Fix UB in elem_hash The ubsan triggers: UBSAN: Undefined behaviour in ./elem-hash.h:27:14 left shift of 360447856 by 3 cannot be represented in type 'int' Using unsigned shift here zero fill like signed.	2025-05-07 14:20:40 -03:00
Adhemerval Zanella	4c7cf51f83	localte: Fix UB on collate_finish The ubsan triggers: UBSAN: Undefined behaviour in programs/ld-collate.c:1557:7 variable length array bound evaluates to non-positive value 0 nrules is guaranteed to be at most sizeof (((struct element_t ) 0)->used_in_level) 8, so use it instead.	2025-05-07 14:20:20 -03:00
Adhemerval Zanella	915959c55d	locale: Fix UB on insert_weights The ubsan triggers: UBSAN: Undefined behaviour in programs/ld-collate.c:862:5 null pointer passed as argument 2, nonnull attribute declared at unknown:0:0, The memcpy is only requires if current 'weights' is nonnull, so check it before calling it.	2025-05-07 10:41:34 -03:00
Adhemerval Zanella	59d419cf12	locate: Fix UB on memcpy call The ubsan triggers: UBSAN: Undefined behaviour in programs/charmap.c:908:2 null pointer passed as argument 2, nonnull attribute declared at unknown:0:0 This is not an isseu since size is always '0' in this case.	2025-05-07 10:41:34 -03:00
Richard Henderson	73c7242778	elf: Adjust DT_EXTRATAGIDX to avoid undefined shifts When building with --enable-ubsan, the relocation code triggers: UBSAN: Undefined behaviour in get-dynamic-info.h:56:30 left shift of 1879047925 by 1 cannot be represented in type 'int' Originally from https://sourceware.org/pipermail/libc-alpha/2015-August/063015.html.	2025-05-07 10:41:34 -03:00
Adhemerval Zanella	8ab4aa30e5	locale: Fix --enable-ubsan build failure on some ABIs On mips, arc, powerpc, and s390 gcc 14 triggers the warning In function ‘charmap_new_char’, inlined from ‘parse_charmap.isra’ at ../locale/programs/charmap.c:570:6: ../locale/programs/charmap.c:1017:32: error: ‘strncmp’ specified bound [2147483649, 4294967295] exceeds maximum object size 2147483647 [-Werror=stringop-overread] 1017 \| if (cp == &from[len1 - 1] \|\| strncmp (from, to, prefix_len) != 0) \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../locale/programs/charmap.c:1017:32: error: ‘strncmp’ specified bound [2147483649, 4294967295] exceeds maximum object size 2147483647 [-Werror=stringop-overread] cc1: all warnings being treated as errors So move the case to an special function and disable the sanitizer.	2025-05-07 10:41:34 -03:00
Adhemerval Zanella	f13e58f9b1	riscv: Fix --enable-ubsan build failure on riscv With ubsan enable, libc.so fails to build with: [...]linkobj/libc_pic.a(setcontext.os): in function `__start_context': [...]sysdeps/unix/sysv/linux/riscv/setcontext.S:111:(.text+0xc0): relocation truncated to fit: R_RISCV_JAL against symbol `__GI_exit' defined in .text section in [...]/linkobj/libc_pic.a(exit.os) Using 'call' instead of 'j' works regardless whether UBSAN.	2025-05-07 10:41:34 -03:00
Adhemerval Zanella	aacf6238fb	ubsan: Add initial support for -fsanitize=undefined It is enabled through a new configure flag, --enable-ubsan, and should be used for debugging and/or testing. Not all ubsan handlers are implemented, only those generated/required by glibc libraries, programs, and tests. Some extra handlers might be needed in future C++ tests, and __ubsan_handle_dynamic_type_cache_miss also needs a proper implementation. The ubsan handlers are exported from ld.so since they are used on all libraries and tests. This might interfere with ubsan from compiler runtime (when programs are built with libubsan in shared mode), and this is completely untested and/or not supported at the moment. There is no support for the UBSAN_OPTIONS environment variable, although some options are supported through glibc.ubsan tunables. Currently, glibc.ubsan.halt_on_errors can be used to avoid the process halt when any UB handler is issued. Using -fsanitize=undefined enables some extra compiler checks that are not easily enabled through the libc-diag.h macro. For instance on iconv/iconvconfig.c, gcc 14.2.1 shows: In file included from ../include/bits/string_fortified.h:1, from ../string/string.h:548, from ../include/string.h:60, from iconvconfig.c:32: In function ‘strcpy’, inlined from ‘write_output’ at iconvconfig.c:1033:7, inlined from ‘main’ at iconvconfig.c:340:14: ../string/bits/string_fortified.h:81:10: error: ‘__builtin_memcpy’ offset [0, 7] is out of the bounds [0, 0] [-Werror=array-bounds=] 81 \| return __builtin___strcpy_chk (__dest, __src, __glibc_objsize (__dest)); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../string/bits/string_fortified.h:81:10: error: ‘__builtin_memcpy’ offset [0, 7] is out of the bounds [0, 0] [-Werror=array-bounds=] cc1: all warnings being treated as errors Some extra code adjustments are required to fix such cases. This preliminary support is still incomplete: * Not all targets are supported, nor have I checked the test suitei on all successful targets. Also, I only checked with limited gcc versions (only gcc 14.2.1 and for some targets 15.0.0). Currently --enable-ubsan builds on Linux for aarch64, arm, hppa, i686, powerpc64, microblaze, mips64, loongarch64, sparc, s390x, and x86_64. * The instrumentation is disabled on rltd.c, although it is enabled on other loaders functions. * A lot of test cases show failures due to UB. Also, gcc-14 triggers an ICE building math routines. gcc-15 works correctly.	2025-05-07 10:41:28 -03:00
Collin Funk	b4495bd405	nss: remove undefined behavior and optimize getaddrinfo On x86-64 and compiling with -O2 using stdc_leading_zeros compiles to the bsr instruction. The fls function removed by this patch is inlined but still loops while checking each bit individually. * nss/getaddrinfo.c: Include <stdbit.h>. (fls): Remove function. This function contains a left shift of 31 on an 'int' which is undefined. (rfc3484_sort): Use stdc_leading_zeros instead of fls. Signed-off-by: Collin Funk <collin.funk1@gmail.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-05-06 13:31:59 -03:00
Adhemerval Zanella	ac4e838289	powerpc: Remove POWER7 strncasecmp optimization These routines are not extensively used (gnulib documentation even recommend use a replacement [1]), and there is already a POWER8 version that uses proper vectorized instructions. [1] https://www.gnu.org/software/gnulib/manual/gnulib.html#C-strings Checked with a build for some powerpc variations. Reviewed-by: Peter Bergner <bergner@linux.ibm.com>	2025-05-06 13:31:01 -03:00
DJ Delorie	3270c50e48	manual: add more pthread functions Add stubs and partial docs for many undocumented pthreads functions. While neither exhaustive nor complete, gives minimal usage docs for many functions and expands the pthreads chapters, making it easier to continue improving this section in the future. Reviewed-by: Collin Funk <collin.funk1@gmail.com>	2025-05-05 22:42:26 -04:00
Stefan Liebler	4f6dae2195	S390: Add new s390 platform z17. The glibc-hwcaps subdirectories are extended by "z17". Libraries are loaded if the z17 facility bits are active: - Miscellaneous-instruction-extensions facility 4 - Vector-enhancements-facility 3 - Vector-Packed-Decimal-Enhancement Facility 3 - CPU: Concurrent-Functions Facility tst-glibc-hwcaps.c is extended in order to test z17 via new marker6. In case of running on a z17 with a kernel not recognizing z17 yet, AT_PLATFORM will be z900 but vector-bit in AT_HWCAP is set. This situation is now recognized and this testcase does not fail. A fatal glibc error is dumped if glibc was build with architecture level set for z17, but run on an older machine (See dl-hwcap-check.h). Note, you might get an SIGILL before this check if you don't use: configure --with-rtld-early-cflags=-march=<older-machine> ld.so --list-diagnostics now also dumps information about s390.cpu_features. Independent from z17, the s390x kernel won't introduce new HWCAP-Bits if there is no special handling needed in kernel itself. For z17, we don't have new HWCAP flags, but have to check the facility bits retrieved by stfle-instruction. Instead of storing all the stfle-bits (currently four 64bit values) in the cpu_features struct, we now only store those bits, which are needed within glibc itself. Note that we have this list twice, one with original values and the other one which can be filtered with GLIBC_TUNABLES=glibc.cpu.hwcaps. Those new fields are stored in so far reserved space in cpu_features struct. Thus processes started in between the update of glibc package and we e.g. have a new ld.so and an old libc.so, won't crash. The glibc internal ifunc-resolvers would not select the best optimized variant. The users of stfle-bits are also updated: - parsing of GLIBC_TUNABLES=glibc.cpu.hwcaps - glibc internal ifunc-resolvers - __libc_ifunc_impl_list - sysconf	2025-05-05 10:30:55 +02:00
Joseph Myers	59f64a1f4f	Correct test descriptors in libm-test-pown.inc While working on implementing compoundn, I noticed that libm-test-pown.inc was wrongly using TEST_ff_f and AUTO_TESTS_ff_f when the actual types involved meant fL_f should be used instead of ff_f; fix to use the correct descriptor strings for pown. (These strings affect how gen-libm-test.py generates a C file in some cases. The structure type test_fL_f_data for expected results and the use of RUN_TEST_LOOP_fL_f in the ALL_RM_TEST call were already correct.) Tested for x86_64. The generated libm-test-pown.c was actually unchanged, but the old descriptor strings were still logically incorrect.	2025-05-01 22:28:59 +00:00
Wilco Dijkstra	5d10174581	malloc: Inline tcache_try_malloc Inline tcache_try_malloc into calloc since it is the only caller. Also fix usize2tidx and use it in __libc_malloc, __libc_calloc and _mid_memalign. The result is simpler, cleaner code. Reviewed-by: DJ Delorie <dj@redhat.com>	2025-05-01 20:01:53 +00:00
Adhemerval Zanella	84977600da	math: Fix UB on sinpif (BZ 32925) The left shift overflows for 'int', use uint32_t instead. It syncs with CORE-MATH commit bbfabd99. Checked on aarch64-linux-gnu, x86_64-linux-gnu, and i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-04-29 15:20:28 -03:00
Adhemerval Zanella	7a0d7fb25c	math: Fix UB on erfcf (BZ 32924) The left shift overflows for 'int', use uint64_t instead. It syncs with CORE-MATH commit d0a2be200cbc1344d800d9ef0ebee9ad67dd3ad8. Checked on aarch64-linux-gnu, x86_64-linux-gnu, and i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-04-29 15:20:25 -03:00
Adhemerval Zanella	8eeb7de8a2	math: Fix UB on cospif (BZ 32923) The left shift overflows for 'int', use uint32_t instead. It syncs with CORE-MATH commit bbfabd993a71b049c210b0febfd06d18369fadc1. Checked on aarch64-linux-gnu, x86_64-linux-gnu, and i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-04-29 15:20:16 -03:00
Adhemerval Zanella	7619c1b032	math: Fix UB on cbrtf (BZ 32922) The left shift overflows for 'int64_t', use unsigned instead. It syncs with CORE-MATH commit f7c7408d1749ec2859ea249495af699359ae559b. Checked on aarch64-linux-gnu, x86_64-linux-gnu, and i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-04-29 15:20:10 -03:00
Adhemerval Zanella	c8775c0423	math: Fix UB on sinhf (BZ 32921) The left shift overflows for 'int', use uint64_t instead. It syncs with CORE-MATH commit bbfabd99. Checked on aarch64-linux-gnu, x86_64-linux-gnu, and i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-04-29 15:20:04 -03:00
Adhemerval Zanella	de0c4adf94	math: Fix UB on logf (BZ 32920) The left shift overflows for 'int', use a literal instead. It syncs with OPTIMIZED-ROUTINES commit 0f87f607b976820ef41fe64d004fe67dc7af8236. Checked on aarch64-linux-gnu, x86_64-linux-gnu, and i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-04-29 15:19:59 -03:00
Adhemerval Zanella	4a1b96bf52	math: Fix UB on coshf (BZ 32919) The left shift overflows for 'int', use uint64_t instead. It syncs with CORE-MATH commit 4d6192d2. Checked on aarch64-linux-gnu, x86_64-linux-gnu, and i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-04-29 15:19:54 -03:00
Adhemerval Zanella	92f7b6061d	math: Fix UB on atanhf (BZ 32918) The left shift overflows for 'int', use unsigned instead. It syncs with CORE-MATH commit 4d6192d2. Checked on aarch64-linux-gnu, x86_64-linux-gnu, and i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2025-04-29 15:19:42 -03:00
Adhemerval Zanella	0c34259423	nptl: Fix pthread_getattr_np when modules with execstack are allowed (BZ 32897) The BZ 32653 fix (`12a497c716`) kept the stack pointer zeroing from make_main_stack_executable on _dl_make_stack_executable. However, previously the 'stack_endp' pointed to temporary variable created before the call of _dl_map_object_from_fd; while now we use the __libc_stack_end directly. Since pthread_getattr_np relies on correct __libc_stack_end, if _dl_make_stack_executable is called (for instance, when glibc.rtld.execstack=2 is set) __libc_stack_end will be set to zero, and the call will always fail. The __libc_stack_end zero was used a mitigation hardening, but since `52a01100ad` it is used solely on pthread_getattr_np code. So there is no point in zeroing anymore. Checked on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Sam James <sam@gentoo.org>	2025-04-28 10:13:46 -03:00
Julian Zhu	4c966c0780	RISC-V: Use builtin for ffs and ffsll while supported extension available Hardware ctz instructions are available in the RISC-V Zbb and XTheadBb extension. With special `-march` flags defined, we can generate more simplified code compared to the generic implementation of `ffs`/`ffsll`. Signed-off-by: Julian Zhu <julian.oerv@isrc.iscas.ac.cn> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2025-04-28 09:51:59 -03:00
Adhemerval Zanella	2be836fe44	stdio: Remove UB on printf_fp The __printf_fp_buffer_1 issues count_leading_zeros with 0 argument, which might leads to call __builtin_ctz depending of the ABI. Replace with stdbit.h function instead. Checked on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Paul Eggert <eggert@cs.ucla.edu>	2025-04-28 09:51:59 -03:00
Cupertino Miranda	77930e0447	benchtest: Correct shell script related to bench-malloc-thread This patch changes the shell script that selects which arguments are used for the execution of bench-malloc-thread. The problem seems to have been introduced in commit: commit `2d6427a63c` Author: Wangyang Guo <wangyang.guo@intel.com> Date: Fri Nov 29 16:05:35 2024 +0800 benchtests: Add calloc test With current condition, the following error "/bin/sh: 3: [[: not found" occurs when executing `make bench BENCHSET="malloc-thread"` and the else path is taken, using incorrect arguments for bench test execution. Error is reproducible in Debian based distros. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2025-04-25 16:38:25 +02:00
H. Peter Anvin	e04afb7177	linux/termio: remove <termio.h> and struct termio The <termio.h> interface is absolutely ancient: it was obsoleted by <termios.h> already in the first version of POSIX (1988) and thus predates the very first version of Linux. Unfortunately, some constant macros are used both by <termio.h> and <termios.h>; particularly problematic is the baud rate constants since the termio interface requires that the baud rate is set via an enumeration as part of c_cflag. In preparation of revamping the termios interface to support the arbitrary baud rate capability that the Linux kernel has supported since 2008, remove <termio.h> in the hope that no one still uses this archaic interface. Note that there is no actual code in glibc to support termio: it is purely an unabstracted ioctl() interface. Signed-off-by: H. Peter Anvin (Intel) <hpa@zytor.com> Reviewed-by: Florian Weimer <fweimer@redhat.com>	2025-04-25 07:30:59 +02:00
Aurelien Jarno	e78caeb4ff	elf: tst-audit10: split AVX512F code into dedicated functions [BZ #32882 ] "Recent" GCC versions (since commit fc62716fe8d1, backported to stable branches) emit a vzeroupper instruction at the end of functions containing AVX instructions. This causes the tst-audit10 test to fail on CPUs lacking AVX instructions, despite the AVX512F check. The crash occurs in the pltenter function of tst-auditmod10b.c. Fix that by moving the code guarded by the check_avx512 function into specific functions using the target ("avx512f") attribute. Note that since commit `5359c3bc91` ("x86-64: Remove compiler -mavx512f check") it is safe to assume that the compiler has AVX512F support, thus the __AVX512F__ checks can be dropped. Tested on non-AVX, AVX2 and AVX512F machines. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2025-04-22 23:39:59 +02:00

1 2 3 4 5 ...

42371 Commits All Branches Search

42371 Commits

All Branches