Show patches with: Submitter = Noah Goldstein       |    State = Action Required       |    Archived = No       |   888 patches
« 1 2 3 48 9 »
Patch Series A/F/R/T S/W/F Date Submitter Delegate State
[v1] x86/string: Fixup alignment of main loop in str{n}cmp-evex [BZ #32212] [v1] x86/string: Fixup alignment of main loop in str{n}cmp-evex [BZ #32212] - - 1 - --- 2024-09-27 Noah Goldstein New
[v3,2/2] x86: Add `Avoid_STOSB` tunable to allow NT memset without ERMS [v3,1/2] x86: Use `Avoid_Non_Temporal_Memset` to control non-temporal path - - 1 - --- 2024-08-14 Noah Goldstein New
[v3,1/2] x86: Use `Avoid_Non_Temporal_Memset` to control non-temporal path [v3,1/2] x86: Use `Avoid_Non_Temporal_Memset` to control non-temporal path - - 1 - --- 2024-08-14 Noah Goldstein New
x86: Add `Avoid_STOSB` tunable to allow NT memset without ERMS x86: Add `Avoid_STOSB` tunable to allow NT memset without ERMS - - - - --- 2024-08-13 Noah Goldstein New
x86: Use `Avoid_Non_Temporal_Memset` to control non-temporal path x86: Use `Avoid_Non_Temporal_Memset` to control non-temporal path - - - - --- 2024-08-13 Noah Goldstein New
x86: Add `Avoid_STOSB` tunable to allow NT memset without ERMS x86: Add `Avoid_STOSB` tunable to allow NT memset without ERMS - - - - --- 2024-08-13 Noah Goldstein New
x86: Use `Avoid_Non_Temporal_Memset` to control non-temporal path x86: Use `Avoid_Non_Temporal_Memset` to control non-temporal path - - - - --- 2024-08-13 Noah Goldstein New
[v1] x86: Fix bug in strchrnul-evex512 [BZ #32078] [v1] x86: Fix bug in strchrnul-evex512 [BZ #32078] - - 1 - --- 2024-08-13 Noah Goldstein New
x86: Add new cpu-flag `Prefer_Non_Temporal` x86: Add new cpu-flag `Prefer_Non_Temporal` - - - - --- 2024-08-13 Noah Goldstein New
[v2] x86: Add new cpu-flag `Prefer_Non_Temporal` [v2] x86: Add new cpu-flag `Prefer_Non_Temporal` - - - - --- 2024-08-11 Noah Goldstein New
[v1] stdlib: Mark `abort` as `cold` [v1] stdlib: Mark `abort` as `cold` - - 1 - --- 2024-07-29 Noah Goldstein New
assert: Mark `__assert_fail` as `cold` assert: Mark `__assert_fail` as `cold` - - - - --- 2024-07-26 Noah Goldstein New
assert: Mark `__assert_fail` as `cold` assert: Mark `__assert_fail` as `cold` - - 2 - --- 2024-07-26 Noah Goldstein New
x86: Increase default `rep stosb` threshold for SKX [BZ #32009] x86: Increase default `rep stosb` threshold for SKX [BZ #32009] - - - - --- 2024-07-23 Noah Goldstein New
assert: Wrap __assert_fail in templated function assert: Wrap __assert_fail in templated function - - - - --- 2024-07-22 Noah Goldstein New
x86: Disable non-temporal memset on Skylake Server x86: Disable non-temporal memset on Skylake Server - - - - --- 2024-07-16 Noah Goldstein New
x86: Disable non-temporal memset on Skylake Server x86: Disable non-temporal memset on Skylake Server - - - - --- 2024-07-16 Noah Goldstein New
[v1] x86: Disable non-temporal memset on Skylake Server [v1] x86: Disable non-temporal memset on Skylake Server - - - - --- 2024-07-15 Noah Goldstein New
x86: Add new cpu-flag `Prefer_Non_Temporal` x86: Add new cpu-flag `Prefer_Non_Temporal` - - 1 - --- 2024-07-10 Noah Goldstein New
[v1] x86: Fix value for `x86_memset_non_temporal_threshold` when it is undesirable [v1] x86: Fix value for `x86_memset_non_temporal_threshold` when it is undesirable - - 1 1 --- 2024-06-14 Noah Goldstein New
[v2,2/2] x86: Add seperate non-temporal tunable for memset [v2,1/2] x86: Improve large memset perf with non-temporal stores [RHEL-29312] - - 1 - --- 2024-05-24 Noah Goldstein New
[v2,1/2] x86: Improve large memset perf with non-temporal stores [RHEL-29312] [v2,1/2] x86: Improve large memset perf with non-temporal stores [RHEL-29312] - - 1 - --- 2024-05-24 Noah Goldstein New
[v1] x86: Improve large memset perf with non-temporal stores [RHEL-29312] [v1] x86: Improve large memset perf with non-temporal stores [RHEL-29312] - - 1 - --- 2024-05-19 Noah Goldstein New
x86: Fixup some nits in longjmp asm implementation x86: Fixup some nits in longjmp asm implementation - - - - --- 2024-01-05 Noah Goldstein New
x86: Fixup some nits in longjmp asm implementation x86: Fixup some nits in longjmp asm implementation - - - - --- 2024-01-05 Noah Goldstein New
x86: Fixup some nits in longjmp asm implementation x86: Fixup some nits in longjmp asm implementation - - - - --- 2024-01-04 Noah Goldstein New
x86: Fixup some nits in longjmp asm implementation x86: Fixup some nits in longjmp asm implementation - - - - --- 2024-01-04 Noah Goldstein New
string: Add additional output in test-strchr failure string: Add additional output in test-strchr failure - - - - --- 2023-12-27 Noah Goldstein New
x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S - - 1 - --- 2023-11-12 Noah Goldstein New
[v4] x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S [v4] x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S - - - - --- 2023-11-11 Noah Goldstein New
[v3] x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S [v3] x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S - - - - --- 2023-11-09 Noah Goldstein New
[v2] x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S [v2] x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S - - - - --- 2023-11-08 Noah Goldstein New
x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S x86: Fix unchecked AVX512-VBMI2 usage in strrchr-evex-base.S - - - - --- 2023-11-01 Noah Goldstein New
x86: Only align destination to 1x VEC_SIZE in memset 4x loop x86: Only align destination to 1x VEC_SIZE in memset 4x loop - - 1 - --- 2023-11-01 Noah Goldstein New
x86: Prepare `strrchr-evex` and `strrchr-evex512` for AVX10 x86: Prepare `strrchr-evex` and `strrchr-evex512` for AVX10 - - 1 - --- 2023-10-04 Noah Goldstein New
x86: Prepare `strrchr-evex` and `strrchr-evex512` for AVX10 x86: Prepare `strrchr-evex` and `strrchr-evex512` for AVX10 - - - - --- 2023-09-21 Noah Goldstein New
x86: Add support for AVX10 version in cpu-features x86: Add support for AVX10 version in cpu-features - - - - --- 2023-09-20 Noah Goldstein New
x86: Add support for AVX10 preset and vec size in cpu-features x86: Add support for AVX10 preset and vec size in cpu-features - - - - --- 2023-09-20 Noah Goldstein New
[v2] x86: Add support for AVX10 version and vec size in cpu-features [v2] x86: Add support for AVX10 version and vec size in cpu-features - - - - --- 2023-08-24 Noah Goldstein New
[v2] x86: Add support for AVX10 version and vec size in cpu-features [v2] x86: Add support for AVX10 version and vec size in cpu-features - - - - --- 2023-08-24 Noah Goldstein New
[v1] x86: Add support for AVX10 version and vec size in cpu-features [v1] x86: Add support for AVX10 version and vec size in cpu-features - - - - --- 2023-08-24 Noah Goldstein New
[v3] x86: Fix incorrect scope of setting `shared_per_thread` [BZ# 30745] [v3] x86: Fix incorrect scope of setting `shared_per_thread` [BZ# 30745] - - 1 1 --- 2023-08-11 Noah Goldstein New
[v2] x86: Fix incorrect scope of setting `shared_per_thread` [BZ# 30745] [v2] x86: Fix incorrect scope of setting `shared_per_thread` [BZ# 30745] - - - - --- 2023-08-11 Noah Goldstein New
[v1] x86: Fix incorrect scope of setting `shared_per_thread` [v1] x86: Fix incorrect scope of setting `shared_per_thread` - - - - --- 2023-08-11 Noah Goldstein New
[v3] x86: Use `3/4*sizeof(per-thread-L3)` as low bound for NT threshold. [v3] x86: Use `3/4*sizeof(per-thread-L3)` as low bound for NT threshold. - - - - --- 2023-07-18 Noah Goldstein New
[v1] x86: Fix slight bug in `shared_per_thread` cache size calculation. [v1] x86: Fix slight bug in `shared_per_thread` cache size calculation. - - - - --- 2023-07-18 Noah Goldstein New
[v2] x86: Use `3/4*sizeof(per-thread-L3)` as low bound for NT threshold. [v2] x86: Use `3/4*sizeof(per-thread-L3)` as low bound for NT threshold. - - - - --- 2023-07-17 Noah Goldstein New
[v1] x86: Use `3/4*sizeof(per-thread-L3)` as low bound for NT threshold. [v1] x86: Use `3/4*sizeof(per-thread-L3)` as low bound for NT threshold. - - - - --- 2023-07-14 Noah Goldstein New
[v1,2/2] x86: Implement clock_nanosleep{_time64} syscall for x86 only. [v1,1/2] x86: Implement sched_yield syscall for x86 only. - - - - --- 2023-06-08 Noah Goldstein New
[v1,1/2] x86: Implement sched_yield syscall for x86 only. [v1,1/2] x86: Implement sched_yield syscall for x86 only. - - - - --- 2023-06-08 Noah Goldstein New
[v1,2/2] x86: Add `prepare_context_switch` to initialize register inuse states [v1,1/2] linux: Implement sched_yield in C - - - - --- 2023-06-07 Noah Goldstein New
[v1,1/2] linux: Implement sched_yield in C [v1,1/2] linux: Implement sched_yield in C - - - - --- 2023-06-07 Noah Goldstein New
[v11,3/3] x86: Make the divisor in setting `non_temporal_threshold` cpu specific [v11,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - 1 - --- 2023-06-07 Noah Goldstein New
[v11,2/3] x86: Refactor Intel `init_cpu_features` [v11,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - 1 - --- 2023-06-07 Noah Goldstein New
[v11,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` [v11,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - 2 - --- 2023-06-07 Noah Goldstein New
[v10,3/3] x86: Make the divisor in setting `non_temporal_threshold` cpu specific [v10,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - 1 - --- 2023-05-27 Noah Goldstein New
[v10,2/3] x86: Refactor Intel `init_cpu_features` [v10,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - 1 - --- 2023-05-27 Noah Goldstein New
[v10,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` [v10,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - 2 - --- 2023-05-27 Noah Goldstein New
[v9,3/3] x86: Make the divisor in setting `non_temporal_threshold` cpu specific [v9,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - - - --- 2023-05-13 Noah Goldstein New
[v9,2/3] x86: Refactor Intel `init_cpu_features` [v9,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - 1 - --- 2023-05-13 Noah Goldstein New
[v9,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` [v9,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - 1 - --- 2023-05-13 Noah Goldstein New
[v8,3/3] x86: Make the divisor in setting `non_temporal_threshold` cpu specific [v1] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` - - - - --- 2023-05-12 Noah Goldstein New
[v8,2/3] x86: Refactor Intel `init_cpu_features` [v8,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - - - --- 2023-05-12 Noah Goldstein New
[v8,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` [v8,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - - - --- 2023-05-12 Noah Goldstein New
[v7,4/4] x86: Tune 'Saltwell' microarch the same was a 'Bonnell' Untitled series #354409 - - - - --- 2023-05-10 Noah Goldstein New
[v7,3/4] x86: Make the divisor in setting `non_temporal_threshold` cpu specific Untitled series #354409 - - - - --- 2023-05-10 Noah Goldstein New
[v7,2/4] x86: Refactor Intel `init_cpu_features` Untitled series #354409 - - - - --- 2023-05-10 Noah Goldstein New
[v6,4/4] x86: Tune 'Saltwell' microarch the same was a 'Bonnell' [v6,1/4] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - - - --- 2023-05-10 Noah Goldstein New
[v6,3/4] x86: Make the divisor in setting `non_temporal_threshold` cpu specific [v6,1/4] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - - - --- 2023-05-10 Noah Goldstein New
[v6,2/4] x86: Refactor Intel `init_cpu_features` [v1] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` - - - - --- 2023-05-10 Noah Goldstein New
[v6,1/4] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` [v6,1/4] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - - - --- 2023-05-10 Noah Goldstein New
[v5,3/3] x86: Make the divisor in setting `non_temporal_threshold` cpu specific [v5,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - - - --- 2023-05-09 Noah Goldstein New
[v5,2/3] x86: Refactor Intel `init_cpu_features` [v5,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - - - --- 2023-05-09 Noah Goldstein New
[v5,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` [v5,1/3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` - - - - --- 2023-05-09 Noah Goldstein New
[v1] x86: Use 64MB as nt-store threshold if no cacheinfo [BZ #30429] [v1] x86: Use 64MB as nt-store threshold if no cacheinfo [BZ #30429] - - 1 - --- 2023-05-09 Noah Goldstein New
[v4] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` [v4] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` - - - - --- 2023-04-25 Noah Goldstein New
[v3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` [v3] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` - - - - --- 2023-04-25 Noah Goldstein New
[v2] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` [v2] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` - - - - --- 2023-04-24 Noah Goldstein New
[v1] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` [v1] x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 2` - - - - --- 2023-04-24 Noah Goldstein New
[v1] Benchtest: Add benchtests for {wcs, str}lcpy and {wcs, str}lcat [v1] Benchtest: Add benchtests for {wcs, str}lcpy and {wcs, str}lcat - - - - --- 2023-04-22 Noah Goldstein New
[v1] nptl: Add single-threaded optimization to ADAPTIVE_NP mutex [v1] nptl: Add single-threaded optimization to ADAPTIVE_NP mutex - - - - --- 2023-04-19 Noah Goldstein New
[v2] x86-64: Replace `and %ah` write with `test %ah` read [v2] x86-64: Replace `and %ah` write with `test %ah` read - - - - --- 2023-03-13 Noah Goldstein New
[v1] x86-64: Replace `%ah` write with `%eax` read [v1] x86-64: Replace `%ah` write with `%eax` read - - - - --- 2023-03-10 Noah Goldstein New
[v2] x86: Fix strncat-avx2.S reading past length [BZ #30065] [v2] x86: Fix strncat-avx2.S reading past length [BZ #30065] - - - - --- 2023-01-31 Noah Goldstein New
[v1] x86: Fix strncat-avx2.S when `src` has no null-term [BZ #30065] [v1] x86: Fix strncat-avx2.S when `src` has no null-term [BZ #30065] - - - - --- 2023-01-31 Noah Goldstein New
[v1,2/2] nptl: Add a testcase for inputs racey to {w}memcmp{eq} BZ #29863 [v1,1/2] nptl: Changes tests to be 1/line and sorted - - - - --- 2022-12-19 Noah Goldstein New
[v1,1/2] nptl: Changes tests to be 1/line and sorted [v1,1/2] nptl: Changes tests to be 1/line and sorted - - - - --- 2022-12-19 Noah Goldstein New
[v1] nptl: Changes tests to be 1/line and sorted [v1] nptl: Changes tests to be 1/line and sorted - - - - --- 2022-12-19 Noah Goldstein New
[v4] x86: Prevent SIGSEGV in memcmp-sse2 when data is concurrently modified [BZ #29863] [v4] x86: Prevent SIGSEGV in memcmp-sse2 when data is concurrently modified [BZ #29863] - - - - --- 2022-12-14 Noah Goldstein New
[v3] x86: Prevent SIGSEGV in memcmp-sse2 when data is concurrently modified [BZ #29863] [v3] x86: Prevent SIGSEGV in memcmp-sse2 when data is concurrently modified [BZ #29863] - - 1 - --- 2022-12-14 Noah Goldstein New
[v2] x86: Prevent SIGSEGV in memcmp-sse2 when data is concurrently modified [BZ #29863] [v2] x86: Prevent SIGSEGV in memcmp-sse2 when data is concurrently modified [BZ #29863] - - - - --- 2022-12-14 Noah Goldstein New
[v1] x86: Prevent SIG11 in memcmp-sse2 when data is concurrently modified [BZ #29863] [v1] x86: Prevent SIG11 in memcmp-sse2 when data is concurrently modified [BZ #29863] - - - - --- 2022-12-14 Noah Goldstein New
[v1,27/27] x86/fpu: Remove unused svml_s_logf_data.S file [v1,01/27] x86/fpu: Create helper file for common data macros - - - - --- 2022-12-07 Noah Goldstein New
[v1,26/27] x86/fpu: Optimize svml_s_logf8_core_avx2.S [v1,01/27] x86/fpu: Create helper file for common data macros - - - - --- 2022-12-07 Noah Goldstein New
[v1,25/27] x86/fpu: Optimize svml_s_logf4_core_sse4.S [v1,01/27] x86/fpu: Create helper file for common data macros - - - - --- 2022-12-07 Noah Goldstein New
[v1,24/27] x86/fpu: Optimize svml_s_logf16_core_avx512.S [v1,01/27] x86/fpu: Create helper file for common data macros - - - - --- 2022-12-07 Noah Goldstein New
[v1,23/27] x86/fpu: Optimize svml_s_log2f8_core_avx2.S [v1,01/27] x86/fpu: Create helper file for common data macros - - - - --- 2022-12-07 Noah Goldstein New
[v1,22/27] x86/fpu: Optimize svml_s_log2f4_core_sse4.S [v1,01/27] x86/fpu: Create helper file for common data macros - - - - --- 2022-12-07 Noah Goldstein New
[v1,21/27] x86/fpu: Optimize svml_s_log2f16_core_avx512.S [v1,01/27] x86/fpu: Create helper file for common data macros - - - - --- 2022-12-07 Noah Goldstein New
[v1,20/27] x86/fpu: Optimize svml_s_log10f8_core_avx2.S [v1,01/27] x86/fpu: Create helper file for common data macros - - - - --- 2022-12-07 Noah Goldstein New
« 1 2 3 48 9 »