Show patches with: Submitter = Liu, Hongtao       |    State = Action Required       |    Archived = No       |   611 patches
« 1 2 3 46 7 »
Patch Series A/F/R/T S/W/F Date Submitter Delegate State
Don't assume it's AVX_U128_CLEAN after call_insn whose abi.mode_clobber(V4DImode) deosn't contains … Don't assume it's AVX_U128_CLEAN after call_insn whose abi.mode_clobber(V4DImode) deosn't contains … - - - - --- 2023-12-08 Liu, Hongtao New
Support udot_prodv*qi with emulation sdot_prodv*hi Support udot_prodv*qi with emulation sdot_prodv*hi - - - - --- 2023-12-04 Liu, Hongtao New
Don't vectorize when vector stmts are only vec_contruct and stores Don't vectorize when vector stmts are only vec_contruct and stores - - - - --- 2023-12-04 Liu, Hongtao New
Take register pressure into account for vec_construct/scalar_to_vec when the components are not loa… Take register pressure into account for vec_construct/scalar_to_vec when the components are not loa… - - - - --- 2023-12-01 Liu, Hongtao New
Use vec_extact_lo instead of subreg in reduc_<code>_scal_m. Use vec_extact_lo instead of subreg in reduc_<code>_scal_m. - - - - --- 2023-11-29 Liu, Hongtao New
[x86] Support sdot_prodv*qi with emulation of sdot_prodv*hi. [x86] Support sdot_prodv*qi with emulation of sdot_prodv*hi. - - - - --- 2023-11-29 Liu, Hongtao New
Take register pressure into account for vec_construct when the components are not loaded from memor… Take register pressure into account for vec_construct when the components are not loaded from memor… - - - - --- 2023-11-28 Liu, Hongtao New
Set AVOID_256FMA_CHAINS TO m_GENERIC as it's generally good to new platforms Set AVOID_256FMA_CHAINS TO m_GENERIC as it's generally good to new platforms - - - - --- 2023-11-22 Liu, Hongtao New
[x86] Support reduc_{and, ior, xor}_scal_m for V4HI/V8QI/V4QImode [x86] Support reduc_{and, ior, xor}_scal_m for V4HI/V8QI/V4QImode - - - - --- 2023-11-20 Liu, Hongtao New
Support cbranchm for Vector HI/QImode. Support cbranchm for Vector HI/QImode. - - - - --- 2023-11-17 Liu, Hongtao New
[2/2] Add i?86-*-* and x86_64-*-* to vect_logical_reduc [1/2] Support reduc_{plus, xor, and, ior}_scal_m for vector integer mode. - - - - --- 2023-11-17 Liu, Hongtao New
[1/2] Support reduc_{plus, xor, and, ior}_scal_m for vector integer mode. [1/2] Support reduc_{plus, xor, and, ior}_scal_m for vector integer mode. - - - - --- 2023-11-17 Liu, Hongtao New
[V2] Simplify vector ((VCE (a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE ((a cmp b) ? (VCE c) : (VC… [V2] Simplify vector ((VCE (a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE ((a cmp b) ? (VCE c) : (VC… - - - - --- 2023-11-16 Liu, Hongtao New
Fix ICE of unrecognizable insn. Fix ICE of unrecognizable insn. - - - - --- 2023-11-16 Liu, Hongtao New
Fix ICE in vectorizable_nonlinear_induction with bitfield. Fix ICE in vectorizable_nonlinear_induction with bitfield. - - - - --- 2023-11-14 Liu, Hongtao New
Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just VCE:((a cmp b) ? (VCE c) : (VCE d)). Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just VCE:((a cmp b) ? (VCE c) : (VCE d)). - - - - --- 2023-11-10 Liu, Hongtao New
Support vec_set/vec_extract/vec_init for V4HF/V2HF. Support vec_set/vec_extract/vec_init for V4HF/V2HF. - - - - --- 2023-11-10 Liu, Hongtao New
Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE:a cmp VCE:b) ? c : d. Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE:a cmp VCE:b) ? c : d. - - - - --- 2023-11-10 Liu, Hongtao New
Fix wrong code due to vec_merge + pcmp to blendvb splitter. Fix wrong code due to vec_merge + pcmp to blendvb splitter. - - - - --- 2023-11-09 Liu, Hongtao New
[V2] Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect. [V2] Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect. - - - - --- 2023-11-07 Liu, Hongtao New
Avoid generating RTL code when d->testing_p. Avoid generating RTL code when d->testing_p. - - - - --- 2023-11-06 Liu, Hongtao New
Support cmul{_conj}v4hf3/cmla{_conj}v4hf4 with AVX512FP16 instruction. Support cmul{_conj}v4hf3/cmla{_conj}v4hf4 with AVX512FP16 instruction. - - - - --- 2023-11-01 Liu, Hongtao New
Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect. Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect. - - - - --- 2023-10-30 Liu, Hongtao New
Fix wrong code due to incorrest define_split Fix wrong code due to incorrest define_split - - - - --- 2023-10-30 Liu, Hongtao New
Improve memcmpeq for 512-bit vector with vpcmpeq + kortest. Improve memcmpeq for 512-bit vector with vpcmpeq + kortest. - - - - --- 2023-10-27 Liu, Hongtao New
[V2,2/2] Support vec_cmpmn/vcondmn for v2hf/v4hf. [V2,1/2] Pass type of comparison operands instead of comparison result to truth_type_for in build_v… - - - - --- 2023-10-26 Liu, Hongtao New
[V2,1/2] Pass type of comparison operands instead of comparison result to truth_type_for in build_v… [V2,1/2] Pass type of comparison operands instead of comparison result to truth_type_for in build_v… - - - - --- 2023-10-26 Liu, Hongtao New
[GCC13,backport] Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_… [GCC13,backport] Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_… - - - - --- 2023-10-24 Liu, Hongtao New
Support vec_cmpmn/vcondmn for v2hf/v4hf. Support vec_cmpmn/vcondmn for v2hf/v4hf. - - - - --- 2023-10-23 Liu, Hongtao New
[x86] Remove unused mmx_pinsrw. [x86] Remove unused mmx_pinsrw. - - - - --- 2023-10-20 Liu, Hongtao New
Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … - - - - --- 2023-10-20 Liu, Hongtao New
Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … - - - - --- 2023-10-19 Liu, Hongtao New
Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when … - - - - --- 2023-10-18 Liu, Hongtao New
Support 32/64-bit vectorization for _Float16 fma related operations. Support 32/64-bit vectorization for _Float16 fma related operations. - - - - --- 2023-10-17 Liu, Hongtao New
[2/2] Support 32/64-bit vectorization for conversion between _Float16 and integer/float. [1/2] Enable vectorization for V2HF/V4HF rounding operations and sqrt. - - - - --- 2023-10-12 Liu, Hongtao New
[1/2] Enable vectorization for V2HF/V4HF rounding operations and sqrt. [1/2] Enable vectorization for V2HF/V4HF rounding operations and sqrt. - - - - --- 2023-10-12 Liu, Hongtao New
[x86] Refine predicate of operands[2] in divv4hf3 with register_operand. [x86] Refine predicate of operands[2] in divv4hf3 with register_operand. - - - - --- 2023-10-10 Liu, Hongtao New
[2/2] Support signbit/xorsign/copysign/abs/neg/and/xor/ior/andn for V2HF/V4HF. [1/2,x86] Support smin/smax for V2HF/V4HF - - - - --- 2023-10-08 Liu, Hongtao New
[1/2,x86] Support smin/smax for V2HF/V4HF [1/2,x86] Support smin/smax for V2HF/V4HF - - - - --- 2023-10-08 Liu, Hongtao New
Remove constraint modifier % for fcmaddcph/fmaddcph/fcmulcph since there're not commutative. Remove constraint modifier % for fcmaddcph/fmaddcph/fcmulcph since there're not commutative. - - - - --- 2023-09-11 Liu, Hongtao New
Remove constraint modifier % for fcmaddcph/fcmulcph since there're not commutative. Remove constraint modifier % for fcmaddcph/fcmulcph since there're not commutative. - - - - --- 2023-09-08 Liu, Hongtao New
Support vpermw/vpermi2w/vpermt2w instructions for vector HF/BFmodes. Support vpermw/vpermi2w/vpermt2w instructions for vector HF/BFmodes. - - - - --- 2023-09-07 Liu, Hongtao New
Generate vmovsh instead of vpblendw for specific vec_merge. Generate vmovsh instead of vpblendw for specific vec_merge. - - - - --- 2023-09-04 Liu, Hongtao New
Adjust costing of emulated vectorized gather/scatter Adjust costing of emulated vectorized gather/scatter - - - - --- 2023-08-30 Liu, Hongtao New
Refactor vector HF/BF mode iterators and patterns. Refactor vector HF/BF mode iterators and patterns. - - - - --- 2023-08-30 Liu, Hongtao New
Use vmaskmov{ps, pd} for VI48_128_256 when TARGET_AVX2 is not available. Use vmaskmov{ps, pd} for VI48_128_256 when TARGET_AVX2 is not available. - - - - --- 2023-08-25 Liu, Hongtao New
[x86] Refactor mode iterator V_128 and V_128H, V_256 and V_256H [x86] Refactor mode iterator V_128 and V_128H, V_256 and V_256H - - - - --- 2023-08-24 Liu, Hongtao New
Fix target_clone ("arch=graniterapids-d") and target_clone ("arch=arrowlake-s") Fix target_clone ("arch=graniterapids-d") and target_clone ("arch=arrowlake-s") - - - - --- 2023-08-23 Liu, Hongtao New
[x86] Testcase fix. [x86] Testcase fix. - - - - --- 2023-08-22 Liu, Hongtao New
Adjust testcase for Intel GDS. Adjust testcase for Intel GDS. - - - - --- 2023-08-21 Liu, Hongtao New
Mention Intel -march=gracemont for Alderlake-N. Mention Intel -march=gracemont for Alderlake-N. - - - - --- 2023-08-21 Liu, Hongtao New
Support -march=gracemont Support -march=gracemont - - - - --- 2023-08-18 Liu, Hongtao New
Generate vmovapd instead of vmovsd for moving DFmode between SSE_REGS. Generate vmovapd instead of vmovsd for moving DFmode between SSE_REGS. - - - - --- 2023-08-14 Liu, Hongtao New
[V2] Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter in… [V2] Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter in… - - - - --- 2023-08-11 Liu, Hongtao New
Software mitigation: Disable gather generation in vectorization for GDS affected Intel Processors. Software mitigation: Disable gather generation in vectorization for GDS affected Intel Processors. - - - - --- 2023-08-11 Liu, Hongtao New
Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter instruc… Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter instruc… - - - - --- 2023-08-10 Liu, Hongtao New
i386: Do not sanitize upper part of V2HFmode and V4HFmode reg with -fno-trapping-math [PR110832] i386: Do not sanitize upper part of V2HFmode and V4HFmode reg with -fno-trapping-math [PR110832] - - - - --- 2023-08-10 Liu, Hongtao New
Rename local variable subleaf_level to max_subleaf_level. Rename local variable subleaf_level to max_subleaf_level. - - - - --- 2023-08-09 Liu, Hongtao New
[V2,X86] Workaround possible CPUID bug in Sandy Bridge. [V2,X86] Workaround possible CPUID bug in Sandy Bridge. - - - - --- 2023-08-09 Liu, Hongtao New
[X86] Workaround possible CPUID bug in Sandy Bridge. [X86] Workaround possible CPUID bug in Sandy Bridge. - - - - --- 2023-08-08 Liu, Hongtao New
i386: Clear upper bits of XMM register for V4HFmode/V2HFmode operations [PR110762] i386: Clear upper bits of XMM register for V4HFmode/V2HFmode operations [PR110762] - - - - --- 2023-08-07 Liu, Hongtao New
Fix ICE in rtl check when bootstrap. Fix ICE in rtl check when bootstrap. - - - - --- 2023-08-07 Liu, Hongtao New
Optimize vlddqu + inserti128 to vbroadcasti128 Optimize vlddqu + inserti128 to vbroadcasti128 - - - - --- 2023-08-02 Liu, Hongtao New
Support vec_fmaddsub/vec_fmsubadd for vector HFmode. Support vec_fmaddsub/vec_fmsubadd for vector HFmode. - - - - --- 2023-08-02 Liu, Hongtao New
Adjust testcase for more optimal codegen. Adjust testcase for more optimal codegen. - - - - --- 2023-08-01 Liu, Hongtao New
[x86] Add UNSPEC_MASKOP to vpbroadcastm pattern. [x86] Add UNSPEC_MASKOP to vpbroadcastm pattern. - - - - --- 2023-07-27 Liu, Hongtao New
Optimize vlddqu to vmovdqu for TARGET_AVX Optimize vlddqu to vmovdqu for TARGET_AVX - - - - --- 2023-07-20 Liu, Hongtao New
Fix fp16 related testcase failure for i686. Fix fp16 related testcase failure for i686. - - - - --- 2023-07-20 Liu, Hongtao New
Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output. Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output. - - - - --- 2023-07-17 Liu, Hongtao New
Fix typo in the testcase. Fix typo in the testcase. - - - - --- 2023-07-11 Liu, Hongtao New
Add peephole to eliminate redundant comparison after cmpccxadd. Add peephole to eliminate redundant comparison after cmpccxadd. - - - - --- 2023-07-11 Liu, Hongtao New
[v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand… [v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand… - - - - --- 2023-07-11 Liu, Hongtao New
Add peephole to eliminate redundant comparison after cmpccxadd. Add peephole to eliminate redundant comparison after cmpccxadd. - - - - --- 2023-07-11 Liu, Hongtao New
Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '… Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '… - - - - --- 2023-07-10 Liu, Hongtao New
[V2,x86] Add pre_reload splitter to detect fp min/max pattern. [V2,x86] Add pre_reload splitter to detect fp min/max pattern. - - - - --- 2023-07-07 Liu, Hongtao New
[2/2] Adjust rtx_cost for DF/SFmode AND/IOR/XOR/ANDN operations. [1/2,x86] Add pre_reload splitter to detect fp min/max pattern. - - - - --- 2023-07-06 Liu, Hongtao New
[1/2,x86] Add pre_reload splitter to detect fp min/max pattern. [1/2,x86] Add pre_reload splitter to detect fp min/max pattern. - - - - --- 2023-07-06 Liu, Hongtao New
Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS. Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS. - - - - --- 2023-07-06 Liu, Hongtao New
Break false dependence for vpternlog by inserting vpxor. Break false dependence for vpternlog by inserting vpxor. - - - - --- 2023-07-04 Liu, Hongtao New
[2/2] Make option mvzeroupper independent of optimization level. [1/2] Don't issue vzeroupper for vzeroupper call_insn. - - - - --- 2023-06-27 Liu, Hongtao New
[1/2] Don't issue vzeroupper for vzeroupper call_insn. [1/2] Don't issue vzeroupper for vzeroupper call_insn. - - - - --- 2023-06-27 Liu, Hongtao New
[x86] Refine maskstore patterns with UNSPEC_MASKMOV. [x86] Refine maskstore patterns with UNSPEC_MASKMOV. - - - - --- 2023-06-27 Liu, Hongtao New
Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16. Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16. - - - - --- 2023-06-26 Liu, Hongtao New
[3/3,aarch64] Adjust testcase to match assembly output after r14-2007. [1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. - - - - --- 2023-06-26 Liu, Hongtao New
[2/3] Don't use intermiediate type for FIX_TRUNC_EXPR when ftrapping-math. [1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. - - - - --- 2023-06-26 Liu, Hongtao New
[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. [1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest. - - - - --- 2023-06-26 Liu, Hongtao New
Refine maskloadmn pattern with UNSPEC_MASKLOAD. Refine maskloadmn pattern with UNSPEC_MASKLOAD. - - - - --- 2023-06-21 Liu, Hongtao New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… [vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… - - - - --- 2023-06-20 Liu, Hongtao New
[2/2] Refined 256/512-bit vpacksswb/vpackssdw patterns. [1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate. - - - - --- 2023-06-16 Liu, Hongtao New
[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate. [1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate. - - - - --- 2023-06-16 Liu, Hongtao New
[x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal. [x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal. - - - - --- 2023-06-14 Liu, Hongtao New
[1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. [1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. - - - - --- 2023-06-06 Liu, Hongtao New
[v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins. [v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins. - - - - --- 2023-06-06 Liu, Hongtao New
Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char. Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char. - - - - --- 2023-06-06 Liu, Hongtao New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE. - - - - --- 2023-06-06 Liu, Hongtao New
[x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion. [x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion. - - - - --- 2023-06-05 Liu, Hongtao New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… [vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe… - - - - --- 2023-06-02 Liu, Hongtao New
i386: Add missing vector truncate patterns [PR92658]. i386: Add missing vector truncate patterns [PR92658]. - - - - --- 2023-06-02 Liu, Hongtao New
Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range. Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range. - - - - --- 2023-06-01 Liu, Hongtao New
Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors. Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors. - - - - --- 2023-05-26 Liu, Hongtao New
« 1 2 3 46 7 »