GNU Compiler Collection

Show patches with: Submitter = Liu, Hongtao | State = Action Required | Archived = No | 611 patches

« 1 2 3 4 … 6 7 »

Patch	Series	A/F/R/T	S/W/F	Date	Submitter	State
Don't assume it's AVX_U128_CLEAN after call_insn whose abi.mode_clobber(V4DImode) deosn't contains …	Don't assume it's AVX_U128_CLEAN after call_insn whose abi.mode_clobber(V4DImode) deosn't contains …	- - - -	---	2023-12-08	Liu, Hongtao	New
Support udot_prodvqi with emulation sdot_prodvhi	Support udot_prodvqi with emulation sdot_prodvhi	- - - -	---	2023-12-04	Liu, Hongtao	New
Don't vectorize when vector stmts are only vec_contruct and stores	Don't vectorize when vector stmts are only vec_contruct and stores	- - - -	---	2023-12-04	Liu, Hongtao	New
Take register pressure into account for vec_construct/scalar_to_vec when the components are not loa…	Take register pressure into account for vec_construct/scalar_to_vec when the components are not loa…	- - - -	---	2023-12-01	Liu, Hongtao	New
Use vec_extact_lo instead of subreg in reduc_<code>_scal_m.	Use vec_extact_lo instead of subreg in reduc_<code>_scal_m.	- - - -	---	2023-11-29	Liu, Hongtao	New
[x86] Support sdot_prodvqi with emulation of sdot_prodvhi.	[x86] Support sdot_prodvqi with emulation of sdot_prodvhi.	- - - -	---	2023-11-29	Liu, Hongtao	New
Take register pressure into account for vec_construct when the components are not loaded from memor…	Take register pressure into account for vec_construct when the components are not loaded from memor…	- - - -	---	2023-11-28	Liu, Hongtao	New
Set AVOID_256FMA_CHAINS TO m_GENERIC as it's generally good to new platforms	Set AVOID_256FMA_CHAINS TO m_GENERIC as it's generally good to new platforms	- - - -	---	2023-11-22	Liu, Hongtao	New
[x86] Support reduc_{and, ior, xor}_scal_m for V4HI/V8QI/V4QImode	[x86] Support reduc_{and, ior, xor}_scal_m for V4HI/V8QI/V4QImode	- - - -	---	2023-11-20	Liu, Hongtao	New
Support cbranchm for Vector HI/QImode.	Support cbranchm for Vector HI/QImode.	- - - -	---	2023-11-17	Liu, Hongtao	New
[2/2] Add i?86-- and x86_64-- to vect_logical_reduc	[1/2] Support reduc_{plus, xor, and, ior}_scal_m for vector integer mode.	- - - -	---	2023-11-17	Liu, Hongtao	New
[1/2] Support reduc_{plus, xor, and, ior}_scal_m for vector integer mode.	[1/2] Support reduc_{plus, xor, and, ior}_scal_m for vector integer mode.	- - - -	---	2023-11-17	Liu, Hongtao	New
[V2] Simplify vector ((VCE (a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE ((a cmp b) ? (VCE c) : (VC…	[V2] Simplify vector ((VCE (a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE ((a cmp b) ? (VCE c) : (VC…	- - - -	---	2023-11-16	Liu, Hongtao	New
Fix ICE of unrecognizable insn.	Fix ICE of unrecognizable insn.	- - - -	---	2023-11-16	Liu, Hongtao	New
Fix ICE in vectorizable_nonlinear_induction with bitfield.	Fix ICE in vectorizable_nonlinear_induction with bitfield.	- - - -	---	2023-11-14	Liu, Hongtao	New
Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just VCE:((a cmp b) ? (VCE c) : (VCE d)).	Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just VCE:((a cmp b) ? (VCE c) : (VCE d)).	- - - -	---	2023-11-10	Liu, Hongtao	New
Support vec_set/vec_extract/vec_init for V4HF/V2HF.	Support vec_set/vec_extract/vec_init for V4HF/V2HF.	- - - -	---	2023-11-10	Liu, Hongtao	New
Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE:a cmp VCE:b) ? c : d.	Simplify vector ((VCE?(a cmp b ? -1 : 0)) < 0) ? c : d to just (VCE:a cmp VCE:b) ? c : d.	- - - -	---	2023-11-10	Liu, Hongtao	New
Fix wrong code due to vec_merge + pcmp to blendvb splitter.	Fix wrong code due to vec_merge + pcmp to blendvb splitter.	- - - -	---	2023-11-09	Liu, Hongtao	New
[V2] Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect.	[V2] Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect.	- - - -	---	2023-11-07	Liu, Hongtao	New
Avoid generating RTL code when d->testing_p.	Avoid generating RTL code when d->testing_p.	- - - -	---	2023-11-06	Liu, Hongtao	New
Support cmul{_conj}v4hf3/cmla{_conj}v4hf4 with AVX512FP16 instruction.	Support cmul{_conj}v4hf3/cmla{_conj}v4hf4 with AVX512FP16 instruction.	- - - -	---	2023-11-01	Liu, Hongtao	New
Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect.	Handle bitop with INTEGER_CST in analyze_and_compute_bitop_with_inv_effect.	- - - -	---	2023-10-30	Liu, Hongtao	New
Fix wrong code due to incorrest define_split	Fix wrong code due to incorrest define_split	- - - -	---	2023-10-30	Liu, Hongtao	New
Improve memcmpeq for 512-bit vector with vpcmpeq + kortest.	Improve memcmpeq for 512-bit vector with vpcmpeq + kortest.	- - - -	---	2023-10-27	Liu, Hongtao	New
[V2,2/2] Support vec_cmpmn/vcondmn for v2hf/v4hf.	[V2,1/2] Pass type of comparison operands instead of comparison result to truth_type_for in build_v…	- - - -	---	2023-10-26	Liu, Hongtao	New
[V2,1/2] Pass type of comparison operands instead of comparison result to truth_type_for in build_v…	[V2,1/2] Pass type of comparison operands instead of comparison result to truth_type_for in build_v…	- - - -	---	2023-10-26	Liu, Hongtao	New
[GCC13,backport] Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_…	[GCC13,backport] Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_…	- - - -	---	2023-10-24	Liu, Hongtao	New
Support vec_cmpmn/vcondmn for v2hf/v4hf.	Support vec_cmpmn/vcondmn for v2hf/v4hf.	- - - -	---	2023-10-23	Liu, Hongtao	New
[x86] Remove unused mmx_pinsrw.	[x86] Remove unused mmx_pinsrw.	- - - -	---	2023-10-20	Liu, Hongtao	New
Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when …	Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when …	- - - -	---	2023-10-20	Liu, Hongtao	New
Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when …	Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when …	- - - -	---	2023-10-19	Liu, Hongtao	New
Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when …	Avoid compile time hog on vect_peel_nonlinear_iv_init for nonlinear induction vec_step_op_mul when …	- - - -	---	2023-10-18	Liu, Hongtao	New
Support 32/64-bit vectorization for _Float16 fma related operations.	Support 32/64-bit vectorization for _Float16 fma related operations.	- - - -	---	2023-10-17	Liu, Hongtao	New
[2/2] Support 32/64-bit vectorization for conversion between _Float16 and integer/float.	[1/2] Enable vectorization for V2HF/V4HF rounding operations and sqrt.	- - - -	---	2023-10-12	Liu, Hongtao	New
[1/2] Enable vectorization for V2HF/V4HF rounding operations and sqrt.	[1/2] Enable vectorization for V2HF/V4HF rounding operations and sqrt.	- - - -	---	2023-10-12	Liu, Hongtao	New
[x86] Refine predicate of operands[2] in divv4hf3 with register_operand.	[x86] Refine predicate of operands[2] in divv4hf3 with register_operand.	- - - -	---	2023-10-10	Liu, Hongtao	New
[2/2] Support signbit/xorsign/copysign/abs/neg/and/xor/ior/andn for V2HF/V4HF.	[1/2,x86] Support smin/smax for V2HF/V4HF	- - - -	---	2023-10-08	Liu, Hongtao	New
[1/2,x86] Support smin/smax for V2HF/V4HF	[1/2,x86] Support smin/smax for V2HF/V4HF	- - - -	---	2023-10-08	Liu, Hongtao	New
Remove constraint modifier % for fcmaddcph/fmaddcph/fcmulcph since there're not commutative.	Remove constraint modifier % for fcmaddcph/fmaddcph/fcmulcph since there're not commutative.	- - - -	---	2023-09-11	Liu, Hongtao	New
Remove constraint modifier % for fcmaddcph/fcmulcph since there're not commutative.	Remove constraint modifier % for fcmaddcph/fcmulcph since there're not commutative.	- - - -	---	2023-09-08	Liu, Hongtao	New
Support vpermw/vpermi2w/vpermt2w instructions for vector HF/BFmodes.	Support vpermw/vpermi2w/vpermt2w instructions for vector HF/BFmodes.	- - - -	---	2023-09-07	Liu, Hongtao	New
Generate vmovsh instead of vpblendw for specific vec_merge.	Generate vmovsh instead of vpblendw for specific vec_merge.	- - - -	---	2023-09-04	Liu, Hongtao	New
Adjust costing of emulated vectorized gather/scatter	Adjust costing of emulated vectorized gather/scatter	- - - -	---	2023-08-30	Liu, Hongtao	New
Refactor vector HF/BF mode iterators and patterns.	Refactor vector HF/BF mode iterators and patterns.	- - - -	---	2023-08-30	Liu, Hongtao	New
Use vmaskmov{ps, pd} for VI48_128_256 when TARGET_AVX2 is not available.	Use vmaskmov{ps, pd} for VI48_128_256 when TARGET_AVX2 is not available.	- - - -	---	2023-08-25	Liu, Hongtao	New
[x86] Refactor mode iterator V_128 and V_128H, V_256 and V_256H	[x86] Refactor mode iterator V_128 and V_128H, V_256 and V_256H	- - - -	---	2023-08-24	Liu, Hongtao	New
Fix target_clone ("arch=graniterapids-d") and target_clone ("arch=arrowlake-s")	Fix target_clone ("arch=graniterapids-d") and target_clone ("arch=arrowlake-s")	- - - -	---	2023-08-23	Liu, Hongtao	New
[x86] Testcase fix.	[x86] Testcase fix.	- - - -	---	2023-08-22	Liu, Hongtao	New
Adjust testcase for Intel GDS.	Adjust testcase for Intel GDS.	- - - -	---	2023-08-21	Liu, Hongtao	New
Mention Intel -march=gracemont for Alderlake-N.	Mention Intel -march=gracemont for Alderlake-N.	- - - -	---	2023-08-21	Liu, Hongtao	New
Support -march=gracemont	Support -march=gracemont	- - - -	---	2023-08-18	Liu, Hongtao	New
Generate vmovapd instead of vmovsd for moving DFmode between SSE_REGS.	Generate vmovapd instead of vmovsd for moving DFmode between SSE_REGS.	- - - -	---	2023-08-14	Liu, Hongtao	New
[V2] Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter in…	[V2] Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter in…	- - - -	---	2023-08-11	Liu, Hongtao	New
Software mitigation: Disable gather generation in vectorization for GDS affected Intel Processors.	Software mitigation: Disable gather generation in vectorization for GDS affected Intel Processors.	- - - -	---	2023-08-11	Liu, Hongtao	New
Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter instruc…	Support -m[no-]gather -m[no-]scatter to enable/disable vectorization for all gather/scatter instruc…	- - - -	---	2023-08-10	Liu, Hongtao	New
i386: Do not sanitize upper part of V2HFmode and V4HFmode reg with -fno-trapping-math [PR110832]	i386: Do not sanitize upper part of V2HFmode and V4HFmode reg with -fno-trapping-math [PR110832]	- - - -	---	2023-08-10	Liu, Hongtao	New
Rename local variable subleaf_level to max_subleaf_level.	Rename local variable subleaf_level to max_subleaf_level.	- - - -	---	2023-08-09	Liu, Hongtao	New
[V2,X86] Workaround possible CPUID bug in Sandy Bridge.	[V2,X86] Workaround possible CPUID bug in Sandy Bridge.	- - - -	---	2023-08-09	Liu, Hongtao	New
[X86] Workaround possible CPUID bug in Sandy Bridge.	[X86] Workaround possible CPUID bug in Sandy Bridge.	- - - -	---	2023-08-08	Liu, Hongtao	New
i386: Clear upper bits of XMM register for V4HFmode/V2HFmode operations [PR110762]	i386: Clear upper bits of XMM register for V4HFmode/V2HFmode operations [PR110762]	- - - -	---	2023-08-07	Liu, Hongtao	New
Fix ICE in rtl check when bootstrap.	Fix ICE in rtl check when bootstrap.	- - - -	---	2023-08-07	Liu, Hongtao	New
Optimize vlddqu + inserti128 to vbroadcasti128	Optimize vlddqu + inserti128 to vbroadcasti128	- - - -	---	2023-08-02	Liu, Hongtao	New
Support vec_fmaddsub/vec_fmsubadd for vector HFmode.	Support vec_fmaddsub/vec_fmsubadd for vector HFmode.	- - - -	---	2023-08-02	Liu, Hongtao	New
Adjust testcase for more optimal codegen.	Adjust testcase for more optimal codegen.	- - - -	---	2023-08-01	Liu, Hongtao	New
[x86] Add UNSPEC_MASKOP to vpbroadcastm pattern.	[x86] Add UNSPEC_MASKOP to vpbroadcastm pattern.	- - - -	---	2023-07-27	Liu, Hongtao	New
Optimize vlddqu to vmovdqu for TARGET_AVX	Optimize vlddqu to vmovdqu for TARGET_AVX	- - - -	---	2023-07-20	Liu, Hongtao	New
Fix fp16 related testcase failure for i686.	Fix fp16 related testcase failure for i686.	- - - -	---	2023-07-20	Liu, Hongtao	New
Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output.	Remove # from <mask_codefor>one_cmpl<mode>2<mask_name> assemble output.	- - - -	---	2023-07-17	Liu, Hongtao	New
Fix typo in the testcase.	Fix typo in the testcase.	- - - -	---	2023-07-11	Liu, Hongtao	New
Add peephole to eliminate redundant comparison after cmpccxadd.	Add peephole to eliminate redundant comparison after cmpccxadd.	- - - -	---	2023-07-11	Liu, Hongtao	New
[v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand…	[v2] Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand…	- - - -	---	2023-07-11	Liu, Hongtao	New
Add peephole to eliminate redundant comparison after cmpccxadd.	Add peephole to eliminate redundant comparison after cmpccxadd.	- - - -	---	2023-07-11	Liu, Hongtao	New
Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '…	Break false dependence for vpternlog by inserting vpxor or setting constraint of input operand to '…	- - - -	---	2023-07-10	Liu, Hongtao	New
[V2,x86] Add pre_reload splitter to detect fp min/max pattern.	[V2,x86] Add pre_reload splitter to detect fp min/max pattern.	- - - -	---	2023-07-07	Liu, Hongtao	New
[2/2] Adjust rtx_cost for DF/SFmode AND/IOR/XOR/ANDN operations.	[1/2,x86] Add pre_reload splitter to detect fp min/max pattern.	- - - -	---	2023-07-06	Liu, Hongtao	New
[1/2,x86] Add pre_reload splitter to detect fp min/max pattern.	[1/2,x86] Add pre_reload splitter to detect fp min/max pattern.	- - - -	---	2023-07-06	Liu, Hongtao	New
Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS.	Disparage slightly for the alternative which move DFmode between SSE_REGS and GENERAL_REGS.	- - - -	---	2023-07-06	Liu, Hongtao	New
Break false dependence for vpternlog by inserting vpxor.	Break false dependence for vpternlog by inserting vpxor.	- - - -	---	2023-07-04	Liu, Hongtao	New
[2/2] Make option mvzeroupper independent of optimization level.	[1/2] Don't issue vzeroupper for vzeroupper call_insn.	- - - -	---	2023-06-27	Liu, Hongtao	New
[1/2] Don't issue vzeroupper for vzeroupper call_insn.	[1/2] Don't issue vzeroupper for vzeroupper call_insn.	- - - -	---	2023-06-27	Liu, Hongtao	New
[x86] Refine maskstore patterns with UNSPEC_MASKMOV.	[x86] Refine maskstore patterns with UNSPEC_MASKMOV.	- - - -	---	2023-06-27	Liu, Hongtao	New
Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16.	Issue a warning for conversion between short and __bf16 under TARGET_AVX512BF16.	- - - -	---	2023-06-26	Liu, Hongtao	New
[3/3,aarch64] Adjust testcase to match assembly output after r14-2007.	[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.	- - - -	---	2023-06-26	Liu, Hongtao	New
[2/3] Don't use intermiediate type for FIX_TRUNC_EXPR when ftrapping-math.	[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.	- - - -	---	2023-06-26	Liu, Hongtao	New
[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.	[1/3] Use cvt_op to save intermediate type operand instead of "subtle" vec_dest.	- - - -	---	2023-06-26	Liu, Hongtao	New
Refine maskloadmn pattern with UNSPEC_MASKLOAD.	Refine maskloadmn pattern with UNSPEC_MASKLOAD.	- - - -	---	2023-06-21	Liu, Hongtao	New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…	[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…	- - - -	---	2023-06-20	Liu, Hongtao	New
[2/2] Refined 256/512-bit vpacksswb/vpackssdw patterns.	[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate.	- - - -	---	2023-06-16	Liu, Hongtao	New
[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate.	[1/2] Reimplement packuswb/packusdw with UNSPEC_US_TRUNCATE instead of original us_truncate.	- - - -	---	2023-06-16	Liu, Hongtao	New
[x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal.	[x86] Use x instead of v for alternative 2 (v, BH) in mov<mode>_internal.	- - - -	---	2023-06-14	Liu, Hongtao	New
[1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.	[1/2] Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.	- - - -	---	2023-06-06	Liu, Hongtao	New
[v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins.	[v2] Explicitly view_convert_expr mask to signed type when folding pblendvb builtins.	- - - -	---	2023-06-06	Liu, Hongtao	New
Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char.	Don't fold _mm{, 256}_blendv_epi8 into (mask < 0 ? src1 : src2) when -funsigned-char.	- - - -	---	2023-06-06	Liu, Hongtao	New
Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.	Fold _mm{, 256, 512}_abs_{epi8, epi16, epi32, epi64} into gimple ABSU_EXPR + VCE.	- - - -	---	2023-06-06	Liu, Hongtao	New
[x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion.	[x86] Add missing vec_pack/unpacks patterns for _Float16 <-> int/float conversion.	- - - -	---	2023-06-05	Liu, Hongtao	New
[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…	[vect] Use intermiediate integer type for float_expr/fix_trunc_expr when direct optab is not existe…	- - - -	---	2023-06-02	Liu, Hongtao	New
i386: Add missing vector truncate patterns [PR92658].	i386: Add missing vector truncate patterns [PR92658].	- - - -	---	2023-06-02	Liu, Hongtao	New
Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range.	Don't try bswap + rotate when TYPE_PRECISION(n->type) > n->range.	- - - -	---	2023-06-01	Liu, Hongtao	New
Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors.	Disable avoid_false_dep_for_bmi for atom and icelake(and later) core processors.	- - - -	---	2023-05-26	Liu, Hongtao	New

« 1 2 3 4 … 6 7 »