diff mbox series

[v2] combine: Tweak the condition of last_set invalidation

Message ID e81e6734-d0a9-4769-e527-07104ad50a4f@linux.ibm.com
State New
Headers show
Series [v2] combine: Tweak the condition of last_set invalidation | expand

Commit Message

Kewen.Lin June 11, 2021, 1:16 p.m. UTC
Hi Segher,

Thanks for the review!

on 2021/6/10 上午4:17, Segher Boessenkool wrote:
> Hi!
> 
> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote:
>> Currently we have the check:
>>
>>       if (!insn
>> 	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
>> 	rsp->last_set_invalid = 1; 
>>
>> which means if we want to record some value for some reg and
>> this reg got refered before in a valid scope,
> 
> If we already know it is *set* in this same extended basic block.
> Possibly by the same instruction btw.
> 
>> we invalidate the
>> set of reg (last_set_invalid to 1).  It avoids to find the wrong
>> set for one reg reference, such as the case like:
>>
>>    ... op regX  // this regX could find wrong last_set below
>>    regX = ...   // if we think this set is valid
>>    ... op regX
> 
> Yup, exactly.
> 
>> But because of retry's existence, the last_set_table_tick could
>> be set by some later reference insns, but we see it's set due
>> to retry on the set (for that reg) insn again, such as:
>>
>>    insn 1
>>    insn 2
>>
>>    regX = ...     --> (a)
>>    ... op regX    --> (b)
>>    
>>    insn 3
>>
>>    // assume all in the same BB.
>>
>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two
>> (3 insns -> 2 insns),
> 
> This will delete insn 1 and write the combined result to insns 2 and 3.
> 
>> retrying from insn1 or insn2 again:
> 
> Always 2, but your point remains valid.
> 
>> it will scan insn (a) again, the below condition holds for regX:
>>
>>   (value && rsp->last_set_table_tick >= label_tick_ebb_start)
>>
>> it will mark this set as invalid set.  But actually the
>> last_set_table_tick here is set by insn (b) before retrying, so it
>> should be safe to be taken as valid set.
> 
> Yup.
> 
>> This proposal is to check whether the last_set_table safely happens
>> after the current set, make the set still valid if so.
> 
>> Full SPEC2017 building shows this patch gets more sucessful combines
>> from 1902208 to 1902243 (trivial though).
> 
> Do you have some example, or maybe even a testcase?  :-)
> 

Sorry for the late reply, it took some time to get one reduced case.

typedef struct SA *pa_t;

struct SC {
  int h;
  pa_t elem[];
};

struct SD {
  struct SC *e;
};

struct SA {
  struct {
    struct SD f[1];
  } g;
};

void foo(pa_t *k, char **m) {
  int l, i;
  pa_t a;
  l = (int)a->g.f[5].e;
  i = 0;
  for (; i < l; i++) {
    k[i] = a->g.f[5].e->elem[i];
    m[i] = "";
  }
}

Baseline is r12-0 and the option is "-O3 -mcpu=power9 -fno-strict-aliasing",
with this patch, the generated assembly can save two rlwinm s.

>> +  /* Record the luid of the insn whose expression involving register n.  */
>> +
>> +  int				last_set_table_luid;
> 
> "Record the luid of the insn for which last_set_table_tick was set",
> right?
> 

But it can be updated later to one smaller luid, how about the wording like:


+  /* Record the luid of the insn which uses register n, the insn should
+     be the first one using register n in that block of the insn which
+     last_set_table_tick was set for.  */


>> -static void update_table_tick (rtx);
>> +static void update_table_tick (rtx, int);
> 
> Please remove this declaration instead, the function is not used until
> after its actual definition :-)
> 

Done.

>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x)
>>        for (r = regno; r < endregno; r++)
>>  	{
>>  	  reg_stat_type *rsp = &reg_stat[r];
>> -	  rsp->last_set_table_tick = label_tick;
>> +	  if (rsp->last_set_table_tick >= label_tick_ebb_start)
>> +	    {
>> +	      /* Later references should not have lower ticks.  */
>> +	      gcc_assert (label_tick >= rsp->last_set_table_tick);
> 
> This should be obvious, but checking it won't hurt, okay.
> 
>> +	      /* Should pick up the lowest luid if the references
>> +		 are in the same block.  */
>> +	      if (label_tick == rsp->last_set_table_tick
>> +		  && rsp->last_set_table_luid > insn_luid)
>> +		rsp->last_set_table_luid = insn_luid;
> 
> Why?  Is it conservative for the check you will do later?  Please spell
> this out, it is crucial!
> 

Since later the combinations involving this insn probably make the
register be used in one insn sitting ahead (which has smaller luid than
the one which was recorded before).  Yes, it's very conservative, this
ensure that we always use the luid of the insn which is the first insn
using this register in the block.  The last_set invalidation is going
to catch the case like:

   ... regX  // avoid the set used here ...
   regX = ...
   ...

Once we have the smallest luid one of all insns which use register X,
any unsafe regX sets should be caught.

I updated the comments to:

+              /* Since combination may generate some instructions
+                 to replace some foregoing instructions with the
+                 references to register r (using register r), we
+                 need to make sure we record the first instruction
+                 which is using register r, so always update with
+                 the lowest luid here.  If the given set happens
+                 before this recorded earliest reference, the set
+                 value should be safe to be used.  */

>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
>>  
>>    /* Mark registers that are being referenced in this value.  */
>>    if (value)
>> -    update_table_tick (value);
>> +    {
>> +      gcc_assert (insn);
>> +      update_table_tick (value, DF_INSN_LUID (insn));
>> +    }
> 
> Don't add that assert please.  If you really want one it should come
> right at the start of the function, not 60 lines later :-)
> 

Exactly, fixed.

> Looks good if I understood this correctly :-)
> 
> 

Thanks again, I also updated the comments in func record_value_for_reg,
the new version is attached.

BR,
Kewen
-----
gcc/ChangeLog:

	* combine.c (struct reg_stat_type): New member
	last_set_table_luid.
	(update_table_tick): Add one argument for insn luid and
	set last_set_table_luid with it, remove its declaration.
	(record_value_for_reg): Adjust the condition to set
	last_set_invalid nonzero.

Comments

Kewen.Lin June 28, 2021, 7 a.m. UTC | #1
Hi!

I'd like to gentle ping this:

https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html


BR,
Kewen

on 2021/6/11 下午9:16, Kewen.Lin via Gcc-patches wrote:
> Hi Segher,
> 
> Thanks for the review!
> 
> on 2021/6/10 上午4:17, Segher Boessenkool wrote:
>> Hi!
>>
>> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote:
>>> Currently we have the check:
>>>
>>>       if (!insn
>>> 	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
>>> 	rsp->last_set_invalid = 1; 
>>>
>>> which means if we want to record some value for some reg and
>>> this reg got refered before in a valid scope,
>>
>> If we already know it is *set* in this same extended basic block.
>> Possibly by the same instruction btw.
>>
>>> we invalidate the
>>> set of reg (last_set_invalid to 1).  It avoids to find the wrong
>>> set for one reg reference, such as the case like:
>>>
>>>    ... op regX  // this regX could find wrong last_set below
>>>    regX = ...   // if we think this set is valid
>>>    ... op regX
>>
>> Yup, exactly.
>>
>>> But because of retry's existence, the last_set_table_tick could
>>> be set by some later reference insns, but we see it's set due
>>> to retry on the set (for that reg) insn again, such as:
>>>
>>>    insn 1
>>>    insn 2
>>>
>>>    regX = ...     --> (a)
>>>    ... op regX    --> (b)
>>>    
>>>    insn 3
>>>
>>>    // assume all in the same BB.
>>>
>>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two
>>> (3 insns -> 2 insns),
>>
>> This will delete insn 1 and write the combined result to insns 2 and 3.
>>
>>> retrying from insn1 or insn2 again:
>>
>> Always 2, but your point remains valid.
>>
>>> it will scan insn (a) again, the below condition holds for regX:
>>>
>>>   (value && rsp->last_set_table_tick >= label_tick_ebb_start)
>>>
>>> it will mark this set as invalid set.  But actually the
>>> last_set_table_tick here is set by insn (b) before retrying, so it
>>> should be safe to be taken as valid set.
>>
>> Yup.
>>
>>> This proposal is to check whether the last_set_table safely happens
>>> after the current set, make the set still valid if so.
>>
>>> Full SPEC2017 building shows this patch gets more sucessful combines
>>> from 1902208 to 1902243 (trivial though).
>>
>> Do you have some example, or maybe even a testcase?  :-)
>>
> 
> Sorry for the late reply, it took some time to get one reduced case.
> 
> typedef struct SA *pa_t;
> 
> struct SC {
>   int h;
>   pa_t elem[];
> };
> 
> struct SD {
>   struct SC *e;
> };
> 
> struct SA {
>   struct {
>     struct SD f[1];
>   } g;
> };
> 
> void foo(pa_t *k, char **m) {
>   int l, i;
>   pa_t a;
>   l = (int)a->g.f[5].e;
>   i = 0;
>   for (; i < l; i++) {
>     k[i] = a->g.f[5].e->elem[i];
>     m[i] = "";
>   }
> }
> 
> Baseline is r12-0 and the option is "-O3 -mcpu=power9 -fno-strict-aliasing",
> with this patch, the generated assembly can save two rlwinm s.
> 
>>> +  /* Record the luid of the insn whose expression involving register n.  */
>>> +
>>> +  int				last_set_table_luid;
>>
>> "Record the luid of the insn for which last_set_table_tick was set",
>> right?
>>
> 
> But it can be updated later to one smaller luid, how about the wording like:
> 
> 
> +  /* Record the luid of the insn which uses register n, the insn should
> +     be the first one using register n in that block of the insn which
> +     last_set_table_tick was set for.  */
> 
> 
>>> -static void update_table_tick (rtx);
>>> +static void update_table_tick (rtx, int);
>>
>> Please remove this declaration instead, the function is not used until
>> after its actual definition :-)
>>
> 
> Done.
> 
>>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x)
>>>        for (r = regno; r < endregno; r++)
>>>  	{
>>>  	  reg_stat_type *rsp = &reg_stat[r];
>>> -	  rsp->last_set_table_tick = label_tick;
>>> +	  if (rsp->last_set_table_tick >= label_tick_ebb_start)
>>> +	    {
>>> +	      /* Later references should not have lower ticks.  */
>>> +	      gcc_assert (label_tick >= rsp->last_set_table_tick);
>>
>> This should be obvious, but checking it won't hurt, okay.
>>
>>> +	      /* Should pick up the lowest luid if the references
>>> +		 are in the same block.  */
>>> +	      if (label_tick == rsp->last_set_table_tick
>>> +		  && rsp->last_set_table_luid > insn_luid)
>>> +		rsp->last_set_table_luid = insn_luid;
>>
>> Why?  Is it conservative for the check you will do later?  Please spell
>> this out, it is crucial!
>>
> 
> Since later the combinations involving this insn probably make the
> register be used in one insn sitting ahead (which has smaller luid than
> the one which was recorded before).  Yes, it's very conservative, this
> ensure that we always use the luid of the insn which is the first insn
> using this register in the block.  The last_set invalidation is going
> to catch the case like:
> 
>    ... regX  // avoid the set used here ...
>    regX = ...
>    ...
> 
> Once we have the smallest luid one of all insns which use register X,
> any unsafe regX sets should be caught.
> 
> I updated the comments to:
> 
> +              /* Since combination may generate some instructions
> +                 to replace some foregoing instructions with the
> +                 references to register r (using register r), we
> +                 need to make sure we record the first instruction
> +                 which is using register r, so always update with
> +                 the lowest luid here.  If the given set happens
> +                 before this recorded earliest reference, the set
> +                 value should be safe to be used.  */
> 
>>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
>>>  
>>>    /* Mark registers that are being referenced in this value.  */
>>>    if (value)
>>> -    update_table_tick (value);
>>> +    {
>>> +      gcc_assert (insn);
>>> +      update_table_tick (value, DF_INSN_LUID (insn));
>>> +    }
>>
>> Don't add that assert please.  If you really want one it should come
>> right at the start of the function, not 60 lines later :-)
>>
> 
> Exactly, fixed.
> 
>> Looks good if I understood this correctly :-)
>>
>>
> 
> Thanks again, I also updated the comments in func record_value_for_reg,
> the new version is attached.
> 
> BR,
> Kewen
> -----
> gcc/ChangeLog:
> 
> 	* combine.c (struct reg_stat_type): New member
> 	last_set_table_luid.
> 	(update_table_tick): Add one argument for insn luid and
> 	set last_set_table_luid with it, remove its declaration.
> 	(record_value_for_reg): Adjust the condition to set
> 	last_set_invalid nonzero.
>
Kewen.Lin July 15, 2021, 2 a.m. UTC | #2
Hi,

Gentle ping this:

https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html

BR,
Kewen

on 2021/6/28 下午3:00, Kewen.Lin via Gcc-patches wrote:
> Hi!
> 
> I'd like to gentle ping this:
> 
> https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html
> 
> 
> BR,
> Kewen
> 
> on 2021/6/11 下午9:16, Kewen.Lin via Gcc-patches wrote:
>> Hi Segher,
>>
>> Thanks for the review!
>>
>> on 2021/6/10 上午4:17, Segher Boessenkool wrote:
>>> Hi!
>>>
>>> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote:
>>>> Currently we have the check:
>>>>
>>>>       if (!insn
>>>> 	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
>>>> 	rsp->last_set_invalid = 1; 
>>>>
>>>> which means if we want to record some value for some reg and
>>>> this reg got refered before in a valid scope,
>>>
>>> If we already know it is *set* in this same extended basic block.
>>> Possibly by the same instruction btw.
>>>
>>>> we invalidate the
>>>> set of reg (last_set_invalid to 1).  It avoids to find the wrong
>>>> set for one reg reference, such as the case like:
>>>>
>>>>    ... op regX  // this regX could find wrong last_set below
>>>>    regX = ...   // if we think this set is valid
>>>>    ... op regX
>>>
>>> Yup, exactly.
>>>
>>>> But because of retry's existence, the last_set_table_tick could
>>>> be set by some later reference insns, but we see it's set due
>>>> to retry on the set (for that reg) insn again, such as:
>>>>
>>>>    insn 1
>>>>    insn 2
>>>>
>>>>    regX = ...     --> (a)
>>>>    ... op regX    --> (b)
>>>>    
>>>>    insn 3
>>>>
>>>>    // assume all in the same BB.
>>>>
>>>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two
>>>> (3 insns -> 2 insns),
>>>
>>> This will delete insn 1 and write the combined result to insns 2 and 3.
>>>
>>>> retrying from insn1 or insn2 again:
>>>
>>> Always 2, but your point remains valid.
>>>
>>>> it will scan insn (a) again, the below condition holds for regX:
>>>>
>>>>   (value && rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>
>>>> it will mark this set as invalid set.  But actually the
>>>> last_set_table_tick here is set by insn (b) before retrying, so it
>>>> should be safe to be taken as valid set.
>>>
>>> Yup.
>>>
>>>> This proposal is to check whether the last_set_table safely happens
>>>> after the current set, make the set still valid if so.
>>>
>>>> Full SPEC2017 building shows this patch gets more sucessful combines
>>>> from 1902208 to 1902243 (trivial though).
>>>
>>> Do you have some example, or maybe even a testcase?  :-)
>>>
>>
>> Sorry for the late reply, it took some time to get one reduced case.
>>
>> typedef struct SA *pa_t;
>>
>> struct SC {
>>   int h;
>>   pa_t elem[];
>> };
>>
>> struct SD {
>>   struct SC *e;
>> };
>>
>> struct SA {
>>   struct {
>>     struct SD f[1];
>>   } g;
>> };
>>
>> void foo(pa_t *k, char **m) {
>>   int l, i;
>>   pa_t a;
>>   l = (int)a->g.f[5].e;
>>   i = 0;
>>   for (; i < l; i++) {
>>     k[i] = a->g.f[5].e->elem[i];
>>     m[i] = "";
>>   }
>> }
>>
>> Baseline is r12-0 and the option is "-O3 -mcpu=power9 -fno-strict-aliasing",
>> with this patch, the generated assembly can save two rlwinm s.
>>
>>>> +  /* Record the luid of the insn whose expression involving register n.  */
>>>> +
>>>> +  int				last_set_table_luid;
>>>
>>> "Record the luid of the insn for which last_set_table_tick was set",
>>> right?
>>>
>>
>> But it can be updated later to one smaller luid, how about the wording like:
>>
>>
>> +  /* Record the luid of the insn which uses register n, the insn should
>> +     be the first one using register n in that block of the insn which
>> +     last_set_table_tick was set for.  */
>>
>>
>>>> -static void update_table_tick (rtx);
>>>> +static void update_table_tick (rtx, int);
>>>
>>> Please remove this declaration instead, the function is not used until
>>> after its actual definition :-)
>>>
>>
>> Done.
>>
>>>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x)
>>>>        for (r = regno; r < endregno; r++)
>>>>  	{
>>>>  	  reg_stat_type *rsp = &reg_stat[r];
>>>> -	  rsp->last_set_table_tick = label_tick;
>>>> +	  if (rsp->last_set_table_tick >= label_tick_ebb_start)
>>>> +	    {
>>>> +	      /* Later references should not have lower ticks.  */
>>>> +	      gcc_assert (label_tick >= rsp->last_set_table_tick);
>>>
>>> This should be obvious, but checking it won't hurt, okay.
>>>
>>>> +	      /* Should pick up the lowest luid if the references
>>>> +		 are in the same block.  */
>>>> +	      if (label_tick == rsp->last_set_table_tick
>>>> +		  && rsp->last_set_table_luid > insn_luid)
>>>> +		rsp->last_set_table_luid = insn_luid;
>>>
>>> Why?  Is it conservative for the check you will do later?  Please spell
>>> this out, it is crucial!
>>>
>>
>> Since later the combinations involving this insn probably make the
>> register be used in one insn sitting ahead (which has smaller luid than
>> the one which was recorded before).  Yes, it's very conservative, this
>> ensure that we always use the luid of the insn which is the first insn
>> using this register in the block.  The last_set invalidation is going
>> to catch the case like:
>>
>>    ... regX  // avoid the set used here ...
>>    regX = ...
>>    ...
>>
>> Once we have the smallest luid one of all insns which use register X,
>> any unsafe regX sets should be caught.
>>
>> I updated the comments to:
>>
>> +              /* Since combination may generate some instructions
>> +                 to replace some foregoing instructions with the
>> +                 references to register r (using register r), we
>> +                 need to make sure we record the first instruction
>> +                 which is using register r, so always update with
>> +                 the lowest luid here.  If the given set happens
>> +                 before this recorded earliest reference, the set
>> +                 value should be safe to be used.  */
>>
>>>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
>>>>  
>>>>    /* Mark registers that are being referenced in this value.  */
>>>>    if (value)
>>>> -    update_table_tick (value);
>>>> +    {
>>>> +      gcc_assert (insn);
>>>> +      update_table_tick (value, DF_INSN_LUID (insn));
>>>> +    }
>>>
>>> Don't add that assert please.  If you really want one it should come
>>> right at the start of the function, not 60 lines later :-)
>>>
>>
>> Exactly, fixed.
>>
>>> Looks good if I understood this correctly :-)
>>>
>>>
>>
>> Thanks again, I also updated the comments in func record_value_for_reg,
>> the new version is attached.
>>
>> BR,
>> Kewen
>> -----
>> gcc/ChangeLog:
>>
>> 	* combine.c (struct reg_stat_type): New member
>> 	last_set_table_luid.
>> 	(update_table_tick): Add one argument for insn luid and
>> 	set last_set_table_luid with it, remove its declaration.
>> 	(record_value_for_reg): Adjust the condition to set
>> 	last_set_invalid nonzero.
>>
Kewen.Lin Sept. 8, 2021, 7:03 a.m. UTC | #3
Hi,

Gentle ping this:

https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html

BR,
Kewen

on 2021/7/15 上午10:00, Kewen.Lin via Gcc-patches wrote:
> Hi,
> 
> Gentle ping this:
> 
> https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html
> 
> BR,
> Kewen
> 
> on 2021/6/28 下午3:00, Kewen.Lin via Gcc-patches wrote:
>> Hi!
>>
>> I'd like to gentle ping this:
>>
>> https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html
>>
>>
>> BR,
>> Kewen
>>
>> on 2021/6/11 下午9:16, Kewen.Lin via Gcc-patches wrote:
>>> Hi Segher,
>>>
>>> Thanks for the review!
>>>
>>> on 2021/6/10 上午4:17, Segher Boessenkool wrote:
>>>> Hi!
>>>>
>>>> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote:
>>>>> Currently we have the check:
>>>>>
>>>>>       if (!insn
>>>>> 	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
>>>>> 	rsp->last_set_invalid = 1; 
>>>>>
>>>>> which means if we want to record some value for some reg and
>>>>> this reg got refered before in a valid scope,
>>>>
>>>> If we already know it is *set* in this same extended basic block.
>>>> Possibly by the same instruction btw.
>>>>
>>>>> we invalidate the
>>>>> set of reg (last_set_invalid to 1).  It avoids to find the wrong
>>>>> set for one reg reference, such as the case like:
>>>>>
>>>>>    ... op regX  // this regX could find wrong last_set below
>>>>>    regX = ...   // if we think this set is valid
>>>>>    ... op regX
>>>>
>>>> Yup, exactly.
>>>>
>>>>> But because of retry's existence, the last_set_table_tick could
>>>>> be set by some later reference insns, but we see it's set due
>>>>> to retry on the set (for that reg) insn again, such as:
>>>>>
>>>>>    insn 1
>>>>>    insn 2
>>>>>
>>>>>    regX = ...     --> (a)
>>>>>    ... op regX    --> (b)
>>>>>    
>>>>>    insn 3
>>>>>
>>>>>    // assume all in the same BB.
>>>>>
>>>>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two
>>>>> (3 insns -> 2 insns),
>>>>
>>>> This will delete insn 1 and write the combined result to insns 2 and 3.
>>>>
>>>>> retrying from insn1 or insn2 again:
>>>>
>>>> Always 2, but your point remains valid.
>>>>
>>>>> it will scan insn (a) again, the below condition holds for regX:
>>>>>
>>>>>   (value && rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>
>>>>> it will mark this set as invalid set.  But actually the
>>>>> last_set_table_tick here is set by insn (b) before retrying, so it
>>>>> should be safe to be taken as valid set.
>>>>
>>>> Yup.
>>>>
>>>>> This proposal is to check whether the last_set_table safely happens
>>>>> after the current set, make the set still valid if so.
>>>>
>>>>> Full SPEC2017 building shows this patch gets more sucessful combines
>>>>> from 1902208 to 1902243 (trivial though).
>>>>
>>>> Do you have some example, or maybe even a testcase?  :-)
>>>>
>>>
>>> Sorry for the late reply, it took some time to get one reduced case.
>>>
>>> typedef struct SA *pa_t;
>>>
>>> struct SC {
>>>   int h;
>>>   pa_t elem[];
>>> };
>>>
>>> struct SD {
>>>   struct SC *e;
>>> };
>>>
>>> struct SA {
>>>   struct {
>>>     struct SD f[1];
>>>   } g;
>>> };
>>>
>>> void foo(pa_t *k, char **m) {
>>>   int l, i;
>>>   pa_t a;
>>>   l = (int)a->g.f[5].e;
>>>   i = 0;
>>>   for (; i < l; i++) {
>>>     k[i] = a->g.f[5].e->elem[i];
>>>     m[i] = "";
>>>   }
>>> }
>>>
>>> Baseline is r12-0 and the option is "-O3 -mcpu=power9 -fno-strict-aliasing",
>>> with this patch, the generated assembly can save two rlwinm s.
>>>
>>>>> +  /* Record the luid of the insn whose expression involving register n.  */
>>>>> +
>>>>> +  int				last_set_table_luid;
>>>>
>>>> "Record the luid of the insn for which last_set_table_tick was set",
>>>> right?
>>>>
>>>
>>> But it can be updated later to one smaller luid, how about the wording like:
>>>
>>>
>>> +  /* Record the luid of the insn which uses register n, the insn should
>>> +     be the first one using register n in that block of the insn which
>>> +     last_set_table_tick was set for.  */
>>>
>>>
>>>>> -static void update_table_tick (rtx);
>>>>> +static void update_table_tick (rtx, int);
>>>>
>>>> Please remove this declaration instead, the function is not used until
>>>> after its actual definition :-)
>>>>
>>>
>>> Done.
>>>
>>>>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x)
>>>>>        for (r = regno; r < endregno; r++)
>>>>>  	{
>>>>>  	  reg_stat_type *rsp = &reg_stat[r];
>>>>> -	  rsp->last_set_table_tick = label_tick;
>>>>> +	  if (rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>> +	    {
>>>>> +	      /* Later references should not have lower ticks.  */
>>>>> +	      gcc_assert (label_tick >= rsp->last_set_table_tick);
>>>>
>>>> This should be obvious, but checking it won't hurt, okay.
>>>>
>>>>> +	      /* Should pick up the lowest luid if the references
>>>>> +		 are in the same block.  */
>>>>> +	      if (label_tick == rsp->last_set_table_tick
>>>>> +		  && rsp->last_set_table_luid > insn_luid)
>>>>> +		rsp->last_set_table_luid = insn_luid;
>>>>
>>>> Why?  Is it conservative for the check you will do later?  Please spell
>>>> this out, it is crucial!
>>>>
>>>
>>> Since later the combinations involving this insn probably make the
>>> register be used in one insn sitting ahead (which has smaller luid than
>>> the one which was recorded before).  Yes, it's very conservative, this
>>> ensure that we always use the luid of the insn which is the first insn
>>> using this register in the block.  The last_set invalidation is going
>>> to catch the case like:
>>>
>>>    ... regX  // avoid the set used here ...
>>>    regX = ...
>>>    ...
>>>
>>> Once we have the smallest luid one of all insns which use register X,
>>> any unsafe regX sets should be caught.
>>>
>>> I updated the comments to:
>>>
>>> +              /* Since combination may generate some instructions
>>> +                 to replace some foregoing instructions with the
>>> +                 references to register r (using register r), we
>>> +                 need to make sure we record the first instruction
>>> +                 which is using register r, so always update with
>>> +                 the lowest luid here.  If the given set happens
>>> +                 before this recorded earliest reference, the set
>>> +                 value should be safe to be used.  */
>>>
>>>>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
>>>>>  
>>>>>    /* Mark registers that are being referenced in this value.  */
>>>>>    if (value)
>>>>> -    update_table_tick (value);
>>>>> +    {
>>>>> +      gcc_assert (insn);
>>>>> +      update_table_tick (value, DF_INSN_LUID (insn));
>>>>> +    }
>>>>
>>>> Don't add that assert please.  If you really want one it should come
>>>> right at the start of the function, not 60 lines later :-)
>>>>
>>>
>>> Exactly, fixed.
>>>
>>>> Looks good if I understood this correctly :-)
>>>>
>>>>
>>>
>>> Thanks again, I also updated the comments in func record_value_for_reg,
>>> the new version is attached.
>>>
>>> BR,
>>> Kewen
>>> -----
>>> gcc/ChangeLog:
>>>
>>> 	* combine.c (struct reg_stat_type): New member
>>> 	last_set_table_luid.
>>> 	(update_table_tick): Add one argument for insn luid and
>>> 	set last_set_table_luid with it, remove its declaration.
>>> 	(record_value_for_reg): Adjust the condition to set
>>> 	last_set_invalid nonzero.
>>>
Kewen.Lin Oct. 13, 2021, 2:27 a.m. UTC | #4
Hi,

Gentle ping this:

https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html

BR,
Kewen

>>> on 2021/6/11 下午9:16, Kewen.Lin via Gcc-patches wrote:
>>>> Hi Segher,
>>>>
>>>> Thanks for the review!
>>>>
>>>> on 2021/6/10 上午4:17, Segher Boessenkool wrote:
>>>>> Hi!
>>>>>
>>>>> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote:
>>>>>> Currently we have the check:
>>>>>>
>>>>>>       if (!insn
>>>>>> 	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
>>>>>> 	rsp->last_set_invalid = 1; 
>>>>>>
>>>>>> which means if we want to record some value for some reg and
>>>>>> this reg got refered before in a valid scope,
>>>>>
>>>>> If we already know it is *set* in this same extended basic block.
>>>>> Possibly by the same instruction btw.
>>>>>
>>>>>> we invalidate the
>>>>>> set of reg (last_set_invalid to 1).  It avoids to find the wrong
>>>>>> set for one reg reference, such as the case like:
>>>>>>
>>>>>>    ... op regX  // this regX could find wrong last_set below
>>>>>>    regX = ...   // if we think this set is valid
>>>>>>    ... op regX
>>>>>
>>>>> Yup, exactly.
>>>>>
>>>>>> But because of retry's existence, the last_set_table_tick could
>>>>>> be set by some later reference insns, but we see it's set due
>>>>>> to retry on the set (for that reg) insn again, such as:
>>>>>>
>>>>>>    insn 1
>>>>>>    insn 2
>>>>>>
>>>>>>    regX = ...     --> (a)
>>>>>>    ... op regX    --> (b)
>>>>>>    
>>>>>>    insn 3
>>>>>>
>>>>>>    // assume all in the same BB.
>>>>>>
>>>>>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two
>>>>>> (3 insns -> 2 insns),
>>>>>
>>>>> This will delete insn 1 and write the combined result to insns 2 and 3.
>>>>>
>>>>>> retrying from insn1 or insn2 again:
>>>>>
>>>>> Always 2, but your point remains valid.
>>>>>
>>>>>> it will scan insn (a) again, the below condition holds for regX:
>>>>>>
>>>>>>   (value && rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>>
>>>>>> it will mark this set as invalid set.  But actually the
>>>>>> last_set_table_tick here is set by insn (b) before retrying, so it
>>>>>> should be safe to be taken as valid set.
>>>>>
>>>>> Yup.
>>>>>
>>>>>> This proposal is to check whether the last_set_table safely happens
>>>>>> after the current set, make the set still valid if so.
>>>>>
>>>>>> Full SPEC2017 building shows this patch gets more sucessful combines
>>>>>> from 1902208 to 1902243 (trivial though).
>>>>>
>>>>> Do you have some example, or maybe even a testcase?  :-)
>>>>>
>>>>
>>>> Sorry for the late reply, it took some time to get one reduced case.
>>>>
>>>> typedef struct SA *pa_t;
>>>>
>>>> struct SC {
>>>>   int h;
>>>>   pa_t elem[];
>>>> };
>>>>
>>>> struct SD {
>>>>   struct SC *e;
>>>> };
>>>>
>>>> struct SA {
>>>>   struct {
>>>>     struct SD f[1];
>>>>   } g;
>>>> };
>>>>
>>>> void foo(pa_t *k, char **m) {
>>>>   int l, i;
>>>>   pa_t a;
>>>>   l = (int)a->g.f[5].e;
>>>>   i = 0;
>>>>   for (; i < l; i++) {
>>>>     k[i] = a->g.f[5].e->elem[i];
>>>>     m[i] = "";
>>>>   }
>>>> }
>>>>
>>>> Baseline is r12-0 and the option is "-O3 -mcpu=power9 -fno-strict-aliasing",
>>>> with this patch, the generated assembly can save two rlwinm s.
>>>>
>>>>>> +  /* Record the luid of the insn whose expression involving register n.  */
>>>>>> +
>>>>>> +  int				last_set_table_luid;
>>>>>
>>>>> "Record the luid of the insn for which last_set_table_tick was set",
>>>>> right?
>>>>>
>>>>
>>>> But it can be updated later to one smaller luid, how about the wording like:
>>>>
>>>>
>>>> +  /* Record the luid of the insn which uses register n, the insn should
>>>> +     be the first one using register n in that block of the insn which
>>>> +     last_set_table_tick was set for.  */
>>>>
>>>>
>>>>>> -static void update_table_tick (rtx);
>>>>>> +static void update_table_tick (rtx, int);
>>>>>
>>>>> Please remove this declaration instead, the function is not used until
>>>>> after its actual definition :-)
>>>>>
>>>>
>>>> Done.
>>>>
>>>>>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x)
>>>>>>        for (r = regno; r < endregno; r++)
>>>>>>  	{
>>>>>>  	  reg_stat_type *rsp = &reg_stat[r];
>>>>>> -	  rsp->last_set_table_tick = label_tick;
>>>>>> +	  if (rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>> +	    {
>>>>>> +	      /* Later references should not have lower ticks.  */
>>>>>> +	      gcc_assert (label_tick >= rsp->last_set_table_tick);
>>>>>
>>>>> This should be obvious, but checking it won't hurt, okay.
>>>>>
>>>>>> +	      /* Should pick up the lowest luid if the references
>>>>>> +		 are in the same block.  */
>>>>>> +	      if (label_tick == rsp->last_set_table_tick
>>>>>> +		  && rsp->last_set_table_luid > insn_luid)
>>>>>> +		rsp->last_set_table_luid = insn_luid;
>>>>>
>>>>> Why?  Is it conservative for the check you will do later?  Please spell
>>>>> this out, it is crucial!
>>>>>
>>>>
>>>> Since later the combinations involving this insn probably make the
>>>> register be used in one insn sitting ahead (which has smaller luid than
>>>> the one which was recorded before).  Yes, it's very conservative, this
>>>> ensure that we always use the luid of the insn which is the first insn
>>>> using this register in the block.  The last_set invalidation is going
>>>> to catch the case like:
>>>>
>>>>    ... regX  // avoid the set used here ...
>>>>    regX = ...
>>>>    ...
>>>>
>>>> Once we have the smallest luid one of all insns which use register X,
>>>> any unsafe regX sets should be caught.
>>>>
>>>> I updated the comments to:
>>>>
>>>> +              /* Since combination may generate some instructions
>>>> +                 to replace some foregoing instructions with the
>>>> +                 references to register r (using register r), we
>>>> +                 need to make sure we record the first instruction
>>>> +                 which is using register r, so always update with
>>>> +                 the lowest luid here.  If the given set happens
>>>> +                 before this recorded earliest reference, the set
>>>> +                 value should be safe to be used.  */
>>>>
>>>>>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
>>>>>>  
>>>>>>    /* Mark registers that are being referenced in this value.  */
>>>>>>    if (value)
>>>>>> -    update_table_tick (value);
>>>>>> +    {
>>>>>> +      gcc_assert (insn);
>>>>>> +      update_table_tick (value, DF_INSN_LUID (insn));
>>>>>> +    }
>>>>>
>>>>> Don't add that assert please.  If you really want one it should come
>>>>> right at the start of the function, not 60 lines later :-)
>>>>>
>>>>
>>>> Exactly, fixed.
>>>>
>>>>> Looks good if I understood this correctly :-)
>>>>>
>>>>>
>>>>
>>>> Thanks again, I also updated the comments in func record_value_for_reg,
>>>> the new version is attached.
>>>>
>>>> BR,
>>>> Kewen
>>>> -----
>>>> gcc/ChangeLog:
>>>>
>>>> 	* combine.c (struct reg_stat_type): New member
>>>> 	last_set_table_luid.
>>>> 	(update_table_tick): Add one argument for insn luid and
>>>> 	set last_set_table_luid with it, remove its declaration.
>>>> 	(record_value_for_reg): Adjust the condition to set
>>>> 	last_set_invalid nonzero.
>>>>
Kewen.Lin Oct. 20, 2021, 9:28 a.m. UTC | #5
Hi,

Gentle ping this:

https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html

BR,
Kewen

> 
>>>> on 2021/6/11 下午9:16, Kewen.Lin via Gcc-patches wrote:
>>>>> Hi Segher,
>>>>>
>>>>> Thanks for the review!
>>>>>
>>>>> on 2021/6/10 上午4:17, Segher Boessenkool wrote:
>>>>>> Hi!
>>>>>>
>>>>>> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote:
>>>>>>> Currently we have the check:
>>>>>>>
>>>>>>>       if (!insn
>>>>>>> 	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
>>>>>>> 	rsp->last_set_invalid = 1; 
>>>>>>>
>>>>>>> which means if we want to record some value for some reg and
>>>>>>> this reg got refered before in a valid scope,
>>>>>>
>>>>>> If we already know it is *set* in this same extended basic block.
>>>>>> Possibly by the same instruction btw.
>>>>>>
>>>>>>> we invalidate the
>>>>>>> set of reg (last_set_invalid to 1).  It avoids to find the wrong
>>>>>>> set for one reg reference, such as the case like:
>>>>>>>
>>>>>>>    ... op regX  // this regX could find wrong last_set below
>>>>>>>    regX = ...   // if we think this set is valid
>>>>>>>    ... op regX
>>>>>>
>>>>>> Yup, exactly.
>>>>>>
>>>>>>> But because of retry's existence, the last_set_table_tick could
>>>>>>> be set by some later reference insns, but we see it's set due
>>>>>>> to retry on the set (for that reg) insn again, such as:
>>>>>>>
>>>>>>>    insn 1
>>>>>>>    insn 2
>>>>>>>
>>>>>>>    regX = ...     --> (a)
>>>>>>>    ... op regX    --> (b)
>>>>>>>    
>>>>>>>    insn 3
>>>>>>>
>>>>>>>    // assume all in the same BB.
>>>>>>>
>>>>>>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two
>>>>>>> (3 insns -> 2 insns),
>>>>>>
>>>>>> This will delete insn 1 and write the combined result to insns 2 and 3.
>>>>>>
>>>>>>> retrying from insn1 or insn2 again:
>>>>>>
>>>>>> Always 2, but your point remains valid.
>>>>>>
>>>>>>> it will scan insn (a) again, the below condition holds for regX:
>>>>>>>
>>>>>>>   (value && rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>>>
>>>>>>> it will mark this set as invalid set.  But actually the
>>>>>>> last_set_table_tick here is set by insn (b) before retrying, so it
>>>>>>> should be safe to be taken as valid set.
>>>>>>
>>>>>> Yup.
>>>>>>
>>>>>>> This proposal is to check whether the last_set_table safely happens
>>>>>>> after the current set, make the set still valid if so.
>>>>>>
>>>>>>> Full SPEC2017 building shows this patch gets more sucessful combines
>>>>>>> from 1902208 to 1902243 (trivial though).
>>>>>>
>>>>>> Do you have some example, or maybe even a testcase?  :-)
>>>>>>
>>>>>
>>>>> Sorry for the late reply, it took some time to get one reduced case.
>>>>>
>>>>> typedef struct SA *pa_t;
>>>>>
>>>>> struct SC {
>>>>>   int h;
>>>>>   pa_t elem[];
>>>>> };
>>>>>
>>>>> struct SD {
>>>>>   struct SC *e;
>>>>> };
>>>>>
>>>>> struct SA {
>>>>>   struct {
>>>>>     struct SD f[1];
>>>>>   } g;
>>>>> };
>>>>>
>>>>> void foo(pa_t *k, char **m) {
>>>>>   int l, i;
>>>>>   pa_t a;
>>>>>   l = (int)a->g.f[5].e;
>>>>>   i = 0;
>>>>>   for (; i < l; i++) {
>>>>>     k[i] = a->g.f[5].e->elem[i];
>>>>>     m[i] = "";
>>>>>   }
>>>>> }
>>>>>
>>>>> Baseline is r12-0 and the option is "-O3 -mcpu=power9 -fno-strict-aliasing",
>>>>> with this patch, the generated assembly can save two rlwinm s.
>>>>>
>>>>>>> +  /* Record the luid of the insn whose expression involving register n.  */
>>>>>>> +
>>>>>>> +  int				last_set_table_luid;
>>>>>>
>>>>>> "Record the luid of the insn for which last_set_table_tick was set",
>>>>>> right?
>>>>>>
>>>>>
>>>>> But it can be updated later to one smaller luid, how about the wording like:
>>>>>
>>>>>
>>>>> +  /* Record the luid of the insn which uses register n, the insn should
>>>>> +     be the first one using register n in that block of the insn which
>>>>> +     last_set_table_tick was set for.  */
>>>>>
>>>>>
>>>>>>> -static void update_table_tick (rtx);
>>>>>>> +static void update_table_tick (rtx, int);
>>>>>>
>>>>>> Please remove this declaration instead, the function is not used until
>>>>>> after its actual definition :-)
>>>>>>
>>>>>
>>>>> Done.
>>>>>
>>>>>>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x)
>>>>>>>        for (r = regno; r < endregno; r++)
>>>>>>>  	{
>>>>>>>  	  reg_stat_type *rsp = &reg_stat[r];
>>>>>>> -	  rsp->last_set_table_tick = label_tick;
>>>>>>> +	  if (rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>>> +	    {
>>>>>>> +	      /* Later references should not have lower ticks.  */
>>>>>>> +	      gcc_assert (label_tick >= rsp->last_set_table_tick);
>>>>>>
>>>>>> This should be obvious, but checking it won't hurt, okay.
>>>>>>
>>>>>>> +	      /* Should pick up the lowest luid if the references
>>>>>>> +		 are in the same block.  */
>>>>>>> +	      if (label_tick == rsp->last_set_table_tick
>>>>>>> +		  && rsp->last_set_table_luid > insn_luid)
>>>>>>> +		rsp->last_set_table_luid = insn_luid;
>>>>>>
>>>>>> Why?  Is it conservative for the check you will do later?  Please spell
>>>>>> this out, it is crucial!
>>>>>>
>>>>>
>>>>> Since later the combinations involving this insn probably make the
>>>>> register be used in one insn sitting ahead (which has smaller luid than
>>>>> the one which was recorded before).  Yes, it's very conservative, this
>>>>> ensure that we always use the luid of the insn which is the first insn
>>>>> using this register in the block.  The last_set invalidation is going
>>>>> to catch the case like:
>>>>>
>>>>>    ... regX  // avoid the set used here ...
>>>>>    regX = ...
>>>>>    ...
>>>>>
>>>>> Once we have the smallest luid one of all insns which use register X,
>>>>> any unsafe regX sets should be caught.
>>>>>
>>>>> I updated the comments to:
>>>>>
>>>>> +              /* Since combination may generate some instructions
>>>>> +                 to replace some foregoing instructions with the
>>>>> +                 references to register r (using register r), we
>>>>> +                 need to make sure we record the first instruction
>>>>> +                 which is using register r, so always update with
>>>>> +                 the lowest luid here.  If the given set happens
>>>>> +                 before this recorded earliest reference, the set
>>>>> +                 value should be safe to be used.  */
>>>>>
>>>>>>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
>>>>>>>  
>>>>>>>    /* Mark registers that are being referenced in this value.  */
>>>>>>>    if (value)
>>>>>>> -    update_table_tick (value);
>>>>>>> +    {
>>>>>>> +      gcc_assert (insn);
>>>>>>> +      update_table_tick (value, DF_INSN_LUID (insn));
>>>>>>> +    }
>>>>>>
>>>>>> Don't add that assert please.  If you really want one it should come
>>>>>> right at the start of the function, not 60 lines later :-)
>>>>>>
>>>>>
>>>>> Exactly, fixed.
>>>>>
>>>>>> Looks good if I understood this correctly :-)
>>>>>>
>>>>>>
>>>>>
>>>>> Thanks again, I also updated the comments in func record_value_for_reg,
>>>>> the new version is attached.
>>>>>
>>>>> BR,
>>>>> Kewen
>>>>> -----
>>>>> gcc/ChangeLog:
>>>>>
>>>>> 	* combine.c (struct reg_stat_type): New member
>>>>> 	last_set_table_luid.
>>>>> 	(update_table_tick): Add one argument for insn luid and
>>>>> 	set last_set_table_luid with it, remove its declaration.
>>>>> 	(record_value_for_reg): Adjust the condition to set
>>>>> 	last_set_invalid nonzero.
>>>>>
Kewen.Lin Nov. 4, 2021, 10:56 a.m. UTC | #6
Hi,

Gentle ping this:

https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html

BR,
Kewen

>>>>> on 2021/6/11 下午9:16, Kewen.Lin via Gcc-patches wrote:
>>>>>> Hi Segher,
>>>>>>
>>>>>> Thanks for the review!
>>>>>>
>>>>>> on 2021/6/10 上午4:17, Segher Boessenkool wrote:
>>>>>>> Hi!
>>>>>>>
>>>>>>> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote:
>>>>>>>> Currently we have the check:
>>>>>>>>
>>>>>>>>       if (!insn
>>>>>>>> 	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
>>>>>>>> 	rsp->last_set_invalid = 1; 
>>>>>>>>
>>>>>>>> which means if we want to record some value for some reg and
>>>>>>>> this reg got refered before in a valid scope,
>>>>>>>
>>>>>>> If we already know it is *set* in this same extended basic block.
>>>>>>> Possibly by the same instruction btw.
>>>>>>>
>>>>>>>> we invalidate the
>>>>>>>> set of reg (last_set_invalid to 1).  It avoids to find the wrong
>>>>>>>> set for one reg reference, such as the case like:
>>>>>>>>
>>>>>>>>    ... op regX  // this regX could find wrong last_set below
>>>>>>>>    regX = ...   // if we think this set is valid
>>>>>>>>    ... op regX
>>>>>>>
>>>>>>> Yup, exactly.
>>>>>>>
>>>>>>>> But because of retry's existence, the last_set_table_tick could
>>>>>>>> be set by some later reference insns, but we see it's set due
>>>>>>>> to retry on the set (for that reg) insn again, such as:
>>>>>>>>
>>>>>>>>    insn 1
>>>>>>>>    insn 2
>>>>>>>>
>>>>>>>>    regX = ...     --> (a)
>>>>>>>>    ... op regX    --> (b)
>>>>>>>>    
>>>>>>>>    insn 3
>>>>>>>>
>>>>>>>>    // assume all in the same BB.
>>>>>>>>
>>>>>>>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two
>>>>>>>> (3 insns -> 2 insns),
>>>>>>>
>>>>>>> This will delete insn 1 and write the combined result to insns 2 and 3.
>>>>>>>
>>>>>>>> retrying from insn1 or insn2 again:
>>>>>>>
>>>>>>> Always 2, but your point remains valid.
>>>>>>>
>>>>>>>> it will scan insn (a) again, the below condition holds for regX:
>>>>>>>>
>>>>>>>>   (value && rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>>>>
>>>>>>>> it will mark this set as invalid set.  But actually the
>>>>>>>> last_set_table_tick here is set by insn (b) before retrying, so it
>>>>>>>> should be safe to be taken as valid set.
>>>>>>>
>>>>>>> Yup.
>>>>>>>
>>>>>>>> This proposal is to check whether the last_set_table safely happens
>>>>>>>> after the current set, make the set still valid if so.
>>>>>>>
>>>>>>>> Full SPEC2017 building shows this patch gets more sucessful combines
>>>>>>>> from 1902208 to 1902243 (trivial though).
>>>>>>>
>>>>>>> Do you have some example, or maybe even a testcase?  :-)
>>>>>>>
>>>>>>
>>>>>> Sorry for the late reply, it took some time to get one reduced case.
>>>>>>
>>>>>> typedef struct SA *pa_t;
>>>>>>
>>>>>> struct SC {
>>>>>>   int h;
>>>>>>   pa_t elem[];
>>>>>> };
>>>>>>
>>>>>> struct SD {
>>>>>>   struct SC *e;
>>>>>> };
>>>>>>
>>>>>> struct SA {
>>>>>>   struct {
>>>>>>     struct SD f[1];
>>>>>>   } g;
>>>>>> };
>>>>>>
>>>>>> void foo(pa_t *k, char **m) {
>>>>>>   int l, i;
>>>>>>   pa_t a;
>>>>>>   l = (int)a->g.f[5].e;
>>>>>>   i = 0;
>>>>>>   for (; i < l; i++) {
>>>>>>     k[i] = a->g.f[5].e->elem[i];
>>>>>>     m[i] = "";
>>>>>>   }
>>>>>> }
>>>>>>
>>>>>> Baseline is r12-0 and the option is "-O3 -mcpu=power9 -fno-strict-aliasing",
>>>>>> with this patch, the generated assembly can save two rlwinm s.
>>>>>>
>>>>>>>> +  /* Record the luid of the insn whose expression involving register n.  */
>>>>>>>> +
>>>>>>>> +  int				last_set_table_luid;
>>>>>>>
>>>>>>> "Record the luid of the insn for which last_set_table_tick was set",
>>>>>>> right?
>>>>>>>
>>>>>>
>>>>>> But it can be updated later to one smaller luid, how about the wording like:
>>>>>>
>>>>>>
>>>>>> +  /* Record the luid of the insn which uses register n, the insn should
>>>>>> +     be the first one using register n in that block of the insn which
>>>>>> +     last_set_table_tick was set for.  */
>>>>>>
>>>>>>
>>>>>>>> -static void update_table_tick (rtx);
>>>>>>>> +static void update_table_tick (rtx, int);
>>>>>>>
>>>>>>> Please remove this declaration instead, the function is not used until
>>>>>>> after its actual definition :-)
>>>>>>>
>>>>>>
>>>>>> Done.
>>>>>>
>>>>>>>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x)
>>>>>>>>        for (r = regno; r < endregno; r++)
>>>>>>>>  	{
>>>>>>>>  	  reg_stat_type *rsp = &reg_stat[r];
>>>>>>>> -	  rsp->last_set_table_tick = label_tick;
>>>>>>>> +	  if (rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>>>> +	    {
>>>>>>>> +	      /* Later references should not have lower ticks.  */
>>>>>>>> +	      gcc_assert (label_tick >= rsp->last_set_table_tick);
>>>>>>>
>>>>>>> This should be obvious, but checking it won't hurt, okay.
>>>>>>>
>>>>>>>> +	      /* Should pick up the lowest luid if the references
>>>>>>>> +		 are in the same block.  */
>>>>>>>> +	      if (label_tick == rsp->last_set_table_tick
>>>>>>>> +		  && rsp->last_set_table_luid > insn_luid)
>>>>>>>> +		rsp->last_set_table_luid = insn_luid;
>>>>>>>
>>>>>>> Why?  Is it conservative for the check you will do later?  Please spell
>>>>>>> this out, it is crucial!
>>>>>>>
>>>>>>
>>>>>> Since later the combinations involving this insn probably make the
>>>>>> register be used in one insn sitting ahead (which has smaller luid than
>>>>>> the one which was recorded before).  Yes, it's very conservative, this
>>>>>> ensure that we always use the luid of the insn which is the first insn
>>>>>> using this register in the block.  The last_set invalidation is going
>>>>>> to catch the case like:
>>>>>>
>>>>>>    ... regX  // avoid the set used here ...
>>>>>>    regX = ...
>>>>>>    ...
>>>>>>
>>>>>> Once we have the smallest luid one of all insns which use register X,
>>>>>> any unsafe regX sets should be caught.
>>>>>>
>>>>>> I updated the comments to:
>>>>>>
>>>>>> +              /* Since combination may generate some instructions
>>>>>> +                 to replace some foregoing instructions with the
>>>>>> +                 references to register r (using register r), we
>>>>>> +                 need to make sure we record the first instruction
>>>>>> +                 which is using register r, so always update with
>>>>>> +                 the lowest luid here.  If the given set happens
>>>>>> +                 before this recorded earliest reference, the set
>>>>>> +                 value should be safe to be used.  */
>>>>>>
>>>>>>>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
>>>>>>>>  
>>>>>>>>    /* Mark registers that are being referenced in this value.  */
>>>>>>>>    if (value)
>>>>>>>> -    update_table_tick (value);
>>>>>>>> +    {
>>>>>>>> +      gcc_assert (insn);
>>>>>>>> +      update_table_tick (value, DF_INSN_LUID (insn));
>>>>>>>> +    }
>>>>>>>
>>>>>>> Don't add that assert please.  If you really want one it should come
>>>>>>> right at the start of the function, not 60 lines later :-)
>>>>>>>
>>>>>>
>>>>>> Exactly, fixed.
>>>>>>
>>>>>>> Looks good if I understood this correctly :-)
>>>>>>>
>>>>>>>
>>>>>>
>>>>>> Thanks again, I also updated the comments in func record_value_for_reg,
>>>>>> the new version is attached.
>>>>>>
>>>>>> BR,
>>>>>> Kewen
>>>>>> -----
>>>>>> gcc/ChangeLog:
>>>>>>
>>>>>> 	* combine.c (struct reg_stat_type): New member
>>>>>> 	last_set_table_luid.
>>>>>> 	(update_table_tick): Add one argument for insn luid and
>>>>>> 	set last_set_table_luid with it, remove its declaration.
>>>>>> 	(record_value_for_reg): Adjust the condition to set
>>>>>> 	last_set_invalid nonzero.
>>>>>>
Kewen.Lin Nov. 22, 2021, 2:22 a.m. UTC | #7
Hi,

Gentle ping this:

https://gcc.gnu.org/pipermail/gcc-patches/2021-June/572555.html

BR,
Kewen

>>>>>> on 2021/6/11 下午9:16, Kewen.Lin via Gcc-patches wrote:
>>>>>>> Hi Segher,
>>>>>>>
>>>>>>> Thanks for the review!
>>>>>>>
>>>>>>> on 2021/6/10 上午4:17, Segher Boessenkool wrote:
>>>>>>>> Hi!
>>>>>>>>
>>>>>>>> On Wed, Dec 16, 2020 at 04:49:49PM +0800, Kewen.Lin wrote:
>>>>>>>>> Currently we have the check:
>>>>>>>>>
>>>>>>>>>       if (!insn
>>>>>>>>> 	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
>>>>>>>>> 	rsp->last_set_invalid = 1; 
>>>>>>>>>
>>>>>>>>> which means if we want to record some value for some reg and
>>>>>>>>> this reg got refered before in a valid scope,
>>>>>>>>
>>>>>>>> If we already know it is *set* in this same extended basic block.
>>>>>>>> Possibly by the same instruction btw.
>>>>>>>>
>>>>>>>>> we invalidate the
>>>>>>>>> set of reg (last_set_invalid to 1).  It avoids to find the wrong
>>>>>>>>> set for one reg reference, such as the case like:
>>>>>>>>>
>>>>>>>>>    ... op regX  // this regX could find wrong last_set below
>>>>>>>>>    regX = ...   // if we think this set is valid
>>>>>>>>>    ... op regX
>>>>>>>>
>>>>>>>> Yup, exactly.
>>>>>>>>
>>>>>>>>> But because of retry's existence, the last_set_table_tick could
>>>>>>>>> be set by some later reference insns, but we see it's set due
>>>>>>>>> to retry on the set (for that reg) insn again, such as:
>>>>>>>>>
>>>>>>>>>    insn 1
>>>>>>>>>    insn 2
>>>>>>>>>
>>>>>>>>>    regX = ...     --> (a)
>>>>>>>>>    ... op regX    --> (b)
>>>>>>>>>    
>>>>>>>>>    insn 3
>>>>>>>>>
>>>>>>>>>    // assume all in the same BB.
>>>>>>>>>
>>>>>>>>> Assuming we combine 1, 2 -> 3 sucessfully and replace them as two
>>>>>>>>> (3 insns -> 2 insns),
>>>>>>>>
>>>>>>>> This will delete insn 1 and write the combined result to insns 2 and 3.
>>>>>>>>
>>>>>>>>> retrying from insn1 or insn2 again:
>>>>>>>>
>>>>>>>> Always 2, but your point remains valid.
>>>>>>>>
>>>>>>>>> it will scan insn (a) again, the below condition holds for regX:
>>>>>>>>>
>>>>>>>>>   (value && rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>>>>>
>>>>>>>>> it will mark this set as invalid set.  But actually the
>>>>>>>>> last_set_table_tick here is set by insn (b) before retrying, so it
>>>>>>>>> should be safe to be taken as valid set.
>>>>>>>>
>>>>>>>> Yup.
>>>>>>>>
>>>>>>>>> This proposal is to check whether the last_set_table safely happens
>>>>>>>>> after the current set, make the set still valid if so.
>>>>>>>>
>>>>>>>>> Full SPEC2017 building shows this patch gets more sucessful combines
>>>>>>>>> from 1902208 to 1902243 (trivial though).
>>>>>>>>
>>>>>>>> Do you have some example, or maybe even a testcase?  :-)
>>>>>>>>
>>>>>>>
>>>>>>> Sorry for the late reply, it took some time to get one reduced case.
>>>>>>>
>>>>>>> typedef struct SA *pa_t;
>>>>>>>
>>>>>>> struct SC {
>>>>>>>   int h;
>>>>>>>   pa_t elem[];
>>>>>>> };
>>>>>>>
>>>>>>> struct SD {
>>>>>>>   struct SC *e;
>>>>>>> };
>>>>>>>
>>>>>>> struct SA {
>>>>>>>   struct {
>>>>>>>     struct SD f[1];
>>>>>>>   } g;
>>>>>>> };
>>>>>>>
>>>>>>> void foo(pa_t *k, char **m) {
>>>>>>>   int l, i;
>>>>>>>   pa_t a;
>>>>>>>   l = (int)a->g.f[5].e;
>>>>>>>   i = 0;
>>>>>>>   for (; i < l; i++) {
>>>>>>>     k[i] = a->g.f[5].e->elem[i];
>>>>>>>     m[i] = "";
>>>>>>>   }
>>>>>>> }
>>>>>>>
>>>>>>> Baseline is r12-0 and the option is "-O3 -mcpu=power9 -fno-strict-aliasing",
>>>>>>> with this patch, the generated assembly can save two rlwinm s.
>>>>>>>
>>>>>>>>> +  /* Record the luid of the insn whose expression involving register n.  */
>>>>>>>>> +
>>>>>>>>> +  int				last_set_table_luid;
>>>>>>>>
>>>>>>>> "Record the luid of the insn for which last_set_table_tick was set",
>>>>>>>> right?
>>>>>>>>
>>>>>>>
>>>>>>> But it can be updated later to one smaller luid, how about the wording like:
>>>>>>>
>>>>>>>
>>>>>>> +  /* Record the luid of the insn which uses register n, the insn should
>>>>>>> +     be the first one using register n in that block of the insn which
>>>>>>> +     last_set_table_tick was set for.  */
>>>>>>>
>>>>>>>
>>>>>>>>> -static void update_table_tick (rtx);
>>>>>>>>> +static void update_table_tick (rtx, int);
>>>>>>>>
>>>>>>>> Please remove this declaration instead, the function is not used until
>>>>>>>> after its actual definition :-)
>>>>>>>>
>>>>>>>
>>>>>>> Done.
>>>>>>>
>>>>>>>>> @@ -13243,7 +13247,21 @@ update_table_tick (rtx x)
>>>>>>>>>        for (r = regno; r < endregno; r++)
>>>>>>>>>  	{
>>>>>>>>>  	  reg_stat_type *rsp = &reg_stat[r];
>>>>>>>>> -	  rsp->last_set_table_tick = label_tick;
>>>>>>>>> +	  if (rsp->last_set_table_tick >= label_tick_ebb_start)
>>>>>>>>> +	    {
>>>>>>>>> +	      /* Later references should not have lower ticks.  */
>>>>>>>>> +	      gcc_assert (label_tick >= rsp->last_set_table_tick);
>>>>>>>>
>>>>>>>> This should be obvious, but checking it won't hurt, okay.
>>>>>>>>
>>>>>>>>> +	      /* Should pick up the lowest luid if the references
>>>>>>>>> +		 are in the same block.  */
>>>>>>>>> +	      if (label_tick == rsp->last_set_table_tick
>>>>>>>>> +		  && rsp->last_set_table_luid > insn_luid)
>>>>>>>>> +		rsp->last_set_table_luid = insn_luid;
>>>>>>>>
>>>>>>>> Why?  Is it conservative for the check you will do later?  Please spell
>>>>>>>> this out, it is crucial!
>>>>>>>>
>>>>>>>
>>>>>>> Since later the combinations involving this insn probably make the
>>>>>>> register be used in one insn sitting ahead (which has smaller luid than
>>>>>>> the one which was recorded before).  Yes, it's very conservative, this
>>>>>>> ensure that we always use the luid of the insn which is the first insn
>>>>>>> using this register in the block.  The last_set invalidation is going
>>>>>>> to catch the case like:
>>>>>>>
>>>>>>>    ... regX  // avoid the set used here ...
>>>>>>>    regX = ...
>>>>>>>    ...
>>>>>>>
>>>>>>> Once we have the smallest luid one of all insns which use register X,
>>>>>>> any unsafe regX sets should be caught.
>>>>>>>
>>>>>>> I updated the comments to:
>>>>>>>
>>>>>>> +              /* Since combination may generate some instructions
>>>>>>> +                 to replace some foregoing instructions with the
>>>>>>> +                 references to register r (using register r), we
>>>>>>> +                 need to make sure we record the first instruction
>>>>>>> +                 which is using register r, so always update with
>>>>>>> +                 the lowest luid here.  If the given set happens
>>>>>>> +                 before this recorded earliest reference, the set
>>>>>>> +                 value should be safe to be used.  */
>>>>>>>
>>>>>>>>> @@ -13359,7 +13378,10 @@ record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
>>>>>>>>>  
>>>>>>>>>    /* Mark registers that are being referenced in this value.  */
>>>>>>>>>    if (value)
>>>>>>>>> -    update_table_tick (value);
>>>>>>>>> +    {
>>>>>>>>> +      gcc_assert (insn);
>>>>>>>>> +      update_table_tick (value, DF_INSN_LUID (insn));
>>>>>>>>> +    }
>>>>>>>>
>>>>>>>> Don't add that assert please.  If you really want one it should come
>>>>>>>> right at the start of the function, not 60 lines later :-)
>>>>>>>>
>>>>>>>
>>>>>>> Exactly, fixed.
>>>>>>>
>>>>>>>> Looks good if I understood this correctly :-)
>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>> Thanks again, I also updated the comments in func record_value_for_reg,
>>>>>>> the new version is attached.
>>>>>>>
>>>>>>> BR,
>>>>>>> Kewen
>>>>>>> -----
>>>>>>> gcc/ChangeLog:
>>>>>>>
>>>>>>> 	* combine.c (struct reg_stat_type): New member
>>>>>>> 	last_set_table_luid.
>>>>>>> 	(update_table_tick): Add one argument for insn luid and
>>>>>>> 	set last_set_table_luid with it, remove its declaration.
>>>>>>> 	(record_value_for_reg): Adjust the condition to set
>>>>>>> 	last_set_invalid nonzero.
>>>>>>>


BR,
Kewen
Segher Boessenkool Nov. 29, 2021, 10:28 p.m. UTC | #8
Hi!

On Fri, Jun 11, 2021 at 09:16:21PM +0800, Kewen.Lin wrote:
> >> +	      /* Should pick up the lowest luid if the references
> >> +		 are in the same block.  */
> >> +	      if (label_tick == rsp->last_set_table_tick
> >> +		  && rsp->last_set_table_luid > insn_luid)
> >> +		rsp->last_set_table_luid = insn_luid;
> > 
> > Why?  Is it conservative for the check you will do later?  Please spell
> > this out, it is crucial!
> 
> Since later the combinations involving this insn probably make the
> register be used in one insn sitting ahead (which has smaller luid than
> the one which was recorded before).  Yes, it's very conservative, this
> ensure that we always use the luid of the insn which is the first insn
> using this register in the block.

Why would that be correct?!

> The last_set invalidation is going
> to catch the case like:
> 
>    ... regX  // avoid the set used here ...
>    regX = ...
>    ...
> 
> Once we have the smallest luid one of all insns which use register X,
> any unsafe regX sets should be caught.

Yes, you invalidate more, but because you put lies in the table :-(

> 	* combine.c (struct reg_stat_type): New member
> 	last_set_table_luid.

This fits on one line.

> 	(update_table_tick): Add one argument for insn luid and
> 	set last_set_table_luid with it, remove its declaration.
> 	(record_value_for_reg): Adjust the condition to set
> 	last_set_invalid nonzero.

These lines break earlier than they should as well.

> +  /* Record the luid of the insn which uses register n, the insn should
> +     be the first one using register n in that block of the insn which
> +     last_set_table_tick was set for.  */
> +
> +  int				last_set_table_luid;

I'm not sure what this variable is for.  The comment says something
else than the variable name does, and now I don't know what to
believe :-)

The name says it is for a SET, the explanation says it is for a USE.


Segher
Kewen.Lin Nov. 30, 2021, 8:47 a.m. UTC | #9
Hi Segher,

Thanks for the review!

on 2021/11/30 上午6:28, Segher Boessenkool wrote:
> Hi!
> 
> On Fri, Jun 11, 2021 at 09:16:21PM +0800, Kewen.Lin wrote:
>>>> +	      /* Should pick up the lowest luid if the references
>>>> +		 are in the same block.  */
>>>> +	      if (label_tick == rsp->last_set_table_tick
>>>> +		  && rsp->last_set_table_luid > insn_luid)
>>>> +		rsp->last_set_table_luid = insn_luid;
>>>
>>> Why?  Is it conservative for the check you will do later?  Please spell
>>> this out, it is crucial!
>>
>> Since later the combinations involving this insn probably make the
>> register be used in one insn sitting ahead (which has smaller luid than
>> the one which was recorded before).  Yes, it's very conservative, this
>> ensure that we always use the luid of the insn which is the first insn
>> using this register in the block.
> 
> Why would that be correct?!
> 

The later check has:

       if (!insn
-          || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
+          || (value && rsp->last_set_table_tick >= label_tick_ebb_start
+              && !(label_tick == rsp->last_set_table_tick
+                   && DF_INSN_LUID (insn) < rsp->last_set_table_luid)))
         rsp->last_set_invalid = 1;

For "label_tick != rsp->last_set_table_tick", it's the same as before.

For "label_tick == rsp->last_set_table_tick", we have the below:

+              if (label_tick == rsp->last_set_table_tick
+                  && rsp->last_set_table_luid > insn_luid)
+                rsp->last_set_table_luid = insn_luid;

It keeps checking and updating with the smallest LUID of the insns which
have the expression involving register n are placed in last_set_value.

The updating here aims to ensure we always the LUID of first INSN which
uses register n (or saying that having one expression involving register n
is placed in last_set_value).

For the first time we set last_set_table_tick for register n, we will also
set last_set_table_luid.  For below case, we record x for LUID.  Assuming
we combining 1,2,x to 2,x and regX is updated to be used in insn2.  Then
the first INSN using regX has become to insn 2.

  ... reg1 // insn 1
  ...
  ... reg2 // insn 2
  ...
  ... regX // insn x
  ...
  regX     // insn y
  ...

Later whether combining moves regX setting upward or not, the LUID which it
compares with is always the updated smallest one (insn 2 here), not the one
which is set at the beginning.  So I think it's conservative.

>> The last_set invalidation is going
>> to catch the case like:
>>
>>    ... regX  // avoid the set used here ...
>>    regX = ...
>>    ...
>>
>> Once we have the smallest luid one of all insns which use register X,
>> any unsafe regX sets should be caught.
> 
> Yes, you invalidate more, but because you put lies in the table :-(
> 

This patch tries to relax some restrictions, it seems there are no lies.  :)
Could you help to explain this comment more?

>> 	* combine.c (struct reg_stat_type): New member
>> 	last_set_table_luid.
> 
> This fits on one line.
> 
>> 	(update_table_tick): Add one argument for insn luid and
>> 	set last_set_table_luid with it, remove its declaration.
>> 	(record_value_for_reg): Adjust the condition to set
>> 	last_set_invalid nonzero.
> 
> These lines break earlier than they should as well.
> 
>> +  /* Record the luid of the insn which uses register n, the insn should
>> +     be the first one using register n in that block of the insn which
>> +     last_set_table_tick was set for.  */
>> +
>> +  int				last_set_table_luid;
> 
> I'm not sure what this variable is for.  The comment says something
> else than the variable name does, and now I don't know what to
> believe :-)
> 
> The name says it is for a SET, the explanation says it is for a USE.
> 

Good point.  :)  For the existing last_set_table_tick,
  /* Record the value of label_tick when an expression involving register n
     is placed in last_set_value.  */

  int				last_set_table_tick;

it seems it has the set in the name too, but "an expression involving
register n " is actually a reference (use) to register n?

How about the below one referring to "last_set_table_tick"?

/* Record the smallest luid of the insns whose expressions involving
   register n are placed in last_set_value, meanwhile the insns locate
   in the same block of the insn which last_set_table_tick was set for.  */


BR,
Kewen
diff mbox series

Patch

diff --git a/gcc/combine.c b/gcc/combine.c
index 62bf4aeaaba..6ef6a08d54f 100644
--- a/gcc/combine.c
+++ b/gcc/combine.c
@@ -202,6 +202,12 @@  struct reg_stat_type {
 
   int				last_set_table_tick;
 
+  /* Record the luid of the insn which uses register n, the insn should
+     be the first one using register n in that block of the insn which
+     last_set_table_tick was set for.  */
+
+  int				last_set_table_luid;
+
   /* Record the value of label_tick when the value for register n is placed in
      last_set_value.  */
 
@@ -476,7 +482,6 @@  static rtx gen_lowpart_for_combine (machine_mode, rtx);
 static enum rtx_code simplify_compare_const (enum rtx_code, machine_mode,
 					     rtx, rtx *);
 static enum rtx_code simplify_comparison (enum rtx_code, rtx *, rtx *);
-static void update_table_tick (rtx);
 static void record_value_for_reg (rtx, rtx_insn *, rtx);
 static void check_promoted_subreg (rtx_insn *, rtx);
 static void record_dead_and_set_regs_1 (rtx, const_rtx, void *);
@@ -13179,7 +13184,7 @@  count_rtxs (rtx x)
    for each register mentioned.  Similar to mention_regs in cse.c  */
 
 static void
-update_table_tick (rtx x)
+update_table_tick (rtx x, int insn_luid)
 {
   enum rtx_code code = GET_CODE (x);
   const char *fmt = GET_RTX_FORMAT (code);
@@ -13194,7 +13199,27 @@  update_table_tick (rtx x)
       for (r = regno; r < endregno; r++)
 	{
 	  reg_stat_type *rsp = &reg_stat[r];
-	  rsp->last_set_table_tick = label_tick;
+	  if (rsp->last_set_table_tick >= label_tick_ebb_start)
+	    {
+	      /* Later references should not have lower ticks.  */
+	      gcc_assert (label_tick >= rsp->last_set_table_tick);
+	      /* Since combination may generate some instructions
+		 to replace some foregoing instructions with the
+		 references to register r (using register r), we
+		 need to make sure we record the first instruction
+		 which is using register r, so always update with
+		 the lowest luid here.  If the given set happens
+		 before this recorded earliest reference, the set
+		 value should be safe to be used.  */
+	      if (label_tick == rsp->last_set_table_tick
+		  && rsp->last_set_table_luid > insn_luid)
+		rsp->last_set_table_luid = insn_luid;
+	    }
+	  else
+	    {
+	      rsp->last_set_table_tick = label_tick;
+	      rsp->last_set_table_luid = insn_luid;
+	    }
 	}
 
       return;
@@ -13230,16 +13255,17 @@  update_table_tick (rtx x)
 	    if (ARITHMETIC_P (x0)
 		&& (x1 == XEXP (x0, 0) || x1 == XEXP (x0, 1)))
 	      {
-		update_table_tick (XEXP (x0, x1 == XEXP (x0, 0) ? 1 : 0));
+		update_table_tick (XEXP (x0, x1 == XEXP (x0, 0) ? 1 : 0),
+				   insn_luid);
 		break;
 	      }
 	  }
 
-	update_table_tick (XEXP (x, i));
+	update_table_tick (XEXP (x, i), insn_luid);
       }
     else if (fmt[i] == 'E')
       for (j = 0; j < XVECLEN (x, i); j++)
-	update_table_tick (XVECEXP (x, i, j));
+	update_table_tick (XVECEXP (x, i, j), insn_luid);
 }
 
 /* Record that REG is set to VALUE in insn INSN.  If VALUE is zero, we
@@ -13310,21 +13336,26 @@  record_value_for_reg (rtx reg, rtx_insn *insn, rtx value)
 
   /* Mark registers that are being referenced in this value.  */
   if (value)
-    update_table_tick (value);
+    update_table_tick (value, DF_INSN_LUID (insn));
 
   /* Now update the status of each register being set.
      If someone is using this register in this block, set this register
      to invalid since we will get confused between the two lives in this
      basic block.  This makes using this register always invalid.  In cse, we
      scan the table to invalidate all entries using this register, but this
-     is too much work for us.  */
+     is too much work for us.  If we know this register set and its register
+     uses are in the same block, and the set always happens before any uses,
+     we don't need to make it invalid.  */
 
   for (i = regno; i < endregno; i++)
     {
       rsp = &reg_stat[i];
       rsp->last_set_label = label_tick;
+      gcc_assert (label_tick >= rsp->last_set_table_tick);
       if (!insn
-	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start))
+	  || (value && rsp->last_set_table_tick >= label_tick_ebb_start
+	      && !(label_tick == rsp->last_set_table_tick
+		   && DF_INSN_LUID (insn) < rsp->last_set_table_luid)))
 	rsp->last_set_invalid = 1;
       else
 	rsp->last_set_invalid = 0;