PR71275 ira.c bb_loop_depth

Message ID	20160526110205.GJ3300@bubble.grove.modra.org
State	New
Headers	show Return-Path: <gcc-patches-return-428346-incoming=patchwork.ozlabs.org@gcc.gnu.org> DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:date :from:to:subject:message-id:mime-version:content-type; q=dns; s= default; b=ZHfTyWe7WDuKJT5AFZqEIAtf1wYbuXt7VKmYWQsgurCwNx4yGCuCP TFw/FnHZNV8+P7YugNOsph2ZtOH8tAhSktgjWcp2B3Mbt0zEIuPEtnovFnuZPgYG H1kzuu4PH4LP6TYan3fmfZT/qsxg+bug/rHoSDwI6y7l7ZKn2mNbPo= Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk Sender: gcc-patches-owner@gcc.gnu.org Date: Thu, 26 May 2016 20:32:06 +0930 From: Alan Modra <amodra@gmail.com> To: gcc-patches@gcc.gnu.org Subject: [PATCH] PR71275 ira.c bb_loop_depth Message-ID: <20160526110205.GJ3300@bubble.grove.modra.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.24 (2015-08-30)

Alan Modra May 26, 2016, 11:02 a.m. UTC

This fixes lack of bb_loop_depth info in some of the early parts of
ira, which has been the case for quite some time.  All active branches
return 0 from bb_loop_depth() in update_equiv_regs, but whether that
actually causes mis-optimization anywhere but trunk is yet to be
determined.

I played a little with trying to consolidate this loop_optimizer_init
call with one that occurs a little later, but ran into ICEs.  (We now
have four calls to loop_optimizer_init in ira.c.)

Bootstrapped and regression tested powerpc64le-linux and x86_64-linux.
OK to apply?

	PR rtl-optimization/71275
	* ira.c (ira): Call loop_optimizer_init to set up bb_loop_depth
	for update_equiv_regs and combine_and_move_insns.

Vladimir Makarov May 26, 2016, 2:12 p.m. UTC | #1

On 05/26/2016 07:02 AM, Alan Modra wrote:
> This fixes lack of bb_loop_depth info in some of the early parts of
> ira, which has been the case for quite some time.  All active branches
> return 0 from bb_loop_depth() in update_equiv_regs, but whether that
> actually causes mis-optimization anywhere but trunk is yet to be
> determined.
>
> I played a little with trying to consolidate this loop_optimizer_init
> call with one that occurs a little later, but ran into ICEs.  (We now
> have four calls to loop_optimizer_init in ira.c.)
>
> Bootstrapped and regression tested powerpc64le-linux and x86_64-linux.
> OK to apply?
>
Yes.  Thank you, Alan.

Alan Modra May 27, 2016, 2:14 a.m. UTC | #2

On Thu, May 26, 2016 at 10:12:14AM -0400, Vladimir Makarov wrote:
> On 05/26/2016 07:02 AM, Alan Modra wrote:
> >This fixes lack of bb_loop_depth info in some of the early parts of
> >ira, which has been the case for quite some time.  All active branches
> >return 0 from bb_loop_depth() in update_equiv_regs, but whether that
> >actually causes mis-optimization anywhere but trunk is yet to be
> >determined.
> >
> >I played a little with trying to consolidate this loop_optimizer_init
> >call with one that occurs a little later, but ran into ICEs.  (We now
> >have four calls to loop_optimizer_init in ira.c.)
> >
> >Bootstrapped and regression tested powerpc64le-linux and x86_64-linux.
> >OK to apply?
> >
> Yes.  Thank you, Alan.

Hi Vlad,
Sorry to do this to you and others, but the patch (committed as
r236789) may be wrong.  I didn't see any problems on trunk but when
I backported to gcc-5, I hit an error in stage2 compiling
insn-recog.c "dominator of 10 status unknown" from if_after_reload.

On gcc-5, the error disappears by adding a call to
  free_dominance_info (CDI_DOMINATORS);
after the newly added call to loop_optimizer_finalize.

I'm not sure yet what is going on.  Does anyone know whether the
free_dominance_info call is needed on trunk?

Vladimir Makarov May 27, 2016, 3:04 a.m. UTC | #3

On 05/26/2016 10:14 PM, Alan Modra wrote:
> On Thu, May 26, 2016 at 10:12:14AM -0400, Vladimir Makarov wrote:
>> On 05/26/2016 07:02 AM, Alan Modra wrote:
>>> This fixes lack of bb_loop_depth info in some of the early parts of
>>> ira, which has been the case for quite some time.  All active branches
>>> return 0 from bb_loop_depth() in update_equiv_regs, but whether that
>>> actually causes mis-optimization anywhere but trunk is yet to be
>>> determined.
>>>
>>> I played a little with trying to consolidate this loop_optimizer_init
>>> call with one that occurs a little later, but ran into ICEs.  (We now
>>> have four calls to loop_optimizer_init in ira.c.)
>>>
>>> Bootstrapped and regression tested powerpc64le-linux and x86_64-linux.
>>> OK to apply?
>>>
>> Yes.  Thank you, Alan.
> Hi Vlad,
> Sorry to do this to you and others, but the patch (committed as
> r236789) may be wrong.  I didn't see any problems on trunk but when
> I backported to gcc-5, I hit an error in stage2 compiling
> insn-recog.c "dominator of 10 status unknown" from if_after_reload.
>
> On gcc-5, the error disappears by adding a call to
>    free_dominance_info (CDI_DOMINATORS);
> after the newly added call to loop_optimizer_finalize.
>
> I'm not sure yet what is going on.  Does anyone know whether the
> free_dominance_info call is needed on trunk?
>
That is ok.  It is always a discovery.  I am not sure but I think I saw 
this problem when I wrote IRA.

Looking at the dominance code, I seems to me that it can reuse the 
previous info if it was not cleared.  So I guess free_dominance_info is 
important.

H.J. Lu June 14, 2016, 4:26 p.m. UTC | #4

On Thu, May 26, 2016 at 4:02 AM, Alan Modra <amodra@gmail.com> wrote:
> This fixes lack of bb_loop_depth info in some of the early parts of
> ira, which has been the case for quite some time.  All active branches
> return 0 from bb_loop_depth() in update_equiv_regs, but whether that
> actually causes mis-optimization anywhere but trunk is yet to be
> determined.
>
> I played a little with trying to consolidate this loop_optimizer_init
> call with one that occurs a little later, but ran into ICEs.  (We now
> have four calls to loop_optimizer_init in ira.c.)
>
> Bootstrapped and regression tested powerpc64le-linux and x86_64-linux.
> OK to apply?
>
>         PR rtl-optimization/71275
>         * ira.c (ira): Call loop_optimizer_init to set up bb_loop_depth
>         for update_equiv_regs and combine_and_move_insns.
>

This caused:

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71531

Alan Modra June 15, 2016, 1:30 a.m. UTC | #5

On Tue, Jun 14, 2016 at 09:26:19AM -0700, H.J. Lu wrote:
> On Thu, May 26, 2016 at 4:02 AM, Alan Modra <amodra@gmail.com> wrote:
> > This fixes lack of bb_loop_depth info in some of the early parts of
> > ira, which has been the case for quite some time.  All active branches
> > return 0 from bb_loop_depth() in update_equiv_regs, but whether that
> > actually causes mis-optimization anywhere but trunk is yet to be
> > determined.
> >
> > I played a little with trying to consolidate this loop_optimizer_init
> > call with one that occurs a little later, but ran into ICEs.  (We now
> > have four calls to loop_optimizer_init in ira.c.)
> >
> > Bootstrapped and regression tested powerpc64le-linux and x86_64-linux.
> > OK to apply?
> >
> >         PR rtl-optimization/71275
> >         * ira.c (ira): Call loop_optimizer_init to set up bb_loop_depth
> >         for update_equiv_regs and combine_and_move_insns.
> >
> 
> This caused:
> 
> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71531

I have an x86 ABI question.  Is it correct according to the ABI that
parameter passing stack slots always have the same value on function
exit as they had on entry?

Between these two calls to _gfortran_string_verify,
  if (verify(c4, "A", back = .true.) .ne. 3) call abort
  if (verify(c4, "AB") .ne. 0) call abort
it seems that gfortran is assuming that parameters passed on the stack
are unchanged.

_gfortran_string_verify is a C function, declared as follows in
libgfortran/intrinsics/string_intrinsics_inc.c:
gfc_charlen_type
string_verify (gfc_charlen_type slen, const CHARTYPE *str,
	       gfc_charlen_type setlen, const CHARTYPE *set,
	       GFC_LOGICAL_4 back)

This function happens to modify the stack slot for "slen" to -1 when
"back" is true.

Alan Modra June 15, 2016, 6:01 a.m. UTC | #6

On Wed, Jun 15, 2016 at 11:00:22AM +0930, Alan Modra wrote:
> On Tue, Jun 14, 2016 at 09:26:19AM -0700, H.J. Lu wrote:
> > On Thu, May 26, 2016 at 4:02 AM, Alan Modra <amodra@gmail.com> wrote:
> > > This fixes lack of bb_loop_depth info in some of the early parts of
> > > ira, which has been the case for quite some time.  All active branches
> > > return 0 from bb_loop_depth() in update_equiv_regs, but whether that
> > > actually causes mis-optimization anywhere but trunk is yet to be
> > > determined.
> > >
> > > I played a little with trying to consolidate this loop_optimizer_init
> > > call with one that occurs a little later, but ran into ICEs.  (We now
> > > have four calls to loop_optimizer_init in ira.c.)
> > >
> > > Bootstrapped and regression tested powerpc64le-linux and x86_64-linux.
> > > OK to apply?
> > >
> > >         PR rtl-optimization/71275
> > >         * ira.c (ira): Call loop_optimizer_init to set up bb_loop_depth
> > >         for update_equiv_regs and combine_and_move_insns.
> > >
> > 
> > This caused:
> > 
> > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71531

So, not guilty.  Or at least only guilty of exposing another bug.
You have a problem with parameter passing.

cat > pass.c <<EOF
extern int __attribute__ ((pure)) f1 (int, int);

int main (void)
{
  if (f1 (1, 0) != 2)
    __builtin_abort ();
  if (f1 (1, 1) != 0)
    __builtin_abort ();
  return 0;
}
EOF
cat > pass2.c <<EOF
int __attribute__ ((pure)) f2 (int);

int __attribute__ ((pure)) f1 (int a, int b)
{
  int x = a;
  if (b)
    x = -a;
  return f2 (x + 1);
}
EOF
cat > pass3.c <<EOF
int __attribute__ ((pure)) f2 (int a)
{
  return a;
}
EOF
gcc -m32 -march=corei7 -mtune=slm -O2 pass.c pass2.c pass3.c
./a.out
Aborted
gcc -O2 pass.c pass2.c pass3.c
./a.out

> I have an x86 ABI question.  Is it correct according to the ABI that
> parameter passing stack slots always have the same value on function
> exit as they had on entry?
> 
> Between these two calls to _gfortran_string_verify,
>   if (verify(c4, "A", back = .true.) .ne. 3) call abort
>   if (verify(c4, "AB") .ne. 0) call abort
> it seems that gfortran is assuming that parameters passed on the stack
> are unchanged.
> 
> _gfortran_string_verify is a C function, declared as follows in
> libgfortran/intrinsics/string_intrinsics_inc.c:
> gfc_charlen_type
> string_verify (gfc_charlen_type slen, const CHARTYPE *str,
> 	       gfc_charlen_type setlen, const CHARTYPE *set,
> 	       GFC_LOGICAL_4 back)
> 
> This function happens to modify the stack slot for "slen" to -1 when
> "back" is true.
> 
> -- 
> Alan Modra
> Australia Development Lab, IBM

Bernd Schmidt June 15, 2016, 9:49 a.m. UTC | #7

On 06/15/2016 03:30 AM, Alan Modra wrote:
> Between these two calls to _gfortran_string_verify,
>   if (verify(c4, "A", back = .true.) .ne. 3) call abort
>   if (verify(c4, "AB") .ne. 0) call abort
> it seems that gfortran is assuming that parameters passed on the stack
> are unchanged.

How? Is this something in the Fortran frontend, or is there CSE going on 
for stores to the argument area?


Bernd

H.J. Lu June 15, 2016, 12:18 p.m. UTC | #8

On Wed, Jun 15, 2016 at 2:49 AM, Bernd Schmidt <bschmidt@redhat.com> wrote:
> On 06/15/2016 03:30 AM, Alan Modra wrote:
>>
>> Between these two calls to _gfortran_string_verify,
>>   if (verify(c4, "A", back = .true.) .ne. 3) call abort
>>   if (verify(c4, "AB") .ne. 0) call abort
>> it seems that gfortran is assuming that parameters passed on the stack
>> are unchanged.
>
>
> How? Is this something in the Fortran frontend, or is there CSE going on for
> stores to the argument area?

Is this related to

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71532

Alan Modra June 15, 2016, 2:03 p.m. UTC | #9

On Wed, Jun 15, 2016 at 11:49:50AM +0200, Bernd Schmidt wrote:
> On 06/15/2016 03:30 AM, Alan Modra wrote:
> >Between these two calls to _gfortran_string_verify,
> >  if (verify(c4, "A", back = .true.) .ne. 3) call abort
> >  if (verify(c4, "AB") .ne. 0) call abort
> >it seems that gfortran is assuming that parameters passed on the stack
> >are unchanged.
> 
> How? Is this something in the Fortran frontend, or is there CSE going on for
> stores to the argument area?

It's not a fortran problem, and I answered my own question about the
ABI by the testcase in
https://gcc.gnu.org/ml/gcc-patches/2016-06/msg01098.html.  If tail
calls are allowed, then the stack parameter area must be overwritten.
Thus an ABI can't allow tail calls if it specifies the stack parameter
area is preserved.  x86 allows tail calls.  I didn't look further into
where the problem in arg setup occurs, sorry, except to note that
gcc-5 does not have the problem.

Bernd Schmidt June 15, 2016, 2:06 p.m. UTC | #10

On 06/15/2016 04:03 PM, Alan Modra wrote:
> On Wed, Jun 15, 2016 at 11:49:50AM +0200, Bernd Schmidt wrote:
>> On 06/15/2016 03:30 AM, Alan Modra wrote:
>>> Between these two calls to _gfortran_string_verify,
>>>  if (verify(c4, "A", back = .true.) .ne. 3) call abort
>>>  if (verify(c4, "AB") .ne. 0) call abort
>>> it seems that gfortran is assuming that parameters passed on the stack
>>> are unchanged.
>>
>> How? Is this something in the Fortran frontend, or is there CSE going on for
>> stores to the argument area?
>
> It's not a fortran problem, and I answered my own question about the
> ABI by the testcase in
> https://gcc.gnu.org/ml/gcc-patches/2016-06/msg01098.html.  If tail
> calls are allowed, then the stack parameter area must be overwritten.
> Thus an ABI can't allow tail calls if it specifies the stack parameter
> area is preserved.  x86 allows tail calls.

Well yes. The problem isn't that the stack area is overwritten, the 
problem is that something expects that it isn't, and it's not clear to 
me yet where that problem occurs.


Bernd

H.J. Lu June 15, 2016, 3:14 p.m. UTC | #11

On Wed, Jun 15, 2016 at 7:06 AM, Bernd Schmidt <bschmidt@redhat.com> wrote:
>
>
> On 06/15/2016 04:03 PM, Alan Modra wrote:
>>
>> On Wed, Jun 15, 2016 at 11:49:50AM +0200, Bernd Schmidt wrote:
>>>
>>> On 06/15/2016 03:30 AM, Alan Modra wrote:
>>>>
>>>> Between these two calls to _gfortran_string_verify,
>>>>  if (verify(c4, "A", back = .true.) .ne. 3) call abort
>>>>  if (verify(c4, "AB") .ne. 0) call abort
>>>> it seems that gfortran is assuming that parameters passed on the stack
>>>> are unchanged.
>>>
>>>
>>> How? Is this something in the Fortran frontend, or is there CSE going on
>>> for
>>> stores to the argument area?
>>
>>
>> It's not a fortran problem, and I answered my own question about the
>> ABI by the testcase in
>> https://gcc.gnu.org/ml/gcc-patches/2016-06/msg01098.html.  If tail
>> calls are allowed, then the stack parameter area must be overwritten.
>> Thus an ABI can't allow tail calls if it specifies the stack parameter
>> area is preserved.  x86 allows tail calls.
>
>
> Well yes. The problem isn't that the stack area is overwritten, the problem
> is that something expects that it isn't, and it's not clear to me yet where
> that problem occurs.

Will this patch

https://gcc.gnu.org/bugzilla/attachment.cgi?id=38705

also fix this

PR71275 ira.c bb_loop_depth

Commit Message

Comments

Patch