Fix tree-optimization/91169

Message ID	1764855.0oU1eeNi03@arcturus.home
State	New
Headers	show Return-Path: <gcc-patches-return-505961-incoming=patchwork.ozlabs.org@gcc.gnu.org> DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:from :to:subject:date:message-id:mime-version:content-type :content-transfer-encoding; q=dns; s=default; b=FXHJwj45Ys74zpfv p/p1ZDCbI0bsAKrzdohXTCeP96eD7ELp6IbAbcRg8mLzd34CfkncDD8WXt+P415E HqhzgxXW/SY6KUwai9K5pcOZlpL7GT5eqEWY/++fM+mpWqUCAhBMfXkk+VyGmCN2 GqxD6lx4OsA4Rg/ZumgVsFt5EKI= Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk Sender: gcc-patches-owner@gcc.gnu.org From: Eric Botcazou <ebotcazou@adacore.com> To: gcc-patches@gcc.gnu.org Subject: [patch] Fix tree-optimization/91169 Date: Thu, 01 Aug 2019 10:27:43 +0200 Message-ID: <1764855.0oU1eeNi03@arcturus.home> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="nextPart1592953.AfkLepBaZp" Content-Transfer-Encoding: 7Bit
Series	Fix tree-optimization/91169 \| expand Fix tree-optimization/91169

Eric Botcazou Aug. 1, 2019, 8:27 a.m. UTC

Hi,

this fixes the cd2a31a regression in the ACATS testsuite on 32-bit targets 
introduced by the recent change to get_array_ctor_element_at_index:

	* fold-const.h (get_array_ctor_element_at_index): Adjust.
	* fold-const.c (get_array_ctor_element_at_index): Add
	ctor_idx output parameter informing the caller where in
	the constructor the element was (not) found.  Add early exit
	for when the ctor is sorted.

This change overlooks that the index can wrap around during the traversal of 
the CONSTRUCTOR and therefore causes the function to return bogus values as 
soon as this happens.  Moreover, given this chunk of added code:

      else if (in_gimple_form)
	/* We're past the element we search for.  Note during parsing
	   the elements might not be sorted.
	   ???  We should use a binary search and a flag on the
	   CONSTRUCTOR as to whether elements are sorted in declaration
	   order.  */
	break;

I would respectfully suggest that the author thinks about redoing things from 
scratch here.  In the meantime, the attached patch kludges around the issue.

Tested on x86_64-suse-linux, OK for the mainline?


2019-08-01  Eric Botcazou  <ebotcazou@adacore.com>

	PR tree-optimization/91169
	* fold-const.c (get_array_ctor_element_at_index): Remove early exit and
	do not return a bogus ctor_idx when the index wraps around.

Richard Biener Aug. 1, 2019, 9:20 a.m. UTC | #1

On Thu, Aug 1, 2019 at 10:27 AM Eric Botcazou <ebotcazou@adacore.com> wrote:
>
> Hi,
>
> this fixes the cd2a31a regression in the ACATS testsuite on 32-bit targets
> introduced by the recent change to get_array_ctor_element_at_index:
>
>         * fold-const.h (get_array_ctor_element_at_index): Adjust.
>         * fold-const.c (get_array_ctor_element_at_index): Add
>         ctor_idx output parameter informing the caller where in
>         the constructor the element was (not) found.  Add early exit
>         for when the ctor is sorted.
>
> This change overlooks that the index can wrap around during the traversal of
> the CONSTRUCTOR and therefore causes the function to return bogus values as
> soon as this happens.  Moreover, given this chunk of added code:

So isn't this an issue with the code before when we have a RANGE_EXPR
that covers the wrapping point?  Then index > max_index and may not
catch the found element with

    /* Do we have match?  */
    if (wi::cmpu (access_index, index) >= 0
        && wi::cmpu (access_index, max_index) <= 0)
      return cval;

?  We seem to be careful to convert the indices to the index type but
above use unsigned compares - isn't that the underlying issue?  So
isn't

Index: gcc/fold-const.c
===================================================================
--- gcc/fold-const.c    (revision 273968)
+++ gcc/fold-const.c    (working copy)
@@ -11864,9 +11864,13 @@ get_array_ctor_element_at_index (tree ct
        }
     }

+  signop index_sgn = UNSIGNED;
   if (index_type)
-    access_index = wi::ext (access_index, TYPE_PRECISION (index_type),
-                           TYPE_SIGN (index_type));
+    {
+      index_sgn = TYPE_SIGN (index_type);
+      access_index = wi::ext (access_index, TYPE_PRECISION (index_type),
+                             TYPE_SIGN (index_type));
+    }

   offset_int index = low_bound - 1;
   if (index_type)
@@ -11903,9 +11907,9 @@ get_array_ctor_element_at_index (tree ct
        }

       /* Do we have match?  */
-      if (wi::cmpu (access_index, index) >= 0)
+      if (wi::cmp (access_index, index, index_sgn) >= 0)
        {
-         if (wi::cmpu (access_index, max_index) <= 0)
+         if (wi::cmp (access_index, max_index, index_sgn) <= 0)
            {
              if (ctor_idx)
                *ctor_idx = cnt;

>       else if (in_gimple_form)
>         /* We're past the element we search for.  Note during parsing
>            the elements might not be sorted.
>            ???  We should use a binary search and a flag on the
>            CONSTRUCTOR as to whether elements are sorted in declaration
>            order.  */
>         break;
>
> I would respectfully suggest that the author thinks about redoing things from
> scratch here.  In the meantime, the attached patch kludges around the issue.

I'm not sure how "doing from scratch" would fix things - we simply rely
on reliably detecting an element match.

I wonder how much we can rely on the array domain type to be in sync
with the constructor field entries.

Also we're using offset_int here - doesn't that break when with
Ada we have an array type with domain [uint128_t_max - 1, uint128_t_max + 2]
in a type larger than int128_t?  Or, since we're interpreting it
inconsistently signed/unsigned, a 128bit integer typed domain wraps
around the sign?

If that's not an issue then using signed compare unconditionally is
a valid fix as well I guess.

But in reality 'index' should never really wrap, that is, if the domain
is of 'int' type and starts with INT_MAX then the array shall not have
more than 1 element.  No?

The patch you posted isn't a real fix btw, since iff the break was
hit we either missed an index that is present or we found a gap
but then we can break.

So I'd appreciate more explanation on how the index "wraps".

Richard.

> Tested on x86_64-suse-linux, OK for the mainline?
>
>
> 2019-08-01  Eric Botcazou  <ebotcazou@adacore.com>
>
>         PR tree-optimization/91169
>         * fold-const.c (get_array_ctor_element_at_index): Remove early exit and
>         do not return a bogus ctor_idx when the index wraps around.
>
> --
> Eric Botcazou

Eric Botcazou Aug. 1, 2019, 5:12 p.m. UTC | #2

> So isn't this an issue with the code before when we have a RANGE_EXPR
> that covers the wrapping point?

No, see the reduced testcase attached in the PR.

> Then index > max_index and may not catch the found element with
> 
>     /* Do we have match?  */
>     if (wi::cmpu (access_index, index) >= 0
>         && wi::cmpu (access_index, max_index) <= 0)
>       return cval;
> 
> ?  We seem to be careful to convert the indices to the index type but
> above use unsigned compares - isn't that the underlying issue?

Not really, everything is properly unsigned at this point, so the traversal 
wraps around in an unsigned type.

> I'm not sure how "doing from scratch" would fix things - we simply rely
> on reliably detecting an element match.
> 
> I wonder how much we can rely on the array domain type to be in sync
> with the constructor field entries.

At least we need to take into account the wraparound.

> Also we're using offset_int here - doesn't that break when with
> Ada we have an array type with domain [uint128_t_max - 1, uint128_t_max + 2]
> in a type larger than int128_t?  Or, since we're interpreting it
> inconsistently signed/unsigned, a 128bit integer typed domain wraps
> around the sign?

Integer types are limited to 64 bits in GNAT.

> If that's not an issue then using signed compare unconditionally is
> a valid fix as well I guess.

But all the types are unsigned here so this doesn't seem appealing.

> But in reality 'index' should never really wrap, that is, if the domain
> is of 'int' type and starts with INT_MAX then the array shall not have
> more than 1 element.  No?

The range is [-1, 1] converted to sizetype, which is unsigned, so the low 
bound is UINT32_MAX and the high bound is 1.

> The patch you posted isn't a real fix btw, since iff the break was
> hit we either missed an index that is present or we found a gap
> but then we can break.

No, we cannot break if the index wraps around, that's the bug.

> So I'd appreciate more explanation on how the index "wraps".

See above.  I can rewind the entire chain of events if need be, but this is a 
long one starting a long time ago with the initial blunder with PLUS_EXPR and 
sizetype.

Richard Biener Aug. 2, 2019, 8:24 a.m. UTC | #3

On Thu, Aug 1, 2019 at 7:11 PM Eric Botcazou <ebotcazou@adacore.com> wrote:
>
> > So isn't this an issue with the code before when we have a RANGE_EXPR
> > that covers the wrapping point?
>
> No, see the reduced testcase attached in the PR.
>
> > Then index > max_index and may not catch the found element with
> >
> >     /* Do we have match?  */
> >     if (wi::cmpu (access_index, index) >= 0
> >         && wi::cmpu (access_index, max_index) <= 0)
> >       return cval;
> >
> > ?  We seem to be careful to convert the indices to the index type but
> > above use unsigned compares - isn't that the underlying issue?
>
> Not really, everything is properly unsigned at this point, so the traversal
> wraps around in an unsigned type.
>
> > I'm not sure how "doing from scratch" would fix things - we simply rely
> > on reliably detecting an element match.
> >
> > I wonder how much we can rely on the array domain type to be in sync
> > with the constructor field entries.
>
> At least we need to take into account the wraparound.
>
> > Also we're using offset_int here - doesn't that break when with
> > Ada we have an array type with domain [uint128_t_max - 1, uint128_t_max + 2]
> > in a type larger than int128_t?  Or, since we're interpreting it
> > inconsistently signed/unsigned, a 128bit integer typed domain wraps
> > around the sign?
>
> Integer types are limited to 64 bits in GNAT.
>
> > If that's not an issue then using signed compare unconditionally is
> > a valid fix as well I guess.
>
> But all the types are unsigned here so this doesn't seem appealing.
>
> > But in reality 'index' should never really wrap, that is, if the domain
> > is of 'int' type and starts with INT_MAX then the array shall not have
> > more than 1 element.  No?
>
> The range is [-1, 1] converted to sizetype, which is unsigned, so the low
> bound is UINT32_MAX and the high bound is 1.

Oh.  OK, now I see.  Still, if we have this domain and a CONSTRUCTOR
with a RANGE_EXPR UINT32_MAX, 1 and we are asking for access_index
zero then the old code did

    /* Do we have match?  */
    if (wi::cmpu (access_index, index) >= 0
        && wi::cmpu (access_index, max_index) <= 0)
      return cval;

and index == UINT32_MAX, max_index == 1.  So we don't match it
but return NULL, assuming the constructor value is zero?

Not sure if the Ada FE ever creates RANGE_EXPRs for CONSTRUCTOR
entries, for single-valued entries the check of course works.  But as it
was written above I assumed certain ordering conditions must hold...
(grepping shows no traces of RANGE_EXPR in gigi)

I'm hoping for a GCC C extension to specify array domains so the
GIMPLE FE can be used to play with these kind of things...

> > The patch you posted isn't a real fix btw, since iff the break was
> > hit we either missed an index that is present or we found a gap
> > but then we can break.
>
> No, we cannot break if the index wraps around, that's the bug.
>
> > So I'd appreciate more explanation on how the index "wraps".
>
> See above.  I can rewind the entire chain of events if need be, but this is a
> long one starting a long time ago with the initial blunder with PLUS_EXPR and
> sizetype.

Yeah.  Note that you can very well use signed TYPE_DOMAIN nowadays
(OK, no guarantees...).

I think we should kludge around similar to how we do in stor-layout.c
which does

                /* ???  When it is obvious that the range is signed
                   represent it using ssizetype.  */
                if (TREE_CODE (lb) == INTEGER_CST
                    && TREE_CODE (ub) == INTEGER_CST
                    && TYPE_UNSIGNED (TREE_TYPE (lb))
                    && tree_int_cst_lt (ub, lb))
                  {
                    lb = wide_int_to_tree (ssizetype,
                                           offset_int::from (wi::to_wide (lb),
                                                             SIGNED));
                    ub = wide_int_to_tree (ssizetype,
                                           offset_int::from (wi::to_wide (ub),
                                                             SIGNED));
                  }

So I am testing the attached.

Richard.

>
> --
> Eric Botcazou

Richard Biener Aug. 2, 2019, 10:59 a.m. UTC | #4

On Fri, Aug 2, 2019 at 10:24 AM Richard Biener
<richard.guenther@gmail.com> wrote:
>
> On Thu, Aug 1, 2019 at 7:11 PM Eric Botcazou <ebotcazou@adacore.com> wrote:
> >
> > > So isn't this an issue with the code before when we have a RANGE_EXPR
> > > that covers the wrapping point?
> >
> > No, see the reduced testcase attached in the PR.
> >
> > > Then index > max_index and may not catch the found element with
> > >
> > >     /* Do we have match?  */
> > >     if (wi::cmpu (access_index, index) >= 0
> > >         && wi::cmpu (access_index, max_index) <= 0)
> > >       return cval;
> > >
> > > ?  We seem to be careful to convert the indices to the index type but
> > > above use unsigned compares - isn't that the underlying issue?
> >
> > Not really, everything is properly unsigned at this point, so the traversal
> > wraps around in an unsigned type.
> >
> > > I'm not sure how "doing from scratch" would fix things - we simply rely
> > > on reliably detecting an element match.
> > >
> > > I wonder how much we can rely on the array domain type to be in sync
> > > with the constructor field entries.
> >
> > At least we need to take into account the wraparound.
> >
> > > Also we're using offset_int here - doesn't that break when with
> > > Ada we have an array type with domain [uint128_t_max - 1, uint128_t_max + 2]
> > > in a type larger than int128_t?  Or, since we're interpreting it
> > > inconsistently signed/unsigned, a 128bit integer typed domain wraps
> > > around the sign?
> >
> > Integer types are limited to 64 bits in GNAT.
> >
> > > If that's not an issue then using signed compare unconditionally is
> > > a valid fix as well I guess.
> >
> > But all the types are unsigned here so this doesn't seem appealing.
> >
> > > But in reality 'index' should never really wrap, that is, if the domain
> > > is of 'int' type and starts with INT_MAX then the array shall not have
> > > more than 1 element.  No?
> >
> > The range is [-1, 1] converted to sizetype, which is unsigned, so the low
> > bound is UINT32_MAX and the high bound is 1.
>
> Oh.  OK, now I see.  Still, if we have this domain and a CONSTRUCTOR
> with a RANGE_EXPR UINT32_MAX, 1 and we are asking for access_index
> zero then the old code did
>
>     /* Do we have match?  */
>     if (wi::cmpu (access_index, index) >= 0
>         && wi::cmpu (access_index, max_index) <= 0)
>       return cval;
>
> and index == UINT32_MAX, max_index == 1.  So we don't match it
> but return NULL, assuming the constructor value is zero?
>
> Not sure if the Ada FE ever creates RANGE_EXPRs for CONSTRUCTOR
> entries, for single-valued entries the check of course works.  But as it
> was written above I assumed certain ordering conditions must hold...
> (grepping shows no traces of RANGE_EXPR in gigi)
>
> I'm hoping for a GCC C extension to specify array domains so the
> GIMPLE FE can be used to play with these kind of things...
>
> > > The patch you posted isn't a real fix btw, since iff the break was
> > > hit we either missed an index that is present or we found a gap
> > > but then we can break.
> >
> > No, we cannot break if the index wraps around, that's the bug.
> >
> > > So I'd appreciate more explanation on how the index "wraps".
> >
> > See above.  I can rewind the entire chain of events if need be, but this is a
> > long one starting a long time ago with the initial blunder with PLUS_EXPR and
> > sizetype.
>
> Yeah.  Note that you can very well use signed TYPE_DOMAIN nowadays
> (OK, no guarantees...).
>
> I think we should kludge around similar to how we do in stor-layout.c
> which does
>
>                 /* ???  When it is obvious that the range is signed
>                    represent it using ssizetype.  */
>                 if (TREE_CODE (lb) == INTEGER_CST
>                     && TREE_CODE (ub) == INTEGER_CST
>                     && TYPE_UNSIGNED (TREE_TYPE (lb))
>                     && tree_int_cst_lt (ub, lb))
>                   {
>                     lb = wide_int_to_tree (ssizetype,
>                                            offset_int::from (wi::to_wide (lb),
>                                                              SIGNED));
>                     ub = wide_int_to_tree (ssizetype,
>                                            offset_int::from (wi::to_wide (ub),
>                                                              SIGNED));
>                   }
>
> So I am testing the attached.

Testing went OK but it looks like acats doesn't honor
RUNTESTFLAGS so I got no multilib testing for it :/
And the PR didn't contain sth I could plug into gnat.dg so I checked
with visual inspection of dumps on the reduced testcase.
I notice pretty-printing is also confused about the wrapping ;)

  static struct p__intarray___PAD intarray = {.F={-100, [0]=0, 100}};

the [0]= is not necessary.

Anyway, I can see bogus IL before and still after the proposed patch :/
This is because I forgot to adjust cfield handling of setting index.

So I'm testing the adjusted patch attached which fixes the IL
and for testing have patched gcc/testsuite/ada/acats/run_all.sh
to add -m32.

Richard.

> Richard.
>
> >
> > --
> > Eric Botcazou

Rainer Orth Aug. 2, 2019, 11:25 a.m. UTC | #5

Hi Richard,

> Testing went OK but it looks like acats doesn't honor
> RUNTESTFLAGS so I got no multilib testing for it :/

indeed, see PR testsuite/37703.  I had the beginnings of a patch

	http://gcc.gnu.org/ml/gcc-patches/2011-01/msg02310.html
        http://gcc.gnu.org/ml/gcc-patches/2011-02/msg01469.html

but that got stuck years ago.

	Rainer

Eric Botcazou Aug. 5, 2019, 8:28 a.m. UTC | #6

> Testing went OK but it looks like acats doesn't honor
> RUNTESTFLAGS so I got no multilib testing for it :/
> And the PR didn't contain sth I could plug into gnat.dg so I checked
> with visual inspection of dumps on the reduced testcase.

Sorry about that, gnat.dg/array37.adb now attached.

> I notice pretty-printing is also confused about the wrapping ;)
> 
>   static struct p__intarray___PAD intarray = {.F={-100, [0]=0, 100}};
> 
> the [0]= is not necessary.
> 
> Anyway, I can see bogus IL before and still after the proposed patch :/
> This is because I forgot to adjust cfield handling of setting index.
> 
> So I'm testing the adjusted patch attached which fixes the IL
> and for testing have patched gcc/testsuite/ada/acats/run_all.sh
> to add -m32.

Thanks!

Richard Biener Aug. 5, 2019, 12:28 p.m. UTC | #7

On Mon, Aug 5, 2019 at 10:28 AM Eric Botcazou <ebotcazou@adacore.com> wrote:
>
> > Testing went OK but it looks like acats doesn't honor
> > RUNTESTFLAGS so I got no multilib testing for it :/
> > And the PR didn't contain sth I could plug into gnat.dg so I checked
> > with visual inspection of dumps on the reduced testcase.
>
> Sorry about that, gnat.dg/array37.adb now attached.
>
> > I notice pretty-printing is also confused about the wrapping ;)
> >
> >   static struct p__intarray___PAD intarray = {.F={-100, [0]=0, 100}};
> >
> > the [0]= is not necessary.
> >
> > Anyway, I can see bogus IL before and still after the proposed patch :/
> > This is because I forgot to adjust cfield handling of setting index.
> >
> > So I'm testing the adjusted patch attached which fixes the IL
> > and for testing have patched gcc/testsuite/ada/acats/run_all.sh
> > to add -m32.
>
> Thanks!

Fixed as follows - I've added checking asserts that wrapping doesn't
occur and also fixed another bug which treated

 { .el = { RANGE [1, 4], 0 }, .el = { NULL_TREE, 1 } }

as setting [2] to 1.

Bootstrapped and tested on x86_64-unknown-linux-gnu, applied.

Richard.

Fix tree-optimization/91169

Commit Message

Comments

Patch