Fix PR81038 - Patchwork

Message ID	a8bf4899-aed9-f675-69f0-051ca986b73a@linux.vnet.ibm.com
State	New
Headers	show Return-Path: <gcc-patches-return-472569-incoming=patchwork.ozlabs.org@gcc.gnu.org> DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id :list-unsubscribe:list-archive:list-post:list-help:sender:to:cc :from:subject:date:mime-version:content-type :content-transfer-encoding:message-id; q=dns; s=default; b=xs1b2 u6zNJbElgJeoQOW7+LkEXP0HbVQZmiWgBN8xRxbXxLKkqVkzn6pTnKvE9EzQRzpM xBnEKRioxjuwb2s+KMAx+5xWF8sVh3fzujtYQhUw6t28xdl6IyKzoWdr9hrYBpZG xWRzecGgQoJOeijij26wQ5gJtPjCR1XVubras0= Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk Sender: gcc-patches-owner@gcc.gnu.org Gateway: Authorized Use Only! Violators will be prosecuted for <gcc-patches@gcc.gnu.org> from <wschmidt@linux.vnet.ibm.com>; Fri, 2 Feb 2018 18:30:49 -0500 Gateway: Authorized Use Only! Violators will be prosecuted; Fri, 2 Feb 2018 18:30:46 -0500 To: GCC Patches <gcc-patches@gcc.gnu.org> Cc: Richard Biener <richard.guenther@gmail.com> From: Bill Schmidt <wschmidt@linux.vnet.ibm.com> Subject: [PATCH] Fix PR81038 Date: Fri, 2 Feb 2018 17:30:45 -0600 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:52.0) Gecko/20100101 Thunderbird/52.5.2 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Message-Id: <a8bf4899-aed9-f675-69f0-051ca986b73a@linux.vnet.ibm.com>
Series	Fix PR81038 \| expand Fix PR81038

Message ID

a8bf4899-aed9-f675-69f0-051ca986b73a@linux.vnet.ibm.com

State

New

Headers

DomainKey-Signature: a=rsa-sha1; c=nofws; d=gcc.gnu.org; h=list-id
	:list-unsubscribe:list-archive:list-post:list-help:sender:to:cc
	:from:subject:date:mime-version:content-type
	:content-transfer-encoding:message-id; q=dns; s=default; b=xs1b2
	u6zNJbElgJeoQOW7+LkEXP0HbVQZmiWgBN8xRxbXxLKkqVkzn6pTnKvE9EzQRzpM
	xBnEKRioxjuwb2s+KMAx+5xWF8sVh3fzujtYQhUw6t28xdl6IyKzoWdr9hrYBpZG
	xWRzecGgQoJOeijij26wQ5gJtPjCR1XVubras0=
Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm
Precedence: bulk
Sender: gcc-patches-owner@gcc.gnu.org
To: GCC Patches <gcc-patches@gcc.gnu.org>
Cc: Richard Biener <richard.guenther@gmail.com>
From: Bill Schmidt <wschmidt@linux.vnet.ibm.com>
Subject: [PATCH] Fix PR81038
Date: Fri, 2 Feb 2018 17:30:45 -0600
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13;
	rv:52.0) Gecko/20100101 Thunderbird/52.5.2
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
Message-Id: <a8bf4899-aed9-f675-69f0-051ca986b73a@linux.vnet.ibm.com>

Series

Fix PR81038 | expand

Commit Message

Bill Schmidt Feb. 2, 2018, 11:30 p.m. UTC

Hi,

The test g++.dg/vect/slp-pr56812.cc is somewhat fragile and is currently failing
on several targets.  PR81038 notes that this began with r248678, which stopped
some inferior peeling solutions from preventing vectorization that could be done
without peeling.  I observed that for powerpc64le, r248677 vectorizes the code 
during SLP, but r248678 vectorizes it during the loop vectorization pass.  Which
pass does the vectorization is quite dependent on cost model, which for us is a
quite close decision.  In any case, the important thing is that the code is 
vectorized, not which pass does it.

This patch prevents the test from flipping in and out of failure status depending
on which pass does the vectorization, by testing the final "optimized" dump for
the expected vectorized output instead of relying on a specific vectorization
pass dump.

By the way, the test case somehow had gotten DOS/Windows newlines into it, so
I removed those.  The ^M characters disappeared when I pasted into this mailer,
unfortunately.  Anyway, that's the reason for the full replacement of the file.
The only real changes are the dg-final directives and the documentation of the
expected output.

Verified on powerpc64le-unknown-linux-gnu.  Is this okay for trunk?

Thanks,
Bill


2018-02-02  Bill Schmidt  <wschmidt@linux.vnet.ibm.com>

	* g++.dg/vect/slp-pr56812.cc: Convert from DOS newline characters
	to utf-8-unix.  Change to scan "optimized" dump for indications
	that the code was vectorized.

Comments

Richard Biener Feb. 8, 2018, 10:52 a.m. UTC | #1

On Sat, Feb 3, 2018 at 12:30 AM, Bill Schmidt
<wschmidt@linux.vnet.ibm.com> wrote:
> Hi,
>
> The test g++.dg/vect/slp-pr56812.cc is somewhat fragile and is currently failing
> on several targets.  PR81038 notes that this began with r248678, which stopped
> some inferior peeling solutions from preventing vectorization that could be done
> without peeling.  I observed that for powerpc64le, r248677 vectorizes the code
> during SLP, but r248678 vectorizes it during the loop vectorization pass.  Which
> pass does the vectorization is quite dependent on cost model, which for us is a
> quite close decision.  In any case, the important thing is that the code is
> vectorized, not which pass does it.
>
> This patch prevents the test from flipping in and out of failure status depending
> on which pass does the vectorization, by testing the final "optimized" dump for
> the expected vectorized output instead of relying on a specific vectorization
> pass dump.
>
> By the way, the test case somehow had gotten DOS/Windows newlines into it, so
> I removed those.  The ^M characters disappeared when I pasted into this mailer,
> unfortunately.  Anyway, that's the reason for the full replacement of the file.
> The only real changes are the dg-final directives and the documentation of the
> expected output.
>
> Verified on powerpc64le-unknown-linux-gnu.  Is this okay for trunk?

Hmm.  That removes the existing XFAIL.  Also wouldn't it be more elegant to do
the following?  This makes the testcase pass on x86_64, thus committed ;)

Richard.

2018-02-08  Richard Biener  <rguenther@suse.de>

        * g++.dg/vect/slp-pr56812.cc: Allow either basic-block or
        loop vectorization to happen.

Index: gcc/testsuite/g++.dg/vect/slp-pr56812.cc
===================================================================
--- gcc/testsuite/g++.dg/vect/slp-pr56812.cc    (revision 257477)
+++ gcc/testsuite/g++.dg/vect/slp-pr56812.cc    (working copy)
@@ -1,7 +1,7 @@
 /* { dg-do compile } */
 /* { dg-require-effective-target vect_float } */
 /* { dg-require-effective-target vect_hw_misalign } */
-/* { dg-additional-options "-O3 -funroll-loops -fvect-cost-model=dynamic" } */
+/* { dg-additional-options "-O3 -funroll-loops
-fvect-cost-model=dynamic -fopt-info-vec" } */

 class mydata {
 public:
@@ -13,10 +13,7 @@ public:

 void mydata::Set (float x)
 {
-  for (int i=0; i<upper(); i++)
+  /* We want to vectorize this either as loop or basic-block.  */
+  for (int i=0; i<upper(); i++) /* { dg-message "note: \[^\n\]*
vectorized" } */
     data[i] = x;
 }
-
-/* For targets without vector loop peeling the loop becomes cheap
-   enough to be vectorized.  */
-/* { dg-final { scan-tree-dump-times "basic block vectorized" 1
"slp1" { xfail { ! vect_peeling_profitable } } } } */

> Thanks,
> Bill
>
>
> 2018-02-02  Bill Schmidt  <wschmidt@linux.vnet.ibm.com>
>
>         * g++.dg/vect/slp-pr56812.cc: Convert from DOS newline characters
>         to utf-8-unix.  Change to scan "optimized" dump for indications
>         that the code was vectorized.
>
>
> Index: gcc/testsuite/g++.dg/vect/slp-pr56812.cc
> ===================================================================
> --- gcc/testsuite/g++.dg/vect/slp-pr56812.cc    (revision 257352)
> +++ gcc/testsuite/g++.dg/vect/slp-pr56812.cc    (working copy)
> @@ -1,22 +1,31 @@
> -/* { dg-do compile } */
> -/* { dg-require-effective-target vect_float } */
> -/* { dg-require-effective-target vect_hw_misalign } */
> -/* { dg-additional-options "-O3 -funroll-loops -fvect-cost-model=dynamic" } */
> -
> -class mydata {
> -public:
> -    mydata() {Set(-1.0);}
> -    void Set (float);
> -    static int upper() {return 8;}
> -    float data[8];
> -};
> -
> -void mydata::Set (float x)
> -{
> -  for (int i=0; i<upper(); i++)
> -    data[i] = x;
> -}
> -
> -/* For targets without vector loop peeling the loop becomes cheap
> -   enough to be vectorized.  */
> -/* { dg-final { scan-tree-dump-times "basic block vectorized" 1 "slp1" { xfail { ! vect_peeling_profitable } } } } */
> +/* { dg-do compile } */
> +/* { dg-require-effective-target vect_float } */
> +/* { dg-require-effective-target vect_hw_misalign } */
> +/* { dg-additional-options "-O3 -funroll-loops -fvect-cost-model=dynamic -fdump-tree-optimized" } */
> +
> +class mydata {
> +public:
> +    mydata() {Set(-1.0);}
> +    void Set (float);
> +    static int upper() {return 8;}
> +    float data[8];
> +};
> +
> +void mydata::Set (float x)
> +{
> +  for (int i=0; i<upper(); i++)
> +    data[i] = x;
> +}
> +
> +/* { dg-final { scan-tree-dump "vect_cst__\[0-9\]* = " "optimized" } } */
> +/* { dg-final { scan-tree-dump-times "= vect_cst__\[0-9\]*;" 2 "optimized" } } */
> +
> +/* Expected vectorized output is something like:
> +
> +  <bb 2> [11.11%]:
> +  vect_cst__10 = {x_5(D), x_5(D), x_5(D), x_5(D)};
> +  MEM[(float *)this_4(D)] = vect_cst__10;
> +  MEM[(float *)this_4(D) + 16B] = vect_cst__10;
> +  return;
> +
> +  Could be vectorized either by the "vect" or the "slp" pass.  */
>

Bill Schmidt Feb. 8, 2018, 2:11 p.m. UTC | #2

> On Feb 8, 2018, at 4:52 AM, Richard Biener <richard.guenther@gmail.com> wrote:
> 
> On Sat, Feb 3, 2018 at 12:30 AM, Bill Schmidt
> <wschmidt@linux.vnet.ibm.com> wrote:
>> Hi,
>> 
>> The test g++.dg/vect/slp-pr56812.cc is somewhat fragile and is currently failing
>> on several targets.  PR81038 notes that this began with r248678, which stopped
>> some inferior peeling solutions from preventing vectorization that could be done
>> without peeling.  I observed that for powerpc64le, r248677 vectorizes the code
>> during SLP, but r248678 vectorizes it during the loop vectorization pass.  Which
>> pass does the vectorization is quite dependent on cost model, which for us is a
>> quite close decision.  In any case, the important thing is that the code is
>> vectorized, not which pass does it.
>> 
>> This patch prevents the test from flipping in and out of failure status depending
>> on which pass does the vectorization, by testing the final "optimized" dump for
>> the expected vectorized output instead of relying on a specific vectorization
>> pass dump.
>> 
>> By the way, the test case somehow had gotten DOS/Windows newlines into it, so
>> I removed those.  The ^M characters disappeared when I pasted into this mailer,
>> unfortunately.  Anyway, that's the reason for the full replacement of the file.
>> The only real changes are the dg-final directives and the documentation of the
>> expected output.
>> 
>> Verified on powerpc64le-unknown-linux-gnu.  Is this okay for trunk?
> 
> Hmm.  That removes the existing XFAIL.  Also wouldn't it be more elegant to do
> the following?  This makes the testcase pass on x86_64, thus committed ;)

Terrific, thanks!

Bill
> 
> Richard.
> 
> 2018-02-08  Richard Biener  <rguenther@suse.de>
> 
>        * g++.dg/vect/slp-pr56812.cc: Allow either basic-block or
>        loop vectorization to happen.
> 
> Index: gcc/testsuite/g++.dg/vect/slp-pr56812.cc
> ===================================================================
> --- gcc/testsuite/g++.dg/vect/slp-pr56812.cc    (revision 257477)
> +++ gcc/testsuite/g++.dg/vect/slp-pr56812.cc    (working copy)
> @@ -1,7 +1,7 @@
> /* { dg-do compile } */
> /* { dg-require-effective-target vect_float } */
> /* { dg-require-effective-target vect_hw_misalign } */
> -/* { dg-additional-options "-O3 -funroll-loops -fvect-cost-model=dynamic" } */
> +/* { dg-additional-options "-O3 -funroll-loops
> -fvect-cost-model=dynamic -fopt-info-vec" } */
> 
> class mydata {
> public:
> @@ -13,10 +13,7 @@ public:
> 
> void mydata::Set (float x)
> {
> -  for (int i=0; i<upper(); i++)
> +  /* We want to vectorize this either as loop or basic-block.  */
> +  for (int i=0; i<upper(); i++) /* { dg-message "note: \[^\n\]*
> vectorized" } */
>     data[i] = x;
> }
> -
> -/* For targets without vector loop peeling the loop becomes cheap
> -   enough to be vectorized.  */
> -/* { dg-final { scan-tree-dump-times "basic block vectorized" 1
> "slp1" { xfail { ! vect_peeling_profitable } } } } */
> 
>> Thanks,
>> Bill
>> 
>> 
>> 2018-02-02  Bill Schmidt  <wschmidt@linux.vnet.ibm.com>
>> 
>>        * g++.dg/vect/slp-pr56812.cc: Convert from DOS newline characters
>>        to utf-8-unix.  Change to scan "optimized" dump for indications
>>        that the code was vectorized.
>> 
>> 
>> Index: gcc/testsuite/g++.dg/vect/slp-pr56812.cc
>> ===================================================================
>> --- gcc/testsuite/g++.dg/vect/slp-pr56812.cc    (revision 257352)
>> +++ gcc/testsuite/g++.dg/vect/slp-pr56812.cc    (working copy)
>> @@ -1,22 +1,31 @@
>> -/* { dg-do compile } */
>> -/* { dg-require-effective-target vect_float } */
>> -/* { dg-require-effective-target vect_hw_misalign } */
>> -/* { dg-additional-options "-O3 -funroll-loops -fvect-cost-model=dynamic" } */
>> -
>> -class mydata {
>> -public:
>> -    mydata() {Set(-1.0);}
>> -    void Set (float);
>> -    static int upper() {return 8;}
>> -    float data[8];
>> -};
>> -
>> -void mydata::Set (float x)
>> -{
>> -  for (int i=0; i<upper(); i++)
>> -    data[i] = x;
>> -}
>> -
>> -/* For targets without vector loop peeling the loop becomes cheap
>> -   enough to be vectorized.  */
>> -/* { dg-final { scan-tree-dump-times "basic block vectorized" 1 "slp1" { xfail { ! vect_peeling_profitable } } } } */
>> +/* { dg-do compile } */
>> +/* { dg-require-effective-target vect_float } */
>> +/* { dg-require-effective-target vect_hw_misalign } */
>> +/* { dg-additional-options "-O3 -funroll-loops -fvect-cost-model=dynamic -fdump-tree-optimized" } */
>> +
>> +class mydata {
>> +public:
>> +    mydata() {Set(-1.0);}
>> +    void Set (float);
>> +    static int upper() {return 8;}
>> +    float data[8];
>> +};
>> +
>> +void mydata::Set (float x)
>> +{
>> +  for (int i=0; i<upper(); i++)
>> +    data[i] = x;
>> +}
>> +
>> +/* { dg-final { scan-tree-dump "vect_cst__\[0-9\]* = " "optimized" } } */
>> +/* { dg-final { scan-tree-dump-times "= vect_cst__\[0-9\]*;" 2 "optimized" } } */
>> +
>> +/* Expected vectorized output is something like:
>> +
>> +  <bb 2> [11.11%]:
>> +  vect_cst__10 = {x_5(D), x_5(D), x_5(D), x_5(D)};
>> +  MEM[(float *)this_4(D)] = vect_cst__10;
>> +  MEM[(float *)this_4(D) + 16B] = vect_cst__10;
>> +  return;
>> +
>> +  Could be vectorized either by the "vect" or the "slp" pass.  */

Index: gcc/testsuite/g++.dg/vect/slp-pr56812.cc
===================================================================
--- gcc/testsuite/g++.dg/vect/slp-pr56812.cc	(revision 257352)
+++ gcc/testsuite/g++.dg/vect/slp-pr56812.cc	(working copy)
@@ -1,22 +1,31 @@ 
-/* { dg-do compile } */
-/* { dg-require-effective-target vect_float } */
-/* { dg-require-effective-target vect_hw_misalign } */
-/* { dg-additional-options "-O3 -funroll-loops -fvect-cost-model=dynamic" } */
-
-class mydata {
-public:
-    mydata() {Set(-1.0);}
-    void Set (float);
-    static int upper() {return 8;}
-    float data[8];
-};
-
-void mydata::Set (float x)
-{
-  for (int i=0; i<upper(); i++)
-    data[i] = x;
-}
-
-/* For targets without vector loop peeling the loop becomes cheap
-   enough to be vectorized.  */
-/* { dg-final { scan-tree-dump-times "basic block vectorized" 1 "slp1" { xfail { ! vect_peeling_profitable } } } } */
+/* { dg-do compile } */
+/* { dg-require-effective-target vect_float } */
+/* { dg-require-effective-target vect_hw_misalign } */
+/* { dg-additional-options "-O3 -funroll-loops -fvect-cost-model=dynamic -fdump-tree-optimized" } */
+
+class mydata {
+public:
+    mydata() {Set(-1.0);}
+    void Set (float);
+    static int upper() {return 8;}
+    float data[8];
+};
+
+void mydata::Set (float x)
+{
+  for (int i=0; i<upper(); i++)
+    data[i] = x;
+}
+
+/* { dg-final { scan-tree-dump "vect_cst__\[0-9\]* = " "optimized" } } */
+/* { dg-final { scan-tree-dump-times "= vect_cst__\[0-9\]*;" 2 "optimized" } } */
+
+/* Expected vectorized output is something like:
+
+  <bb 2> [11.11%]:
+  vect_cst__10 = {x_5(D), x_5(D), x_5(D), x_5(D)};
+  MEM[(float *)this_4(D)] = vect_cst__10;
+  MEM[(float *)this_4(D) + 16B] = vect_cst__10;
+  return;
+
+  Could be vectorized either by the "vect" or the "slp" pass.  */