[committed,nvptx] Fix ASM_SPEC workaround for sm_30

Message ID	20220331074045.GA29630@delia.home
State	New
Headers	show Return-Path: <gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org> DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 73D103858C50 Date: Thu, 31 Mar 2022 09:40:47 +0200 To: gcc-patches@gcc.gnu.org Subject: [committed][nvptx] Fix ASM_SPEC workaround for sm_30 Message-ID: <20220331074045.GA29630@delia.home> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: list From: Tom de Vries via Gcc-patches <gcc-patches@gcc.gnu.org> Reply-To: Tom de Vries <tdevries@suse.de> Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org Sender: "Gcc-patches" <gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org>
Series	[committed,nvptx] Fix ASM_SPEC workaround for sm_30 \| expand [committed,nvptx] Fix ASM_SPEC workaround for sm_30

Message ID

20220331074045.GA29630@delia.home

State

New

Headers

DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 73D103858C50
Date: Thu, 31 Mar 2022 09:40:47 +0200
To: gcc-patches@gcc.gnu.org
Subject: [committed][nvptx] Fix ASM_SPEC workaround for sm_30
Message-ID: <20220331074045.GA29630@delia.home>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.10.1 (2018-07-13)
Precedence: list
From: Tom de Vries via Gcc-patches <gcc-patches@gcc.gnu.org>
Reply-To: Tom de Vries <tdevries@suse.de>
Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org
Sender: "Gcc-patches"
 <gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org>

Series

[committed,nvptx] Fix ASM_SPEC workaround for sm_30 | expand

Commit Message

Tom de Vries March 31, 2022, 7:40 a.m. UTC

Hi,

Newer versions of CUDA no longer support sm_30, and nvptx-tools as
currently doesn't handle that gracefully when verifying
( https://github.com/MentorEmbedded/nvptx-tools/issues/30 ).

There's a --no-verify work-around in place in ASM_SPEC, but that one doesn't
work when using -Wa,--verify on the command line.

Use a more robust workaround: verify using sm_35 when misa=sm_30 is specified
(either implicitly or explicitly).

Tested on nvptx.

Committed to trunk.

Thanks,
- Tom

[nvptx] Fix ASM_SPEC workaround for sm_30

gcc/ChangeLog:

2022-03-30  Tom de Vries  <tdevries@suse.de>

	* config/nvptx/nvptx.h (ASM_SPEC): Use "-m sm_35" for -misa=sm_30.

---
 gcc/config/nvptx/nvptx.h | 22 ++++++++++++++++++----
 1 file changed, 18 insertions(+), 4 deletions(-)

Comments

Thomas Schwinge April 7, 2022, 2:17 p.m. UTC | #1

Hi!

On 2022-03-31T09:40:47+0200, Tom de Vries via Gcc-patches <gcc-patches@gcc.gnu.org> wrote:
> Newer versions of CUDA no longer support sm_30, and nvptx-tools as
> currently doesn't handle that gracefully when verifying
> ( https://github.com/MentorEmbedded/nvptx-tools/issues/30 ).

There's now <https://github.com/MentorEmbedded/nvptx-tools/pull/33>
'as: Deal with CUDA 11.0, "Support for Kepler 'sm_30' and 'sm_32'
architecture based products is dropped"' available for comment/testing.

> There's a --no-verify work-around in place in ASM_SPEC, but that one doesn't
> work when using -Wa,--verify on the command line.

With that resolved in nvptx-tools, we may then revert these GCC-level
workarounds, GCC commit bf4832d6fa817f66009f100a9cd68953062add7d
"[nvptx] Fix ASM_SPEC workaround for sm_30", and
GCC commit 12fa7641ceed9c9139e2ea7b62c11f3dc5b6f6f4
"[nvptx] Use --no-verify for sm_30".  OK to push, once nvptx-tools ready?

> Use a more robust workaround: verify using sm_35 when misa=sm_30 is specified
> (either implicitly or explicitly).

Thanks for that suggestion!

Grüße
 Thomas
-----------------
Siemens Electronic Design Automation GmbH; Anschrift: Arnulfstraße 201, 80634 München; Gesellschaft mit beschränkter Haftung; Geschäftsführer: Thomas Heurung, Frank Thürauf; Sitz der Gesellschaft: München; Registergericht München, HRB 106955

Tom de Vries April 11, 2022, 9:10 a.m. UTC | #2

On 4/7/22 16:17, Thomas Schwinge wrote:
> Hi!
> 
> On 2022-03-31T09:40:47+0200, Tom de Vries via Gcc-patches <gcc-patches@gcc.gnu.org> wrote:
>> Newer versions of CUDA no longer support sm_30, and nvptx-tools as
>> currently doesn't handle that gracefully when verifying
>> ( https://github.com/MentorEmbedded/nvptx-tools/issues/30 ).
> 
> There's now <https://github.com/MentorEmbedded/nvptx-tools/pull/33>
> 'as: Deal with CUDA 11.0, "Support for Kepler 'sm_30' and 'sm_32'
> architecture based products is dropped"' available for comment/testing.
> 
>> There's a --no-verify work-around in place in ASM_SPEC, but that one doesn't
>> work when using -Wa,--verify on the command line.
> 
> With that resolved in nvptx-tools, we may then revert these GCC-level
> workarounds, GCC commit bf4832d6fa817f66009f100a9cd68953062add7d
> "[nvptx] Fix ASM_SPEC workaround for sm_30", and
> GCC commit 12fa7641ceed9c9139e2ea7b62c11f3dc5b6f6f4
> "[nvptx] Use --no-verify for sm_30".  OK to push, once nvptx-tools ready?
> 
>> Use a more robust workaround: verify using sm_35 when misa=sm_30 is specified
>> (either implicitly or explicitly).
> 
> Thanks for that suggestion!
> 

Hi,

I've tested the nvptx-tools patch in combination with a patch that 
remote ASM_SPEC, and that went fine.

[ Well apart from a new	libgomp FAIL:
...
FAIL: libgomp.oacc-fortran/private-variables.f90 
-DACC_DEVICE_TYPE_nvidia=1 -DACC_MEM_SHARED=0 -foffload=nvptx-none  -O1 
  at line 142 (test for bogus messages, line 131)
...
but I assume that's unrelated ]

So, patch that removes ASM_SPEC pre-approved.

Thanks,
- Tom

diff --git a/gcc/config/nvptx/nvptx.h b/gcc/config/nvptx/nvptx.h
index 75ac7a666b1..3b06f33032f 100644
--- a/gcc/config/nvptx/nvptx.h
+++ b/gcc/config/nvptx/nvptx.h
@@ -29,10 +29,24 @@ 
 
 #define STARTFILE_SPEC "%{mmainkernel:crt0.o}"
 
-/* Default needs to be in sync with default for misa in nvptx.opt.
-   We add a default here to work around a hard-coded sm_30 default in
-   nvptx-as.  */
-#define ASM_SPEC "%{misa=*:-m %*; :-m sm_35}%{misa=sm_30:--no-verify}"
+/* Newer versions of CUDA no longer support sm_30, and nvptx-tools as
+   currently doesn't handle that gracefully when verifying
+   ( https://github.com/MentorEmbedded/nvptx-tools/issues/30 ).  Work around
+   this by verifying with sm_35 when having misa=sm_30 (either implicitly
+   or explicitly).  */
+#define ASM_SPEC				\
+  "%{"						\
+  /* Explict misa=sm_30.  */			\
+  "misa=sm_30:-m sm_35"				\
+  /* Separator.	 */				\
+  "; "						\
+  /* Catch-all.	 */				\
+  "misa=*:-m %*"				\
+  /* Separator.	 */				\
+  "; "						\
+  /* Implicit misa=sm_30.  */			\
+  ":-m sm_35"					\
+  "}"
 
 #define TARGET_CPU_CPP_BUILTINS() nvptx_cpu_cpp_builtins ()

[committed,nvptx] Fix ASM_SPEC workaround for sm_30

Commit Message

Comments

Patch