From patchwork Tue Mar 5 13:00:41 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Xi Ruoyao X-Patchwork-Id: 1908218 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; dkim=fail reason="signature verification failed" (1024-bit key; unprotected) header.d=xry111.site header.i=@xry111.site header.a=rsa-sha256 header.s=default header.b=SCeddB5P; dkim-atps=neutral Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=2620:52:3:1:0:246e:9693:128c; helo=server2.sourceware.org; envelope-from=gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=patchwork.ozlabs.org) Received: from server2.sourceware.org (server2.sourceware.org [IPv6:2620:52:3:1:0:246e:9693:128c]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (secp384r1) server-digest SHA384) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4Tpwh24yLPz23cm for ; Wed, 6 Mar 2024 00:02:10 +1100 (AEDT) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 815583858C50 for ; Tue, 5 Mar 2024 13:02:08 +0000 (GMT) X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from xry111.site (xry111.site [89.208.246.23]) by sourceware.org (Postfix) with ESMTPS id 2C4B43858D20 for ; Tue, 5 Mar 2024 13:01:45 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 2C4B43858D20 Authentication-Results: sourceware.org; dmarc=pass (p=reject dis=none) header.from=xry111.site Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=xry111.site ARC-Filter: OpenARC Filter v1.0.0 sourceware.org 2C4B43858D20 Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=89.208.246.23 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1709643709; cv=none; b=T8PTE8/ij9lWE0ibVT9M76g1zgrjLTdqAwqqfU2bsPwTnjKVE0Gcl1+9mqV+0HUQ6gKHvS1kYR5mTkv5NiJ/FqSjMubch5reQvJQPqVrpS26qG+RzQvx0Pgm0XskE5YvkhVQsXnGW+LzC+167I0Ayzt7i4JYNGmMYGk1E3Xwosk= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1709643709; c=relaxed/simple; bh=9Ae5QhZAEv0dUTtUs91YY8E/WjR2gFu6AUokhPHKnik=; h=DKIM-Signature:From:To:Subject:Date:Message-ID:MIME-Version; b=KyBQa0ZS8ym4USE6dHk0wz33DyBufFS4Zzphn0d1hVAi98bExS1jOxyOs5bfBO6JkdtnQwWeu9EwVzMcCGNB1aLlRFlYZ21dsxml6J11i4GdmShEBzzZGJcyWoL/eVnJ7o2rbGNJ1DqRuOl2sARpgzZXG7ExZlVBx8LXqbwCVIs= ARC-Authentication-Results: i=1; server2.sourceware.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=xry111.site; s=default; t=1709643700; bh=9Ae5QhZAEv0dUTtUs91YY8E/WjR2gFu6AUokhPHKnik=; h=From:To:Cc:Subject:Date:From; b=SCeddB5Pvm69/FWENIcpOBwYtcCJUq7j17wlgXGv1lKheYNJap25RqafRKG0YJqVg BOlAbhpc1HHFqvF9YmdZKbfmWbOKJAAXL1gUs6FndwiA5kEORwppJ1KiaWevFLQ0Kz BR+kdy+OLYfkCvl/0U8jakOXy/r6+aOAjVUJb/Ss= Received: from stargazer.. (unknown [IPv6:240e:358:115a:2100:dc73:854d:832e:6]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-384) server-digest SHA384) (Client did not present a certificate) (Authenticated sender: xry111@xry111.site) by xry111.site (Postfix) with ESMTPSA id 6C5E266AA5; Tue, 5 Mar 2024 08:01:36 -0500 (EST) From: Xi Ruoyao To: gcc-patches@gcc.gnu.org Cc: chenglulu , i@xen0n.name, xuchenghua@loongson.cn, Xi Ruoyao Subject: [PATCH] LoongArch: testsuite: Rewrite {x, }vfcmp-{d, f}.c to avoid named registers Date: Tue, 5 Mar 2024 21:00:41 +0800 Message-ID: <20240305130114.373076-1-xry111@xry111.site> X-Mailer: git-send-email 2.44.0 MIME-Version: 1.0 X-Spam-Status: No, score=-9.2 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, GIT_PATCH_0, KAM_SHORT, LIKELY_SPAM_FROM, SPF_HELO_PASS, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org Loops on named vector register are not vectorized (see comment 11 of PR113622), so the these test cases have been failing for a while. Rewrite them using check-function-bodies to remove hard coding register names. A barrier is needed to always load the first operand before the second operand. gcc/testsuite/ChangeLog: * gcc.target/loongarch/vfcmp-f.c: Rewrite to avoid named registers. * gcc.target/loongarch/vfcmp-d.c: Likewise. * gcc.target/loongarch/xvfcmp-f.c: Likewise. * gcc.target/loongarch/xvfcmp-d.c: Likewise. --- Tested on loongarch64-linux-gnu. Ok for trunk? gcc/testsuite/gcc.target/loongarch/vfcmp-d.c | 202 ++++++++-- gcc/testsuite/gcc.target/loongarch/vfcmp-f.c | 347 ++++++++++++++---- gcc/testsuite/gcc.target/loongarch/xvfcmp-d.c | 202 ++++++++-- gcc/testsuite/gcc.target/loongarch/xvfcmp-f.c | 204 ++++++++-- 4 files changed, 816 insertions(+), 139 deletions(-) diff --git a/gcc/testsuite/gcc.target/loongarch/vfcmp-d.c b/gcc/testsuite/gcc.target/loongarch/vfcmp-d.c index 8b870ef38a0..87e4ed19e96 100644 --- a/gcc/testsuite/gcc.target/loongarch/vfcmp-d.c +++ b/gcc/testsuite/gcc.target/loongarch/vfcmp-d.c @@ -1,28 +1,188 @@ /* { dg-do compile } */ -/* { dg-options "-O2 -mlsx -ffixed-f0 -ffixed-f1 -ffixed-f2 -fno-vect-cost-model" } */ +/* { dg-options "-O2 -mlsx -fno-vect-cost-model" } */ +/* { dg-final { check-function-bodies "**" "" } } */ #define F double #define I long long #include "vfcmp-f.c" -/* { dg-final { scan-assembler "compare_quiet_equal:.*\tvfcmp\\.ceq\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_equal:.*\tvfcmp\\.cune\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_not_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater:.*\tvfcmp\\.slt\\.d\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_signaling_greater\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater_equal:.*\tvfcmp\\.sle\\.d\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_signaling_greater_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less:.*\tvfcmp\\.slt\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_signaling_less\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less_equal:.*\tvfcmp\\.sle\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_signaling_less_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_not_greater:.*\tvfcmp\\.sule\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_signaling_not_greater\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less_unordered:.*\tvfcmp\\.sult\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_signaling_less_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_not_less:.*\tvfcmp\\.sule\\.d\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_signaling_not_less\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater_unordered:.*\tvfcmp\\.sult\\.d\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_signaling_greater_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less:.*\tvfcmp\\.clt\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_less\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less_equal:.*\tvfcmp\\.cle\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_less_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater:.*\tvfcmp\\.clt\\.d\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_quiet_greater\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater_equal:.*\tvfcmp\\.cle\\.d\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_quiet_greater_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_less:.*\tvfcmp\\.cule\\.d\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_quiet_not_less\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater_unordered:.*\tvfcmp\\.cult\\.d\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_quiet_greater_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_greater:.*\tvfcmp\\.cule\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_not_greater\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less_unordered:.*\tvfcmp\\.cult\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_less_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_unordered:.*\tvfcmp\\.cun\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_ordered:.*\tvfcmp\\.cor\\.d\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_ordered\n" } } */ +/* +** compare_quiet_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.ceq.d (\$vr[0-9]+),(\1,\2|\2,\1) +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cune.d (\$vr[0-9]+),(\1,\2|\2,\1) +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.slt.d (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sle.d (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.slt.d (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sle.d (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_not_greater: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sule.d (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sult.d (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_not_less: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sule.d (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sult.d (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.clt.d (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cle.d (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.clt.d (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cle.d (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_less: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cule.d (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cult.d (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_greater: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cule.d (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cult.d (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cun.d (\$vr[0-9]+),(\1,\2|\2,\1) +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_ordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cor.d (\$vr[0-9]+),(\1,\2|\2,\1) +** vst \3,\$r6,0 +** jr \$r1 +*/ diff --git a/gcc/testsuite/gcc.target/loongarch/vfcmp-f.c b/gcc/testsuite/gcc.target/loongarch/vfcmp-f.c index b9110b90cb5..8d2671998ec 100644 --- a/gcc/testsuite/gcc.target/loongarch/vfcmp-f.c +++ b/gcc/testsuite/gcc.target/loongarch/vfcmp-f.c @@ -2,7 +2,8 @@ For details read C23 Annex F.3 and LoongArch Vol. 1 section 3.2.2.1. */ /* { dg-do compile } */ -/* { dg-options "-O2 -mlsx -ffixed-f0 -ffixed-f1 -ffixed-f2 -fno-vect-cost-model" } */ +/* { dg-options "-O2 -mlsx -fno-vect-cost-model" } */ +/* { dg-final { check-function-bodies "**" "" } } */ #ifndef F #define F float @@ -19,160 +20,354 @@ typedef F VF __attribute__ ((vector_size (VL))); typedef I VI __attribute__ ((vector_size (VL))); -register VF a asm ("f0"); -register VF b asm ("f1"); -register VI c asm ("f2"); +#define ARGS const VF *a, const VF *b, VI *c void -compare_quiet_equal (void) +compare_quiet_equal (ARGS) { - c = (a == b); + VF _a = *a; + asm("" ::: "memory"); + *c = (_a == *b); } void -compare_quiet_not_equal (void) +compare_quiet_not_equal (ARGS) { - c = (a != b); + VF _a = *a; + asm("" ::: "memory"); + *c = (_a != *b); } void -compare_signaling_greater (void) +compare_signaling_greater (ARGS) { - c = (a > b); + VF _a = *a; + asm("" ::: "memory"); + *c = (_a > *b); } void -compare_signaling_greater_equal (void) +compare_signaling_greater_equal (ARGS) { - c = (a >= b); + VF _a = *a; + asm("" ::: "memory"); + *c = (_a >= *b); } void -compare_signaling_less (void) +compare_signaling_less (ARGS) { - c = (a < b); + VF _a = *a; + asm("" ::: "memory"); + *c = (_a < *b); } void -compare_signaling_less_equal (void) +compare_signaling_less_equal (ARGS) { - c = (a <= b); + VF _a = *a; + asm("" ::: "memory"); + *c = (_a <= *b); } void -compare_signaling_not_greater (void) +compare_signaling_not_greater (ARGS) { - c = ~(a > b); + VF _a = *a; + asm("" ::: "memory"); + *c = ~(_a > *b); } void -compare_signaling_less_unordered (void) +compare_signaling_less_unordered (ARGS) { - c = ~(a >= b); + VF _a = *a; + asm("" ::: "memory"); + *c = ~(_a >= *b); } void -compare_signaling_not_less (void) +compare_signaling_not_less (ARGS) { - c = ~(a < b); + VF _a = *a; + asm("" ::: "memory"); + *c = ~(_a < *b); } void -compare_signaling_greater_unordered (void) +compare_signaling_greater_unordered (ARGS) { - c = ~(a <= b); + VF _a = *a; + asm("" ::: "memory"); + *c = ~(_a <= *b); } void -compare_quiet_less (void) +compare_quiet_less (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_isless (a[i], b[i]) ? -1 : 0; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_isless (_a[i], (*b)[i]) ? -1 : 0; } void -compare_quiet_less_equal (void) +compare_quiet_less_equal (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_islessequal (a[i], b[i]) ? -1 : 0; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_islessequal (_a[i], (*b)[i]) ? -1 : 0; } void -compare_quiet_greater (void) +compare_quiet_greater (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_isgreater (a[i], b[i]) ? -1 : 0; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_isgreater (_a[i], (*b)[i]) ? -1 : 0; } void -compare_quiet_greater_equal (void) +compare_quiet_greater_equal (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_isgreaterequal (a[i], b[i]) ? -1 : 0; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_isgreaterequal (_a[i], (*b)[i]) ? -1 : 0; } void -compare_quiet_not_less (void) +compare_quiet_not_less (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_isless (a[i], b[i]) ? 0 : -1; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_isless (_a[i], (*b)[i]) ? 0 : -1; } void -compare_quiet_greater_unordered (void) +compare_quiet_greater_unordered (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_islessequal (a[i], b[i]) ? 0 : -1; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_islessequal (_a[i], (*b)[i]) ? 0 : -1; } void -compare_quiet_not_greater (void) +compare_quiet_not_greater (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_isgreater (a[i], b[i]) ? 0 : -1; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_isgreater (_a[i], (*b)[i]) ? 0 : -1; } void -compare_quiet_less_unordered (void) +compare_quiet_less_unordered (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_isgreaterequal (a[i], b[i]) ? 0 : -1; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_isgreaterequal (_a[i], (*b)[i]) ? 0 : -1; } void -compare_quiet_unordered (void) +compare_quiet_unordered (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_isunordered (a[i], b[i]) ? -1 : 0; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_isunordered (_a[i], (*b)[i]) ? -1 : 0; } void -compare_quiet_ordered (void) +compare_quiet_ordered (ARGS) { - for (int i = 0; i < sizeof (c) / sizeof (c[0]); i++) - c[i] = __builtin_isunordered (a[i], b[i]) ? 0 : -1; + VF _a = *a; + asm("" ::: "memory"); + for (int i = 0; i < sizeof (*c) / sizeof ((*c)[0]); i++) + (*c)[i] = __builtin_isunordered (_a[i], (*b)[i]) ? 0 : -1; } -/* The "-" matches the .size directive after the function - body, so we can ensure the instruction is in the correct function. */ +/* +** compare_quiet_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.ceq.s (\$vr[0-9]+),(\1,\2|\2,\1) +** vst \3,\$r6,0 +** jr \$r1 +*/ -/* { dg-final { scan-assembler "compare_quiet_equal:.*\tvfcmp\\.ceq\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_equal:.*\tvfcmp\\.cune\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_not_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater:.*\tvfcmp\\.slt\\.s\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_signaling_greater\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater_equal:.*\tvfcmp\\.sle\\.s\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_signaling_greater_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less:.*\tvfcmp\\.slt\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_signaling_less\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less_equal:.*\tvfcmp\\.sle\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_signaling_less_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_not_greater:.*\tvfcmp\\.sule\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_signaling_not_greater\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less_unordered:.*\tvfcmp\\.sult\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_signaling_less_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_not_less:.*\tvfcmp\\.sule\\.s\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_signaling_not_less\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater_unordered:.*\tvfcmp\\.sult\\.s\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_signaling_greater_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less:.*\tvfcmp\\.clt\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_less\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less_equal:.*\tvfcmp\\.cle\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_less_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater:.*\tvfcmp\\.clt\\.s\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_quiet_greater\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater_equal:.*\tvfcmp\\.cle\\.s\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_quiet_greater_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_less:.*\tvfcmp\\.cule\\.s\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_quiet_not_less\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater_unordered:.*\tvfcmp\\.cult\\.s\t\\\$vr2,\\\$vr1,\\\$vr0.*-compare_quiet_greater_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_greater:.*\tvfcmp\\.cule\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_not_greater\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less_unordered:.*\tvfcmp\\.cult\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_less_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_unordered:.*\tvfcmp\\.cun\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_ordered:.*\tvfcmp\\.cor\\.s\t\\\$vr2,\\\$vr0,\\\$vr1.*-compare_quiet_ordered\n" } } */ +/* +** compare_quiet_not_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cune.s (\$vr[0-9]+),(\1,\2|\2,\1) +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.slt.s (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sle.s (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.slt.s (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sle.s (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_not_greater: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sule.s (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sult.s (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_not_less: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sule.s (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.sult.s (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.clt.s (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cle.s (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.clt.s (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater_equal: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cle.s (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_less: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cule.s (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cult.s (\$vr[0-9]+),\2,\1 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_greater: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cule.s (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cult.s (\$vr[0-9]+),\1,\2 +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_unordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cun.s (\$vr[0-9]+),(\1,\2|\2,\1) +** vst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_ordered: +** vld (\$vr[0-9]+),\$r4,0 +** vld (\$vr[0-9]+),\$r5,0 +** vfcmp.cor.s (\$vr[0-9]+),(\1,\2|\2,\1) +** vst \3,\$r6,0 +** jr \$r1 +*/ diff --git a/gcc/testsuite/gcc.target/loongarch/xvfcmp-d.c b/gcc/testsuite/gcc.target/loongarch/xvfcmp-d.c index d8017caaa01..b27efebad8c 100644 --- a/gcc/testsuite/gcc.target/loongarch/xvfcmp-d.c +++ b/gcc/testsuite/gcc.target/loongarch/xvfcmp-d.c @@ -1,5 +1,6 @@ /* { dg-do compile } */ -/* { dg-options "-O2 -mlasx -ffixed-f0 -ffixed-f1 -ffixed-f2 -fno-vect-cost-model" } */ +/* { dg-options "-O2 -mlasx -fno-vect-cost-model" } */ +/* { dg-final { check-function-bodies "**" "" } } */ #define F double #define I long long @@ -7,23 +8,182 @@ #include "vfcmp-f.c" -/* { dg-final { scan-assembler "compare_quiet_equal:.*\txvfcmp\\.ceq\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_equal:.*\txvfcmp\\.cune\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_not_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater:.*\txvfcmp\\.slt\\.d\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_signaling_greater\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater_equal:.*\txvfcmp\\.sle\\.d\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_signaling_greater_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less:.*\txvfcmp\\.slt\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_signaling_less\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less_equal:.*\txvfcmp\\.sle\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_signaling_less_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_not_greater:.*\txvfcmp\\.sule\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_signaling_not_greater\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less_unordered:.*\txvfcmp\\.sult\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_signaling_less_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_not_less:.*\txvfcmp\\.sule\\.d\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_signaling_not_less\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater_unordered:.*\txvfcmp\\.sult\\.d\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_signaling_greater_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less:.*\txvfcmp\\.clt\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_less\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less_equal:.*\txvfcmp\\.cle\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_less_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater:.*\txvfcmp\\.clt\\.d\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_quiet_greater\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater_equal:.*\txvfcmp\\.cle\\.d\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_quiet_greater_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_less:.*\txvfcmp\\.cule\\.d\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_quiet_not_less\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater_unordered:.*\txvfcmp\\.cult\\.d\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_quiet_greater_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_greater:.*\txvfcmp\\.cule\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_not_greater\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less_unordered:.*\txvfcmp\\.cult\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_less_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_unordered:.*\txvfcmp\\.cun\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_ordered:.*\txvfcmp\\.cor\\.d\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_ordered\n" } } */ +/* +** compare_quiet_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.ceq.d (\$xr[0-9]+),(\1,\2|\2,\1) +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cune.d (\$xr[0-9]+),(\1,\2|\2,\1) +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.slt.d (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sle.d (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.slt.d (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sle.d (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_not_greater: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sule.d (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sult.d (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_not_less: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sule.d (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sult.d (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.clt.d (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cle.d (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.clt.d (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cle.d (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_less: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cule.d (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cult.d (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_greater: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cule.d (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cult.d (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cun.d (\$xr[0-9]+),(\1,\2|\2,\1) +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_ordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cor.d (\$xr[0-9]+),(\1,\2|\2,\1) +** xvst \3,\$r6,0 +** jr \$r1 +*/ diff --git a/gcc/testsuite/gcc.target/loongarch/xvfcmp-f.c b/gcc/testsuite/gcc.target/loongarch/xvfcmp-f.c index b5455647554..1ca1e6c8b69 100644 --- a/gcc/testsuite/gcc.target/loongarch/xvfcmp-f.c +++ b/gcc/testsuite/gcc.target/loongarch/xvfcmp-f.c @@ -1,27 +1,189 @@ /* { dg-do compile } */ -/* { dg-options "-O2 -mlasx -ffixed-f0 -ffixed-f1 -ffixed-f2" } */ +/* { dg-options "-O2 -mlasx -fno-vect-cost-model" } */ +/* { dg-final { check-function-bodies "**" "" } } */ +#define F float +#define I int #define VL 32 #include "vfcmp-f.c" -/* { dg-final { scan-assembler "compare_quiet_equal:.*\txvfcmp\\.ceq\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_equal:.*\txvfcmp\\.cune\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_not_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater:.*\txvfcmp\\.slt\\.s\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_signaling_greater\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater_equal:.*\txvfcmp\\.sle\\.s\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_signaling_greater_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less:.*\txvfcmp\\.slt\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_signaling_less\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less_equal:.*\txvfcmp\\.sle\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_signaling_less_equal\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_not_greater:.*\txvfcmp\\.sule\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_signaling_not_greater\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_less_unordered:.*\txvfcmp\\.sult\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_signaling_less_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_not_less:.*\txvfcmp\\.sule\\.s\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_signaling_not_less\n" } } */ -/* { dg-final { scan-assembler "compare_signaling_greater_unordered:.*\txvfcmp\\.sult\\.s\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_signaling_greater_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less:.*\txvfcmp\\.clt\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_less\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less_equal:.*\txvfcmp\\.cle\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_less_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater:.*\txvfcmp\\.clt\\.s\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_quiet_greater\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater_equal:.*\txvfcmp\\.cle\\.s\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_quiet_greater_equal\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_less:.*\txvfcmp\\.cule\\.s\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_quiet_not_less\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_greater_unordered:.*\txvfcmp\\.cult\\.s\t\\\$xr2,\\\$xr1,\\\$xr0.*-compare_quiet_greater_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_not_greater:.*\txvfcmp\\.cule\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_not_greater\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_less_unordered:.*\txvfcmp\\.cult\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_less_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_unordered:.*\txvfcmp\\.cun\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_unordered\n" } } */ -/* { dg-final { scan-assembler "compare_quiet_ordered:.*\txvfcmp\\.cor\\.s\t\\\$xr2,\\\$xr0,\\\$xr1.*-compare_quiet_ordered\n" } } */ +/* +** compare_quiet_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.ceq.s (\$xr[0-9]+),(\1,\2|\2,\1) +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cune.s (\$xr[0-9]+),(\1,\2|\2,\1) +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.slt.s (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sle.s (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.slt.s (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sle.s (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_not_greater: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sule.s (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_less_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sult.s (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_not_less: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sule.s (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_signaling_greater_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.sult.s (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.clt.s (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cle.s (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.clt.s (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater_equal: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cle.s (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_less: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cule.s (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_greater_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cult.s (\$xr[0-9]+),\2,\1 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_not_greater: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cule.s (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_less_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cult.s (\$xr[0-9]+),\1,\2 +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_unordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cun.s (\$xr[0-9]+),(\1,\2|\2,\1) +** xvst \3,\$r6,0 +** jr \$r1 +*/ + +/* +** compare_quiet_ordered: +** xvld (\$xr[0-9]+),\$r4,0 +** xvld (\$xr[0-9]+),\$r5,0 +** xvfcmp.cor.s (\$xr[0-9]+),(\1,\2|\2,\1) +** xvst \3,\$r6,0 +** jr \$r1 +*/