From patchwork Wed May 25 03:39:20 2022
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: "Liu, Hongtao" <hongtao.liu@intel.com>
X-Patchwork-Id: 1635273
Return-Path: <gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org>
X-Original-To: incoming@patchwork.ozlabs.org
Delivered-To: patchwork-incoming@bilbo.ozlabs.org
Authentication-Results: bilbo.ozlabs.org;
	dkim=pass (1024-bit key;
 unprotected) header.d=gcc.gnu.org header.i=@gcc.gnu.org header.a=rsa-sha256
 header.s=default header.b=XirN6VvG;
	dkim-atps=neutral
Authentication-Results: ozlabs.org;
 spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org
 (client-ip=8.43.85.97; helo=sourceware.org;
 envelope-from=gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org;
 receiver=<UNKNOWN>)
Received: from sourceware.org (ip-8-43-85-97.sourceware.org [8.43.85.97])
	(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)
	 key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest
 SHA256)
	(No client certificate requested)
	by bilbo.ozlabs.org (Postfix) with ESMTPS id 4L7Gz94Bmbz9sGl
	for <incoming@patchwork.ozlabs.org>; Wed, 25 May 2022 13:39:48 +1000 (AEST)
Received: from server2.sourceware.org (localhost [IPv6:::1])
	by sourceware.org (Postfix) with ESMTP id D66F9385608B
	for <incoming@patchwork.ozlabs.org>; Wed, 25 May 2022 03:39:44 +0000 (GMT)
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org D66F9385608B
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org;
	s=default; t=1653449984;
	bh=GNeIibhCRI8LRXEFCRXOFwdcQ5fVveB9Wv/qAFv566Y=;
	h=To:Subject:Date:List-Id:List-Unsubscribe:List-Archive:List-Post:
	 List-Help:List-Subscribe:From:Reply-To:From;
	b=XirN6VvGRCEkws5qEoRUz5MfyTbxYVKmiNRAS7XccZWX7jH09PBYhKF9HLn1POUVZ
	 nTZWkIs5airbRDPg3971YPIiDJ1c2A2Ul0lwy9YIETRdtGYena3BxJEnX2BCZeXNwx
	 LfdYW17UN6V335CxDQ89rM9A8hMVb8aHo59rORUA=
X-Original-To: gcc-patches@gcc.gnu.org
Delivered-To: gcc-patches@gcc.gnu.org
Received: from mga03.intel.com (mga03.intel.com [134.134.136.65])
 by sourceware.org (Postfix) with ESMTPS id 9F861385608A
 for <gcc-patches@gcc.gnu.org>; Wed, 25 May 2022 03:39:23 +0000 (GMT)
DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 9F861385608A
X-IronPort-AV: E=McAfee;i="6400,9594,10357"; a="273703254"
X-IronPort-AV: E=Sophos;i="5.91,250,1647327600"; d="scan'208";a="273703254"
Received: from orsmga001.jf.intel.com ([10.7.209.18])
 by orsmga103.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384;
 24 May 2022 20:39:22 -0700
X-ExtLoop1: 1
X-IronPort-AV: E=Sophos;i="5.91,250,1647327600"; d="scan'208";a="608943802"
Received: from scymds01.sc.intel.com ([10.148.94.138])
 by orsmga001.jf.intel.com with ESMTP; 24 May 2022 20:39:22 -0700
Received: from shliclel051.sh.intel.com (shliclel051.sh.intel.com
 [10.239.236.51]) by scymds01.sc.intel.com
 with ESMTP id 24P3dLii021178; Tue, 24 May 2022 20:39:21 -0700
To: gcc-patches@gcc.gnu.org
Subject: [PATCH] Add a bit dislike for separate mem alternative when op is
 REG_P.
Date: Wed, 25 May 2022 11:39:20 +0800
Message-Id: <20220525033920.77449-1-hongtao.liu@intel.com>
X-Mailer: git-send-email 2.18.1
X-Spam-Status: No, score=-12.6 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH,
 DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, GIT_PATCH_0,
 KAM_SHORT,
 SPF_HELO_NONE, SPF_NONE, TXREP,
 T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on
 server2.sourceware.org
X-BeenThere: gcc-patches@gcc.gnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Gcc-patches mailing list <gcc-patches.gcc.gnu.org>
List-Unsubscribe: <https://gcc.gnu.org/mailman/options/gcc-patches>,
 <mailto:gcc-patches-request@gcc.gnu.org?subject=unsubscribe>
List-Archive: <https://gcc.gnu.org/pipermail/gcc-patches/>
List-Post: <mailto:gcc-patches@gcc.gnu.org>
List-Help: <mailto:gcc-patches-request@gcc.gnu.org?subject=help>
List-Subscribe: <https://gcc.gnu.org/mailman/listinfo/gcc-patches>,
 <mailto:gcc-patches-request@gcc.gnu.org?subject=subscribe>
X-Patchwork-Original-From: liuhongt via Gcc-patches <gcc-patches@gcc.gnu.org>
From: "Liu, Hongtao" <hongtao.liu@intel.com>
Reply-To: liuhongt <hongtao.liu@intel.com>
Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org
Sender: "Gcc-patches"
 <gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org>

Rigt now, mem_cost for separate mem alternative is 1 * frequency which
is pretty small and caused the unnecessary SSE spill in the PR, I've tried
to rework backend cost model, but RA still not happy with that(regress
somewhere else). I think the root cause of this is cost for separate 'm'
alternative cost is too small, especially considering that the mov cost
of gpr are 2(default for REGISTER_MOVE_COST). So this patch increase mem_cost
to 2*frequency, also increase 1 for reg_class cost when m alternative.


Bootstrapped and regtested on x86_64-pc-linux-gnu{-m32,}.
Ok for trunk?

gcc/ChangeLog:

	PR target/105513
	* ira-costs.cc (record_reg_classes): Increase both mem_cost
	and reg class cost by 1 for separate mem alternative when
	REG_P (op).

gcc/testsuite/ChangeLog:

	* gcc.target/i386/pr105513-1.c: New test.
---
 gcc/ira-costs.cc                           | 26 +++++++++++++---------
 gcc/testsuite/gcc.target/i386/pr105513-1.c | 16 +++++++++++++
 2 files changed, 31 insertions(+), 11 deletions(-)
 create mode 100644 gcc/testsuite/gcc.target/i386/pr105513-1.c
diff --git a/gcc/ira-costs.cc b/gcc/ira-costs.cc
index 964c94a06ef..f7b8325e195 100644
--- a/gcc/ira-costs.cc
+++ b/gcc/ira-costs.cc
@@ -625,7 +625,8 @@ record_reg_classes (int n_alts, int n_ops, rtx *ops,
 			  for (k = cost_classes_ptr->num - 1; k >= 0; k--)
 			    {
 			      rclass = cost_classes[k];
-			      pp_costs[k] = mem_cost[rclass][0] * frequency;
+			      pp_costs[k] = (mem_cost[rclass][0]
+					     + 1) * frequency;
 			    }
 			}
 		      else
@@ -648,7 +649,8 @@ record_reg_classes (int n_alts, int n_ops, rtx *ops,
 			  for (k = cost_classes_ptr->num - 1; k >= 0; k--)
 			    {
 			      rclass = cost_classes[k];
-			      pp_costs[k] = mem_cost[rclass][1] * frequency;
+			      pp_costs[k] = (mem_cost[rclass][1]
+					     + 1) * frequency;
 			    }
 			}
 		      else
@@ -670,9 +672,9 @@ record_reg_classes (int n_alts, int n_ops, rtx *ops,
 			  for (k = cost_classes_ptr->num - 1; k >= 0; k--)
 			    {
 			      rclass = cost_classes[k];
-			      pp_costs[k] = ((mem_cost[rclass][0]
-					      + mem_cost[rclass][1])
-					     * frequency);
+			      pp_costs[k] = (mem_cost[rclass][0]
+					     + mem_cost[rclass][1]
+					     + 2) * frequency;
 			    }
 			}
 		      else
@@ -861,7 +863,8 @@ record_reg_classes (int n_alts, int n_ops, rtx *ops,
 			  for (k = cost_classes_ptr->num - 1; k >= 0; k--)
 			    {
 			      rclass = cost_classes[k];
-			      pp_costs[k] = mem_cost[rclass][0] * frequency;
+			      pp_costs[k] = (mem_cost[rclass][0]
+					     + 1) * frequency;
 			    }
 			}
 		      else
@@ -884,7 +887,8 @@ record_reg_classes (int n_alts, int n_ops, rtx *ops,
 			  for (k = cost_classes_ptr->num - 1; k >= 0; k--)
 			    {
 			      rclass = cost_classes[k];
-			      pp_costs[k] = mem_cost[rclass][1] * frequency;
+			      pp_costs[k] = (mem_cost[rclass][1]
+					     + 1) * frequency;
 			    }
 			}
 		      else
@@ -906,9 +910,9 @@ record_reg_classes (int n_alts, int n_ops, rtx *ops,
 			  for (k = cost_classes_ptr->num - 1; k >= 0; k--)
 			    {
 			      rclass = cost_classes[k];
-			      pp_costs[k] = ((mem_cost[rclass][0]
-					      + mem_cost[rclass][1])
-					     * frequency);
+			      pp_costs[k] = (mem_cost[rclass][0]
+					     + mem_cost[rclass][1]
+					     + 2) * frequency;
 			    }
 			}
 		      else
@@ -929,7 +933,7 @@ record_reg_classes (int n_alts, int n_ops, rtx *ops,
 		    /* Although we don't need insn to reload from
 		       memory, still accessing memory is usually more
 		       expensive than a register.  */
-		    pp->mem_cost = frequency;
+		    pp->mem_cost = 2 * frequency;
 		  else
 		    /* If the alternative actually allows memory, make
 		       things a bit cheaper since we won't need an
diff --git a/gcc/testsuite/gcc.target/i386/pr105513-1.c b/gcc/testsuite/gcc.target/i386/pr105513-1.c
new file mode 100644
index 00000000000..530f5292252
--- /dev/null
+++ b/gcc/testsuite/gcc.target/i386/pr105513-1.c
@@ -0,0 +1,16 @@
+/* { dg-do compile { target { ! ia32 } } } */
+/* { dg-options "-O2 -msse2 -mtune=skylake -mfpmath=sse" } */
+/* { dg-final { scan-assembler-not "\\(%rsp\\)" } } */
+
+static int as_int(float x)
+{
+    return (union{float x; int i;}){x}.i;
+}
+
+float f(double y, float x)
+{
+    int i = as_int(x);
+    if (__builtin_expect(i > 99, 0)) return 0;
+    if (i*2u < 77) if (i==2) return 0;
+    return y*x;
+}