From patchwork Thu May 23 06:37:39 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Hu, Lin1" X-Patchwork-Id: 1938196 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; dkim=pass (2048-bit key; unprotected) header.d=intel.com header.i=@intel.com header.a=rsa-sha256 header.s=Intel header.b=LjPO2v3m; dkim-atps=neutral Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=8.43.85.97; helo=server2.sourceware.org; envelope-from=gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=patchwork.ozlabs.org) Received: from server2.sourceware.org (server2.sourceware.org [8.43.85.97]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (secp384r1) server-digest SHA384) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4VlJQm0xzKz1ydW for ; Thu, 23 May 2024 16:38:23 +1000 (AEST) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 48EF138449CF for ; Thu, 23 May 2024 06:38:20 +0000 (GMT) X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from mgamail.intel.com (mgamail.intel.com [192.198.163.15]) by sourceware.org (Postfix) with ESMTPS id EA5103865488 for ; Thu, 23 May 2024 06:37:45 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org EA5103865488 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=intel.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=intel.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org EA5103865488 Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=192.198.163.15 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1716446270; cv=none; b=HosuZ6BMVPwj68NTPKHJuB6EH7x/7q+p7tTXHNaCqjdIvZJniW5fISDchggmqf8s5CCyX6ehtk5fJhRv+tcUIKZa9ZWlo6L5dmfgGjFfZ+1Ohu64XeLUnBZfvTLTX25mcZHF8trgdZcBPtwRSBLStn8YFeMovFrDJ/w2IdENHu4= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1716446270; c=relaxed/simple; bh=WXkrxwKyaLQBMqGlQTln+9T7aC0DHJS67iB56JK4nX0=; h=DKIM-Signature:From:To:Subject:Date:Message-Id:MIME-Version; b=ec8NmeguWd+FcLZaXGYoUtDoIBa2mLxNv71t7uHLIqN1LbA7f36e/eO6/iwJJyAvodQRhK8efoP59oBWxAOIeWNUFfStx8L2VEfpb9ERMtgLvK+Ge3LpBpUiAggNbvx4BG2JotRoyNNUhGQPrDWoIheOj6mQpKl7D/f6V7as1B8= ARC-Authentication-Results: i=1; server2.sourceware.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1716446266; x=1747982266; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=WXkrxwKyaLQBMqGlQTln+9T7aC0DHJS67iB56JK4nX0=; b=LjPO2v3mrvekhl4OMnKTzRmTtthqlAOr+R3C4B68Q5m7GOvG79B5gy0h 86X/SaH2nuoLu7JGDvburcTyyOhifjtv+3fhYwVx/xHzuT1RGS8QmP8KD CB0Inpgb1qe2wdKHItH26q7KGF/V7y+NFal864oTF4j4a7MY/yIwTfq0a h/glwdZQNeBjWB0ntUvKWHnKJSZQ2e40q6ThX+v8kpvt+ckoHh2fIFwnC ZNYpv+6/Gvsd7IbMqATrHVn+3JHTtSGgiMOdpcPuDPWk2nMbwYQmF7LfC HPSm3Dn4PtRcwdIvFT0o9DsnxhtXjEcs7hGHVUK8CsaBBZzeyKBoS4PD5 g==; X-CSE-ConnectionGUID: FI1u3bqeTQKGIySrhypd6g== X-CSE-MsgGUID: ecJKuGvZTruixMTfEGgMxg== X-IronPort-AV: E=McAfee;i="6600,9927,11080"; a="12918774" X-IronPort-AV: E=Sophos;i="6.08,181,1712646000"; d="scan'208";a="12918774" Received: from fmviesa010.fm.intel.com ([10.60.135.150]) by fmvoesa109.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 22 May 2024 23:37:45 -0700 X-CSE-ConnectionGUID: ApBF2MWEQ3qtnHaiGefedw== X-CSE-MsgGUID: jbtvLtNjSSmK7Ryla2zklw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.08,181,1712646000"; d="scan'208";a="33667869" Received: from shvmail03.sh.intel.com ([10.239.245.20]) by fmviesa010.fm.intel.com with ESMTP; 22 May 2024 23:37:43 -0700 Received: from shliclel4217.sh.intel.com (shliclel4217.sh.intel.com [10.239.240.127]) by shvmail03.sh.intel.com (Postfix) with ESMTP id 4B6BE10077D3; Thu, 23 May 2024 14:37:42 +0800 (CST) From: "Hu, Lin1" To: gcc-patches@gcc.gnu.org Cc: hongtao.liu@intel.com, ubizjak@gmail.com, rguenther@suse.de Subject: [PATCH 0/3] Optimize __builtin_convertvector for x86-64-v4 and Date: Thu, 23 May 2024 14:37:39 +0800 Message-Id: <20240523063742.2333446-1-lin1.hu@intel.com> X-Mailer: git-send-email 2.31.1 In-Reply-To: References: MIME-Version: 1.0 X-Spam-Status: No, score=-4.6 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_SHORT, SPF_HELO_NONE, SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: gcc-patches-bounces+incoming=patchwork.ozlabs.org@gcc.gnu.org These patches are a series of improved patches to the __builtin_convertvector for x86-64-v4 and x86-64-v3. I modified the first patch according to Richard's suggestion and send them out together to show my complete modification of the function. They are bootstrapped and regtested on x86_64-pc-linux-gnu. BRs, Lin Hu, Lin1 (3): vect: generate suitable convert insn for int -> int, float -> float and int <-> float. vect: Support v4hi -> v4qi. vect: support direct conversion under x86-64-v3. gcc/config/i386/i386-expand.cc | 47 +++- gcc/config/i386/i386-protos.h | 3 + gcc/config/i386/mmx.md | 10 + gcc/config/i386/sse.md | 87 ++++++-- gcc/testsuite/gcc.target/i386/pr107432-1.c | 244 +++++++++++++++++++++ gcc/testsuite/gcc.target/i386/pr107432-2.c | 105 +++++++++ gcc/testsuite/gcc.target/i386/pr107432-3.c | 55 +++++ gcc/testsuite/gcc.target/i386/pr107432-4.c | 56 +++++ gcc/testsuite/gcc.target/i386/pr107432-5.c | 72 ++++++ gcc/testsuite/gcc.target/i386/pr107432-6.c | 152 +++++++++++++ gcc/testsuite/gcc.target/i386/pr107432-7.c | 156 +++++++++++++ gcc/testsuite/gcc.target/i386/pr107432-8.c | 73 ++++++ gcc/testsuite/gcc.target/i386/pr107432-9.c | 121 ++++++++++ gcc/testsuite/gcc.target/i386/pr92645-4.c | 2 - gcc/tree-vect-generic.cc | 157 ++++++++++++- 15 files changed, 1305 insertions(+), 35 deletions(-) create mode 100644 gcc/testsuite/gcc.target/i386/pr107432-1.c create mode 100644 gcc/testsuite/gcc.target/i386/pr107432-2.c create mode 100644 gcc/testsuite/gcc.target/i386/pr107432-3.c create mode 100644 gcc/testsuite/gcc.target/i386/pr107432-4.c create mode 100644 gcc/testsuite/gcc.target/i386/pr107432-5.c create mode 100644 gcc/testsuite/gcc.target/i386/pr107432-6.c create mode 100644 gcc/testsuite/gcc.target/i386/pr107432-7.c create mode 100644 gcc/testsuite/gcc.target/i386/pr107432-8.c create mode 100644 gcc/testsuite/gcc.target/i386/pr107432-9.c