From patchwork Mon Aug 26 06:42:29 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Haochen Jiang X-Patchwork-Id: 1976624 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; dkim=pass (2048-bit key; unprotected) header.d=intel.com header.i=@intel.com header.a=rsa-sha256 header.s=Intel header.b=JJEz9/DD; dkim-atps=neutral Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=2620:52:3:1:0:246e:9693:128c; helo=server2.sourceware.org; envelope-from=gcc-patches-bounces~incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=patchwork.ozlabs.org) Received: from server2.sourceware.org (server2.sourceware.org [IPv6:2620:52:3:1:0:246e:9693:128c]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (secp384r1) server-digest SHA384) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4Wsh2T6pYrz1yYl for ; Mon, 26 Aug 2024 16:43:12 +1000 (AEST) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id CF672384AB4F for ; Mon, 26 Aug 2024 06:43:09 +0000 (GMT) X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.16]) by sourceware.org (Postfix) with ESMTPS id 970F93858D29 for ; Mon, 26 Aug 2024 06:42:42 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 970F93858D29 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=intel.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=intel.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org 970F93858D29 Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=198.175.65.16 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1724654564; cv=none; b=Pxvj4j/9sCmpL0DRDRH8+BP0UKP1Uve1t+ei+slmLtDR59f7Q4hvvQPQd/0wgM9VXZGp4icqiCnFWlvXdUyEWHF4RAZtKLFRM3Buww3h/RiQosFLFcD8XxHh0v1HjGRImz/PFTWqdyVnYRaG6rDgCub7z7KDlIozPj6U7Ap472I= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1724654564; c=relaxed/simple; bh=nHVJhIEnucrid0J7+pOorLU4DFMK2N3ISGGXx+iKCIw=; h=DKIM-Signature:From:To:Subject:Date:Message-Id:MIME-Version; b=T+1Ry6N0mMNOzVJUWt+/U2Jm3zNHVtSiUgfI9p5Yw2nJjx772U0Bwsi37UJrgY7iNXZ74wbx3v6EiPkzR6Zf8oUFSD6lYSKAvDCQ09R0+LHb3KDWWSwffbWj+KCCdJ4IvBDRsv0ZChfgSD20UYHh0ojidBTU4RtkmFh/Ln/EUbs= ARC-Authentication-Results: i=1; server2.sourceware.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1724654563; x=1756190563; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=nHVJhIEnucrid0J7+pOorLU4DFMK2N3ISGGXx+iKCIw=; b=JJEz9/DDV+M3KYyVyUNiCdflgXk8k9Apkne812gVizGQq05Vd2pC2BTG zEMCV7uvebF4RXYhLhibdtjQyNAjMwq2s3auwiD3FZx/+DGiDZ636iFIN u/XuiYLMttGDuQB2AW9NWc16vhk5c7stDVIhW8rRkieB0t+iQK7H625OY 0LNBnjrDUor12T6IfMI+r+Rpm2Lp9y25XFk1xSWvk7wDtSpmiD1hV7vLC tvbnovx8N2ANKbfzY7LsscubcgcvYqCZrinslb/bJ97wwZFFrZ07OifBW reXmETPtbYQMQJ/Q93kae2nm5s0KkPQebGITpjZLLOjPUe9VpKSRzkODy w==; X-CSE-ConnectionGUID: pWVuqYWxSVeYOgsv/eVBOQ== X-CSE-MsgGUID: yl+uEFTNQNWzawaOoyE6fg== X-IronPort-AV: E=McAfee;i="6700,10204,11175"; a="23188149" X-IronPort-AV: E=Sophos;i="6.10,176,1719903600"; d="scan'208";a="23188149" Received: from orviesa005.jf.intel.com ([10.64.159.145]) by orvoesa108.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 25 Aug 2024 23:42:41 -0700 X-CSE-ConnectionGUID: tP+b1z9LT5SDF4A7gPqY+A== X-CSE-MsgGUID: HKJboLRJQqyz12b9ULCQyw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.10,176,1719903600"; d="scan'208";a="67308056" Received: from shliclel4217.sh.intel.com ([10.239.240.127]) by orviesa005.jf.intel.com with ESMTP; 25 Aug 2024 23:42:41 -0700 From: Haochen Jiang To: gcc-patches@gcc.gnu.org Cc: hongtao.liu@intel.com, ubizjak@gmail.com Subject: [PATCH 0/8] i386: Opmitize code with AVX10.2 new instructions Date: Mon, 26 Aug 2024 14:42:29 +0800 Message-Id: <20240826064238.2268967-1-haochen.jiang@intel.com> X-Mailer: git-send-email 2.31.1 MIME-Version: 1.0 X-Spam-Status: No, score=-4.6 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, SPF_HELO_NONE, SPF_NONE, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: gcc-patches-bounces~incoming=patchwork.ozlabs.org@gcc.gnu.org Hi all, I have just commited AVX10.2 new instructions patches into trunk hours ago. The next and final part for AVX10.2 upstream is to optimize code with AVX10.2 new instructions. In this patch series, it will contain the following optimizations: - VNNI instruction auto vectorize (PATCH 1). - Codegen optimization with new scalar comparison instructions to eliminate redundant code (PATCH 2-3). - BF16 instruction auto vectorize (PATCH 4-8). This will finish the upstream for AVX10.2 series. Afterwards, we may add V2BF/V4BF in another thread just like what we have done for V2HF/V4HF when AVX512FP16 upstreamed. Bootstrapped on x86-64-pc-linux-gnu. Ok for trunk? Thx, Haochen