From patchwork Mon Nov 4 13:09:41 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Craig Blackmore X-Patchwork-Id: 2006239 Return-Path: X-Original-To: incoming@patchwork.ozlabs.org Delivered-To: patchwork-incoming@legolas.ozlabs.org Authentication-Results: legolas.ozlabs.org; dkim=pass (2048-bit key; unprotected) header.d=embecosm.com header.i=@embecosm.com header.a=rsa-sha256 header.s=google header.b=IKA736I1; dkim-atps=neutral Authentication-Results: legolas.ozlabs.org; spf=pass (sender SPF authorized) smtp.mailfrom=gcc.gnu.org (client-ip=2620:52:3:1:0:246e:9693:128c; helo=server2.sourceware.org; envelope-from=gcc-patches-bounces~incoming=patchwork.ozlabs.org@gcc.gnu.org; receiver=patchwork.ozlabs.org) Received: from server2.sourceware.org (server2.sourceware.org [IPv6:2620:52:3:1:0:246e:9693:128c]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (secp384r1) server-digest SHA384) (No client certificate requested) by legolas.ozlabs.org (Postfix) with ESMTPS id 4XhsKH6xmHz1xxN for ; Tue, 5 Nov 2024 00:10:43 +1100 (AEDT) Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 708B03857B96 for ; Mon, 4 Nov 2024 13:10:41 +0000 (GMT) X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from mail-wr1-x42d.google.com (mail-wr1-x42d.google.com [IPv6:2a00:1450:4864:20::42d]) by sourceware.org (Postfix) with ESMTPS id C1496385841D for ; Mon, 4 Nov 2024 13:10:17 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org C1496385841D Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=embecosm.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=embecosm.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org C1496385841D Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=2a00:1450:4864:20::42d ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1730725820; cv=none; b=ZPqE0fWQd2aGIafbSD9pGIbuLm8/T4weFpjDhuP9wffOLB+hEfyweh4CSuoeRs33Ks9j1aTmEmvTSd/7WYMBdUQcwlsWQskZzkTMWicI6xd0QhFku75TC8EtXlnHbPRr4Zu41/Jid4V9AxLaHxoXj71DUhGUSMaURxWu/eSb1f4= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1730725820; c=relaxed/simple; bh=+y7IO8riog5RC+NRlom4aoTOEjWq+EFI5uRlYvtSMb8=; h=DKIM-Signature:From:To:Subject:Date:Message-ID:MIME-Version; b=gdscXO8CIFUXUL2OK7ErRskiXUwYhpyMdYQbuFUPlSMAyMzB3T18JrFAuOx7qOtz/ANHTAZ9ycxFEvbVUy4ikB+jxFsv0rIc87+AI33OYBae1Czvo3CCJzRKruk7s912b2ixiTbkW9U1JJdQ9lGvo5okd66qeZvnnhyVB7m3NPc= ARC-Authentication-Results: i=1; server2.sourceware.org Received: by mail-wr1-x42d.google.com with SMTP id ffacd0b85a97d-37d4fd00574so2592596f8f.0 for ; Mon, 04 Nov 2024 05:10:17 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=embecosm.com; s=google; t=1730725816; x=1731330616; darn=gcc.gnu.org; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=+lrbjFv9Fz45dU0ga8/oOwnhHUibfp+mRijAGH56Dm4=; b=IKA736I1p2vnFoplnVugUkNFqZecbTJrsMceFV9ab8kSohbhkbe2jqAKYAMD8YbGmi Jb70lpR3iDWBx2XgbWGdZ2NCo9dFcHVoZy63Sr65u+whB0EbXaF2lG1yhae+uf+VEc3y vhFOKImNtx8t02GSm8MKjcEAgTSYJae983UjcbqKg0QyARKhGmSurY/8SfZhEhGoUccd QbSHTzv+Fc51QPZoY0qDZ397nCb3vyJgCC7aU6Yh6nGVNInQgSt3SHaK5CPnOdB8UxRg DI+AhPqRlFZQn/poRkhM5pZyPsK7aobiaUc+My+YCzpYX7H2fEK+mlb6JkiWINtMgQpo xoSg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1730725816; x=1731330616; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=+lrbjFv9Fz45dU0ga8/oOwnhHUibfp+mRijAGH56Dm4=; b=CVoqSibISyO6FG8Tmfp7suxMiWVAD2JBXkmnq22W0kSX55jCUXEgPjiNtSmN+4cRGK GBXeIR2DP8DLEt1qCkai0pkjjKSFwlmkbmep/3cHumjU5JPYqXSiHMhaZdijHq3ZAfIk 4xwQSEOGMMT2kTUikXSWG5r9C/UTZGQAsCcLEpYBOmpt0V5w74mqTSvP+eQCnA7SQ8f1 UiHQdYoqfnJSj2RfrV5Cy9Z1hSExqP3Gx/vJ36SOlM+H5Y14+5yAjlOqq7zBVnTUDhgR T7RsYf0F8/mVty/1GjQTNpsHMG/qq1ID/D6lHPdfRvPkUn1KmugqbYkXo1KCV3HVyqDd WAhw== X-Gm-Message-State: AOJu0YxMe+q3u2NdswJrIC1CkPYGIiYUxo9z2SnbRKekh+uXj+8DaHGI os+/r6vvMFAhCW0eSGd9KGByl8SkRnH8bd7NKkse1DmKng+4wghcM+Jtx/ossJh3NvZ2jV8NJni I X-Google-Smtp-Source: AGHT+IGotNedyUxX1K3KMmcVTG/4XLrFKF8grPTkR92GsEHZ9Zyy48sfznJaxAjjLbVOjiXFieOIwQ== X-Received: by 2002:a05:6000:2a2:b0:376:dbb5:10c2 with SMTP id ffacd0b85a97d-381c14ef2afmr12422477f8f.29.1730725816410; Mon, 04 Nov 2024 05:10:16 -0800 (PST) Received: from dorian.sou.embecosm-corp.com ([212.69.42.53]) by smtp.gmail.com with ESMTPSA id ffacd0b85a97d-381c10d4983sm13118154f8f.33.2024.11.04.05.10.15 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 04 Nov 2024 05:10:16 -0800 (PST) From: Craig Blackmore To: gcc-patches@gcc.gnu.org Cc: jeffreyalaw@gmail.com, Craig Blackmore Subject: [PATCH v2 0/2] RISC-V: Vector memcpy/memset fixes and improvements Date: Mon, 4 Nov 2024 13:09:41 +0000 Message-ID: <20241104130943.4041719-1-craig.blackmore@embecosm.com> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20241018131300.1150819-1-craig.blackmore@embecosm.com> References: <20241018131300.1150819-1-craig.blackmore@embecosm.com> MIME-Version: 1.0 X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: gcc-patches-bounces~incoming=patchwork.ozlabs.org@gcc.gnu.org Patch 1-5 of v1 have already been pushed. This is v2 of patch 6 and 7 of that series. Changes since v1: RISC-V: Make vectorized memset handle more cases * Removed vector memset loop generation. RISC-V: Disable by pieces for vector setmem length > UNITS_PER_WORD * No changes. gcc/config/riscv/riscv-string.cc | 37 ++++++++++--------- gcc/config/riscv/riscv.cc | 19 ++++++++++ .../gcc.target/riscv/rvv/autovec/pr113469.c | 3 +- .../gcc.target/riscv/rvv/base/setmem-2.c | 12 +++--- .../gcc.target/riscv/rvv/base/setmem-3.c | 18 +++++---- 5 files changed, 57 insertions(+), 32 deletions(-)