[PULL,11/87] target/ppc: Implement vmsumcud instruction

Message ID	20220302110803.849505-12-clg@kaod.org
State	Handled Elsewhere
Headers	show Return-Path: <qemu-ppc-bounces+incoming=patchwork.ozlabs.org@nongnu.org> From: =?utf-8?q?C=C3=A9dric_Le_Goater?= <clg@kaod.org> To: qemu-ppc@nongnu.org, qemu-devel@nongnu.org Subject: [PULL 11/87] target/ppc: Implement vmsumcud instruction Date: Wed, 2 Mar 2022 12:06:47 +0100 Message-Id: <20220302110803.849505-12-clg@kaod.org> In-Reply-To: <20220302110803.849505-1-clg@kaod.org> References: <20220302110803.849505-1-clg@kaod.org> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Received-SPF: softfail client-ip=148.163.158.5; envelope-from=clg@kaod.org; helo=mx0b-001b2d01.pphosted.com X-Spam_score_int: -11 X-Spam_score: -1.2 X-Spam_bar: - X-Spam_report: (-1.2 / 5.0 requ) BAYES_00=-1.9, RCVD_IN_MSPIKE_H5=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_SOFTFAIL=0.665, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no X-Spam_action: no action Precedence: list Cc: =?utf-8?q?V=C3=ADctor_Colombo?= <victor.colombo@eldorado.org.br>, Peter Maydell <peter.maydell@linaro.org>, Richard Henderson <richard.henderson@linaro.org>, Matheus Ferst <matheus.ferst@eldorado.org.br>, =?utf-8?q?C=C3=A9dric_Le_Goa?= =?utf-8?q?ter?= <clg@kaod.org> Errors-To: qemu-ppc-bounces+incoming=patchwork.ozlabs.org@nongnu.org Sender: "Qemu-ppc" <qemu-ppc-bounces+incoming=patchwork.ozlabs.org@nongnu.org>
Series	[PULL,01/87] hw/ppc/pnv: Determine ns16550's IRQ number from QOM property \| expand [PULL,01/87] hw/ppc/pnv: Determine ns16550's IRQ number from QOM property [PULL,02/87] ppc/pnv: fix default PHB4 QOM hierarchy [PULL,03/87] target/ppc: make power8-pmu.c CONFIG_TCG only [PULL,04/87] target/ppc: finalize pre-EBB PMU logic [PULL,05/87] target/ppc: add PPC_INTERRUPT_EBB and EBB exceptions [PULL,06/87] target/ppc: trigger PERFM EBBs from power8-pmu.c [PULL,07/87] target/ppc: Introduce TRANSFLAGS macros [PULL,08/87] target/ppc: moved vector even and odd multiplication to decodetree [PULL,09/87] target/ppc: Moved vector multiply high and low to decodetree [PULL,10/87] target/ppc: vmulh instructions without helpers [PULL,11/87] target/ppc: Implement vmsumcud instruction [PULL,12/87] target/ppc: Implement vmsumudm instruction [PULL,13/87] target/ppc: Move vexts[bhw]2[wd] to decodetree [PULL,14/87] target/ppc: Implement vextsd2q [PULL,15/87] target/ppc: Move Vector Compare Equal/Not Equal/Greater Than to decodetree [PULL,16/87] target/ppc: Move Vector Compare Not Equal or Zero to decodetree [PULL,17/87] target/ppc: Implement Vector Compare Equal Quadword [PULL,18/87] target/ppc: Implement Vector Compare Greater Than Quadword [PULL,19/87] target/ppc: Implement Vector Compare Quadword [PULL,20/87] target/ppc: implement vstri[bh][lr] [PULL,21/87] target/ppc: implement vclrlb [PULL,22/87] target/ppc: implement vclrrb [PULL,23/87] target/ppc: implement vcntmb[bhwd] [PULL,24/87] target/ppc: implement vgnb [PULL,25/87] target/ppc: move vs[lr][a][bhwd] to decodetree [PULL,26/87] target/ppc: implement vslq [PULL,27/87] target/ppc: implement vsrq [PULL,28/87] target/ppc: implement vsraq [PULL,29/87] target/ppc: move vrl[bhwd] to decodetree [PULL,30/87] target/ppc: move vrl[bhwd]nm/vrl[bhwd]mi to decodetree [PULL,31/87] target/ppc: implement vrlq [PULL,32/87] target/ppc: implement vrlqnm [PULL,33/87] target/ppc: implement vrlqmi [PULL,34/87] target/ppc: Move vsel and vperm/vpermr to decodetree [PULL,35/87] target/ppc: Move xxsel to decodetree [PULL,36/87] target/ppc: move xxperm/xxpermr to decodetree [PULL,37/87] target/ppc: Move xxpermdi to decodetree [PULL,38/87] target/ppc: Implement xxpermx instruction [PULL,39/87] tcg/tcg-op-gvec.c: Introduce tcg_gen_gvec_4i [PULL,40/87] target/ppc: Implement xxeval [PULL,41/87] target/ppc: Implement xxgenpcv[bhwd]m instruction [PULL,42/87] target/ppc: move xs[n]madd[am][ds]p/xs[n]msub[am][ds]p to decodetree [PULL,43/87] target/ppc: implement xs[n]maddqp[o]/xs[n]msubqp[o] [PULL,44/87] target/ppc: Implement xvtlsbb instruction [PULL,45/87] target/ppc: Remove xscmpnedp instruction [PULL,46/87] target/ppc: Refactor VSX_SCALAR_CMP_DP [PULL,47/87] target/ppc: Implement xscmp{eq,ge,gt}qp [PULL,48/87] target/ppc: Move xscmp{eq,ge,gt}dp to decodetree [PULL,49/87] target/ppc: Move xs{max, min}[cj]dp to use do_helper_XX3 [PULL,50/87] target/ppc: Refactor VSX_MAX_MINC helper [PULL,51/87] target/ppc: Implement xs{max,min}cqp [PULL,52/87] target/ppc: Implement xvcvbf16spn and xvcvspbf16 instructions [PULL,53/87] target/ppc: implement plxsd/pstxsd [PULL,54/87] target/ppc: implement plxssp/pstxssp [PULL,55/87] target/ppc: implement lxvr[bhwd]/stxvr[bhwd]x [PULL,56/87] ppc/xive2: Introduce a XIVE2 core framework [PULL,57/87] ppc/xive2: Introduce a presenter matching routine [PULL,58/87] ppc/pnv: Add a XIVE2 controller to the POWER10 chip [PULL,59/87] ppc/pnv: Add a OCC model for POWER10 [PULL,60/87] ppc/pnv: Add POWER10 quads [PULL,61/87] ppc/pnv: Add model for POWER10 PHB5 PCIe Host bridge [PULL,62/87] ppc/pnv: Add a HOMER model to POWER10 [PULL,63/87] ppc/psi: Add support for StoreEOI and 64k ESB pages (POWER10) [PULL,64/87] ppc/xive2: Add support for notification injection on ESB pages [PULL,65/87] ppc/xive: Add support for PQ state bits offload [PULL,66/87] ppc/pnv: Add support for PQ offload on PHB5 [PULL,67/87] ppc/pnv: Add support for PHB5 "Address-based trigger" mode [PULL,68/87] pnv/xive2: Introduce new capability bits [PULL,69/87] ppc/pnv: add XIVE Gen2 TIMA support [PULL,70/87] pnv/xive2: Add support XIVE2 P9-compat mode (or Gen1) [PULL,71/87] xive2: Add a get_config() handler for the router configuration [PULL,72/87] pnv/xive2: Add support for automatic save&restore [PULL,73/87] pnv/xive2: Add support for 8bits thread id [PULL,74/87] hw/ppc/spapr.c: use g_autofree in spapr_dt_chosen() [PULL,75/87] hw/ppc/spapr.c: fail early if no firmware found in machine_init() [PULL,76/87] hw/ppc/spapr_caps.c: use g_autofree in spapr_cap_set_string() [PULL,77/87] hw/ppc/spapr_caps.c: use g_autofree in spapr_cap_get_string() [PULL,78/87] hw/ppc/spapr_caps.c: use g_autofree in spapr_caps_add_properties() [PULL,79/87] hw/ppc/spapr_drc.c: use g_auto in spapr_dt_drc() [PULL,80/87] hw/ppc/spapr_drc.c: use g_autofree in drc_realize() [PULL,81/87] hw/ppc/spapr_drc.c: use g_autofree in drc_unrealize() [PULL,82/87] hw/ppc/spapr_drc.c: use g_autofree in spapr_dr_connector_new() [PULL,83/87] hw/ppc/spapr_drc.c: use g_autofree in spapr_drc_by_index() [PULL,84/87] hw/ppc/spapr_numa.c: simplify spapr_numa_write_assoc_lookup_arrays() [PULL,85/87] spapr_pci_nvlink2.c: use g_autofree in spapr_phb_nvgpu_ram_populate_dt() [PULL,86/87] hw/ppc/spapr_rtas.c: use g_autofree in rtas_ibm_get_system_parameter() [PULL,87/87] hw/ppc/spapr_vio.c: use g_autofree in spapr_dt_vdevice()

Message ID

20220302110803.849505-12-clg@kaod.org

State

Handled Elsewhere

Headers

From: =?utf-8?q?C=C3=A9dric_Le_Goater?= <clg@kaod.org>
To: qemu-ppc@nongnu.org, qemu-devel@nongnu.org
Subject: [PULL 11/87] target/ppc: Implement vmsumcud instruction
Date: Wed,  2 Mar 2022 12:06:47 +0100
Message-Id: <20220302110803.849505-12-clg@kaod.org>
In-Reply-To: <20220302110803.849505-1-clg@kaod.org>
References: <20220302110803.849505-1-clg@kaod.org>
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Received-SPF: softfail client-ip=148.163.158.5; envelope-from=clg@kaod.org;
 helo=mx0b-001b2d01.pphosted.com
X-Spam_score_int: -11
X-Spam_score: -1.2
X-Spam_bar: -
X-Spam_report: (-1.2 / 5.0 requ) BAYES_00=-1.9, RCVD_IN_MSPIKE_H5=0.001,
 RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_SOFTFAIL=0.665,
 T_SCC_BODY_TEXT_LINE=-0.01 autolearn=no autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-ppc@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-ppc.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-ppc>,
 <mailto:qemu-ppc-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-ppc>
List-Post: <mailto:qemu-ppc@nongnu.org>
List-Help: <mailto:qemu-ppc-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-ppc>,
 <mailto:qemu-ppc-request@nongnu.org?subject=subscribe>
Cc: =?utf-8?q?V=C3=ADctor_Colombo?= <victor.colombo@eldorado.org.br>,
 Peter Maydell <peter.maydell@linaro.org>,
 Richard Henderson <richard.henderson@linaro.org>,
 Matheus Ferst <matheus.ferst@eldorado.org.br>, =?utf-8?q?C=C3=A9dric_Le_Goa?=
	=?utf-8?q?ter?= <clg@kaod.org>
Errors-To: qemu-ppc-bounces+incoming=patchwork.ozlabs.org@nongnu.org
Sender: "Qemu-ppc" <qemu-ppc-bounces+incoming=patchwork.ozlabs.org@nongnu.org>

Series

[PULL,01/87] hw/ppc/pnv: Determine ns16550's IRQ number from QOM property | expand

Commit Message

Cédric Le Goater March 2, 2022, 11:06 a.m. UTC

From: Víctor Colombo <victor.colombo@eldorado.org.br>

Based on [1] by Lijun Pan <ljp@linux.ibm.com>, which was never merged
into master.

[1]: https://lists.gnu.org/archive/html/qemu-ppc/2020-07/msg00419.html

Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
Signed-off-by: Víctor Colombo <victor.colombo@eldorado.org.br>
Signed-off-by: Matheus Ferst <matheus.ferst@eldorado.org.br>
Message-Id: <20220225210936.1749575-6-matheus.ferst@eldorado.org.br>
Signed-off-by: Cédric Le Goater <clg@kaod.org>
---
 target/ppc/insn32.decode            |  4 +++
 target/ppc/translate/vmx-impl.c.inc | 53 +++++++++++++++++++++++++++++
 2 files changed, 57 insertions(+)

diff --git a/target/ppc/insn32.decode b/target/ppc/insn32.decode
index d817e44c7104..e85a75db2ff7 100644
--- a/target/ppc/insn32.decode
+++ b/target/ppc/insn32.decode
@@ -468,6 +468,10 @@  VMULHSD         000100 ..... ..... ..... 01111001001    @VX
 VMULHUD         000100 ..... ..... ..... 01011001001    @VX
 VMULLD          000100 ..... ..... ..... 00111001001    @VX
 
+## Vector Multiply-Sum Instructions
+
+VMSUMCUD        000100 ..... ..... ..... ..... 010111   @VA
+
 # VSX Load/Store Instructions
 
 LXV             111101 ..... ..... ............ . 001   @DQ_TSX
diff --git a/target/ppc/translate/vmx-impl.c.inc b/target/ppc/translate/vmx-impl.c.inc
index 97a075efd1ef..4f528dc82018 100644
--- a/target/ppc/translate/vmx-impl.c.inc
+++ b/target/ppc/translate/vmx-impl.c.inc
@@ -2081,6 +2081,59 @@  static bool trans_VPEXTD(DisasContext *ctx, arg_VX *a)
     return true;
 }
 
+static bool trans_VMSUMCUD(DisasContext *ctx, arg_VA *a)
+{
+    TCGv_i64 tmp0, tmp1, prod1h, prod1l, prod0h, prod0l, zero;
+
+    REQUIRE_INSNS_FLAGS2(ctx, ISA310);
+    REQUIRE_VECTOR(ctx);
+
+    tmp0 = tcg_temp_new_i64();
+    tmp1 = tcg_temp_new_i64();
+    prod1h = tcg_temp_new_i64();
+    prod1l = tcg_temp_new_i64();
+    prod0h = tcg_temp_new_i64();
+    prod0l = tcg_temp_new_i64();
+    zero = tcg_constant_i64(0);
+
+    /* prod1 = vsr[vra+32].dw[1] * vsr[vrb+32].dw[1] */
+    get_avr64(tmp0, a->vra, false);
+    get_avr64(tmp1, a->vrb, false);
+    tcg_gen_mulu2_i64(prod1l, prod1h, tmp0, tmp1);
+
+    /* prod0 = vsr[vra+32].dw[0] * vsr[vrb+32].dw[0] */
+    get_avr64(tmp0, a->vra, true);
+    get_avr64(tmp1, a->vrb, true);
+    tcg_gen_mulu2_i64(prod0l, prod0h, tmp0, tmp1);
+
+    /* Sum lower 64-bits elements */
+    get_avr64(tmp1, a->rc, false);
+    tcg_gen_add2_i64(tmp1, tmp0, tmp1, zero, prod1l, zero);
+    tcg_gen_add2_i64(tmp1, tmp0, tmp1, tmp0, prod0l, zero);
+
+    /*
+     * Discard lower 64-bits, leaving the carry into bit 64.
+     * Then sum the higher 64-bit elements.
+     */
+    get_avr64(tmp1, a->rc, true);
+    tcg_gen_add2_i64(tmp1, tmp0, tmp0, zero, tmp1, zero);
+    tcg_gen_add2_i64(tmp1, tmp0, tmp1, tmp0, prod1h, zero);
+    tcg_gen_add2_i64(tmp1, tmp0, tmp1, tmp0, prod0h, zero);
+
+    /* Discard 64 more bits to complete the CHOP128(temp >> 128) */
+    set_avr64(a->vrt, tmp0, false);
+    set_avr64(a->vrt, zero, true);
+
+    tcg_temp_free_i64(tmp0);
+    tcg_temp_free_i64(tmp1);
+    tcg_temp_free_i64(prod1h);
+    tcg_temp_free_i64(prod1l);
+    tcg_temp_free_i64(prod0h);
+    tcg_temp_free_i64(prod0l);
+
+    return true;
+}
+
 static bool do_vx_helper(DisasContext *ctx, arg_VX *a,
                          void (*gen_helper)(TCGv_ptr, TCGv_ptr, TCGv_ptr))
 {

[PULL,11/87] target/ppc: Implement vmsumcud instruction

Commit Message

Patch