[v2,20/46] target/loongarch: Implement vext2xv

Message ID	20230630075904.45940-21-gaosong@loongson.cn
State	New
Headers	show Return-Path: <qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org> From: Song Gao <gaosong@loongson.cn> To: qemu-devel@nongnu.org Cc: richard.henderson@linaro.org Subject: [PATCH v2 20/46] target/loongarch: Implement vext2xv Date: Fri, 30 Jun 2023 15:58:38 +0800 Message-Id: <20230630075904.45940-21-gaosong@loongson.cn> In-Reply-To: <20230630075904.45940-1-gaosong@loongson.cn> References: <20230630075904.45940-1-gaosong@loongson.cn> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=114.242.206.163; envelope-from=gaosong@loongson.cn; helo=mail.loongson.cn X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org
Series	Add LoongArch LASX instructions \| expand [v2,00/46] Add LoongArch LASX instructions [v2,01/46] target/loongarch: Add LASX data support [v2,02/46] target/loongarch: meson.build support build LASX [v2,03/46] target/loongarch: Add CHECK_ASXE maccro for check LASX enable [v2,04/46] target/loongarch: Implement xvadd/xvsub [v2,05/46] target/loongarch: Implement xvreplgr2vr [v2,06/46] target/loongarch: Implement xvaddi/xvsubi [v2,07/46] target/loongarch: Implement xvneg [v2,08/46] target/loongarch: Implement xvsadd/xvssub [v2,09/46] target/loongarch: Implement xvhaddw/xvhsubw [v2,10/46] target/loongarch: Implement xvaddw/xvsubw [v2,11/46] target/loongarch: Implement xavg/xvagr [v2,12/46] target/loongarch: Implement xvabsd [v2,13/46] target/loongarch: Implement xvadda [v2,14/46] target/loongarch: Implement xvmax/xvmin [v2,15/46] target/loongarch: Implement xvmul/xvmuh/xvmulw{ev/od} [v2,16/46] target/loongarch: Implement xvmadd/xvmsub/xvmaddw{ev/od} [v2,17/46] target/loongarch; Implement xvdiv/xvmod [v2,18/46] target/loongarch: Implement xvsat [v2,19/46] target/loongarch: Implement xvexth [v2,20/46] target/loongarch: Implement vext2xv [v2,21/46] target/loongarch: Implement xvsigncov [v2,22/46] target/loongarch: Implement xvmskltz/xvmskgez/xvmsknz [v2,23/46] target/loognarch: Implement xvldi [v2,24/46] target/loongarch: Implement LASX logic instructions [v2,25/46] target/loongarch: Implement xvsll xvsrl xvsra xvrotr [v2,26/46] target/loongarch: Implement xvsllwil xvextl [v2,27/46] target/loongarch: Implement xvsrlr xvsrar [v2,28/46] target/loongarch: Implement xvsrln xvsran [v2,29/46] target/loongarch: Implement xvsrlrn xvsrarn [v2,30/46] target/loongarch: Implement xvssrln xvssran [v2,31/46] target/loongarch: Implement xvssrlrn xvssrarn [v2,32/46] target/loongarch: Implement xvclo xvclz [v2,33/46] target/loongarch: Implement xvpcnt [v2,34/46] target/loongarch: Implement xvbitclr xvbitset xvbitrev [v2,35/46] target/loongarch: Implement xvfrstp [v2,36/46] target/loongarch: Implement LASX fpu arith instructions [v2,37/46] target/loongarch: Implement LASX fpu fcvt instructions [v2,38/46] target/loongarch: Implement xvseq xvsle xvslt [v2,39/46] target/loongarch: Implement xvfcmp [v2,40/46] target/loongarch: Implement xvbitsel xvset [v2,41/46] target/loongarch: Implement xvinsgr2vr xvpickve2gr [v2,42/46] target/loongarch: Implement xvreplve xvinsve0 xvpickve xvb{sll/srl}v [v2,43/46] target/loongarch: Implement xvpack xvpick xvilv{l/h} [v2,44/46] target/loongarch: Implement xvshuf xvperm{i} xvshuf4i xvextrins [v2,45/46] target/loongarch: Implement xvld xvst [v2,46/46] target/loongarch: CPUCFG support LASX

Message ID

20230630075904.45940-21-gaosong@loongson.cn

State

New

Headers

From: Song Gao <gaosong@loongson.cn>
To: qemu-devel@nongnu.org
Cc: richard.henderson@linaro.org
Subject: [PATCH v2 20/46] target/loongarch: Implement vext2xv
Date: Fri, 30 Jun 2023 15:58:38 +0800
Message-Id: <20230630075904.45940-21-gaosong@loongson.cn>
In-Reply-To: <20230630075904.45940-1-gaosong@loongson.cn>
References: <20230630075904.45940-1-gaosong@loongson.cn>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=114.242.206.163;
 envelope-from=gaosong@loongson.cn;
 helo=mail.loongson.cn
X-Spam_score_int: -18
X-Spam_score: -1.9
X-Spam_bar: -
X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org
Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org

Series

Add LoongArch LASX instructions | expand

Commit Message

Song Gao June 30, 2023, 7:58 a.m. UTC

This patch includes:
- VEXT2XV.{H/W/D}.B, VEXT2XV.{HU/WU/DU}.BU;
- VEXT2XV.{W/D}.B, VEXT2XV.{WU/DU}.HU;
- VEXT2XV.D.W, VEXT2XV.DU.WU.

Signed-off-by: Song Gao <gaosong@loongson.cn>
---
 target/loongarch/disas.c                     | 13 +++++++++
 target/loongarch/helper.h                    | 13 +++++++++
 target/loongarch/insn_trans/trans_lasx.c.inc | 13 +++++++++
 target/loongarch/insns.decode                | 13 +++++++++
 target/loongarch/vec_helper.c                | 28 ++++++++++++++++++++
 5 files changed, 80 insertions(+)

Comments

Richard Henderson July 7, 2023, 9:19 p.m. UTC | #1

On 6/30/23 08:58, Song Gao wrote:
> +#define VEXT2XV(NAME, BIT, E1, E2)                        \
> +void HELPER(NAME)(CPULoongArchState *env, uint32_t oprsz, \
> +                  uint32_t vd, uint32_t vj)               \
> +{                                                         \
> +    int i;                                                \
> +    VReg *Vd = &(env->fpr[vd].vreg);                      \
> +    VReg *Vj = &(env->fpr[vj].vreg);                      \
> +    VReg temp;                                            \
> +                                                          \
> +    for (i = 0; i < LASX_LEN / BIT; i++) {                \
> +        temp.E1(i) = Vj->E2(i);                           \
> +    }                                                     \
> +    *Vd = temp;                                           \
> +}

So unlike VEXT(H), this does compress in order?

Anyway, function signature and iteration without LASX_LEN.
Isn't there a 128-bit helper to merge this with?


r~

Song Gao July 8, 2023, 7:24 a.m. UTC | #2

Hi, Richard

在 2023/7/8 上午5:19, Richard Henderson 写道:
> On 6/30/23 08:58, Song Gao wrote:
>> +#define VEXT2XV(NAME, BIT, E1, E2)                        \
>> +void HELPER(NAME)(CPULoongArchState *env, uint32_t oprsz, \
>> +                  uint32_t vd, uint32_t vj)               \
>> +{                                                         \
>> +    int i;                                                \
>> +    VReg *Vd = &(env->fpr[vd].vreg);                      \
>> +    VReg *Vj = &(env->fpr[vj].vreg);                      \
>> +    VReg temp;                                            \
>> +                                                          \
>> +    for (i = 0; i < LASX_LEN / BIT; i++) {                \
>> +        temp.E1(i) = Vj->E2(i);                           \
>> +    }                                                     \
>> +    *Vd = temp;                                           \
>> +}
>
> So unlike VEXT(H), this does compress in order?
Yes.
>
> Anyway, function signature and iteration without LASX_LEN.
> Isn't there a 128-bit helper to merge this with?
>
There is no similar 128 bit instructions.

Thanks.
Song Gao

diff --git a/target/loongarch/disas.c b/target/loongarch/disas.c
index 6ca545956d..975ea018da 100644
--- a/target/loongarch/disas.c
+++ b/target/loongarch/disas.c
@@ -1997,6 +1997,19 @@  INSN_LASX(xvexth_wu_hu,      vv)
 INSN_LASX(xvexth_du_wu,      vv)
 INSN_LASX(xvexth_qu_du,      vv)
 
+INSN_LASX(vext2xv_h_b,       vv)
+INSN_LASX(vext2xv_w_b,       vv)
+INSN_LASX(vext2xv_d_b,       vv)
+INSN_LASX(vext2xv_w_h,       vv)
+INSN_LASX(vext2xv_d_h,       vv)
+INSN_LASX(vext2xv_d_w,       vv)
+INSN_LASX(vext2xv_hu_bu,     vv)
+INSN_LASX(vext2xv_wu_bu,     vv)
+INSN_LASX(vext2xv_du_bu,     vv)
+INSN_LASX(vext2xv_wu_hu,     vv)
+INSN_LASX(vext2xv_du_hu,     vv)
+INSN_LASX(vext2xv_du_wu,     vv)
+
 INSN_LASX(xvreplgr2vr_b,     vr)
 INSN_LASX(xvreplgr2vr_h,     vr)
 INSN_LASX(xvreplgr2vr_w,     vr)
diff --git a/target/loongarch/helper.h b/target/loongarch/helper.h
index b7eece8d43..81d0f06cc0 100644
--- a/target/loongarch/helper.h
+++ b/target/loongarch/helper.h
@@ -339,6 +339,19 @@  DEF_HELPER_4(vexth_wu_hu, void, env, i32, i32, i32)
 DEF_HELPER_4(vexth_du_wu, void, env, i32, i32, i32)
 DEF_HELPER_4(vexth_qu_du, void, env, i32, i32, i32)
 
+DEF_HELPER_4(vext2xv_h_b, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_w_b, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_d_b, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_w_h, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_d_h, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_d_w, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_hu_bu, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_wu_bu, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_du_bu, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_wu_hu, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_du_hu, void, env, i32, i32, i32)
+DEF_HELPER_4(vext2xv_du_wu, void, env, i32, i32, i32)
+
 DEF_HELPER_FLAGS_4(vsigncov_b, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
 DEF_HELPER_FLAGS_4(vsigncov_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
 DEF_HELPER_FLAGS_4(vsigncov_w, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
diff --git a/target/loongarch/insn_trans/trans_lasx.c.inc b/target/loongarch/insn_trans/trans_lasx.c.inc
index f100a4a27c..096f7856c4 100644
--- a/target/loongarch/insn_trans/trans_lasx.c.inc
+++ b/target/loongarch/insn_trans/trans_lasx.c.inc
@@ -379,6 +379,19 @@  TRANS(xvexth_wu_hu, gen_vv, 32, gen_helper_vexth_wu_hu)
 TRANS(xvexth_du_wu, gen_vv, 32, gen_helper_vexth_du_wu)
 TRANS(xvexth_qu_du, gen_vv, 32, gen_helper_vexth_qu_du)
 
+TRANS(vext2xv_h_b, gen_vv, 32, gen_helper_vext2xv_h_b)
+TRANS(vext2xv_w_b, gen_vv, 32, gen_helper_vext2xv_w_b)
+TRANS(vext2xv_d_b, gen_vv, 32, gen_helper_vext2xv_d_b)
+TRANS(vext2xv_w_h, gen_vv, 32, gen_helper_vext2xv_w_h)
+TRANS(vext2xv_d_h, gen_vv, 32, gen_helper_vext2xv_d_h)
+TRANS(vext2xv_d_w, gen_vv, 32, gen_helper_vext2xv_d_w)
+TRANS(vext2xv_hu_bu, gen_vv, 32, gen_helper_vext2xv_hu_bu)
+TRANS(vext2xv_wu_bu, gen_vv, 32, gen_helper_vext2xv_wu_bu)
+TRANS(vext2xv_du_bu, gen_vv, 32, gen_helper_vext2xv_du_bu)
+TRANS(vext2xv_wu_hu, gen_vv, 32, gen_helper_vext2xv_wu_hu)
+TRANS(vext2xv_du_hu, gen_vv, 32, gen_helper_vext2xv_du_hu)
+TRANS(vext2xv_du_wu, gen_vv, 32, gen_helper_vext2xv_du_wu)
+
 TRANS(xvreplgr2vr_b, gvec_dup, 32, MO_8)
 TRANS(xvreplgr2vr_h, gvec_dup, 32, MO_16)
 TRANS(xvreplgr2vr_w, gvec_dup, 32, MO_32)
diff --git a/target/loongarch/insns.decode b/target/loongarch/insns.decode
index 7491f295a5..db1a6689f0 100644
--- a/target/loongarch/insns.decode
+++ b/target/loongarch/insns.decode
@@ -1580,6 +1580,19 @@  xvexth_wu_hu     0111 01101001 11101 11101 ..... .....    @vv
 xvexth_du_wu     0111 01101001 11101 11110 ..... .....    @vv
 xvexth_qu_du     0111 01101001 11101 11111 ..... .....    @vv
 
+vext2xv_h_b      0111 01101001 11110 00100 ..... .....    @vv
+vext2xv_w_b      0111 01101001 11110 00101 ..... .....    @vv
+vext2xv_d_b      0111 01101001 11110 00110 ..... .....    @vv
+vext2xv_w_h      0111 01101001 11110 00111 ..... .....    @vv
+vext2xv_d_h      0111 01101001 11110 01000 ..... .....    @vv
+vext2xv_d_w      0111 01101001 11110 01001 ..... .....    @vv
+vext2xv_hu_bu    0111 01101001 11110 01010 ..... .....    @vv
+vext2xv_wu_bu    0111 01101001 11110 01011 ..... .....    @vv
+vext2xv_du_bu    0111 01101001 11110 01100 ..... .....    @vv
+vext2xv_wu_hu    0111 01101001 11110 01101 ..... .....    @vv
+vext2xv_du_hu    0111 01101001 11110 01110 ..... .....    @vv
+vext2xv_du_wu    0111 01101001 11110 01111 ..... .....    @vv
+
 xvreplgr2vr_b    0111 01101001 11110 00000 ..... .....    @vr
 xvreplgr2vr_h    0111 01101001 11110 00001 ..... .....    @vr
 xvreplgr2vr_w    0111 01101001 11110 00010 ..... .....    @vr
diff --git a/target/loongarch/vec_helper.c b/target/loongarch/vec_helper.c
index 76c8cda563..3fa689bd94 100644
--- a/target/loongarch/vec_helper.c
+++ b/target/loongarch/vec_helper.c
@@ -737,6 +737,34 @@  VEXTH(vexth_hu_bu, 16, UH, UB)
 VEXTH(vexth_wu_hu, 32, UW, UH)
 VEXTH(vexth_du_wu, 64, UD, UW)
 
+#define VEXT2XV(NAME, BIT, E1, E2)                        \
+void HELPER(NAME)(CPULoongArchState *env, uint32_t oprsz, \
+                  uint32_t vd, uint32_t vj)               \
+{                                                         \
+    int i;                                                \
+    VReg *Vd = &(env->fpr[vd].vreg);                      \
+    VReg *Vj = &(env->fpr[vj].vreg);                      \
+    VReg temp;                                            \
+                                                          \
+    for (i = 0; i < LASX_LEN / BIT; i++) {                \
+        temp.E1(i) = Vj->E2(i);                           \
+    }                                                     \
+    *Vd = temp;                                           \
+}
+
+VEXT2XV(vext2xv_h_b, 16, H, B)
+VEXT2XV(vext2xv_w_b, 32, W, B)
+VEXT2XV(vext2xv_d_b, 64, D, B)
+VEXT2XV(vext2xv_w_h, 32, W, H)
+VEXT2XV(vext2xv_d_h, 64, D, H)
+VEXT2XV(vext2xv_d_w, 64, D, W)
+VEXT2XV(vext2xv_hu_bu, 16, UH, UB)
+VEXT2XV(vext2xv_wu_bu, 32, UW, UB)
+VEXT2XV(vext2xv_du_bu, 64, UD, UB)
+VEXT2XV(vext2xv_wu_hu, 32, UW, UH)
+VEXT2XV(vext2xv_du_hu, 64, UD, UH)
+VEXT2XV(vext2xv_du_wu, 64, UD, UW)
+
 #define DO_SIGNCOV(a, b)  (a == 0 ? 0 : a < 0 ? -b : b)
 
 DO_3OP(vsigncov_b, 8, B, DO_SIGNCOV)

[v2,20/46] target/loongarch: Implement vext2xv

Commit Message

Comments

Patch