[v4,06/48] target/loongarch: Implement xvreplgr2vr

Message ID	20230830084902.2113960-7-gaosong@loongson.cn
State	New
Headers	show Return-Path: <qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org> From: Song Gao <gaosong@loongson.cn> To: qemu-devel@nongnu.org Cc: richard.henderson@linaro.org Subject: [PATCH v4 06/48] target/loongarch: Implement xvreplgr2vr Date: Wed, 30 Aug 2023 16:48:20 +0800 Message-Id: <20230830084902.2113960-7-gaosong@loongson.cn> In-Reply-To: <20230830084902.2113960-1-gaosong@loongson.cn> References: <20230830084902.2113960-1-gaosong@loongson.cn> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=114.242.206.163; envelope-from=gaosong@loongson.cn; helo=mail.loongson.cn X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org
Series	Add LoongArch LASX instructions \| expand [v4,00/48] Add LoongArch LASX instructions [v4,01/48] target/loongarch: Add LASX data support [v4,02/48] target/loongarch: meson.build support build LASX [v4,03/48] target/loongarch: Add CHECK_ASXE maccro for check LASX enable [v4,04/48] target/loongarch: Add avail_LASX to check LASX instructions [v4,05/48] target/loongarch: Implement xvadd/xvsub [v4,06/48] target/loongarch: Implement xvreplgr2vr [v4,07/48] target/loongarch: Implement xvaddi/xvsubi [v4,08/48] target/loongarch: Implement xvneg [v4,09/48] target/loongarch: Implement xvsadd/xvssub [v4,10/48] target/loongarch: rename lsx_helper.c to vec_helper.c [v4,11/48] target/loongarch: Implement xvhaddw/xvhsubw [v4,12/48] target/loongarch: Implement xvaddw/xvsubw [v4,13/48] target/loongarch: Implement xavg/xvagr [v4,14/48] target/loongarch: Implement xvabsd [v4,15/48] target/loongarch: Implement xvadda [v4,16/48] target/loongarch: Implement xvmax/xvmin [v4,17/48] target/loongarch: Implement xvmul/xvmuh/xvmulw{ev/od} [v4,18/48] target/loongarch: Implement xvmadd/xvmsub/xvmaddw{ev/od} [v4,19/48] target/loongarch; Implement xvdiv/xvmod [v4,20/48] target/loongarch: Implement xvsat [v4,21/48] target/loongarch: Implement xvexth [v4,22/48] target/loongarch: Implement vext2xv [v4,23/48] target/loongarch: Implement xvsigncov [v4,24/48] target/loongarch: Implement xvmskltz/xvmskgez/xvmsknz [v4,25/48] target/loognarch: Implement xvldi [v4,26/48] target/loongarch: Implement LASX logic instructions [v4,27/48] target/loongarch: Implement xvsll xvsrl xvsra xvrotr [v4,28/48] target/loongarch: Implement xvsllwil xvextl [v4,29/48] target/loongarch: Implement xvsrlr xvsrar [v4,30/48] target/loongarch: Implement xvsrln xvsran [v4,31/48] target/loongarch: Implement xvsrlrn xvsrarn [v4,32/48] target/loongarch: Implement xvssrln xvssran [v4,33/48] target/loongarch: Implement xvssrlrn xvssrarn [v4,34/48] target/loongarch: Implement xvclo xvclz [v4,35/48] target/loongarch: Implement xvpcnt [v4,36/48] target/loongarch: Implement xvbitclr xvbitset xvbitrev [v4,37/48] target/loongarch: Implement xvfrstp [v4,38/48] target/loongarch: Implement LASX fpu arith instructions [v4,39/48] target/loongarch: Implement LASX fpu fcvt instructions [v4,40/48] target/loongarch: Implement xvseq xvsle xvslt [v4,41/48] target/loongarch: Implement xvfcmp [v4,42/48] target/loongarch: Implement xvbitsel xvset [v4,43/48] target/loongarch: Implement xvinsgr2vr xvpickve2gr [v4,44/48] target/loongarch: Implement xvreplve xvinsve0 xvpickve xvb{sll/srl}v [v4,45/48] target/loongarch: Implement xvpack xvpick xvilv{l/h} [v4,46/48] target/loongarch: Implement xvshuf xvperm{i} xvshuf4i xvextrins [v4,47/48] target/loongarch: Implement xvld xvst [v4,48/48] target/loongarch: CPUCFG support LASX

Message ID

20230830084902.2113960-7-gaosong@loongson.cn

State

New

Headers

From: Song Gao <gaosong@loongson.cn>
To: qemu-devel@nongnu.org
Cc: richard.henderson@linaro.org
Subject: [PATCH v4 06/48] target/loongarch: Implement xvreplgr2vr
Date: Wed, 30 Aug 2023 16:48:20 +0800
Message-Id: <20230830084902.2113960-7-gaosong@loongson.cn>
In-Reply-To: <20230830084902.2113960-1-gaosong@loongson.cn>
References: <20230830084902.2113960-1-gaosong@loongson.cn>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=114.242.206.163;
 envelope-from=gaosong@loongson.cn;
 helo=mail.loongson.cn
X-Spam_score_int: -18
X-Spam_score: -1.9
X-Spam_bar: -
X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org
Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org

Series

Add LoongArch LASX instructions | expand

Commit Message

Song Gao Aug. 30, 2023, 8:48 a.m. UTC

This patch includes:
- XVREPLGR2VR.{B/H/W/D}.

Signed-off-by: Song Gao <gaosong@loongson.cn>
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/loongarch/insns.decode                |  5 +++++
 target/loongarch/disas.c                     | 10 ++++++++++
 target/loongarch/insn_trans/trans_lasx.c.inc |  5 +++++
 target/loongarch/insn_trans/trans_lsx.c.inc  | 12 ++++++------
 4 files changed, 26 insertions(+), 6 deletions(-)

Comments

Richard Henderson Aug. 30, 2023, 4:09 p.m. UTC | #1

On 8/30/23 01:48, Song Gao wrote:
> This patch includes:
> - XVREPLGR2VR.{B/H/W/D}.
> 
> Signed-off-by: Song Gao <gaosong@loongson.cn>
> Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
> ---
>   target/loongarch/insns.decode                |  5 +++++
>   target/loongarch/disas.c                     | 10 ++++++++++
>   target/loongarch/insn_trans/trans_lasx.c.inc |  5 +++++
>   target/loongarch/insn_trans/trans_lsx.c.inc  | 12 ++++++------
>   4 files changed, 26 insertions(+), 6 deletions(-)
> 
> diff --git a/target/loongarch/insns.decode b/target/loongarch/insns.decode
> index bcc18fb6c5..04bd238995 100644
> --- a/target/loongarch/insns.decode
> +++ b/target/loongarch/insns.decode
> @@ -1310,3 +1310,8 @@ xvsub_h          0111 01000000 11001 ..... ..... .....    @vvv
>   xvsub_w          0111 01000000 11010 ..... ..... .....    @vvv
>   xvsub_d          0111 01000000 11011 ..... ..... .....    @vvv
>   xvsub_q          0111 01010010 11011 ..... ..... .....    @vvv
> +
> +xvreplgr2vr_b    0111 01101001 11110 00000 ..... .....    @vr
> +xvreplgr2vr_h    0111 01101001 11110 00001 ..... .....    @vr
> +xvreplgr2vr_w    0111 01101001 11110 00010 ..... .....    @vr
> +xvreplgr2vr_d    0111 01101001 11110 00011 ..... .....    @vr
> diff --git a/target/loongarch/disas.c b/target/loongarch/disas.c
> index d8b62ba532..c47f455ed0 100644
> --- a/target/loongarch/disas.c
> +++ b/target/loongarch/disas.c
> @@ -1708,6 +1708,11 @@ static void output_vvv_x(DisasContext *ctx, arg_vvv * a, const char *mnemonic)
>       output(ctx, mnemonic, "x%d, x%d, x%d", a->vd, a->vj, a->vk);
>   }
>   
> +static void output_vr_x(DisasContext *ctx, arg_vr *a, const char *mnemonic)
> +{
> +    output(ctx, mnemonic, "x%d, r%d", a->vd, a->rj);
> +}
> +
>   INSN_LASX(xvadd_b,           vvv)
>   INSN_LASX(xvadd_h,           vvv)
>   INSN_LASX(xvadd_w,           vvv)
> @@ -1718,3 +1723,8 @@ INSN_LASX(xvsub_h,           vvv)
>   INSN_LASX(xvsub_w,           vvv)
>   INSN_LASX(xvsub_d,           vvv)
>   INSN_LASX(xvsub_q,           vvv)
> +
> +INSN_LASX(xvreplgr2vr_b,     vr)
> +INSN_LASX(xvreplgr2vr_h,     vr)
> +INSN_LASX(xvreplgr2vr_w,     vr)
> +INSN_LASX(xvreplgr2vr_d,     vr)
> diff --git a/target/loongarch/insn_trans/trans_lasx.c.inc b/target/loongarch/insn_trans/trans_lasx.c.inc
> index 218b8dc648..66b5abc790 100644
> --- a/target/loongarch/insn_trans/trans_lasx.c.inc
> +++ b/target/loongarch/insn_trans/trans_lasx.c.inc
> @@ -50,3 +50,8 @@ TRANS(xvsub_b, LASX, gvec_vvv, 32, MO_8, tcg_gen_gvec_sub)
>   TRANS(xvsub_h, LASX, gvec_vvv, 32, MO_16, tcg_gen_gvec_sub)
>   TRANS(xvsub_w, LASX, gvec_vvv, 32, MO_32, tcg_gen_gvec_sub)
>   TRANS(xvsub_d, LASX, gvec_vvv, 32, MO_64, tcg_gen_gvec_sub)
> +
> +TRANS(xvreplgr2vr_b, LASX, gvec_dup, 32, MO_8)
> +TRANS(xvreplgr2vr_h, LASX, gvec_dup, 32, MO_16)
> +TRANS(xvreplgr2vr_w, LASX, gvec_dup, 32, MO_32)
> +TRANS(xvreplgr2vr_d, LASX, gvec_dup, 32, MO_64)
> diff --git a/target/loongarch/insn_trans/trans_lsx.c.inc b/target/loongarch/insn_trans/trans_lsx.c.inc
> index 0e12213e8b..c0e7a9a372 100644
> --- a/target/loongarch/insn_trans/trans_lsx.c.inc
> +++ b/target/loongarch/insn_trans/trans_lsx.c.inc
> @@ -4161,7 +4161,7 @@ static bool trans_vpickve2gr_du(DisasContext *ctx, arg_rv_i *a)
>       return true;
>   }
>   
> -static bool gvec_dup(DisasContext *ctx, arg_vr *a, MemOp mop)
> +static bool gvec_dup(DisasContext *ctx, arg_vr *a, uint32_t oprsz, MemOp mop)
>   {
>       TCGv src = gpr_src(ctx, a->rj, EXT_NONE);
>   
> @@ -4172,14 +4172,14 @@ static bool gvec_dup(DisasContext *ctx, arg_vr *a, MemOp mop)
>       CHECK_VEC;
>   
>       tcg_gen_gvec_dup_i64(mop, vec_full_offset(a->vd),
> -                         16, ctx->vl/8, src);
> +                         oprsz, ctx->vl / 8, src);
>       return true;
>   }
>   
> -TRANS(vreplgr2vr_b, LSX, gvec_dup, MO_8)
> -TRANS(vreplgr2vr_h, LSX, gvec_dup, MO_16)
> -TRANS(vreplgr2vr_w, LSX, gvec_dup, MO_32)
> -TRANS(vreplgr2vr_d, LSX, gvec_dup, MO_64)
> +TRANS(vreplgr2vr_b, LSX, gvec_dup, 16, MO_8)
> +TRANS(vreplgr2vr_h, LSX, gvec_dup, 16, MO_16)
> +TRANS(vreplgr2vr_w, LSX, gvec_dup, 16, MO_32)
> +TRANS(vreplgr2vr_d, LSX, gvec_dup, 16, MO_64)

Hmm.

Ok, so revising the advice I gave versus the previous patch, I can see how having a common 
CHECK_VEC is helpful.  But it still needs to use oprsz not vl for the size check.

It would be better to replace with a function, like

     if (!check_vec(ctx, oprsz)) {
         return true;
     }

rather than a macro with a hidden return.  The replacement should be done in a patch by 
itself, probably using check_vec(ctx, 16) for all of the existing LSX code until, step by 
step, oprsz is plumbed into all of the places required.

I still think having separate minimal gen_vvv and gen_xxx helpers will help reduce the 
possibility of typos, when there are a lot of instructions within an instruction format. 
But when there are just 8, like here, just adding oprsz certainly looks simpler.

I wonder if it is really clearer having the LASX instructions in a separate file?  Perhaps 
it be better to keep all of the similar patterns together, e.g.

static bool gvec_dup(...)
{
...
}

TRANS(vreplgr2vr_b, LSX, gvec_dup, 16, MO_8)
TRANS(vreplgr2vr_h, LSX, gvec_dup, 16, MO_16)
TRANS(vreplgr2vr_w, LSX, gvec_dup, 16, MO_32)
TRANS(vreplgr2vr_d, LSX, gvec_dup, 16, MO_64)

TRANS(xvreplgr2vr_b, LASX, gvec_dup, 32, MO_8)
TRANS(xvreplgr2vr_h, LASX, gvec_dup, 32, MO_16)
TRANS(xvreplgr2vr_w, LASX, gvec_dup, 32, MO_32)
TRANS(xvreplgr2vr_d, LASX, gvec_dup, 32, MO_64)


r~

Song Gao Aug. 31, 2023, 7:17 a.m. UTC | #2

在 2023/8/31 上午12:09, Richard Henderson 写道:
> On 8/30/23 01:48, Song Gao wrote:
>> This patch includes:
>> - XVREPLGR2VR.{B/H/W/D}.
>>
>> Signed-off-by: Song Gao <gaosong@loongson.cn>
>> Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
>> ---
>>   target/loongarch/insns.decode                |  5 +++++
>>   target/loongarch/disas.c                     | 10 ++++++++++
>>   target/loongarch/insn_trans/trans_lasx.c.inc |  5 +++++
>>   target/loongarch/insn_trans/trans_lsx.c.inc  | 12 ++++++------
>>   4 files changed, 26 insertions(+), 6 deletions(-)
>>
>> diff --git a/target/loongarch/insns.decode 
>> b/target/loongarch/insns.decode
>> index bcc18fb6c5..04bd238995 100644
>> --- a/target/loongarch/insns.decode
>> +++ b/target/loongarch/insns.decode
>> @@ -1310,3 +1310,8 @@ xvsub_h          0111 01000000 11001 ..... ..... 
>> .....    @vvv
>>   xvsub_w          0111 01000000 11010 ..... ..... .....    @vvv
>>   xvsub_d          0111 01000000 11011 ..... ..... .....    @vvv
>>   xvsub_q          0111 01010010 11011 ..... ..... .....    @vvv
>> +
>> +xvreplgr2vr_b    0111 01101001 11110 00000 ..... .....    @vr
>> +xvreplgr2vr_h    0111 01101001 11110 00001 ..... .....    @vr
>> +xvreplgr2vr_w    0111 01101001 11110 00010 ..... .....    @vr
>> +xvreplgr2vr_d    0111 01101001 11110 00011 ..... .....    @vr
>> diff --git a/target/loongarch/disas.c b/target/loongarch/disas.c
>> index d8b62ba532..c47f455ed0 100644
>> --- a/target/loongarch/disas.c
>> +++ b/target/loongarch/disas.c
>> @@ -1708,6 +1708,11 @@ static void output_vvv_x(DisasContext *ctx, 
>> arg_vvv * a, const char *mnemonic)
>>       output(ctx, mnemonic, "x%d, x%d, x%d", a->vd, a->vj, a->vk);
>>   }
>> +static void output_vr_x(DisasContext *ctx, arg_vr *a, const char 
>> *mnemonic)
>> +{
>> +    output(ctx, mnemonic, "x%d, r%d", a->vd, a->rj);
>> +}
>> +
>>   INSN_LASX(xvadd_b,           vvv)
>>   INSN_LASX(xvadd_h,           vvv)
>>   INSN_LASX(xvadd_w,           vvv)
>> @@ -1718,3 +1723,8 @@ INSN_LASX(xvsub_h,           vvv)
>>   INSN_LASX(xvsub_w,           vvv)
>>   INSN_LASX(xvsub_d,           vvv)
>>   INSN_LASX(xvsub_q,           vvv)
>> +
>> +INSN_LASX(xvreplgr2vr_b,     vr)
>> +INSN_LASX(xvreplgr2vr_h,     vr)
>> +INSN_LASX(xvreplgr2vr_w,     vr)
>> +INSN_LASX(xvreplgr2vr_d,     vr)
>> diff --git a/target/loongarch/insn_trans/trans_lasx.c.inc 
>> b/target/loongarch/insn_trans/trans_lasx.c.inc
>> index 218b8dc648..66b5abc790 100644
>> --- a/target/loongarch/insn_trans/trans_lasx.c.inc
>> +++ b/target/loongarch/insn_trans/trans_lasx.c.inc
>> @@ -50,3 +50,8 @@ TRANS(xvsub_b, LASX, gvec_vvv, 32, MO_8, 
>> tcg_gen_gvec_sub)
>>   TRANS(xvsub_h, LASX, gvec_vvv, 32, MO_16, tcg_gen_gvec_sub)
>>   TRANS(xvsub_w, LASX, gvec_vvv, 32, MO_32, tcg_gen_gvec_sub)
>>   TRANS(xvsub_d, LASX, gvec_vvv, 32, MO_64, tcg_gen_gvec_sub)
>> +
>> +TRANS(xvreplgr2vr_b, LASX, gvec_dup, 32, MO_8)
>> +TRANS(xvreplgr2vr_h, LASX, gvec_dup, 32, MO_16)
>> +TRANS(xvreplgr2vr_w, LASX, gvec_dup, 32, MO_32)
>> +TRANS(xvreplgr2vr_d, LASX, gvec_dup, 32, MO_64)
>> diff --git a/target/loongarch/insn_trans/trans_lsx.c.inc 
>> b/target/loongarch/insn_trans/trans_lsx.c.inc
>> index 0e12213e8b..c0e7a9a372 100644
>> --- a/target/loongarch/insn_trans/trans_lsx.c.inc
>> +++ b/target/loongarch/insn_trans/trans_lsx.c.inc
>> @@ -4161,7 +4161,7 @@ static bool trans_vpickve2gr_du(DisasContext 
>> *ctx, arg_rv_i *a)
>>       return true;
>>   }
>> -static bool gvec_dup(DisasContext *ctx, arg_vr *a, MemOp mop)
>> +static bool gvec_dup(DisasContext *ctx, arg_vr *a, uint32_t oprsz, 
>> MemOp mop)
>>   {
>>       TCGv src = gpr_src(ctx, a->rj, EXT_NONE);
>> @@ -4172,14 +4172,14 @@ static bool gvec_dup(DisasContext *ctx, arg_vr 
>> *a, MemOp mop)
>>       CHECK_VEC;
>>       tcg_gen_gvec_dup_i64(mop, vec_full_offset(a->vd),
>> -                         16, ctx->vl/8, src);
>> +                         oprsz, ctx->vl / 8, src);
>>       return true;
>>   }
>> -TRANS(vreplgr2vr_b, LSX, gvec_dup, MO_8)
>> -TRANS(vreplgr2vr_h, LSX, gvec_dup, MO_16)
>> -TRANS(vreplgr2vr_w, LSX, gvec_dup, MO_32)
>> -TRANS(vreplgr2vr_d, LSX, gvec_dup, MO_64)
>> +TRANS(vreplgr2vr_b, LSX, gvec_dup, 16, MO_8)
>> +TRANS(vreplgr2vr_h, LSX, gvec_dup, 16, MO_16)
>> +TRANS(vreplgr2vr_w, LSX, gvec_dup, 16, MO_32)
>> +TRANS(vreplgr2vr_d, LSX, gvec_dup, 16, MO_64)
> 
> Hmm.
> 
> Ok, so revising the advice I gave versus the previous patch, I can see 
> how having a common CHECK_VEC is helpful.  But it still needs to use 
> oprsz not vl for the size check.
> 
> It would be better to replace with a function, like
> 
>      if (!check_vec(ctx, oprsz)) {
>          return true;
>      }
> 
> rather than a macro with a hidden return.  The replacement should be 
> done in a patch by itself, probably using check_vec(ctx, 16) for all of 
> the existing LSX code until, step by step, oprsz is plumbed into all of 
> the places required.
> 
> I still think having separate minimal gen_vvv and gen_xxx helpers will 
> help reduce the possibility of typos, when there are a lot of 
> instructions within an instruction format. But when there are just 8, 
> like here, just adding oprsz certainly looks simpler.
>
Thanks for you suggestions. I will correct them on v5.

> I wonder if it is really clearer having the LASX instructions in a 
> separate file?  Perhaps it be better to keep all of the similar patterns 
> together, e.g.
> 
OK.

I can merge LSX and LASX in a new file(trans_vec.c.inc or ..).
It seems need more time do this. I will send V5 a few days later.

Thanks.
Song Gao

> static bool gvec_dup(...)
> {
> ...
> }
> 
> TRANS(vreplgr2vr_b, LSX, gvec_dup, 16, MO_8)
> TRANS(vreplgr2vr_h, LSX, gvec_dup, 16, MO_16)
> TRANS(vreplgr2vr_w, LSX, gvec_dup, 16, MO_32)
> TRANS(vreplgr2vr_d, LSX, gvec_dup, 16, MO_64)
> 
> TRANS(xvreplgr2vr_b, LASX, gvec_dup, 32, MO_8)
> TRANS(xvreplgr2vr_h, LASX, gvec_dup, 32, MO_16)
> TRANS(xvreplgr2vr_w, LASX, gvec_dup, 32, MO_32)
> TRANS(xvreplgr2vr_d, LASX, gvec_dup, 32, MO_64)
> 
> 
> r~

diff --git a/target/loongarch/insns.decode b/target/loongarch/insns.decode
index bcc18fb6c5..04bd238995 100644
--- a/target/loongarch/insns.decode
+++ b/target/loongarch/insns.decode
@@ -1310,3 +1310,8 @@  xvsub_h          0111 01000000 11001 ..... ..... .....    @vvv
 xvsub_w          0111 01000000 11010 ..... ..... .....    @vvv
 xvsub_d          0111 01000000 11011 ..... ..... .....    @vvv
 xvsub_q          0111 01010010 11011 ..... ..... .....    @vvv
+
+xvreplgr2vr_b    0111 01101001 11110 00000 ..... .....    @vr
+xvreplgr2vr_h    0111 01101001 11110 00001 ..... .....    @vr
+xvreplgr2vr_w    0111 01101001 11110 00010 ..... .....    @vr
+xvreplgr2vr_d    0111 01101001 11110 00011 ..... .....    @vr
diff --git a/target/loongarch/disas.c b/target/loongarch/disas.c
index d8b62ba532..c47f455ed0 100644
--- a/target/loongarch/disas.c
+++ b/target/loongarch/disas.c
@@ -1708,6 +1708,11 @@  static void output_vvv_x(DisasContext *ctx, arg_vvv * a, const char *mnemonic)
     output(ctx, mnemonic, "x%d, x%d, x%d", a->vd, a->vj, a->vk);
 }
 
+static void output_vr_x(DisasContext *ctx, arg_vr *a, const char *mnemonic)
+{
+    output(ctx, mnemonic, "x%d, r%d", a->vd, a->rj);
+}
+
 INSN_LASX(xvadd_b,           vvv)
 INSN_LASX(xvadd_h,           vvv)
 INSN_LASX(xvadd_w,           vvv)
@@ -1718,3 +1723,8 @@  INSN_LASX(xvsub_h,           vvv)
 INSN_LASX(xvsub_w,           vvv)
 INSN_LASX(xvsub_d,           vvv)
 INSN_LASX(xvsub_q,           vvv)
+
+INSN_LASX(xvreplgr2vr_b,     vr)
+INSN_LASX(xvreplgr2vr_h,     vr)
+INSN_LASX(xvreplgr2vr_w,     vr)
+INSN_LASX(xvreplgr2vr_d,     vr)
diff --git a/target/loongarch/insn_trans/trans_lasx.c.inc b/target/loongarch/insn_trans/trans_lasx.c.inc
index 218b8dc648..66b5abc790 100644
--- a/target/loongarch/insn_trans/trans_lasx.c.inc
+++ b/target/loongarch/insn_trans/trans_lasx.c.inc
@@ -50,3 +50,8 @@  TRANS(xvsub_b, LASX, gvec_vvv, 32, MO_8, tcg_gen_gvec_sub)
 TRANS(xvsub_h, LASX, gvec_vvv, 32, MO_16, tcg_gen_gvec_sub)
 TRANS(xvsub_w, LASX, gvec_vvv, 32, MO_32, tcg_gen_gvec_sub)
 TRANS(xvsub_d, LASX, gvec_vvv, 32, MO_64, tcg_gen_gvec_sub)
+
+TRANS(xvreplgr2vr_b, LASX, gvec_dup, 32, MO_8)
+TRANS(xvreplgr2vr_h, LASX, gvec_dup, 32, MO_16)
+TRANS(xvreplgr2vr_w, LASX, gvec_dup, 32, MO_32)
+TRANS(xvreplgr2vr_d, LASX, gvec_dup, 32, MO_64)
diff --git a/target/loongarch/insn_trans/trans_lsx.c.inc b/target/loongarch/insn_trans/trans_lsx.c.inc
index 0e12213e8b..c0e7a9a372 100644
--- a/target/loongarch/insn_trans/trans_lsx.c.inc
+++ b/target/loongarch/insn_trans/trans_lsx.c.inc
@@ -4161,7 +4161,7 @@  static bool trans_vpickve2gr_du(DisasContext *ctx, arg_rv_i *a)
     return true;
 }
 
-static bool gvec_dup(DisasContext *ctx, arg_vr *a, MemOp mop)
+static bool gvec_dup(DisasContext *ctx, arg_vr *a, uint32_t oprsz, MemOp mop)
 {
     TCGv src = gpr_src(ctx, a->rj, EXT_NONE);
 
@@ -4172,14 +4172,14 @@  static bool gvec_dup(DisasContext *ctx, arg_vr *a, MemOp mop)
     CHECK_VEC;
 
     tcg_gen_gvec_dup_i64(mop, vec_full_offset(a->vd),
-                         16, ctx->vl/8, src);
+                         oprsz, ctx->vl / 8, src);
     return true;
 }
 
-TRANS(vreplgr2vr_b, LSX, gvec_dup, MO_8)
-TRANS(vreplgr2vr_h, LSX, gvec_dup, MO_16)
-TRANS(vreplgr2vr_w, LSX, gvec_dup, MO_32)
-TRANS(vreplgr2vr_d, LSX, gvec_dup, MO_64)
+TRANS(vreplgr2vr_b, LSX, gvec_dup, 16, MO_8)
+TRANS(vreplgr2vr_h, LSX, gvec_dup, 16, MO_16)
+TRANS(vreplgr2vr_w, LSX, gvec_dup, 16, MO_32)
+TRANS(vreplgr2vr_d, LSX, gvec_dup, 16, MO_64)
 
 static bool trans_vreplvei_b(DisasContext *ctx, arg_vv_i *a)
 {

[v4,06/48] target/loongarch: Implement xvreplgr2vr

Commit Message

Comments

Patch