[4/13] AArch64: Cleanup fenv implementation

Message ID	000e01cfeee7$99383f10$cba8bd30$@com
State	New
Headers	show Return-Path: <libc-alpha-return-53718-incoming=patchwork.ozlabs.org@sourceware.org> DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:from:to:subject:date:message-id:mime-version :content-type:content-transfer-encoding; q=dns; s=default; b=N2I tPhKgaSma5fTGl38942kg3f70v1MZQ6nxW6pWxe/D3BLhIOBb9PguQyXvrG6XAZS NDUpuyHc+Mwl6lSpEZUh+0ZIxTrijRLQqflAS/Kv8/SFhyNjxOVmK0oS/iSoskPU Y8EAsa2FoqhIWQPnv9zE4kuPOmpKluAMvSwlnIfA= Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk Sender: libc-alpha-owner@sourceware.org From: "Wilco Dijkstra" <wdijkstr@arm.com> To: <libc-alpha@sourceware.org> Subject: [PATCH 4/13] AArch64: Cleanup fenv implementation Date: Thu, 23 Oct 2014 18:34:28 +0100 Message-ID: <000e01cfeee7$99383f10$cba8bd30$@com> MIME-Version: 1.0 Content-Type: text/plain; charset=WINDOWS-1252 Content-Transfer-Encoding: quoted-printable

Message ID

000e01cfeee7$99383f10$cba8bd30$@com

State

New

Headers

DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id
	:list-unsubscribe:list-subscribe:list-archive:list-post
	:list-help:sender:from:to:subject:date:message-id:mime-version
	:content-type:content-transfer-encoding; q=dns; s=default; b=N2I
	tPhKgaSma5fTGl38942kg3f70v1MZQ6nxW6pWxe/D3BLhIOBb9PguQyXvrG6XAZS
	NDUpuyHc+Mwl6lSpEZUh+0ZIxTrijRLQqflAS/Kv8/SFhyNjxOVmK0oS/iSoskPU
	Y8EAsa2FoqhIWQPnv9zE4kuPOmpKluAMvSwlnIfA=
Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm
Precedence: bulk
Sender: libc-alpha-owner@sourceware.org
From: "Wilco Dijkstra" <wdijkstr@arm.com>
To: <libc-alpha@sourceware.org>
Subject: [PATCH 4/13] AArch64: Cleanup fenv implementation
Date: Thu, 23 Oct 2014 18:34:28 +0100
Message-ID: <000e01cfeee7$99383f10$cba8bd30$@com>
MIME-Version: 1.0
Content-Type: text/plain; charset=WINDOWS-1252
Content-Transfer-Encoding: quoted-printable

Commit Message

Wilco Oct. 23, 2014, 5:34 p.m. UTC

Cleanup feclearexcept to use the same logic as the ARM version. No functional changes.

ChangeLog:
2014-10-23  Wilco Dijkstra  <wdijkstr@arm.com>

	* sysdeps/aarch64/fpu/fclrexcpt.c (feclearexcept):
	Simplify logic.

---
 sysdeps/aarch64/fpu/fclrexcpt.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

Comments

Carlos O'Donell Oct. 23, 2014, 11 p.m. UTC | #1

On 10/23/2014 01:34 PM, Wilco Dijkstra wrote:
> Cleanup feclearexcept to use the same logic as the ARM version. No functional changes.
> 
> ChangeLog:
> 2014-10-23  Wilco Dijkstra  <wdijkstr@arm.com>
> 
> 	* sysdeps/aarch64/fpu/fclrexcpt.c (feclearexcept):
> 	Simplify logic.

Looks good to me.

Reviewed-by: Carlos O'Donell <carlos@redhat.com>

> ---
>  sysdeps/aarch64/fpu/fclrexcpt.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/sysdeps/aarch64/fpu/fclrexcpt.c b/sysdeps/aarch64/fpu/fclrexcpt.c
> index b24f0ff..4471373 100644
> --- a/sysdeps/aarch64/fpu/fclrexcpt.c
> +++ b/sysdeps/aarch64/fpu/fclrexcpt.c
> @@ -28,7 +28,7 @@ feclearexcept (int excepts)
>    excepts &= FE_ALL_EXCEPT;
>  
>    _FPU_GETFPSR (fpsr);
> -  fpsr_new = (fpsr & ~FE_ALL_EXCEPT) | (fpsr & FE_ALL_EXCEPT & ~excepts);
> +  fpsr_new = fpsr & ~excepts;

OK.

The logic does seem to collapse down nicely. No need to assembly the final
fpsr_new from the two halves.

Is the generated code better?

>  
>    if (fpsr != fpsr_new)
>      _FPU_SETFPSR (fpsr_new);
> 

Cheers,
Carlos.

Wilco Oct. 24, 2014, 3:27 p.m. UTC | #2

> Carlos O'Donell wrote:
> > ---
> >  sysdeps/aarch64/fpu/fclrexcpt.c | 2 +-
> >  1 file changed, 1 insertion(+), 1 deletion(-)
> >
> > diff --git a/sysdeps/aarch64/fpu/fclrexcpt.c b/sysdeps/aarch64/fpu/fclrexcpt.c
> > index b24f0ff..4471373 100644
> > --- a/sysdeps/aarch64/fpu/fclrexcpt.c
> > +++ b/sysdeps/aarch64/fpu/fclrexcpt.c
> > @@ -28,7 +28,7 @@ feclearexcept (int excepts)
> >    excepts &= FE_ALL_EXCEPT;
> >
> >    _FPU_GETFPSR (fpsr);
> > -  fpsr_new = (fpsr & ~FE_ALL_EXCEPT) | (fpsr & FE_ALL_EXCEPT & ~excepts);
> > +  fpsr_new = fpsr & ~excepts;
> 
> OK.
> 
> The logic does seem to collapse down nicely. No need to assembly the final
> fpsr_new from the two halves.
> 
> Is the generated code better?

Absolutely - it saves 3 instructions. GCC understands ((X & ~Y) | (X & Y)) == X,
but it doesn't do (X & ~Y) | (X & Y & Z) -> X & (Z | ~Y). In any case the new
version is much easier to understand as you no longer have to figure out what
it is trying to achieve!

Thanks for the link btw, I know what to do in the future for trivial patches.
Patch 1-4 have been committed.

Wilco

Carlos O'Donell Oct. 24, 2014, 3:34 p.m. UTC | #3

On 10/24/2014 11:27 AM, Wilco Dijkstra wrote:
>> Carlos O'Donell wrote:
>>> ---
>>>  sysdeps/aarch64/fpu/fclrexcpt.c | 2 +-
>>>  1 file changed, 1 insertion(+), 1 deletion(-)
>>>
>>> diff --git a/sysdeps/aarch64/fpu/fclrexcpt.c b/sysdeps/aarch64/fpu/fclrexcpt.c
>>> index b24f0ff..4471373 100644
>>> --- a/sysdeps/aarch64/fpu/fclrexcpt.c
>>> +++ b/sysdeps/aarch64/fpu/fclrexcpt.c
>>> @@ -28,7 +28,7 @@ feclearexcept (int excepts)
>>>    excepts &= FE_ALL_EXCEPT;
>>>
>>>    _FPU_GETFPSR (fpsr);
>>> -  fpsr_new = (fpsr & ~FE_ALL_EXCEPT) | (fpsr & FE_ALL_EXCEPT & ~excepts);
>>> +  fpsr_new = fpsr & ~excepts;
>>
>> OK.
>>
>> The logic does seem to collapse down nicely. No need to assembly the final
>> fpsr_new from the two halves.
>>
>> Is the generated code better?
> 
> Absolutely - it saves 3 instructions. GCC understands ((X & ~Y) | (X & Y)) == X,
> but it doesn't do (X & ~Y) | (X & Y & Z) -> X & (Z | ~Y). In any case the new
> version is much easier to understand as you no longer have to figure out what
> it is trying to achieve!

Agreed. Thanks for verifying.
 
> Thanks for the link btw, I know what to do in the future for trivial patches.
> Patch 1-4 have been committed.

My pleasure. I want to make developing for glibc as painless as possible, but
no less ;-)

Cheers,
Carlos.

diff --git a/sysdeps/aarch64/fpu/fclrexcpt.c b/sysdeps/aarch64/fpu/fclrexcpt.c
index b24f0ff..4471373 100644
--- a/sysdeps/aarch64/fpu/fclrexcpt.c
+++ b/sysdeps/aarch64/fpu/fclrexcpt.c
@@ -28,7 +28,7 @@  feclearexcept (int excepts)
   excepts &= FE_ALL_EXCEPT;
 
   _FPU_GETFPSR (fpsr);
-  fpsr_new = (fpsr & ~FE_ALL_EXCEPT) | (fpsr & FE_ALL_EXCEPT & ~excepts);
+  fpsr_new = fpsr & ~excepts;
 
   if (fpsr != fpsr_new)
     _FPU_SETFPSR (fpsr_new);