hppa: Fix stack alignment passed to clone

Message ID	d04fea6f-9898-339c-05b2-602ac4148bc1@bell.net
State	New
Headers	show Return-Path: <libc-alpha-return-106132-incoming=patchwork.ozlabs.org@sourceware.org> DomainKey-Signature: a=rsa-sha1; c=nofws; d=sourceware.org; h=list-id :list-unsubscribe:list-subscribe:list-archive:list-post :list-help:sender:to:cc:from:subject:message-id:date :mime-version:content-type:content-transfer-encoding; q=dns; s= default; b=X1QD6DzbmbVBJlBakysrUTgMDd3vVuh0HzmvF4cfDQ7FlouaYbtQp dAUI5QUd3nIqKKE/hodiXlUZ5Z9QSa6gMY6OkShu6z4dGfbcH14KO9oFR25oq/Pv 6jF9PTh+XZZbMKNHZuuu6cnO6oZhm9vK2EqPBXhQA1nbYBQQdead5Q= Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk Sender: libc-alpha-owner@sourceware.org To: GNU C Library <libc-alpha@sourceware.org> Cc: Carlos O'Donell <carlos@redhat.com> From: John David Anglin <dave.anglin@bell.net> Subject: [PATCH] hppa: Fix stack alignment passed to clone Openpgp: preference=signencrypt Message-ID: <d04fea6f-9898-339c-05b2-602ac4148bc1@bell.net> Date: Sat, 19 Oct 2019 14:33:59 -0400 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.9.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit
Series	hppa: Fix stack alignment passed to clone \| expand hppa: Fix stack alignment passed to clone

John David Anglin Oct. 19, 2019, 6:33 p.m. UTC

The hppa architecture requires strict alignment for loads and stores.  As a result, the
minimum stack alignment that will work is 8 bytes.  This patch adjust __clone() to align
the stack argument passed to it.  It also adjusts slightly some formatting.

This fixes the nptl/tst-tls1 test.

Okay?

Dave

 sysdeps/unix/sysv/linux/hppa/clone.S | 9 +++++++--
 1 file changed, 7 insertions(+), 2 deletions(-)

Florian Weimer Oct. 19, 2019, 7:17 p.m. UTC | #1

* John David Anglin:

> The hppa architecture requires strict alignment for loads and
> stores.  As a result, the minimum stack alignment that will work is
> 8 bytes.  This patch adjust __clone() to align the stack argument
> passed to it.  It also adjusts slightly some formatting.
>
> This fixes the nptl/tst-tls1 test.

I wonder if we should change this in nptl/tst-tls1

  xpthread_attr_setstack (&a, thr_stack + 1, STACK_SIZE);

to:

  xpthread_attr_setstack (&a, thr_stack + _Alignof (max_align_t), STACK_SIZE);

This should fix the test on hppa.  Since the alignment of the test TLS
variables is much larger than that of max_align_t, I don't think it
interferes with the objective of the test.

John David Anglin Oct. 19, 2019, 7:59 p.m. UTC | #2

On 2019-10-19 3:17 p.m., Florian Weimer wrote:
> * John David Anglin:
>
>> The hppa architecture requires strict alignment for loads and
>> stores.  As a result, the minimum stack alignment that will work is
>> 8 bytes.  This patch adjust __clone() to align the stack argument
>> passed to it.  It also adjusts slightly some formatting.
>>
>> This fixes the nptl/tst-tls1 test.
> I wonder if we should change this in nptl/tst-tls1
>
>   xpthread_attr_setstack (&a, thr_stack + 1, STACK_SIZE);
>
> to:
>
>   xpthread_attr_setstack (&a, thr_stack + _Alignof (max_align_t), STACK_SIZE);
>
> This should fix the test on hppa.  Since the alignment of the test TLS
> variables is much larger than that of max_align_t, I don't think it
> interferes with the objective of the test.
Yes, that should fix the test.  However, I still think the hppa fix should be applied because it
avoids misaligned accesses and then a segmentation fault in ld.so.1.  We have this code in
__clone():

        /* Ensure stack argument is 8-byte aligned.  */
        ldo             7(%r25),%r25
        depi            0,31,3,%r25

        /* Save the function pointer, arg, and flags on the new stack.  */
        stwm    %r26, 64(%r25)
        stw     %r23, -60(%r25)
        stw     %r24, -56(%r25)

The stwm instruction adds 64 to %r25.  The preceding two instructions round the stack argument
to an 8-byte aligned value.  So, we are just adding an additional 0 to 7.  Not a big deal.

If %r25 is not aligned, the stores for %r23 and %r24 generated unaligned traps.  Then, the kernel mucks
with the stack argument because the PSW W bit is stored in the bottom of sp:

        /* for now we can *always* set the W bit on entry to the syscall
         * since we don't support wide userland processes.  We could
         * also save the current SM other than in r0 and restore it on
         * exit from the syscall, and also use that value to know
         * whether to do narrow or wide syscalls. -PB
         */
        ssm     PSW_SM_W, %r1
        extrd,u %r1,PSW_W_BIT,1,%r1
        /* sp must be aligned on 4, so deposit the W bit setting into
         * the bottom of sp temporarily */
        or,ev   %r1,%r30,%r30

The child crashes when clone returns.  With the alignment, the code runs correctly.  So, I think we
shouldn't worry about the extra two instructions.

The other alternative is to generate an error when passed a misaligned stack argument.  I looked at
the opengroup spec but it didn't clarify the situations where arguments are invalid.

Dave

Florian Weimer Oct. 19, 2019, 8:07 p.m. UTC | #3

* John David Anglin:

> The other alternative is to generate an error when passed a
> misaligned stack argument.  I looked at the opengroup spec but it
> didn't clarify the situations where arguments are invalid.

I think an unaligned stack triggers undefined behavior, so a crash
would be acceptable in this context.  We could maybe add an assert to
the thread creation (although that will cause backwards compatibility
concerns on i386 due to the silent ABI change for SSE2).

John David Anglin Oct. 19, 2019, 11:26 p.m. UTC | #4

On 2019-10-19 2:33 p.m., John David Anglin wrote:
> This fixes the nptl/tst-tls1 test.
This is bug 25066.

Dave

John David Anglin Oct. 21, 2019, 2:12 p.m. UTC | #5

On 2019-10-19 4:07 p.m., Florian Weimer wrote:
> I think an unaligned stack triggers undefined behavior, so a crash
> would be acceptable in this context.  We could maybe add an assert to
> the thread creation (although that will cause backwards compatibility
> concerns on i386 due to the silent ABI change for SSE2).
The pthread_create() function is supposed to return EINVAL if the attributes specified by
the attr argument are invalid.  The Open Group specification doesn't say a crash is acceptable
behavior.  We don't actually have a "stack" at this point.  We have a pointer to memory to
be used as the stack for the new thread.

As I see it, we have two alternatives in the hppa __clone() implementation:

1) Return EINVAL when passed a misaligned pointer, or
2) Align the pointer.

Dave

hppa: Fix stack alignment passed to clone

Commit Message

Comments

Patch