[1/2] Add futex wrappers with error checking

On Thu, 2014-12-11 at 17:18 -0800, Roland McGrath wrote:
> > The second is that I haven't looked through all the lowlevellock cases
> > yet, so didn't want to touch that; it seemed moving lll_futex* callers
> > over to futex* callers wouldn't be an issue later on.
> 
> Fair enough.  I'm happy as long as the end state has no duplication of
> code/logic (one internal futex interface to rule them all) and you commit
> to driving all the corners of the cleanup so we get to that end state in
> this cycle.

I want to push back on this a little for two reasons.  First, while I
agree that we don't want to have two interfaces for the same thing, this
isn't exactly the case: lll_futex_ has no error checking and is just a
wrapper for whatever the underlying kernel/... provides.  The futex
wrappers I introduced do have error checking.
Second, it's hard to easily commit to doing something that isn't clearly
defined, especially so close to the freeze.  I agree that we shouldn't
let introduce junk or duplicated functionality, but it may be
unrealistic to try to get all-or-nothing instead of incremental changes.

I've introduced the futex wrappers because I wanted to do proper error
checking for the semaphores, and doing so with a reusable header seemed
to be The Right Thing.

> > Third, some of the lll_futex* definitions are in headers that are also
> > used from asm files; I guess that would mean I'd need to use macros
> > instead of C functions.
> 
> #ifdef __ASSEMBLY__ if need be.

Right.  Thanks.

> But we can also just clean things up so
> that's no longer the case.  There's no reason I can see why assembly code
> should want lowlevellock-futex.h.

There is assembly that calls futex syscalls, which needs at least the
macros for the different futex ops, and the syscall number in certain
cases.  Those things are currently in lowlevellock.h.
lowlevellock-futex.h needs the same information, and it should not
include lowlevellock.h.

We have other #ifndef __ASSEMBLY__ in lowlelvellock.h for a similar
reason already, it seems.

> > Fourth, I need some way to get to the arch-specific futex syscalls.  I
> > didn't know whether sysdeps/unix/sysv/linux/lowlevellock-futex.h would
> > work on every arch, so I just used what works for the locks.
> 
> What's an arch-specific futex syscall?  AFAIK
> sysdeps/unix/sysv/linux/lowlevellock-futex.h should be fine for all
> machines.

It doesn't work currently on i386 due to six-argument syscalls not being
supported, AFAIU.  If someone can add that I'd appreciate it; it would
save me finding out how to do that properly.

> Indeed sysdeps/unix/sysv/linux/lowlevellock.h should be fine for
> all machines too, and the only reason we still have any machine-specific
> files is conservatism about making sure that removing each one doesn't
> degrade any performance or semantics (so someone just needs to look at the
> generated code and compare for each machine).

I agree that this is what we want to do in the long run.

> > > I don't
> > > think we want to have both layers as such in the long run,
> > 
> > Maybe not.  If we want to expose our own futex abstraction to users,
> > we'd need a separate version that does less of the error checking we do,
> > as there may be cases where certain errors would need to be handled
> > differently.  You point out something similar below; checking that the
> > kernel (or whatever below provides the futex functionality) didn't
> > return errors we haven't specified in our futex abstraction.
> 
> I think the best approach for now is not to think about any new user API.
> Just do all the cleanup of our internal futex use thoroughly so we think
> it's very good and very maintainable.  When/if we come up with a new user
> API later, we can refactor as needed to implement it.

OK.

> > I didn't think about clean-up as much.  What I wanted is something we
> > can use today to get the futex error handling correct in pthread_once
> > and the the semaphores I'm about to submit, for example.
> 
> I'm going to insist on cleanup so we aren't growing redundant internal
> APIs.

Understood, but see above (e.g., I disagree it's fully redundant; more
details below).

> > I think I have a pretty good understanding for what the futex semantics
> > of the abstraction that we use internally should be.  I don't have a
> > good feel for how to best clean up all the existing code we have related
> > to that.
> 
> If you start with a strawman proposal for the complete new internal API,
> then we can all work together to figure out how to clean up existing code.

I'd start with just futex (timed)wait and wake, so what's in my patch.
That covers most of the uses.  The other ops are mostly for the
low-level locks, and I need to make a pass over the error handling for
these (but this was already discussed in the futex error handling
thread).

> > > > +#include <lowlevellock.h>
> > > 
> > > Include only what you need: lowlevellock-futex.h here.  That changes
> > > which code you're getting today, because all the machine-specific
> > > lowlevellock.h files still need to be removed.  But we should be
> > > finishing that cleanup this cycle anyway (though everyone seems to
> > > have forgotten).
> > 
> > I tried that now, but that doesn't work because it redefines lll_futex*,
> > and it's hard to avoid including lowlevellock.h through some other
> > header.  Therefore, I left this unchanged for now.
> 
> OK.  Perhaps you'd like to take on eliminating at least the x86 versions of
> lowlevellock.h?  (I think we'll really need to eliminate all of them before
> all futex-related cleanup is done.)

I agree that the existance custom lowlevellock.h and the related
assembly files is an issue we want to fix.  But I think we can start
making the futex facility more generic independently of that.

> > > So, now I'm seeing a potential reason to have this layer exist
> > > distinct from the OS-encapsulation layer.  Perhaps we should have the
> > > checks for expected errno values be in an OS-independent layer rather
> > > than just saying in the specification of the OS-encapsulation layer
> > > that it must yield only the particular set.
> > 
> > I'm not sure I can quite follow you.  I could see why the
> > OS-encapsulation layer would want to check that the set of return values
> > is only those we support in higher layers, but that's not what you're
> > after, or is it?
> 
> If the generic (higher) layers require a certain protocol about errno
> values and we want code to enforce/sanity-check the underlying OS calls for
> that (which seems to be the consensus for Linux), then it is duplicative
> and error-prone for each OS-specific layer to repeat that checking logic.

Yes.  That what I implicitly had in my mind too when thinking about
exposing futex to users; we'd need the OS call to expose it as-is, and
we'd need an internal interface that does the common style of error
checking, which should be OS-independent (assuming the OS calls on
different OSes return compatible sets of errors).

> > Updated patch is attached.  Is this one okay, or do you want to see
> > further changes to it and/or more of the full problem being addressed?
> 
> I guess I'd like to be closer to a full plan for cleaning it all up--that
> is, at least a more full sense of what the complete end state will look
> like, if not all the details about how to reach it--before we start
> committing.

OK.  So, what I'd suggest is this:

1) Have an OS-call interface that just does that.  This is what
lowlevellock-futex.h is currently, AFAIU.  This is implemented
differently on each OS.
Make sure that there is a lowlevellock-futex.h on each arch.  The
attached patch does this for x86_64 and i386.  More details below.

2) Have an OS-independent internal futex interface, with error checking.
That's what's in my patch.  This uses the interface from 1).

3) Move generic, non-low-level-lock code over to using the interface
from 2).  The new semaphore does that.  I have a new condvar
implementation which is just missing the PI handling, which would do the
same.  I'm working on a new rwlock that would use it. Etc.

4) Use the interface from 2) in generic low-level lock.  This should be
fine and without significant performance implications because all futex
ops are on the slow path, with the exception of the PI mutexes.  But if
you're doing a syscall anyway, doing a few more instructions more or
less won't matter that much.

5) Remove custom low-level lock implementations after reviewing the
performance implications of such removals.

Does this sound reasonable?  Are you OK with doing steps 1 and 2 before
the freeze, and do as much of 3) as possible before the freeze?

Regarding the attached patch:

This uses generic lowlevellock-futex.h for x86_64.  It also adds a few
#ifdef __ASSEMBLER__ to the generic Linux one to make this work.

With i386, this doesn't work because of lack of support for six-argument
syscalls (see above); thus, this patch just splits out all the
lll_futex_* calls and related stuff into a i386-specific
lowlevellock-futex.h file.  This has fewer features than the generic
one, but the only users are the current interface from 2), which just
have futex (timed)wait and wake.  And it works currently, so this should
be fine and we can add stuff as necessary later on (or, better, move to
the generic lowlevellock-futex.h).

There are a few more archs with custom futex features (not handled in
the attached patch, but I could add them if this approach is fine for
you):
* microblaze, s390, hppa all use INTERNAL_SYSCALL;  I believe those
could just use generic Linux lowlevellock-futex.h.
* ia64 uses DO_INLINE_SYSCALL.  Not sure whether that could use generic
Linux futexes too.
* sh has custom asm.  Not sure what to do about that.
* sparc uses INTERNAL_SYSCALL, so could be moved to generic Linux, but
there is a change (whose purpose/motivation I currently don't
understand):
  /* Returns non-zero if error happened, zero if success.  */
  #ifdef __sparc32_atomic_do_lock
  /* Avoid FUTEX_WAKE_OP if supporting pre-v9 CPUs.  */
  # define lll_futex_wake_unlock(futexp, nr_wake, nr_wake2, futexp2,\
     private) 1
  #else
  # define lll_futex_wake_unlock(futexp, nr_wake, nr_wake2, futexp2,\
  ...

Thoughts?

[1/2] Add futex wrappers with error checking

Commit Message

Comments

Patch