rework daemonizing logic in qemu-nbd

Message ID	20120114124210.E43483DE@gandalf.tls.msk.ru
State	New
Headers	show Return-Path: <qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org> From: Michael Tokarev <mjt@tls.msk.ru> Date: Sat, 14 Jan 2012 16:39:28 +0400 To: qemu-devel@nongnu.org Message-Id: <20120114124210.E43483DE@gandalf.tls.msk.ru> Cc: Paolo Bonzini <pbonzini@redhat.com>, mjt@tls.msk.ru Subject: [Qemu-devel] [PATCH] rework daemonizing logic in qemu-nbd Precedence: list Errors-To: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org Sender: qemu-devel-bounces+incoming=patchwork.ozlabs.org@nongnu.org

Michael Tokarev Jan. 14, 2012, 12:39 p.m. UTC

qemu-nbd uses daemon(3) routine to daemonize, and while
at it, it uses several hacks to make daemon(3) to work
as intended.  Instead of all these hacks, implement
daemon(3) functionality (which is a very simple function)
directly but in a way which is much more suitable for the
use case. It lets us to remove several hacks completely,
and stop using daemon() which is marked as deprecated
on e.g. MacOS.  Some more hacks around daemon(3) will
be removed in subsequent series.

Signed-Off-By: Michael Tokarev <mjt@tls.msk.ru>
---
 qemu-nbd.c |   83 ++++++++++++++++++++++++++++++++++++------------------------
 1 files changed, 50 insertions(+), 33 deletions(-)

Paolo Bonzini Jan. 15, 2012, 10:42 a.m. UTC | #1

On 01/14/2012 01:39 PM, Michael Tokarev wrote:
>           if (pid == 0) {
> -            close(stderr_fd[0]);
> -            ret = qemu_daemon(0, 0);
> -
> -            /* Temporarily redirect stderr to the parent's pipe...  */
> -            dup2(stderr_fd[1], STDERR_FILENO);
> -            if (ret == -1) {
> +            int nullfd = open("/dev/null", O_RDWR);
> +            if (nullfd<  0 || setsid()<  0) {
>                   err(EXIT_FAILURE, "Failed to daemonize");
>               }

This is forking only once.

> -
> -            /* ... close the descriptor we inherited and go on.  */
> -            close(stderr_fd[1]);
> -        } else {
> -            bool errors = false;
> -            char *buf;
> -
> -            /* In the parent.  Print error messages from the child until
> -             * it closes the pipe.
> +            /* redirect stdin from /dev/null,
> +             * stdout (temporarily) to the pipe to parent,

This is a bit of a hack.

> +    /* now complete the daemonizing procedure.
> +     */
> +    if (device && !verbose) {
> +        if (chdir("/") < 0) {
> +            err(EXIT_FAILURE, "unable to chdir to /");
> +        }
> +        /* this redirects stderr to /dev/null */
> +        dup2(STDIN_FILENO, STDERR_FILENO);
> +        /* this redirects stdout to /dev/null too, and closes parent pipe */
> +        dup2(STDIN_FILENO, STDOUT_FILENO);
> +    }
> +

Half of this is already done in client_thread, and that would be the 
place where you should add dup2(0, 1).  Also, the chdir can be moved 
earlier, after bdrv_open.

Paolo

Michael Tokarev Jan. 15, 2012, 12:50 p.m. UTC | #2

On 15.01.2012 14:42, Paolo Bonzini wrote:
> On 01/14/2012 01:39 PM, Michael Tokarev wrote:
>>           if (pid == 0) {
>> -            close(stderr_fd[0]);
>> -            ret = qemu_daemon(0, 0);
>> -
>> -            /* Temporarily redirect stderr to the parent's pipe...  */
>> -            dup2(stderr_fd[1], STDERR_FILENO);
>> -            if (ret == -1) {
>> +            int nullfd = open("/dev/null", O_RDWR);
>> +            if (nullfd<  0 || setsid()<  0) {
>>                   err(EXIT_FAILURE, "Failed to daemonize");
>>               }
> 
> This is forking only once.

Is it good or bad?  There's no need to fork twice.  Second
fork (to the one which is already done in daemon(3)) has
been done to work around lack of proper communication between
parent and child in case of using plain daemon(3).  I.e., due
to daemon(3) interface being unflexible/unsuitable for the
current use case.

>> -
>> -            /* ... close the descriptor we inherited and go on.  */
>> -            close(stderr_fd[1]);
>> -        } else {
>> -            bool errors = false;
>> -            char *buf;
>> -
>> -            /* In the parent.  Print error messages from the child until
>> -             * it closes the pipe.
>> +            /* redirect stdin from /dev/null,
>> +             * stdout (temporarily) to the pipe to parent,
> 
> This is a bit of a hack.

There's another way -- to keep the writing pipe end in some
local variable and use that one instead of STDOUT_FILENO.
I can do it that way for sure, just thought it's already
using too much local variables.

>> +    /* now complete the daemonizing procedure.
>> +     */
>> +    if (device && !verbose) {
>> +        if (chdir("/") < 0) {
>> +            err(EXIT_FAILURE, "unable to chdir to /");
>> +        }
>> +        /* this redirects stderr to /dev/null */
>> +        dup2(STDIN_FILENO, STDERR_FILENO);
>> +        /* this redirects stdout to /dev/null too, and closes parent pipe */
>> +        dup2(STDIN_FILENO, STDOUT_FILENO);
>> +    }
>> +
> 
> Half of this is already done in client_thread, and that would be the place where you should add dup2(0, 1).

I partly disagree.

I wanted to de-couple -c (device) case with daemonizing.
client_thread only works in -c case, but daemonizing in
that case is wrong as I already pointed out in another
email - we should either stop daemonizing here at all
or have a separate option for it.

>  Also, the chdir can be moved earlier, after bdrv_open.

There's no need to, afiacs.  We complete init process and
enter main loop.  Chdir should be done befor entering main
loop, the rest makes no difference (as long as the files
we open will be accessible from cwd).

Thanks,

/mjt

Paolo Bonzini Jan. 15, 2012, 4:11 p.m. UTC | #3

On 01/15/2012 01:50 PM, Michael Tokarev wrote:
> On 15.01.2012 14:42, Paolo Bonzini wrote:
>> On 01/14/2012 01:39 PM, Michael Tokarev wrote:
>>>            if (pid == 0) {
>>> -            close(stderr_fd[0]);
>>> -            ret = qemu_daemon(0, 0);
>>> -
>>> -            /* Temporarily redirect stderr to the parent's pipe...  */
>>> -            dup2(stderr_fd[1], STDERR_FILENO);
>>> -            if (ret == -1) {
>>> +            int nullfd = open("/dev/null", O_RDWR);
>>> +            if (nullfd<   0 || setsid()<   0) {
>>>                    err(EXIT_FAILURE, "Failed to daemonize");
>>>                }
>>
>> This is forking only once.
>
> Is it good or bad?  There's no need to fork twice.  Second
> fork (to the one which is already done in daemon(3)) has
> been done to work around lack of proper communication between
> parent and child in case of using plain daemon(3).  I.e., due
> to daemon(3) interface being unflexible/unsuitable for the
> current use case.

daemon(3) forks twice (so qemu-nbd is effectively forking three times, 
one of which is unnecessary).

See 
http://stackoverflow.com/questions/881388/what-is-the-reason-for-performing-a-double-fork-when-creating-a-daemon 
for why there is a fork before setsid and one after.

>>> -
>>> -            /* ... close the descriptor we inherited and go on.  */
>>> -            close(stderr_fd[1]);
>>> -        } else {
>>> -            bool errors = false;
>>> -            char *buf;
>>> -
>>> -            /* In the parent.  Print error messages from the child until
>>> -             * it closes the pipe.
>>> +            /* redirect stdin from /dev/null,
>>> +             * stdout (temporarily) to the pipe to parent,
>>
>> This is a bit of a hack.
>
> There's another way -- to keep the writing pipe end in some
> local variable and use that one instead of STDOUT_FILENO.
> I can do it that way for sure, just thought it's already
> using too much local variables.

Yes, that would be better.

>>> +    /* now complete the daemonizing procedure.
>>> +     */
>>> +    if (device&&  !verbose) {
>>> +        if (chdir("/")<  0) {
>>> +            err(EXIT_FAILURE, "unable to chdir to /");
>>> +        }
>>> +        /* this redirects stderr to /dev/null */
>>> +        dup2(STDIN_FILENO, STDERR_FILENO);
>>> +        /* this redirects stdout to /dev/null too, and closes parent pipe */
>>> +        dup2(STDIN_FILENO, STDOUT_FILENO);
>>> +    }
>>> +
>>
>> Half of this is already done in client_thread, and that would be
>> theplace where you should add dup2(0, 1).
>
> I partly disagree.
>
> I wanted to de-couple -c (device) case with daemonizing.
> client_thread only works in -c case, but daemonizing in
> that case is wrong as I already pointed out in another
> email - we should either stop daemonizing here at all
> or have a separate option for it.

We can only clean up standard file descriptors after all initialization 
tasks have been done.  nbd_client_thread could still write error 
messages.  Your patch introduces a race.

>>   Also, the chdir can be moved earlier, after bdrv_open.
>
> There's no need to, afiacs.  We complete init process and
> enter main loop.  Chdir should be done befor entering main
> loop, the rest makes no difference (as long as the files
> we open will be accessible from cwd).

Yes, but I prefer to have the chdir done unconditionally as soon as 
possible.

Paolo

Michael Tokarev Jan. 15, 2012, 4:44 p.m. UTC | #4

On 15.01.2012 20:11, Paolo Bonzini wrote:
> On 01/15/2012 01:50 PM, Michael Tokarev wrote:
>> On 15.01.2012 14:42, Paolo Bonzini wrote:
>>> On 01/14/2012 01:39 PM, Michael Tokarev wrote:
>>>>            if (pid == 0) {
>>>> -            close(stderr_fd[0]);
>>>> -            ret = qemu_daemon(0, 0);
>>>> -
>>>> -            /* Temporarily redirect stderr to the parent's pipe...  */
>>>> -            dup2(stderr_fd[1], STDERR_FILENO);
>>>> -            if (ret == -1) {
>>>> +            int nullfd = open("/dev/null", O_RDWR);
>>>> +            if (nullfd<   0 || setsid()<   0) {
>>>>                    err(EXIT_FAILURE, "Failed to daemonize");
>>>>                }
>>>
>>> This is forking only once.
>>
>> Is it good or bad?  There's no need to fork twice.  Second
>> fork (to the one which is already done in daemon(3)) has
>> been done to work around lack of proper communication between
>> parent and child in case of using plain daemon(3).  I.e., due
>> to daemon(3) interface being unflexible/unsuitable for the
>> current use case.
> 
> daemon(3) forks twice (so qemu-nbd is effectively forking three times, one of which is unnecessary).
> 
> See http://stackoverflow.com/questions/881388/what-is-the-reason-for-performing-a-double-fork-when-creating-a-daemon for why there is a fork before setsid and one after.

Daemon(3) on linux (glibc) does not try to fork twice, just
one time is sufficient.  Yes in old times there was some
portability issues on some unixes with controling terminal
and what not.  That thread summaries it up almost nicely at
the end: "So I suppose it all just boils down to tradition
in the end - a single fork is sufficient as long as the
parent dies in short order anyway," and "...think of the
setsid( ) call as the "new" way to do thing (disassociate
from the terminal) and the [second] fork( ) call after it
as redundancy to deal with the SVr4..."

[]
>>>> +             * stdout (temporarily) to the pipe to parent,
>>>
>>> This is a bit of a hack.
>>
>> There's another way -- to keep the writing pipe end in some
>> local variable and use that one instead of STDOUT_FILENO.
>> I can do it that way for sure, just thought it's already
>> using too much local variables.
> 
> Yes, that would be better.

Done in a v2 version I sent you.

>>>> +    /* now complete the daemonizing procedure.
>>>> +     */
>>>> +    if (device&&  !verbose) {
>>>> +        if (chdir("/")<  0) {
>>>> +            err(EXIT_FAILURE, "unable to chdir to /");
>>>> +        }
>>>> +        /* this redirects stderr to /dev/null */
>>>> +        dup2(STDIN_FILENO, STDERR_FILENO);
>>>> +        /* this redirects stdout to /dev/null too, and closes parent pipe */
>>>> +        dup2(STDIN_FILENO, STDOUT_FILENO);
>>>> +    }
>>>> +
>>>
>>> Half of this is already done in client_thread, and that would be
>>> theplace where you should add dup2(0, 1).

Um, I missed that "half of this" part.  Indeed, nbd_client_thread()
does dup2(STDOUT_FILENO, STDERR_FILENO) which should go away, but
it is harmless for now, and can be addressed in a separate patch.

>> I partly disagree.
>>
>> I wanted to de-couple -c (device) case with daemonizing.
>> client_thread only works in -c case, but daemonizing in
>> that case is wrong as I already pointed out in another
>> email - we should either stop daemonizing here at all
>> or have a separate option for it.
> 
> We can only clean up standard file descriptors after all initialization tasks have been done.  nbd_client_thread could still write error messages.  Your patch introduces a race.

Please elaborate where the race is.  Do you mean one
thread can write error message while another at the
same time is closing the filedescriptor in question, --
that race?  We're doomed anyway, and it is even good
we've a small remote chance for our error message to
be seen.  Currently it just goes to /dev/null.

>>>   Also, the chdir can be moved earlier, after bdrv_open.
>>
>> There's no need to, afiacs.  We complete init process and
>> enter main loop.  Chdir should be done befor entering main
>> loop, the rest makes no difference (as long as the files
>> we open will be accessible from cwd).
> 
> Yes, but I prefer to have the chdir done unconditionally as soon as possible.

That's not a bad intention.  I'm fixing existing logic without
introducing new logical changes.  If you want to fix other
stuff, it is better be done in a separate commit/change.

Thanks,

/mjt

Paolo Bonzini Jan. 15, 2012, 5:31 p.m. UTC | #5

On 01/15/2012 05:44 PM, Michael Tokarev wrote:
>>>>> +             * stdout (temporarily) to the pipe to parent,
>>>>
>>>> This is a bit of a hack.
>>>
>>> There's another way -- to keep the writing pipe end in some
>>> local variable and use that one instead of STDOUT_FILENO.
>>> I can do it that way for sure, just thought it's already
>>> using too much local variables.
>>
>> Yes, that would be better.
>
> Done in a v2 version I sent you.

Please stay on the list.

>>>>> +    /* now complete the daemonizing procedure.
>>>>> +     */
>>>>> +    if (device&&   !verbose) {
>>>>> +        if (chdir("/")<   0) {
>>>>> +            err(EXIT_FAILURE, "unable to chdir to /");
>>>>> +        }
>>>>> +        /* this redirects stderr to /dev/null */
>>>>> +        dup2(STDIN_FILENO, STDERR_FILENO);
>>>>> +        /* this redirects stdout to /dev/null too, and closes parent pipe */
>>>>> +        dup2(STDIN_FILENO, STDOUT_FILENO);
>>>>> +    }
>>>>> +
>>>>
>>>> Half of this is already done in client_thread, and that would be
>>>> theplace where you should add dup2(0, 1).
>
> Um, I missed that "half of this" part.  Indeed, nbd_client_thread()
> does dup2(STDOUT_FILENO, STDERR_FILENO) which should go away, but
> it is harmless for now, and can be addressed in a separate patch.

Again, _the client thread_ is the right place to do this!  See below.

>>> I partly disagree.
>>>
>>> I wanted to de-couple -c (device) case with daemonizing.
>>> client_thread only works in -c case, but daemonizing in
>>> that case is wrong as I already pointed out in another
>>> email - we should either stop daemonizing here at all
>>> or have a separate option for it.
>>
>> We can only clean up standard file descriptors after all
>> initialization tasks have been done. nbd_client_thread could still write
>> error messages. Your patch introduces a race.
>
> Please elaborate where the race is.  Do you mean one
> thread can write error message while another at the
> same time is closing the filedescriptor in question, --
> that race?

Yes.

> We're doomed anyway, and it is even good
> we've a small remote chance for our error message to
> be seen.  Currently it just goes to /dev/null.

No, currently it is sent from the daemon to the parent through the pipe, 
the parent prints it and exits with status code 1.  With your patch, if 
the dup2 wins the race you exit with status code 0; if the client thread 
wins the race it is the same as master.

> That's not a bad intention.  I'm fixing existing logic without
> introducing new logical changes.  If you want to fix other
> stuff, it is better be done in a separate commit/change.

AFAIK the only known bug (besides the devfd/sockfd mixup) is the missing 
chdir, and that should be fixed first.

Paolo

Paolo Bonzini Jan. 15, 2012, 5:46 p.m. UTC | #6

On 01/15/2012 06:31 PM, Paolo Bonzini wrote:
>
>
>> We're doomed anyway, and it is even good
>> we've a small remote chance for our error message to
>> be seen.  Currently it just goes to /dev/null.
>
> No, currently it is sent from the daemon to the parent through the pipe,
> the parent prints it and exits with status code 1.  With your patch, if
> the dup2 wins the race you exit with status code 0; if the client thread
> wins the race it is the same as master.

Actually, the dup2 will always win the race.  Until the main loop starts 
and accepts the connection from the client thread, the client thread 
will be stuck connect()ing to the server socket.  So, the client thread 
will never be able to report problems connecting /dev/nbd (for example 
you won't get an error if you chose a device that is already busy).  So 
it looks like there is no race, but there is a bug. :)

Please disprove me if I'm wrong, of course.

Paolo

Michael Tokarev Jan. 16, 2012, 7:22 a.m. UTC | #7

On 15.01.2012 21:31, Paolo Bonzini wrote:
> On 01/15/2012 05:44 PM, Michael Tokarev wrote:
>>>>>> +             * stdout (temporarily) to the pipe to parent,
>>>>>
>>>>> This is a bit of a hack.
>>>>
>>>> There's another way -- to keep the writing pipe end in some
>>>> local variable and use that one instead of STDOUT_FILENO.
>>>> I can do it that way for sure, just thought it's already
>>>> using too much local variables.
>>>
>>> Yes, that would be better.
>>
>> Done in a v2 version I sent you.
> 
> Please stay on the list.

Sorry?  I sent it to you and to the list, here's the command
line from my .bash_history:

 git format-patch --subject-prefix="PATCH v2" --stdout --to 'qemu-devel@nongnu.org' --cc "Paolo Bonzini <pbonzini@redhat.com>" --cc mjt@tls.msk.ru HEAD^ | /usr/sbin/sendmail -t -i

On which list I shoult stay?

[]
>> Um, I missed that "half of this" part.  Indeed, nbd_client_thread()
>> does dup2(STDOUT_FILENO, STDERR_FILENO) which should go away, but
>> it is harmless for now, and can be addressed in a separate patch.
> 
> Again, _the client thread_ is the right place to do this!  See below.

[]
>> We're doomed anyway, and it is even good
>> we've a small remote chance for our error message to
>> be seen.  Currently it just goes to /dev/null.
> 
> No, currently it is sent from the daemon to the parent through the pipe, the parent prints it and exits with status code 1.  With your patch, if the dup2 wins the race you exit with status code 0; if the client thread wins the race it is the same as master.

Aha. I finally see what you mean.

I still disagree, -- all the operations done in the client
thread can be done before forking a new thread, syncronously,
and _that_ will be the easiest and cleanest solution here.

>> That's not a bad intention.  I'm fixing existing logic without
>> introducing new logical changes.  If you want to fix other
>> stuff, it is better be done in a separate commit/change.
> 
> AFAIK the only known bug (besides the devfd/sockfd mixup) is the missing chdir, and that should be fixed first.

It all looked so ugly to me that I didn't even want to think
about just adding a chdir() instead of getting rid of daemon().
But ok, I can go that ugly route too.

Thanks,

/mjt

Paolo Bonzini Jan. 16, 2012, 7:41 a.m. UTC | #8

On 01/16/2012 08:22 AM, Michael Tokarev wrote:
> On 15.01.2012 21:31, Paolo Bonzini wrote:
>> On 01/15/2012 05:44 PM, Michael Tokarev wrote:
>>>>>>> +             * stdout (temporarily) to the pipe to parent,
>>>>>>
>>>>>> This is a bit of a hack.
>>>>>
>>>>> There's another way -- to keep the writing pipe end in some
>>>>> local variable and use that one instead of STDOUT_FILENO.
>>>>> I can do it that way for sure, just thought it's already
>>>>> using too much local variables.
>>>>
>>>> Yes, that would be better.
>>>
>>> Done in a v2 version I sent you.
>>
>> Please stay on the list.
>
> Sorry?  I sent it to you and to the list, here's the command
> line from my .bash_history:
>
>   git format-patch --subject-prefix="PATCH v2" --stdout --to 'qemu-devel@nongnu.org' --cc "Paolo Bonzini<pbonzini@redhat.com>" --cc mjt@tls.msk.ru HEAD^ | /usr/sbin/sendmail -t -i
>
> On which list I shoult stay?

Ah, sorry, I don't connect to the VPN usually during the weekend so I 
thought it was sent privately.

> I still disagree, -- all the operations done in the client
> thread can be done before forking a new thread, syncronously,
> and _that_ will be the easiest and cleanest solution here.

It's a chicken-and-egg problem.  To connect a socket to an NBD device, 
you need to have negotiated with the server already, so the socket must 
be connected already.  Since the other side of the socket is also 
handled by qemu-nbd, you need threads.

It's true that in principle you could use the main loop and do the 
NBD_DO_IT in a new thread.  However, qemu-sockets.c does not support 
asynchronous connect(2).  You could run that code already inside the 
main loop, for example in a coroutine, but then patches pile up quickly.

>>> That's not a bad intention.  I'm fixing existing logic without
>>> introducing new logical changes.  If you want to fix other
>>> stuff, it is better be done in a separate commit/change.
>>
>> AFAIK the only known bug (besides the devfd/sockfd mixup) is the
>> missing chdir, and that should be fixed first.
>
> It all looked so ugly to me that I didn't even want to think
> about just adding a chdir() instead of getting rid of daemon().
> But ok, I can go that ugly route too.

I'm not saying it's particularly pretty, but it's also not as easy to 
improve as you'd think.  There are a lot of constraints (fork/daemonize 
before initializing the block layer, report errors properly, avoid 
races, ...).

Paolo

rework daemonizing logic in qemu-nbd

Commit Message

Comments

Patch