Hypnotoad / Prefork dump cores on small values of accepts and disabled keep-alive #1449

melhesedek · 2019-12-11T15:52:30Z

Mojolicious version: 8.27
Perl version: 5.30.1
Operating system: Linux 5.3.13-arch1-1

Steps to reproduce the behavior

It's a TL;DR version, scroll down and see attached archive for code and more details
I run a simple synchronous server

get '/' => sub {
    usleep 1_000_000 * rand;
    shift->render(text => "i'm ok");
};

with a command

./server.pl prefork --listen 'http://*:3002' \
  --accepts 1 \
  --clients 1 \
  --requests 1 \
  --workers 10

and a client in a second terminal to generate some load.

./client.pl

Expected behavior

I expect each worker process to exit normally after it served one request.

Actual behavior

Eventually a core dump of a worker process is generated. gdb shows Program terminated with signal SIGQUIT, Quit..

Setting up a clean environment

I use a Linux box, plenv to compile recent Perl version with debug symbols and cpanm to install Mojolicious locally.

plenv install --as=5.30.1-debugging 5.30.1 -DDEBUGGING
plenv local 5.30.1-debugging
plenv install-cpanm
cpanm -L local Mojolicious@8.27 --quiet

Then make sure core dumps will end up in a current working directory (you may want to back up previous value but it won't persist across system boots). It's not a necessary step but client.pl and gdb commands here assume it's done.

echo 'core' | sudo tee /proc/sys/kernel/core_pattern

Start server

PERL5LIB=local/lib/perl5 ./server.pl prefork --listen 'http://*:3002' \
  --accepts 1 \
  --clients 1 \
  --requests 1 \
  --workers 10

and client from another terminal.

PERL5LIB=local/lib/perl5 ./client.pl

It usually takes up to 15 minutes to reproduce on my Intel i5-2500K box. After client terminates there should be 1 to 3 core.<pid> files in a current working directory.

If a core dump was generated by e.g. a process with pid 33156, it's backtrace can be inspected with the following command:

gdb -ex bt ~/.plenv/versions/5.30.1-debugging/bin/perl core.33156

To observe signals-related behaviour I run server via strace.
It generates a lot of traces/trace.<pid> files which could take up some disk space.

mkdir traces
PERL5LIB=local/lib/perl5 strace -o traces/trace -ff -e signal,write -s 128 -tt -- ./server.pl prefork \
  --listen 'http://*:3002' \
  --accepts 1 \
  --clients 1 \
  --requests 1 \
  --workers 10

Dumps and traces

According to backtraces it seems processes usually terminate inside Perl_pp_exit or perl_destruct functions. In a syscall traces of dumped processes SIGQUIT arrives some microseconds after SIGQUIT handler is restored to SIG_DFL (a default one).

I assume there's a race condition due to a time gap between signal handlers are reset by Perl and process terminates.

Potential fix

See server-fixed.pl in attached archive. This seems to do the trick but race condition still remains (AFAIK a signal may arrive between rt_sigaction setting SIGQUIT handler to SIG_DFL and rt_sigprocmask blocking the process from recieving SIGQUIT).

app->hook(before_server_start => sub {
	my ($server, $app) = @_;

	my $sigset = POSIX::SigSet->new(SIGQUIT);
	$server->ioloop->on(finish => sub {
		my $loop = shift;

		sigprocmask(SIG_BLOCK, $sigset) // die "Could not block SIGQUIT\n";
	});
});

The text was updated successfully, but these errors were encountered:

jhthorsen · 2020-01-12T09:05:47Z

When is this a problem in the real world?

melhesedek · 2020-01-17T13:21:41Z

When your app does something memory-intensive but not very often, so you lower accepts value to release memory back to the OS. AFAIK this (or memory leakage) is the primary use case for similar setting in other web servers, e. g. httpd's MaxConnectionsPerChild.

In my case setting accepts to 10 resulted in several dozens core dumps a day on a modest 2 RPS.

stale · 2020-08-30T18:07:22Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

kraih · 2021-02-10T15:59:17Z

The problem seems too obscure for us to fix in Mojolicious.

oschwald · 2021-09-24T17:00:09Z

We see 100s of similar core dumps a week in production with accepts set to 100 and requests set to 70. Although it isn't a significant issue, it does cause many spurious logs that must be filtered through when monitoring for real issues.

ksmadsen · 2021-10-28T19:43:19Z

We see the same issue in our production. At this point, our conclusion is that when exit is called, Perl will reset the signal handlers early in the process before it actually exits. This appears to happen even before the END blocks are executed. So if a worker is in the process of shutting down and is executing an END block, when the parent process sends it the QUIT signal, the worker process will core dump.

That Perl resets the signal handlers before END is executed is illustrated by the following short script:

$SIG{QUIT} = sub {};
kill 'QUIT', $$;
print "After first quit\n";

END {
    kill 'QUIT', $$;
    print "After second quit\n";
}

Running this program will only cause the first print-statement to be executed, as the program will core-dump before reaching the second statement (tested on perl 5.34.0)

It seems that it is only handlers that are reset. If sub {} is switched out with 'IGNORE', the print-statement in the END block will also be executed.

I'm not familiar with the innards of the perl-interpreter, but it seems plausible that it will have to reset the signal handlers before exiting, leaving a small window open for a race between exiting and receiving a QUIT signal from its parent.

kiwiroy · 2021-10-29T09:43:30Z

localizing the signal handler in the END block results in handling the second kill.

ksmadsen · 2021-10-29T11:56:47Z

I can't see that localizing the signal handler in the END block eliminates the race completely. We can see that Perl resets the signal handler before the END blocks are executed. The QUIT signal could come after the signal handlers are reset and before our END block that reinstalls the signal handler is executed.

AFAICT the potential fix with the before_server_start-hook in the original post eliminates the race. A "simpler" solution, would be to add:

$SIG{QUIT} = 'IGNORE';

inbetween $loop->start; and exit 0; in Mojo::Server::Prefork::_spawn (we are already exiting, so no need to know that we should do it gracefully - that won't have any effect anyway).

I would submit a pull-request for this, but I haven't yet wrapped my head around how to write a test to ensure that it actually works.

jixam · 2021-10-30T19:26:33Z

@kraih Please consider reopening. We have accepts at 1000 but this issue actually caused an outage for us when Ubuntu started storing core dumps in an unexpected location after a routine apt upgrade.

Here is a maybe less obscure repro which dumps core for me about 50% of the time:

Server (using strace in case core dumps are disabled):

strace -e kill perl -Mojo -E 'get "/" => {inline => "%= sleep 5"}; app->start' prefork -a 1

Client:

while sleep 1; do curl http://127.0.0.1:3000/; done

The trick is doing enough work (sleep 5) that a heartbeat happens while the IO loop is shutting down after reaching its maximum number of requests (i.e. max_accepts). That can end up trying to signal SIGQUIT when the worker has unregistered its handler during exit().

jixam · 2021-11-01T19:35:36Z

It turns out the Prefork signal handlers are local so an effective workaround is to globally ignore SIGQUIT before startup. As @ksmadsen has shown, this will also ignore the signals during exit().

My test server then becomes:

strace -e kill perl -Mojo -E '$SIG{QUIT} = "IGNORE"; get "/" => {inline => "%= sleep 5"}; app->start' prefork -a 1

which never core dumps for me.

I still think a change is warranted so that all users do not have to know about such a workaround.

mschout · 2022-03-01T18:21:31Z

This bit us hard. Eating up huge amounts of disk space once we lowered accepts.

kraih · 2022-03-01T20:59:20Z

There's a possible fix, unfortunately nobody has reviewed it yet.

jixam · 2022-03-02T08:07:33Z

FWIW, this simple workaround has completely solved the problem for us (this is also what I discussed above):

BEGIN {
    # Protect against stray signals from the process manager
    $SIG{QUIT} = 'IGNORE';
}

The PR #1883 ensures that the stray signals never happen and the workaround is no longer needed.

stale bot added the wontfix label Aug 30, 2020

kraih removed the stale label Nov 5, 2020

kraih closed this as completed Feb 10, 2021

jixam mentioned this issue Nov 17, 2021

Fix potential core dump during worker process shutdown #1883

Closed

jixam mentioned this issue Jun 7, 2023

Manager can send SIGQUIT to worker while exiting, causing coredump #2046

Open

brsakai-csco mentioned this issue Jun 9, 2023

Do not send redundant SIGQUITs #2073

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hypnotoad / Prefork dump cores on small values of accepts and disabled keep-alive #1449

Hypnotoad / Prefork dump cores on small values of accepts and disabled keep-alive #1449

melhesedek commented Dec 11, 2019 •

edited

Loading

jhthorsen commented Jan 12, 2020

melhesedek commented Jan 17, 2020

stale bot commented Aug 30, 2020

kraih commented Feb 10, 2021 •

edited

Loading

oschwald commented Sep 24, 2021 •

edited

Loading

ksmadsen commented Oct 28, 2021

kiwiroy commented Oct 29, 2021

ksmadsen commented Oct 29, 2021

jixam commented Oct 30, 2021

jixam commented Nov 1, 2021

mschout commented Mar 1, 2022

kraih commented Mar 1, 2022

jixam commented Mar 2, 2022

Hypnotoad / Prefork dump cores on small values of accepts and disabled keep-alive #1449

Hypnotoad / Prefork dump cores on small values of accepts and disabled keep-alive #1449

Comments

melhesedek commented Dec 11, 2019 • edited Loading

Steps to reproduce the behavior

Expected behavior

Actual behavior

Setting up a clean environment

Dumps and traces

Potential fix

jhthorsen commented Jan 12, 2020

melhesedek commented Jan 17, 2020

stale bot commented Aug 30, 2020

kraih commented Feb 10, 2021 • edited Loading

oschwald commented Sep 24, 2021 • edited Loading

ksmadsen commented Oct 28, 2021

kiwiroy commented Oct 29, 2021

ksmadsen commented Oct 29, 2021

jixam commented Oct 30, 2021

jixam commented Nov 1, 2021

mschout commented Mar 1, 2022

kraih commented Mar 1, 2022

jixam commented Mar 2, 2022

melhesedek commented Dec 11, 2019 •

edited

Loading

kraih commented Feb 10, 2021 •

edited

Loading

oschwald commented Sep 24, 2021 •

edited

Loading