Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

stdout/err buffer overflow behaviour #77

Open
huntc opened this issue Aug 21, 2017 · 2 comments
Open

stdout/err buffer overflow behaviour #77

huntc opened this issue Aug 21, 2017 · 2 comments

Comments

@huntc
Copy link

huntc commented Aug 21, 2017

I've noticed that for Posix, if the stdout buffer becomes full then a RuntimeException("stdout buffer has no bytes remaining") is thrown (https://github.com/brettwooldridge/NuProcess/blob/master/src/main/java/com/zaxxer/nuprocess/internal/BasePosixProcess.java#L483). I've observed that this exception is caught at https://github.com/brettwooldridge/NuProcess/blob/master/src/main/java/com/zaxxer/nuprocess/internal/BaseEventProcessor.java#L83 whereupon the process is then declared as stopped.

My understanding from the API doc is that it is fine for a stdout handler to return the buffer that it was passed in the event that it is not in a position to consume it and that the stdout handler would then be called at a later time. My expectation then was that the upstream process would block until it could write more to stdout.

I was hoping that you could confirm the actual design. It seems that stdout handlers must always consume in order to avoid seeing that the process is represented as one that has stopped.

The same behaviour applies to stderr. Thanks.

Note that #53 is related, although that focuses more on a solution. This issue pertains to unexpected behaviour in accordance with the existing API doc.

huntc added a commit to huntc/akka-contrib-extra that referenced this issue Aug 21, 2017
I had previously thought that NuProcess handled back-pressure. As per brettwooldridge/NuProcess#77, it doesn't appear so. Hence, I'm reverting the code to use a Source.queue and managing a 64KiB buffer directly.

If NuProcess ever does handle back-pressure then the PublishIfAvailable source could make a come-back. It was working beautifully. :-)
huntc added a commit to huntc/akka-contrib-extra that referenced this issue Aug 22, 2017
I had previously thought that NuProcess handled back-pressure. As per brettwooldridge/NuProcess#77, it doesn't appear so. Hence, I'm reverting the code to use a Source.queue and managing a 64KiB buffer directly.

If NuProcess ever does handle back-pressure then the PublishIfAvailable source could make a come-back. It was working beautifully. :-)
huntc added a commit to huntc/akka-contrib-extra that referenced this issue Aug 22, 2017
I had previously thought that NuProcess handled back-pressure. As per brettwooldridge/NuProcess#77, it doesn't appear so. Hence, I'm reverting the code to use a Source.queue and managing a 64KiB buffer directly.

If NuProcess ever does handle back-pressure then the PublishIfAvailable source could make a come-back. It was working beautifully. :-)
@brettwooldridge
Copy link
Owner

@huntc Currently, the handler must consume all data. There is no back pressure. There is work to implement back pressure on the streams branch, but AFAIR it is unfinished.

@bturner
Copy link
Collaborator

bturner commented Sep 26, 2020

My understanding from the API doc is that it is fine for a stdout handler to return the buffer that it was passed in the event that it is not in a position to consume it and that the stdout handler would then be called at a later time.

I can see how one might read that into the NuProcessHandler Javadocs, but that's not how it works. You can return from the callback with some data still in the buffer, but you can't return with the buffer full. Consider a split multibyte UTF-8 character, for example. If you get the first 2 bytes of a 4 byte character, you might leave those 2 bytes in the buffer for the next callback. That sort of leftover data leaves plenty of room in the buffer for new data, though.

The problem with "would then be called at a later time" is: When would that "later time" be, and how would NuProcess know when it is? Currently it's listening for available output, and when there is some it calls back. If the handler doesn't consume that output, though, and the buffer fills up, then NuProcess:

  • Can't listen for more output, because it'd end up in a busy loop (we know there's data available already)
  • Can't buffer any more data, because there's nowhere to put it

NuProcess would need to grow something that's similar to the existing wantWrite that's used to register (or re-register) an interest in a stdin window, where, if your onStdout or onStderr returned with the buffer full, NuProcess would essentially conclude you're not interested in that stream right now and wait for you to signal you are before triggering any more callbacks.

The streams branch starts down one possible approach for this, but it requires a pretty comprehensive rewrite of NuProcess's public API (which would mean a 3.0 major release).

This may be something I, or some colleagues on Bitbucket Server, end up circling back to, though, because eventually we'd very much like to have some backpressure in NuProcess. Currently we implement it "around" NuProcess by using the NuProcessBuilder.run() introduced in 2.0.0 to pump processes directly on a calling thread, rather than the fixed background pool used by NuProcessBuilder.start(). That allows us to safely block in NuProcessHandler callbacks like onStderr and onStdout--effectively applying backpressure. That's how, for example, we've integrated NuProcess with gRPC, Apache SSHD and Tomcat.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants