unicorn.git - Rack HTTP server for Unix and fast clients

Date	Commit message (Collapse)
2009-06-25	tee_input: Don't expose the @rd object as a return value
	Pay a performance penalty and always proxy reads through our TeeInput object to ensure nobody closes our internal reader.
2009-06-10	Optimize body-less GET/HEAD requests (again)
	No point in making syscalls to deal with empty bodies. Reinstate usage of the NULL_IO object which allows us to avoid allocating new objects.
2009-06-09	Avoid duplicating the "Z" constant
	Trying not to repeat ourselves. Unfortunately, Ruby 1.9 forces us to actually care about encodings of arbitrary byte sequences.
2009-06-06	Put copyright text in new files, include GPL2 text
	Just clarifying the license terms of the new code. Other files should really have this notice in there as well.
2009-06-06	Unicorn::App::Inetd: reinventing Unix, poorly :)
	This includes an example of tunneling the git protocol inside a TE:chunked HTTP request. The example is unfortunately contrived in that it relies on the custom examples/cat-chunk-proxy.rb script in the client. My initial wish was to have a generic tool like curl(1) operate like this: cat > ~/bin/cat-chunk-proxy.sh <<EOF #!/bin/sh exec curl -sfNT- http://$1:$2/ EOF chmod +x ~/bin/cat-chunk-proxy.sh GIT_PROXY_COMMAND=cat-chunk-proxy.sh git clone git://0:8080/foo Unfortunately, curl will attempt a blocking read on stdin before reading the TCP socket; causing the git-clone consumer to starve. This does not appear to be a problem with the new server code for handling chunked requests.
2009-06-05	Transfer-Encoding: chunked streaming input support
	This adds support for handling POST/PUT request bodies sent with chunked transfer encodings ("Transfer-Encoding: chunked"). Attention has been paid to ensure that a client cannot OOM us by sending an extremely large chunk. This implementation is pure Ruby as the Ragel-based implementation in rfuzz didn't offer a streaming interface. It should be reasonably close to RFC-compliant but please test it in an attempt to break it. The more interesting part is the ability to stream data to the hosted Rack application as it is being transferred to the server. This can be done regardless if the input is chunked or not, enabling the streaming of POST/PUT bodies can allow the hosted Rack application to process input as it receives it. See examples/echo.ru for an example echo server over HTTP. Enabling streaming also allows Rack applications to support upload progress monitoring previously supported by Mongrel handlers. Since Rack specifies that the input needs to be rewindable, this input is written to a temporary file (a la tee(1)) as it is streamed to the application the first time. Subsequent rewinded reads will read from the temporary file instead of the socket. Streaming input to the application is disabled by default since applications may not necessarily read the entire input body before returning. Since this is a completely new feature we've never seen in any Ruby HTTP application server before, we're taking the safe route by leaving it disabled by default. Enabling this can only be done globally by changing the Unicorn HttpRequest::DEFAULTS hash: Unicorn::HttpRequest::DEFAULTS["unicorn.stream_input"] = true Similarly, a Rack application can check if streaming input is enabled by checking the value of the "unicorn.stream_input" key in the environment hashed passed to it. All of this code has only been lightly tested and test coverage is lacking at the moment. [1] - http://tools.ietf.org/html/rfc2616#section-3.6.1
2009-06-05	http_request: fix typo for 1.9

2009-05-31	http_request: StringIO is binary for empty bodies (1.9)

2009-05-30	http_request: no need to reset the request
	That method no longer exists, but Ruby would never know until it tried to run it. Yes, I miss my compiled languages.
2009-05-28	unicorn 0.8.1 v0.8.1

2009-05-28	Consistent logger assignment for multiple objects
	Bah, it's so much busy work to deal with this as configuration option. Maybe I should say we allow any logger the user wants, as long as it's $stderr :P
2009-05-28	Avoid instance variables lookups in a critical path
	Make us look even better in "Hello World" benchmarks! Passing a third parameter to avoid the constant lookup for the HttpRequest object doesn't seem to have a measurable effect.
2009-05-28	Make our HttpRequest object a global constant
	This should be faster/cheaper than using an instance variable since it's accessed in a critical code path. Unicorn was never designed to be reentrant or thread-safe at all, either.
2009-05-28	SIGHUP reloads app even if preload_app is true
	This makes SIGHUP handling more consistent across different configurations, and allows togging preload_app to take effect when SIGHUP is issued.
2009-05-28	Fix potential race condition in timeout handling
	There is a potential race condition in closing the tempfile immediately after SIGKILL-ing the child. So instead just close it when we reap the dead child. Some some versions of Ruby may also not like having the WORKERS hash being changed while it is iterating through each_pair, so dup the hash to be on the safe side (should be cheap, since it's not a deep copy) since kill_worker() can modify the WORKERS hash. This is somewhat of a shotgun fix as I can't reproduce the problem consistently, but oh well.
2009-05-26	unicorn 0.8.0 v0.8.0

2009-05-25	Switch to autoload to defer requires
	This should prevent Rack from being required too early on so "-I" being passed through the unicorn command-line can modify $LOAD_PATH for Rack
2009-05-25	Only refresh the gem list when building the app
	No point in refreshing the list of gems unless the app can actually be reloaded.
2009-05-25	Refresh Gem list when building the app
	On application reloads, Gems installed by the Ruby instance may be different than when Unicorn started. So when preload_app is false, HUP-ing Unicorn will always refresh the list of Gems before loading the application code.
2009-05-22	Merge commit 'v0.7.1'
	* commit 'v0.7.1': unicorn 0.7.1 Conflicts: lib/unicorn/const.rb
2009-05-22	unicorn 0.7.1 v0.7.1

2009-05-22	http_response: allow string status codes
	Rack::Lint says they just have to work when to_i is called on the status, so that's what we'll do.
2009-05-22	Enforce minimum timeout at 3 seconds
	2 seconds is still prone to race conditions under high load. We're intentionally less accurate than we could be in order to reduce syscall and method dispatch overhead.
2009-05-22	configurator: fix rdoc formatting

2009-05-22	Preserve 1.9 IO encodings in reopen_logs
	Ensure we preserve both internal and external encodings when reopening logs.
2009-05-22	Fix a warning about @pid being uninitialized

2009-05-22	Ignore unhandled master signals in the workers
	This makes it easier to use "killall -$SIGNAL unicorn" without having to lookup the correct PID.
2009-05-22	Safer timeout handling and test case
	Timeouts of less than 2 seconds are unsafe due to the lack of subsecond resolution in most POSIX filesystems. This is the trade-off for using a low-complexity solution for timeouts. Since this type of timeout is a last resort; 2 seconds is not entirely unreasonable IMNSHO. Additionally, timing out too aggressively can put us in a fork loop and slow down the system. Of course, the default is 60 seconds and most people do not bother to change it.
2009-05-22	app/old_rails: correctly log errors in output
	"out" was an invalid variable in that context...
2009-05-22	app/exec_cgi: GC prevention
	Don't allow newly created IO objects to get GC'ed and subsequently close(2)-ed. We're not reopening the {$std,STD}{in,out,err} variables since those can't be trusted to have fileno 1, 2 and 3 respectively.
2009-05-13	privatize constants only used by old_rails/static
	Unicorn proper no longer needs these constants, so don't bother with them.
2009-05-13	http_response: allow string status codes
	Rack::Lint says they just have to work when to_i is called on the status, so that's what we'll do.
2009-05-13	Require Rack for HTTP Status codes
	Preventing needless duplication since Rack already has these codes for us. Also, put the status codes in HttpResponse since nothing else needs (or should need) them.
2009-05-12	Reopen master logs on SIGHUP, too
	There may be other logs opened if preload_app is true besides stderr/stdout paths. So err on the safe side and reopen everything.
2009-05-11	exec_cgi: don't assume the body#each consumer is a socket
	If we're using middleware that pushes the body into an array, bad things will happen if we're clobbering the string for each iteration of body#each.
2009-05-11	HttpRequest::DEF_PARAMS => HttpRequest::DEFAULTS
	Give this a more palatable name and unfreeze it, allowing users to modify it more easily.
2009-05-10	Avoid killing sleeping workers
	This used to happen on machines that were coming out of suspend/hibernation.
2009-05-10	Enforce minimum timeout at 3 seconds
	2 seconds is still prone to race conditions under high load. We're intentionally less accurate than we could be in order to reduce syscall and method dispatch overhead.
2009-05-10	app/exec_cgi: use explicit buffers for read/sysread
	This reduces garbage generation to improve performance. Rack 1.0 allows InputWrapper to read with an explicit buffer.
2009-05-10	http_request: use Rack::InputWrapper-compatible methods
	This allows alternative I/O implementations to be easier to use with Unicorn...
2009-05-10	configurator: fix rdoc formatting

2009-05-04	Preserve 1.9 IO encodings in reopen_logs
	Ensure we preserve both internal and external encodings when reopening logs.
2009-05-04	Inline and remove the HttpRequest#reset method
	These potentially leaves an open file handle around until the next request hits the process, but this makes the common case faster.
2009-05-04	Fix a warning about @pid being uninitialized

2009-05-03	Speed up the worker accept loop
	First, reduce no-op fchmod syscalls under heavy traffic. gettimeofday(2) is a cheaper syscall than fchmod(2). Since ctime resolution is only in seconds on most filesystems (and Ruby can only get to seconds AFAIK), we can avoid fchmod(2) happening within the same second. This allows us to cheat on synthetic benchmarks where performance is measured in requests-per-second and not seconds-per-request :) Secondly, cleanup the acceptor loop and avoid nested begins/loops as much as possible. If we got ECONNABORTED, then there's no way the client variable would've been set correctly, either. If there was something there, then it is at the mercy of the garbage collector because a method can't both return a value and raise an exception.
2009-05-03	Instant shutdown signals really mean instant shutdown
	Use SIGQUIT if you're going to be nice and do graceful shutdowns. Sometimes people run real applications on this server and SIGINT/SIGTERM get lost/trapped when Object is rescued and that is not good. Also make sure we break out of the loop properly when the master is dead. Testcases added for both SIGINT and dead master handling.
2009-05-04	Remove redundant socket closing/checking
	If our response succeeds, we've already closed the socket. Otherwise, we would've raised an exception at some point hit one of the rescue clauses.
2009-05-03	Ignore unhandled master signals in the workers
	This makes it easier to use "killall -$SIGNAL unicorn" without having to lookup the correct PID.
2009-05-03	Safer timeout handling and test case
	Timeouts of less than 2 seconds are unsafe due to the lack of subsecond resolution in most POSIX filesystems. This is the trade-off for using a low-complexity solution for timeouts. Since this type of timeout is a last resort; 2 seconds is not entirely unreasonable IMNSHO. Additionally, timing out too aggressively can put us in a fork loop and slow down the system. Of course, the default is 60 seconds and most people do not bother to change it.
2009-05-03	No point in unsetting the O_NONBLOCK flag
	Since we've switched to readpartial, we'll already be protected from any unpleasant errors that might get thrown at us. There's no easy way to prevent MRI from calling a select() internally to check for readiness, so speculative+blocking read() calls are out already. Additionally, most requests come in the form of GETs which are fully-buffered in the kernel before we even accept() the socket; so a single readpartial call will be enough to fully consume it.