unicorn.git - Rack HTTP server for Unix and fast clients

Date	Commit message (Collapse)
2009-06-29	test_upload: add tests for chunked encoding
	Additionally, provide verifications for sizes after-the-fact to avoid slamming all of our input into the server.
2009-06-10	Optimize body-less GET/HEAD requests (again)
	No point in making syscalls to deal with empty bodies. Reinstate usage of the NULL_IO object which allows us to avoid allocating new objects.
2009-06-05	Transfer-Encoding: chunked streaming input support
	This adds support for handling POST/PUT request bodies sent with chunked transfer encodings ("Transfer-Encoding: chunked"). Attention has been paid to ensure that a client cannot OOM us by sending an extremely large chunk. This implementation is pure Ruby as the Ragel-based implementation in rfuzz didn't offer a streaming interface. It should be reasonably close to RFC-compliant but please test it in an attempt to break it. The more interesting part is the ability to stream data to the hosted Rack application as it is being transferred to the server. This can be done regardless if the input is chunked or not, enabling the streaming of POST/PUT bodies can allow the hosted Rack application to process input as it receives it. See examples/echo.ru for an example echo server over HTTP. Enabling streaming also allows Rack applications to support upload progress monitoring previously supported by Mongrel handlers. Since Rack specifies that the input needs to be rewindable, this input is written to a temporary file (a la tee(1)) as it is streamed to the application the first time. Subsequent rewinded reads will read from the temporary file instead of the socket. Streaming input to the application is disabled by default since applications may not necessarily read the entire input body before returning. Since this is a completely new feature we've never seen in any Ruby HTTP application server before, we're taking the safe route by leaving it disabled by default. Enabling this can only be done globally by changing the Unicorn HttpRequest::DEFAULTS hash: Unicorn::HttpRequest::DEFAULTS["unicorn.stream_input"] = true Similarly, a Rack application can check if streaming input is enabled by checking the value of the "unicorn.stream_input" key in the environment hashed passed to it. All of this code has only been lightly tested and test coverage is lacking at the moment. [1] - http://tools.ietf.org/html/rfc2616#section-3.6.1
2009-05-13	test_response: correct OFS test
	Must have multiple headers to test this effectively
2009-05-13	http_response: allow string status codes
	Rack::Lint says they just have to work when to_i is called on the status, so that's what we'll do.
2009-05-10	Enforce minimum timeout at 3 seconds
	2 seconds is still prone to race conditions under high load. We're intentionally less accurate than we could be in order to reduce syscall and method dispatch overhead.
2009-05-04	Preserve 1.9 IO encodings in reopen_logs
	Ensure we preserve both internal and external encodings when reopening logs.
2009-05-04	Inline and remove the HttpRequest#reset method
	These potentially leaves an open file handle around until the next request hits the process, but this makes the common case faster.
2009-05-04	test_signals: ready workers before connecting
	Otherwise there's a chance a child won't have a socket bound by the time we're trying to connect.
2009-05-03	Instant shutdown signals really mean instant shutdown
	Use SIGQUIT if you're going to be nice and do graceful shutdowns. Sometimes people run real applications on this server and SIGINT/SIGTERM get lost/trapped when Object is rescued and that is not good. Also make sure we break out of the loop properly when the master is dead. Testcases added for both SIGINT and dead master handling.
2009-05-03	Safer timeout handling and test case
	Timeouts of less than 2 seconds are unsafe due to the lack of subsecond resolution in most POSIX filesystems. This is the trade-off for using a low-complexity solution for timeouts. Since this type of timeout is a last resort; 2 seconds is not entirely unreasonable IMNSHO. Additionally, timing out too aggressively can put us in a fork loop and slow down the system. Of course, the default is 60 seconds and most people do not bother to change it.
2009-05-03	http_request: switch to readpartial over sysread
	readpartial is actually as low-level as sysread is, except it's less likely to throw exceptions and won't change the blocking/non-blocking status of a file descriptor (we explicitly enable blocking I/O)
2009-04-27	test_upload: still uncomfortable with 1.9 IO encoding...
	It seems most applications use buffered IO#read instead of IO#sysread. So make sure our encoding is set correctly for buffered IO#read applications, too.
2009-04-25	test_request: enable with Ruby 1.9 now Rack 1.0.0 is out

2009-04-24	configurator: "listen" directive more nginx-like
	The following specifications to bind port 8080 on all interfaces are now accepted in the configuration file: listen "8080" # (with quotes) listen 8080 # (without quotes)
2009-04-23	http_response: minor performance gains
	Avoid creating garbage every time we lookup the status code along with the message. Also, we can use global const arrays for a little extra performance because we only write one-at-a time Looking at MRI 1.8, Array#join with an empty string argument is slightly better because it skips an append for every iteration.
2009-04-23	test_socket_helper: disable GC for this test
	Otherwise the GC will unlink sockets. A better solution (purgatory?) may be needed...
2009-04-23	Fix data corruption with small uploads via browsers
	StringIO.new(partial_body) does not update the offset for new writes. So instead create the StringIO object and then syswrite to it and try to follow the same code path used by large uploads which use Tempfiles.
2009-04-21	test: empty port test for absolute URIs

2009-04-21	http11: support underscores in URI hostnames
	They aren't common, but apparently there exist URLs with them, so we'll support them.
2009-04-21	Stop extending core classes
	This removes the #unicorn_peeraddr methods from TCPSocket and UNIXSocket core classes. Instead, just move that logic into the only place it needs to be used in HttpRequest.
2009-04-21	http11: rfc2616 handling of absolute URIs
	We now parse the scheme, host and port from Absolute URIs and ignore them if the equivalents are specified in the other headers.
2009-04-21	http11: make parser obey HTTP_HOST with empty port
	This means "Host: foo-bar:" (trailing colon) will assume server_port is 80, not a blank string.
2009-04-21	HttpParser: set QUERY_STRING for Rack-compliance

2009-04-21	Move absolute URI parsing into HTTP parser
	It's part of the HTTP/1.1 (rfc2616), so we might as well handle it in there and set PATH_INFO while we're at it. Also, make "OPTIONS *" test not fail Rack::Lint
2009-04-16	test_upload: ensure StringIO objects are binary
	Sockets always return binary encoded data, so when StringIO.new(string) is called, that StringIO object inherits the encoding of the initial string it was created with. And yes, Ruby 1.9 still makes me seriously uncomfortable with I/O manipulation since the encoding layer does things behind my back. UNIX is (and should always be) just a bag of bytes! Signed-off-by: Eric Wong <normalperson@yhbt.net>
2009-04-16	Small garbage reduction in HttpResponse
	Avoid creating new string objects and then discarding them right away by stuffing non-constant but always-present headers into the initial output.
2009-04-16	ensure responses always have the "Status:" header
	There are weird (and possibly broken) clients out there that require it despite being present in the first line of the response. So be nice and accomodate them. Keep in mind that the Rack SPEC explicitly forbids this header from being in the headers returned by the Rack-based application; so we have to always inject it ourselves and ignore it if the application sets it.
2009-04-13	Expose worker to {before,after}_fork hooks
	Instead of just worker.nr. This is a configuration file/API change and will break existing configurations. This allows worker.tempfile to be exposed to the hooks so ownership changes can still happen on it. On the other hand, I don't know of many people actually using this feature (or Unicorn).
2009-04-12	test_http_parser: fix broken URL in comment
	This was back when I did s/mongrel/Unicorn/g on the sources.
2009-04-12	http11: cleanup+safer rack.url_scheme handling
	Avoid using strcmp() since it could break badly if Ruby ever stopped null-terminating strings C-style. We're also freezing "http" as a global. Rack does not explicitly permit nor deny this, and Mongrel has always used frozen strings as hash values in other places.
2009-04-10	listen backlog, sndbuf, rcvbuf are always changeable
	Apparently I was smoking crack and thought they weren't changeable. Additionally, SO_REUSEADDR is set by TCPServer.new, so there's no need to set it ourselves; so avoid putting extra items in the purgatory. This allows SIGHUP to change listen options.
2009-04-10	close listeners when removing them from our array
	This fixes a long-standing bug where listeners would be removed from the known listener set during a reload but never correctly shut down (until reexec). Additionally, test_server was working around this bug (my fault, subconciously) as teardown did not unbind the socket, requiring the tests to grab a new port.
2009-04-08	http11: handle "X-Forwarded-Proto: https"
	Pass "https" to "rack.url_scheme" if the X-Forwarded-Proto header matches "https". X-Forwarded-Proto is a semi-standard header that Ruby frameworks seem to respect; so we use that. We won't support ENV['HTTPS'] since that can only be set at start time and some app servers supporting https also support http. Currently, "rack.url_scheme" only allows "http" and "https", so we won't set anything else to avoid breaking Rack::Lint.
2009-04-08	test_request: tests esoteric/rare REQUEST_URIs
	* Test for '' in "OPTIONS HTTP/1.1" for now (even though Rack doesn't like it). * Some clients can send absolute URIs, too
2009-04-05	test_configurator: rename test name that never ran
	Run tests with warnings so we detect stupid things like this.
2009-04-05	Add test for :preload_app config option

2009-04-05	Enforce umask 0000 with UNIX domain sockets
	I can't think of a good reason to ever use restrictive permissions with UNIX domain sockets for an HTTP server. Since some folks run their nginx on port 80 and then have it drop permissions, we need to ensure our socket is readable and writable across the board. The reason I'm respecting the existing umask at all (instead of using 0000 across the board like most daemonizers) is because the admin may want to restrict access (especially write access) to log files.
2009-04-03	Add a test for signal recovery
	I/O on slow descriptors can be interrupted so make sure we (and Ruby itself) are handling EINTR correctly.
2009-04-03	configurator: allow hooks to be passed callable objects
	Premade lambda/proc/Proc objects may all be passed, to the hooks, not just anonymous blocks.
2009-04-01	test_upload: fix a race condition in unlink test
	We need to ensure the next request has started processing before we can guarantee a temp file has been unlinked.
2009-03-31	Better canonicalization of listener paths + tests
	* Expand addresses like "1:8080" to "127.0.0.1:8080" beforehand so sock_name() in SocketHelper will always return consistent results. * Add support for "unix:/path/to/foo" paths for easier synchronization with nginx config files.
2009-03-29	http11: use :http_body instead of "HTTP_BODY"
	"HTTP_BODY" could conflict with a "Body:" HTTP header if there ever is one. Also, try to hide this body from the Rack environment before @app is called since it is only used by Unicorn internally.
2009-03-29	configurator: per-listener backlog, {rcv,snd}buf config
	Instead of having global options for all listeners, make all socket options per-listener. This allows reverse-proxies to pick different listeners to get different options on different sockets. Given a cluster of machines (10.0.0.1, 10.0.0.2, 10.0.0.3) running Unicorn with the following config: ------------------ 8< ---------------- listen "/tmp/local.sock", :backlog => 1 listen "*:8080" # use the backlog=1024 default ------------------ 8< ---------------- It is possible to configure a reverse proxy to try to use "/tmp/local.sock" first and then fall back to using the TCP listener on port 8080 in a failover configuration. Thus the nginx upstream configuration on 10.0.0.1 to compliment this would be: ------------------ 8< ---------------- upstream unicorn_cluster { # reject connections ASAP if we are overloaded server unix:/tmp/local.sock; # fall back to other machines in the cluster via "backup" # listeners which have a large backlog queue. server 10.0.0.2:8080 backup; server 10.0.0.3:8080 backup; } ------------------ 8< ---------------- This removes the global "backlog" config option which was inflexible with multiple machines in a cluster and exposes the ability to change SO_SNDBUF/SO_RCVBUF via setsockopt(2) for the first time.
2009-03-29	test_response: ensure response body is closed
	This is in the Rack specification and a good idea. Remind ourselves to prevent file descriptor or other resource leaks in case the body is not an Array.
2009-03-29	test_response: ensure closed socket after write
	We always close the socket immediately after a successful write for two reasons: 1) To prevent error responses from being rewritten. If we throw an exception in our request/app/response chain, we'll attempt to write an HTTP 400/500 response out if the socket is open. No way to write to an open socket. 2) To uncork the socket if TCP_CORK is enabled (Linux) ASAP. This should be a tick faster than waiting to go back up the stack and close it there.
2009-03-27	Always try to send a valid HTTP response back
	This reworks error handling throughout the entire stack to be more Ruby-ish. Exceptions are raised instead of forcing the us to check return values. If a client is sending us a bad request, we send a 400. If unicorn or app breaks in an unexpected way, we'll send a 500. Both of these last-resort error responses are sent using IO#write_nonblock to avoid tying Unicorn up longer than necessary and all exceptions raised are ignored. Sending a valid HTTP response back should reduce the chance of us from being marked as down or broken by a load balancer. Previously, some load balancers would mark us as down if we close a socket without sending back a valid response; so make a best effort to send one. If for some reason we cannot write a valid response, we're still susceptible to being marked as down. A successful HttpResponse.write() call will now close the socket immediately (instead of doing it higher up the stack). This ensures the errors will never get written to the socket on a successful response.
2009-03-27	test_server: quieter tests

2009-03-25	Merge commit 'v0.2.3'
	* commit 'v0.2.3': unicorn 0.2.3 Ensure Tempfiles are unlinked after every request Don't bother unlinking UNIX sockets Conflicts: lib/unicorn/socket.rb
2009-03-25	Ensure Tempfiles are unlinked after every request
	Otherwise we bloat TMPDIR and run the host out of space, oops!