-
Notifications
You must be signed in to change notification settings - Fork 86
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Better error messages for ridiculous deadlines #632
Comments
Can we define "clearly unreasonable"? A 10ms deadline might be unreasonable for some services, but a hard requirement for others. We should probably just add the ability to distinguish between client timeouts and server timeouts (and maybe add information about how much time a request was given). We could also short circuit requests that have already passed their deadline and provide a more specific error in those cases. |
Looks like there's already a test for this in |
@willhug, @akshayjshah, and I have discussed (in the context of YARPC retries) the ability to configure a "min" and "max" threshold for the timeout on a request. The "max" would be used to truncate the deadline to now+max, and drop a request if now+min was past the context deadline. The idea would be to shed work off the wire for requests that are clearly impossible, as an optimization. Would this "minimum necessary time to respond" suffice to convey the meaning of "clearly unreasonable"? |
…if that is the case, TChannel would respond with use a Busy error instead of sending or handling a request, with a message indicating that there was not enough time on the deadline. |
@kriskowal so in that case, we'd want to make the change at the YARPC level? |
I don't know if we should create a new error code (since that would require all other libraries to be updated to recognize the error, and I'm not aware of how older clients would handle this error code). However, I think it makes sense to add a minimum threshold directly in TChannel, so we can do the least amount of work as soon as we get the frame, avoid creating a context, and avoid timing out while reading the method name. I'd imagine a new option that's defined as part of channel creation which we'd then compare against in |
@prashantv So if I'm understanding correctly, you'd like to see something that:
Does that sound correct? Can we have a default minimum threshold? |
I don't think TChannel itself should have a default minimum threshold, we can set one in our internal wrapper library, although there are services that can respond within a few milliseconds, so the only reasonable value we can probably set is 1ms. Services that need a lower value should be able to customize it. |
Alright, I'll take a stab at this soon. |
About once a week, someone will report that our service (which uses tchannel-go) is giving them timeouts and they'll show us an error that looks something like " tchannel error ErrCodeTimeout". We investigate, and 9 times out of 10 the actually problem is that they made an RPC request with an extrodinarly short deadline (sub-10ms) or, in some cases, they try to make a request where the deadline is already expired.
I propose that we return a better error message from tchannel when a request is made with a deadline that is clearly unreasonable.
The text was updated successfully, but these errors were encountered: