Timeout for incoming requests in io queue. #673

shaitan · 2015-11-26T12:06:24Z

Problem

Currently, after whole request is read from client's socket it is put to io queue where it will wait until some io thread will pop it. The time that request is in io queue is not limited, so server node can start handling request when the client is desperate to wait a response, so it has already handled timeout error and probably resent the request or gone. But server node will spend time and resources on handling request and sending response. Situation becomes worse when server node is overloaded: it handles timeouted requests and delays handling still alive requests which will be timeouted when server node will reach them.

What can we do

We can introduce timeout that will limit:

time it takes to read the requests from socket - it very depends on a client
time that the request spent in the queue - it completely depends on the current server node load

In the first version timeout can be set via server node config, but a client should be able to disable this timeout via command flags.

If that will not be enough, in the second version we can allow a client to redefine this timeout with request, so request will contain field that will be used as timeout for this request.

bioothod · 2015-11-26T22:50:47Z

Sending timeout with request could fix this issue, but do we really have an empty data in the header for this?

If server doesn't know client timeout it is dangerous to rely on what server has in its config. It might be the case that client deliberately set really long timeout and he does want to get its data, while server may have just a tiny timeout which will destroy request even without considering doing it.

toshic · 2015-11-27T12:33:52Z

At least we have some space in dnet_io_attr header for IO transactions.

shaitan · 2015-11-27T13:45:42Z

I know that only server's timeout from config does not cover all cases, but it can be made as a first step without breaking anything and it covers really huge part of cases. Clients that want to get their data in any time can use command flag that will disable server's timeout.

After the first step we will find (or not) clients and cases in which it is better to set timeout by a client. As the next step, if there will be such clients, we can use dnet_io_attr::reserved2 and allow clients to set timeout only for io commands.

And finally, we can break our protocol and add timeout to dnet_cmd.

shaitan · 2015-12-02T11:58:29Z

I'll implement first step next week and send a PR with changes. Those who interested in such deadline will can build from the PR and check coverage of their cases. In the PR I will collect feedback of usage and uncovered cases. If there will be a reason I will update the PR to the second step and continue collecting feedback.

It is important that both version will not break common use-case of those who not interested in deadline.

shaitan added the enhancement label Nov 26, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timeout for incoming requests in io queue. #673

Timeout for incoming requests in io queue. #673

shaitan commented Nov 26, 2015

bioothod commented Nov 26, 2015

toshic commented Nov 27, 2015

shaitan commented Nov 27, 2015

shaitan commented Dec 2, 2015

Timeout for incoming requests in io queue. #673

Timeout for incoming requests in io queue. #673

Comments

shaitan commented Nov 26, 2015

Problem

What can we do

bioothod commented Nov 26, 2015

toshic commented Nov 27, 2015

shaitan commented Nov 27, 2015

shaitan commented Dec 2, 2015