Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HDDS-12135. Set RM default deadline to 12 minutes and the datanode offset to 6 minutes #7747

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

sodonnel
Copy link
Contributor

What changes were proposed in this pull request?

We recently found that delete commands can run for a long time once picked off the queue, and the default of a 10 minute deadline on SCM and 30 seconds less deadline on the datanodes can result in currently running commands being seen as expired in SCM.

This PR is to make the defaults less aggressive - giving a SCM / RM timeout of 12 minutes and a datanode timeout of 6 minutes. That way, there is longer for commands to be processed before RM will resend them.

With the throttling that RM employs, there should not be a large number of commands on the queue anyway, as the goal of RM is to schedule only the number of commands which can be processed in a heartbeat or two.

Other related Jiras to this one are: HDDS-12127, HDDS-12115, HDDS-12114

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-12135

How was this patch tested?

Simple config change. No new tests added or modified.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant