-
Notifications
You must be signed in to change notification settings - Fork 86
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failed hot spare drive is not detected / HPE ssacli #229
Comments
With the help of ChatGPT, I developed another Bash script that checks for failed drives. So if anyone faces the same problem, here is the script that I use:
I call the script from the SNMP process, so sudo needs to be instructed:
|
@crocodileneptune pull requests are accepted. you should send your code as pull request, see CONTRIBUTING.md for how. to start. nowadays everything is virtualized, haven't had need for check_raid on metal for very long already. |
Hello Glen,
first of all thanks so much for your work!
I noticed that your check_raid.pl plugin doesn't seem to trigger the warning or critical state in the case of a failed spare drive. In my case, the server used to run on two harddisks in a RAID1 configuration, with another harddisk configured as a hot spare device. When I looked at ILO logs last night, I saw that the harddisk in bay 1 failed and the hot spare in bay 3 was activated some time ago.
I would have expected that the check_raid.pl plugin would trigger some sort of warning if any harddisk fails which is why I now created this bug report. I don't mind the exact state (warning or critical), but a failed device needs to trigger an action which is why I am using the plugin. I read CONTRIBUTING.md and I hope that all relevant details are included in this bug report.
Output of
check_raid -d
:Output of each command from
check_raid -d
/sbin/ssacli controller all show status
/sbin/ssacli controller slot=0 logicaldrive all show
However, the failed hot spare drive is not detected, even though ssacli notices it:
/sbin/ssacli ctrl slot=0 pd all show status
Additional environment details:
Thanks and best wishes!
The text was updated successfully, but these errors were encountered: