Skip to content

Commit

Permalink
Linux MegaRAID SAS fixes (#15566)
Browse files Browse the repository at this point in the history
* Linux MegaRAID SAS fixes
 - Updated disk error threshold from 1 to 0.1 because we need to know when
   it reaches 1, not when it exceeds 1.  It would ideally be set to 0, but
   it needs to evaluate to true in order for the threshold to be set.
 - Fixed the OID for the virtual disk failure check, which also removes the
   duplicate check on the predictive failure OID

* Updated the LSI test output

* Another update to the SNMP results

* Update all thresholds to 0

The warning and error thresholds work correctly when set to 0, so I have updated these values

* Change back to the original sensor OID, with the correct MIB OID and description

* Further re-classification of degraded LSI disk check
  • Loading branch information
eskyuu committed May 8, 2024
1 parent d3e5b0d commit d7119fb
Show file tree
Hide file tree
Showing 2 changed files with 123 additions and 123 deletions.
14 changes: 7 additions & 7 deletions includes/definitions/discovery/linux.yaml
Expand Up @@ -35,27 +35,27 @@ modules:
descr: 'Virtual disks'
-
oid: adapterPropertiesTable
value: pdDiskPredFailureCount
value: vdDegradedCount
num_oid: '.1.3.6.1.4.1.3582.4.1.4.1.2.1.19.{{ $index }}'
index: 'vdfail.{{ $index }}'
index: 'vddegraded.{{ $index }}'
group: 'RAID Controller {{ $adapterID-APT }}'
warn_limit: 1
descr: 'Virtual disk failure'
high_limit: 0
descr: 'Virtual disk degraded'
-
oid: adapterPropertiesTable
value: pdDiskPredFailureCount
num_oid: '.1.3.6.1.4.1.3582.4.1.4.1.2.1.23.{{ $index }}'
index: 'criticaldisks.{{ $index }}'
group: 'RAID Controller {{ $adapterID-APT }}'
warn_limit: 1
high_limit: 0
descr: 'Critical disks'
-
oid: adapterPropertiesTable
value: pdDiskFailedCount
num_oid: '.1.3.6.1.4.1.3582.4.1.4.1.2.1.24.{{ $index }}'
index: 'faileddisks.{{ $index }}'
group: 'RAID Controller {{ $adapterID-APT }}'
warn_limit: 1
high_limit: 0
descr: 'Failed disks'
-
oid: physicalDriveTable
Expand All @@ -79,7 +79,7 @@ modules:
num_oid: '.1.3.6.1.4.1.3582.4.1.4.2.1.2.1.9.{{ $index }}'
index: 'predfail.slot.{{ $index }}'
group: 'RAID Controller {{ $adpID-PDT }} Enclosure {{ enclDeviceId }}'
warn_limit: 1
warn_limit: 0
descr: '/c{{ $adpID-PDT }}/e{{ $enclDeviceId }}/s{{ $slotNumber }} Predictive Failures'
state:
data:
Expand Down

0 comments on commit d7119fb

Please sign in to comment.