Get metrics using the command line utility `smartctl` for S.M.A.R.T. (Self-Monitoring, Analysis and Reporting Technology) storage devices. SMART is a monitoring system included in computer hard disk drives (HDDs) and solid-state drives (SSDs)[1] that detects and reports on various indicators of drive reliability, with the intent of enabling the anticipation of hardware failures.
Get metrics using the command line utility `smartctl` for S.M.A.R.T. (Self-Monitoring, Analysis and Reporting Technology) storage devices. SMART is a monitoring system included in computer hard disk drives (HDDs) and solid-state drives (SSDs)[1] that detects and reports on various indicators of drive reliability, with the intent of enabling the anticipation of hardware failures.
See smartmontools (https://www.smartmontools.org/).
See smartmontools (https://www.smartmontools.org/).
...
@@ -24,68 +24,7 @@ To enable SMART on a storage device run:
...
@@ -24,68 +24,7 @@ To enable SMART on a storage device run:
smartctl -s on <device>
smartctl -s on <device>
```
```
## Measurements
### Configuration:
- smart_device:
* Tags:
- `capacity`
- `device`
- `device_model`
- `enabled`
- `health`
- `serial_no`
- `wwn`
* Fields:
- `exit_status`
- `health_ok`
- `read_error_rate`
- `seek_error`
- `temp_c`
- `udma_crc_errors`
- smart_attribute:
* Tags:
- `device`
- `fail`
- `flags`
- `id`
- `name`
- `serial_no`
- `wwn`
* Fields:
- `exit_status`
- `raw_value`
- `threshold`
- `value`
- `worst`
### Flags
The interpretation of the tag `flags` is:
-*K* auto-keep
-*C* event count
-*R* error rate
-*S* speed/performance
-*O* updated online
-*P* prefailure warning
### Exit Status
The `exit_status` field captures the exit status of the smartctl command which
is defined by a bitmask. For the interpretation of the bitmask see the man page for
smartctl.
### Device Names
Device names, e.g., `/dev/sda`, are *not persistent*, and may be
subject to change across reboots or system changes. Instead, you can the
*World Wide Name* (WWN) or serial number to identify devices. On Linux block
devices can be referenced by the WWN in the following location:
`/dev/disk/by-id/`.
## Configuration
```toml
```toml
# Read metrics from storage devices supporting S.M.A.R.T.
# Read metrics from storage devices supporting S.M.A.R.T.
...
@@ -122,14 +61,72 @@ devices can be referenced by the WWN in the following location:
...
@@ -122,14 +61,72 @@ devices can be referenced by the WWN in the following location:
# devices = [ "/dev/ada0 -d atacam" ]
# devices = [ "/dev/ada0 -d atacam" ]
```
```
### Metrics:
- smart_device:
- tags:
- capacity
- device
- device_model
- enabled
- health
- serial_no
- wwn
- fields:
- exit_status
- health_ok
- read_error_rate
- seek_error
- temp_c
- udma_crc_errors
- smart_attribute:
- tags:
- device
- fail
- flags
- id
- name
- serial_no
- wwn
- fields:
- exit_status
- raw_value
- threshold
- value
- worst
#### Flags
The interpretation of the tag `flags` is:
-`K` auto-keep
-`C` event count
-`R` error rate
-`S` speed/performance
-`O` updated online
-`P` prefailure warning
#### Exit Status
The `exit_status` field captures the exit status of the smartctl command which
is defined by a bitmask. For the interpretation of the bitmask see the man page for
smartctl.
#### Device Names
Device names, e.g., `/dev/sda`, are *not persistent*, and may be
subject to change across reboots or system changes. Instead, you can the
*World Wide Name* (WWN) or serial number to identify devices. On Linux block
devices can be referenced by the WWN in the following location:
`/dev/disk/by-id/`.
To run `smartctl` with `sudo` create a wrapper script and use `path` in
To run `smartctl` with `sudo` create a wrapper script and use `path` in