BTRFS-SCRUB(8) | BTRFS | BTRFS-SCRUB(8) |
btrfs-scrub - scrub btrfs filesystem, verify block checksums
btrfs scrub <subcommand> <args>
Scrub is a validation pass over all filesystem data and metadata that detects data checksum errors, basic super block errors, basic metadata block header errors, and disk read errors.
Scrub is done on a per-device base, if a device is specified to btrfs scrub start, then only that device will be scrubbed. Although btrfs will also try to read other device to find a good copy, if the mirror on that specified device failed to be read or pass verification.
If a path of btrfs is specified to btrfs scrub start, btrfs will scrub all devices in parallel.
On filesystems that use replicated block group profiles (e.g. RAID1), read-write scrub will also automatically repair any damage by copying verified good data from one of the other replicas.
Such automatic repair is also carried out when reading metadata or data from a read-write mounted filesystem.
WARNING:
Furthermore, btrfs does not currently mark missing or failed disks as unreliable, so will continue to load-balance reads to potentially damaged replicas. This is not a problem normally because damage is detected by checksum validation, but because NOCOW files are not protected by checksums, btrfs has no idea which mirror is good thus it can return the bad contents to the user space tool.
Detecting and recovering from such failure requires manual intervention.
Notably, systemd sets +C on journals by default, and libvirt ≥ 6.6 sets +C on storage pool directories by default. Other applications or distributions may also set +C to try to improve performance.
NOTE:
btrfs-check(8) performs more exhaustive checking and can sometimes be used, with expert guidance, to rebuild certain corrupted filesystem structures in the absence of any good replica.
The user is supposed to run it manually or via a periodic system service. The recommended period is a month but it could be less. The estimated device bandwidth utilization is about 80% on an idle filesystem.
The scrubbing status is recorded in /var/lib/btrfs/ in textual files named scrub.status.UUID for a filesystem identified by the given UUID. (Progress state is communicated through a named pipe in file scrub.progress.UUID in the same directory.) The status file is updated every 5 seconds. A resumed scrub will continue from the last saved position.
Scrub can be started only on a mounted filesystem, though it's possible to scrub only a selected device. See btrfs scrub start for more.
NOTE:
In the past when the CFQ IO scheduler was generally used the ionice(1) syscalls set the priority to idle so the IO would not interfere with regular IO. Since the kernel 5.0 the CFQ is not available.
The IO scheduler known to support that is BFQ, but first read the documentation before using it!
For other commonly used schedulers like mq-deadline it's recommended to use cgroup2 IO controller which could be managed by e.g. systemd (documented in systemd.resource-control). However, starting scrub like that is not yet completely straightforward. The IO controller must know the physical device of the filesystem and create a slice so all processes started from that belong to the same accounting group.
$ systemd-run -p "IOReadBandwidthMax=/dev/sdx 10M" btrfs scrub start -B /
Since linux 5.14 it's possible to set the per-device bandwidth limits in a BTRFS-specific way using files /sys/fs/btrfs/FSID/devinfo/DEVID/scrub_speed_max. This setting is not persistent, lasts until the filesystem is unmounted. Currently set limits can be displayed by command btrfs scrub limit.
$ echo 100m > /sys/fs/btrfs/9b5fd16e-1b64-4f9b-904a-74e74c0bbadc/devinfo/1/scrub_speed_max $ btrfs scrub limit / UUID: 9b5fd16e-1b64-4f9b-904a-74e74c0bbadc Id Limit Path -- --------- -------- 1 100.00MiB /dev/sdx
If a device is specified, the corresponding filesystem is found and btrfs scrub cancel behaves as if it was called on that filesystem. The progress is saved in the status file so btrfs scrub resume can continue from the last position.
Options
This does not start a new scrub if the last scrub finished successfully.
Options
see scrub start.
Without options, scrub is started as a background process. The automatic repairs of damaged copies are performed by default for block group profiles with redundancy. No-repair can be enabled by option -r.
Options
Deprecated options
If no scrub is running, show statistics of the last finished or cancelled scrub for that filesystem or device.
Options
A status on a filesystem without any error looks like the following:
# btrfs scrub start / # btrfs scrub status / UUID: 76fac721-2294-4f89-a1af-620cde7a1980 Scrub started: Wed Apr 10 12:34:56 2023 Status: running Duration: 0:00:05 Time left: 0:00:05 ETA: Wed Apr 10 12:35:01 2023 Total to scrub: 28.32GiB Bytes scrubbed: 13.76GiB (48.59%) Rate: 2.75GiB/s Error summary: no errors found
With some errors found:
Error summary: csum=72 Corrected: 2 Uncorrectable: 72 Unverified: 0
It's possible to set a per-device limit via file sysfs/fs/btrfs/FSID/devinfo/scrub_speed_max. In that case the limit is printed on the Rate: line if option -d is specified, or without it on a single-device filesystem. Read more about tat in section about scrub IO limiting.
Rate: 989.0MiB/s (limit 1.0G/s)
On a multi-device filesystem with at least one device limit the overall stats cannot print the limit without -d so there's a not that some limits are set:
Rate: 36.37MiB/s (some device limits set)
btrfs scrub returns a zero exit status if it succeeds. Non zero is returned in case of failure:
btrfs is part of btrfs-progs. Please refer to the documentation at https://btrfs.readthedocs.io.
mkfs.btrfs(8)
January 7, 2025 | 6.12 |