Improve disk IO for backups
-
Hello,
Is there anything that can be done to improve the amount of data Updraft writes to disk when doing batched backups? When BinZip is used, every time Updraft “adds” files to a given zip, it results in completely rewriting the zip to a temporary file where this batched data is then added in the end of the zip file, and the process then starts over for the next iteration of files being added.
2577.809 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (19.9 MB, 173597 files batched, 1001 (148946) added so far); re-opening (prior size: 0.7 KB)
2578.606 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (18.8 MB, 173597 files batched, 1001 (149947) added so far); re-opening (prior size: 20604.2 KB)
2580.990 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (22.5 MB, 173597 files batched, 1001 (150948) added so far); re-opening (prior size: 40013.2 KB)
2584.437 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (24.1 MB, 173597 files batched, 1001 (151949) added so far); re-opening (prior size: 63221.6 KB)
2590.148 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (19.3 MB, 173597 files batched, 1001 (152950) added so far); re-opening (prior size: 88060.9 KB)
2594.383 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (18.7 MB, 173597 files batched, 1001 (153951) added so far); re-opening (prior size: 107994.8 KB)
2600.179 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (23.6 MB, 173597 files batched, 1001 (154952) added so far); re-opening (prior size: 127341.8 KB)
2607.114 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (20.1 MB, 173597 files batched, 1001 (155953) added so far); re-opening (prior size: 151697.1 KB)
2614.763 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (32.9 MB, 173597 files batched, 1001 (156954) added so far); re-opening (prior size: 172478.1 KB)
2624.164 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (54.7 MB, 173597 files batched, 1001 (157955) added so far); re-opening (prior size: 206395.2 KB)
2636.662 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (50.7 MB, 173597 files batched, 1001 (158956) added so far); re-opening (prior size: 262615.3 KB)
2650.977 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (39.6 MB, 173597 files batched, 1001 (159957) added so far); re-opening (prior size: 314734.6 KB)
2666.409 (0) Adding batch to zip file (UpdraftPlus_BinZip): over 1000 files added on this batch (39.5 MB, 173597 files batched, 1001 (160958) added so far); re-opening (prior size: 355394.4 KB)
2683.314 (0) Adding batch to zip file (UpdraftPlus_BinZip): possibly approaching split limit (12.5 MB, 205 (161163) files added so far); last ratio: 1; re-opening (prior size: 395955.6 KB)In the above example, the total size effectively becomes ~ 400 megabyte ( as configured in the plugin ), but to create this 400 megabyte file, a total of roughly 3.4 gigabyte is written, meaning we have a write amplification in this case of 8.5x
In another zip that’s being created a total of 4.15 gigabyte is written to disk for a 400 megabyte file, meaning a write amplification 10.3x
Over all, it seems to average out at around 8.9x for a given website of a size of 30 gigabyte, resulting in 267 gigabytes of data being written during the duration of the backup.
I know I can “improve” this by reducing the batch/split size from 400MB to 50MB for example, this greatly reduces the write amplification, but it shouldn’t be needed in the first place.
This has been a “problem” for many years, but when more and more people use updraft, it shows great impact on systems writing multiple terabytes of data each day due to write amplification caused by a backup plugin.
The topic ‘Improve disk IO for backups’ is closed to new replies.