New subject: btrfs deduplication and rsync [actual tests]

3 Nov 2023

      btrfs deduplication and rsync to other file system...

Hello,

I'm looking on btrfs deduplication system, because I have *data* (not 
system) in archives with many duplicates texts. The total is now around 
1Tb large and growing and I would like to keep it on 1Tb disks (I have 
several copies). Often old text work or mail archives.

I tried fdup or similar deduplication application, but then I got a 
scattered archive. I still have the data, but the files that are 
together are now on various folders. I had to stop doing so.

If I understand well, btrfs deduplication works with links (the removed 
files are replaced by a symlink). duperemove:

https://btrfs.readthedocs.io/en/latest/Deduplication.html

this needs btrfs file system (or similar high end one, I will stay with 
btrfs)

question: what happen if I rsync the data to and ext4 file system (on an 
other disk? Is deduplication lost or kept? same question is I sync this 
to the cloud.

Why? because I wonder if my archives may be readable by an other (linux) 
system in the future, not every one support btrfs when ext4 is global.

by the way, I don't find up to date documentation about btrfs backup 
(the wiki is flagged "obsolete").

If I correctly explained my problem, one may understand that I want to 
*archive data*. No incremental backup needed. I keep on total 4Tb of 
data (but the 3 more Tb data are photos, videos and iso files kept 
unique and already compressed, no chance to gain size).

and in fine, obviously, if one have any better solution to what I try to 
do, I'll take it :-)

thanks
jdd
-- 
https://artdagio.fr

btrfs deduplication and rsync

jdd＠dodin.org

Masaru Nomiya

Andrei Borzenkov

jdd＠dodin.org

Andrei Borzenkov

jdd＠dodin.org

jdd＠dodin.org

Andrei Borzenkov

Carlos E. R.

jdd＠dodin.org

Carlos E. R.

jdd＠dodin.org

Carlos E. R.

jdd＠dodin.org

Masaru Nomiya

jdd＠dodin.org

Masaru Nomiya

jdd＠dodin.org

tags

participants (4)