Hello community, here is the log from the commit of package duperemove for openSUSE:Factory checked in at 2015-10-14 16:45:23 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Comparing /work/SRC/openSUSE:Factory/duperemove (Old) and /work/SRC/openSUSE:Factory/.duperemove.new (New) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Package is "duperemove" Changes: -------- --- /work/SRC/openSUSE:Factory/duperemove/duperemove.changes 2015-07-19 11:45:47.000000000 +0200 +++ /work/SRC/openSUSE:Factory/.duperemove.new/duperemove.changes 2015-10-14 16:45:24.000000000 +0200 @@ -1,0 +2,9 @@ +Fri Oct 9 22:37:04 UTC 2015 - mfasheh@suse.com + +- Update to duperemove v0.10 + - Documentation updates for latest stable version + - document --fdupes mode in manpage + - update man page headers + - update FAQ + +------------------------------------------------------------------- Old: ---- v0.10.beta4.tar.gz New: ---- v0.10.tar.gz ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Other differences: ------------------ ++++++ duperemove.spec ++++++ --- /var/tmp/diff_new_pack.DXPmeB/_old 2015-10-14 16:45:25.000000000 +0200 +++ /var/tmp/diff_new_pack.DXPmeB/_new 2015-10-14 16:45:25.000000000 +0200 @@ -16,10 +16,10 @@ # -%define tar_version 0.10.beta4 +%define tar_version 0.10 %define samename btrfs-extent-same Name: duperemove -Version: 0.10.beta4 +Version: 0.10 Release: 0 Summary: Software to find duplicate extents in files and remove them License: GPL-2.0 ++++++ v0.10.beta4.tar.gz -> v0.10.tar.gz ++++++ diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/duperemove-0.10.beta4/FAQ.md new/duperemove-0.10/FAQ.md --- old/duperemove-0.10.beta4/FAQ.md 2015-07-16 22:52:51.000000000 +0200 +++ new/duperemove-0.10/FAQ.md 2015-10-10 00:13:34.000000000 +0200 @@ -2,12 +2,15 @@ ### Is there an upper limit to the amount of data duperemove can process? -v0.08 of duperemove has been tested on small numbers of VMS or iso -files (5-10) it can probably scale up to 50 or so. +Duperemove v0.10 is fast at reading and cataloging data. Dedupe runs +will be memory limited unless the '--hashfile' option is used. '--hashfile' +allows duperemove to temporarily store duplicated hashes to disk, thus removing +the large memory overhead and allowing for a far larger amount of data to be +scanned and deduped. Realistically though you will be limited by the speed of +your disks and cpu. -v0.09 is much faster at hashing and cataloging extents and therefore -can handle a larger data set. My own testing is typically with a -filesystem of about 750 gigabytes and millions of files. +Actual performance numbers are dependent on hardware - up to date +testing information is kept [on the wiki](https://github.com/markfasheh/duperemove/wiki/Performance-Numbers) ### Why does it not print out all duplicate extents? @@ -20,11 +23,14 @@ ### How can I find out my space savings after a dedupe? Duperemove will print out an estimate of the saved space after a -dedupe operation for you. You can also do a df before the dedupe -operation, then a df about 60 seconds after the operation. It is -common for btrfs space reporting to be 'behind' while delayed updates -get processed, so an immediate df after deduping might not show any -savings. +dedupe operation for you. + +You can get a more accurate picture by running 'btrfs fi df' before +and after each duperemove run. + +Be careful about using the 'df' tool on btrfs - it is common for space +reporting to be 'behind' while delayed updates get processed, so an +immediate df after deduping might not show any savings. ### Why is the total deduped data report an estimate? diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/duperemove-0.10.beta4/Makefile new/duperemove-0.10/Makefile --- old/duperemove-0.10.beta4/Makefile 2015-07-16 22:52:51.000000000 +0200 +++ new/duperemove-0.10/Makefile 2015-10-10 00:13:34.000000000 +0200 @@ -1,4 +1,4 @@ -VER=0.10.beta4 +VER=0.10 RELEASE=v$(VER) CC = gcc diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/duperemove-0.10.beta4/README.md new/duperemove-0.10/README.md --- old/duperemove-0.10.beta4/README.md 2015-07-16 22:52:51.000000000 +0200 +++ new/duperemove-0.10/README.md 2015-10-10 00:13:34.000000000 +0200 @@ -1,7 +1,3 @@ -This README is for the development branch of duperemove. If you're looking -for a stable version which is continually updated with fixes, please see -[v0.09 branch](https://github.com/markfasheh/duperemove/tree/v0.09-branch). - # Duperemove Duperemove is a simple tool for finding duplicated extents and @@ -50,7 +46,7 @@ # Requirements -The latest stable code can be found in [v0.09-branch](https://github.com/markfasheh/duperemove/tree/v0.09-branch). +The latest stable code can be found in [v0.10-branch](https://github.com/markfasheh/duperemove/tree/v0.10-branch). Kernel: Duperemove needs a kernel version equal to or greater than 3.13 diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/duperemove-0.10.beta4/duperemove.8 new/duperemove-0.10/duperemove.8 --- old/duperemove-0.10.beta4/duperemove.8 2015-07-16 22:52:51.000000000 +0200 +++ new/duperemove-0.10/duperemove.8 2015-10-10 00:13:34.000000000 +0200 @@ -1,4 +1,4 @@ -.TH "duperemove" "8" "March 2014" "Version 0.04" +.TH "duperemove" "8" "October 2015" "Version 0.10" .SH "NAME" duperemove \- Find duplicate extents and print them to stdout .SH "SYNOPSIS" @@ -79,6 +79,12 @@ hashfiles to optimize the file scan by skipping unchanged files. .TP +\fB\--fdupes\fR +Run in \fBfdupes\fR mode. With this option you can pipe the output of +\fBfdupes\fR to duperemove to dedupe any duplicate files found. When +recieving a file list in this manner, duperemove will skip the hashing phase. + +.TP \fB\-x\fR Don't cross filesystem boundaries. @@ -157,3 +163,4 @@ .SH "SEE ALSO" .BR filesystems(5) .BR btrfs(8) +.BR fdupes(1) diff -urN '--exclude=CVS' '--exclude=.cvsignore' '--exclude=.svn' '--exclude=.svnignore' old/duperemove-0.10.beta4/hashstats.8 new/duperemove-0.10/hashstats.8 --- old/duperemove-0.10.beta4/hashstats.8 2015-07-16 22:52:51.000000000 +0200 +++ new/duperemove-0.10/hashstats.8 2015-10-10 00:13:34.000000000 +0200 @@ -1,4 +1,4 @@ -.TH "hashstats" "8" "March 2014" "Version 0.09" +.TH "hashstats" "8" "October 2015" "Version 0.10" .SH "NAME" hashstats \- Print information about a duperemove hashfile .SH "SYNOPSIS" @@ -6,7 +6,7 @@ .SH "DESCRIPTION" .PP \fIhashfile\fR should be a file generated by running duperemove with -the --write-hashes option. +the --hashfile or --write-hashes option. .SH "OPTIONS"