Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Linux Kernel 3.9.6 / 3....
  • Linux Kernel 3.0.82 LTS...
  • KDE Software Compilatio...
  • PulseAudio 4.0
  • Wireshark 1.10.0
  • NetworkManager 0.9.8.2
  • LibreOffice 3.6.6 / 4.0...
  • SystemRescueCd 3.7.0
  • Linux Kernel 3.10 RC6
  • Ubuntu Tweak 0.8.5
  • Home > Linux > Desktop Environment > File managers

    DedupeCopy 0.3.4

    Download button

    No screenshots available
    Downloads: 101  View global page NEW!  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Erik Schweller | More programs
    BSD License / FREE
    August 29th, 2012, 12:58 GMT
    ROOT / Desktop Environment / File managers

     Read user reviews (0)  Refer to a friend  Subscribe

    DedupeCopy description

    Find duplicates / copy and restructure file layout command-line tool

    DedupeCopy is a simple multi-threaded file copy tool designed for consolidating and restructuring sprawling file systems.

    The most common use case is for backing up data into a new layout, ignoring duplicated files.

    Other uses include:
      1. Getting a .csv file describing all duplicated files
      2. Comparing different file systems
      3. Restructuring existing sets of files into different layouts (such as sorted by extension or last modification time)

    This tool is *NOT* a Robocopy or rsync replacement and does not try to fill the role those play.

    As with all code that walks a file tree, please use with caution and expect absolutely no warranty!

    Command examples:

      Generate a duplicate file report for a path:

          dedupe_copy.py -p /Users/ -r dupes.csv -m manifest

      Copy all *.jpg files from multiple paths to a /YYYY_MM/*.jpg structure

          dedupe_copy.py -p C:\pics -p D:\pics -e jpg -R jpg:mtime -c X:\pics

      Copy all files from two drives to a single target, preserving the path for
      all extensions:

          dedupe_copy.py -p C:\ -p D:\ -c X:\ -m X:\manifest -R *:no_change

      Resume an interrupted run (assuming "-m manifest" used in prior run):

        dedupe_copy.py -p /Users/ -r dupes_2.csv -i manifest -m manifest

    Sequentially copy different sources into the same target, not copying duplicate files (2 sources and 1 target):

        1.) First record manifests for all devices
            dedupe_copy.py -p \\target\share -m target_manifest
            dedupe_copy.py -p \\source1\share -m source1_manifest
            dedupe_copy.py -p \\source2\share -m source2_manifest

        2.) Copy each source to the target (specifying --compare so manifests from
            other sources are loaded but not used as part of the set to copy and
            --no-walk to skip re-scan of the source):
            dedupe_copy.py -p \\source1\share -c \\target\share -i source1_manifest
                --compare source2_manifest --compare target_manifest  --no-walk
            dedupe_copy.py -p \\source2\share -c \\target\share -i source2_manifest
                --compare source1_manifest --compare target_manifest --no-walk

    Complete example:
       
        Assuming you start with a set of files laid out as follows:

        C:\
            pics
                some_photos
                    photo1.jpg
                    photo2.jpg
                photo3.jpg
        D:\
            pics
                copied_photo
                    photo3.jpg
                photo4.jpg


        And you run the command:

        > dedupe_copy.py -p C:\pics -p D:\pics -e jpg -R jpg:mtime -c X:\photos

    You would end up with the following (photo3.jpg was a true duplicate):

        X:\
            photos
                2012_08
                    photo1.jpg
                    photo2.jpg
                    photo3.jpg
                2012_09
                    photo4.jpg


    This project is on bitbucket: http://www.bitbucket.org/othererik/dedupe_copy


    Product's homepage

    Requirements:

    · Python

      


    TAGS:

    file management | duplicate finder | duplicate | finder | restructure

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM