Is Dedup process working properly??

Discussion in 'HD/HDR-FOX T2 Customised Firmware' started by rodp, Nov 1, 2017.

  1. rodp

    rodp Member

    Hi All,

    I went to use the de-duplicate method for the first time in a folder of 57 files (The A Team). The results said that 45 were duplicates. However when I copied the urls from the browser into excel and then extracted the url hyperlink screen tips out and then checked for duplicate I only found 23. Before I put my faith in the dedup process, please could someone confirm that it isn't going to delete the 45 files but half of them (which is closer to the 23 I checked out manually?)

    One thing is that in the results of dedup it only reports back 40 characters of the description. The description is the thing that is key which determines whether programs are duplicated and often have the Sereis and Episode number on. Does Dedup only look at the first 40 character of the program description or does it look at the entire description?



    pasted pic below - hopefully that comes through ok and shows the cutoff at 40 characters.

  2. Ezra Pound

    Ezra Pound Well-Known Member

    I'm guessing that there is a 40 character limit for a file name so the 'proposed file name' has been truncated, it's just a question of which 40 it picks
  3. rodp

    rodp Member

    I see, is there a way to make it ensure it picks out Sx Ep xx style text? A bit of regex?

    I'm not sure it's finding the correct duplicates at the moment as it's not looking at the whole text. I've attached the full info in a spreadsheet showing the difference that dedup finds vs the manual way. I've put an x by the ones that are duplicates.



  4. af123

    af123 Administrator Staff Member

    If you use theTVDB integration, it will do a much better job. Just tell it that it's the A team.

  5. rodp

    rodp Member

    Thanks af123 - that helps defo. Is there a way to keep the label 'The A-Team' in still but add the other stuff when you do a dedup?