A python Script which syncs file names between


Recommended Posts

File Renamer 3.5

This is an app that chatGPT helped me create. I modified some of the code. It lets sync the file names from a source directory to a destination directory if both files are identical and all that was changed was the file's name. This way you don't have to recopy the file all over again.

  

image.png.1f2385aac718e66e8cea9ee662086188.png

image.png.333ff5cea13b2773f063816e4642f0b7.png

 

image.thumb.png.85d106e823e85362ddea018f6e334252.png

Version 3.5.1

  • Fixed a bug, where if a file had a specifically formatted file name it would get incorrectly renamed and shortened.

Change log

Version 3.5

Change log

  • Added the "Save folder paths" to save the file paths for both A and B
  • Added the "Load Folder Paths" to load the folder paths for A and B.

Version 3.1

Change log

  • Brand new UI
  • A save button to save the output of the scan
  • a progress bar
  • If a collision, all the files have been renamed, or there are no files found to be renamed you can't click Continue Renaming, the button is dead.
  • The scan results are now displayed in a table format
  • resizeable window
  • The words "collision found" appears in red at the top
  • All dialog boxes removed
  • The list of files is now scrollable

Version 2.3

Change log

  • Fixed a bug causing the app to hang while scanning a directory with 1000+ files
  • in the corruption detection, the text file now tells you the file (s)  in both A and B that caused the collision.

Version 2.0

Change log

  • After selecting A and B it checks both A and B for collisions and if any are found the program stops. If it finds collisions in either directory it saves the results of both directories to a folder on the desktop called collisions.txt. Once The collisions are fixed or if it doesn't find any the program proceeds Normally.
  • Changed the collision detection to look check the file name as well as the first word of the file name minus the extension. Plus the modified date and time from before.

 

import os
import re
import tkinter as tk
from tkinter import filedialog
from tkinter import messagebox
from tkinter import ttk

IGNORED_EXTENSIONS = ['.srt', '.nfo', '.sub']

def get_folder_path(entry):
    folder_path = filedialog.askdirectory()
    entry.delete(0, tk.END)
    entry.insert(0, folder_path)

def compare_folders():
    folder_a = folder_a_entry.get()
    folder_b = folder_b_entry.get()

    tree.delete(*tree.get_children())

    file_sizes_a = {}
    for filename in os.listdir(folder_a):
        if os.path.isfile(os.path.join(folder_a, filename)) and not filename.endswith(('.srt', '.nfo', '.sub')):
            file_sizes_a[filename] = os.path.getsize(os.path.join(folder_a, filename))

    file_names_b = []
    file_sizes_b = {}
    for filename in os.listdir(folder_b):
        if os.path.isfile(os.path.join(folder_b, filename)) and not filename.endswith(('.srt', '.nfo', '.sub')):
            file_names_b.append(filename)
            file_sizes_b[filename] = os.path.getsize(os.path.join(folder_b, filename))

    differences = []

    for filename_a, size_a in file_sizes_a.items():
        matched = False

        for filename_b, size_b in file_sizes_b.items():
            if size_a == size_b:
                name_b, ext_b = os.path.splitext(filename_b)

                if not re.search(r'\(\d{4}\)$', name_b):
                    differences.append((filename_a, filename_b))
                    matched = True
                    break

        
    for diff in differences:
        tree.insert('', 'end', values=diff)

    check_collisions(folder_b)  # Call check_collisions function after comparing folders

def check_collisions(folder_b):
    file_sizes = {}
    collisions = {}

    for filename in os.listdir(folder_b):
        file_extension = os.path.splitext(filename)[1].lower()

        if os.path.isfile(os.path.join(folder_b, filename)) and file_extension not in IGNORED_EXTENSIONS:
            size = os.path.getsize(os.path.join(folder_b, filename))

            if size in file_sizes:
                if size not in collisions:
                    collisions[size] = []
                collisions[size].extend([file_sizes[size], filename])
            else:
                file_sizes[size] = filename

    if collisions:
        report = "Collision Detected:\n\n"
        for size, filenames in collisions.items():
            report += f"Files with size {size} bytes:\n"
            report += " - ".join(filenames)
            report += "\n\n"

        messagebox.showwarning("Collision Detected", report)
    else:
        messagebox.showinfo("No Collisions", "No collisions found in the directory.")

def rename_files():
    folder_b = folder_b_entry.get()
    skipped_count = 0
    renamed_count = 0
    
    for child in tree.get_children():
        filename_a, filename_b = tree.item(child)['values']
        new_path = os.path.join(folder_b, filename_a)
        
        if not os.path.exists(new_path):
            os.rename(os.path.join(folder_b, filename_b), new_path)
            renamed_count += 1
        else:
            skipped_count += 1
    
    summary_message = f"Renamed files: {renamed_count}\nSkipped files: {skipped_count}"
    messagebox.showinfo('Renaming Summary', summary_message)

def save_folder_paths():
    folder_a = folder_a_entry.get()
    folder_b = folder_b_entry.get()

    save_path = filedialog.asksaveasfilename(defaultextension=".txt", filetypes=(("Text Files", "*.txt"), ("All Files", "*.*")))

    if save_path:
        with open(save_path, 'w') as file:
            file.write(f"Folderfolder_a}\n")
            file.write(f"Folderfolder_b}")

        messagebox.showinfo('Save', 'Folder paths saved successfully.')

def load_folder_paths():
    load_path = filedialog.askopenfilename(filetypes=(("Text Files", "*.txt"), ("All Files", "*.*")))

    if load_path:
        with open(load_path, 'r') as file:
            content = file.read()

        folder_a_match = re.search(r"Folder A: (.+)", content)
        folder_b_match = re.search(r"Folder B: (.+)", content)

        if folder_a_match and folder_b_match:
            folder_a = folder_a_match.group(1).strip()
            folder_b = folder_b_match.group(1).strip()

            folder_a_entry.delete(0, tk.END)
            folder_a_entry.insert(0, folder_a)
            folder_b_entry.delete(0, tk.END)
            folder_b_entry.insert(0, folder_b)

            messagebox.showinfo('Load', 'Folder paths loaded successfully.')
        else:
            messagebox.showwarning('Load', 'Invalid file format. Please select a valid text file.')

root = tk.Tk()

folder_a_entry = tk.Entry(root)
folder_a_entry.pack()
tk.Button(root, text="Use these names in Folder A", command=lambda: get_folder_path(folder_a_entry)).pack()

folder_b_entry = tk.Entry(root)
folder_b_entry.pack()
tk.Button(root, text="To rename files in this folder, Folder B", command=lambda: get_folder_path(folder_b_entry)).pack()

tk.Button(root, text="Compare Folders", command=compare_folders).pack()

tree = ttk.Treeview(root, columns=('Folder A', 'Folder B'), show='headings')
tree.heading('Folder A', text='Folder A')
tree.heading('Folder B', text='Folder B')
tree.pack(side='topill='both', expand=True, padx=10, pady=10)

tk.Button(root, text="Rename Files", command=rename_files).pack()

tk.Button(root, text="Save Folder Paths", command=save_folder_paths).pack()
tk.Button(root, text="Load Folder Paths", command=load_folder_paths).pack()

root.mainloop()

It's not difficult to understand, you've renamed the file so backup software is going to assume it's a new file. The contents of the file may be the same, but backup software isn't going to make assumptions like that. If you want incremental\checksum comparisons etc, the file is going to need to be the same name. Sure it may do some deduping in the background to compress the media set, but it is going to show the file with the amended file name.

If I have a media set, and files have been renamed, I want to be able to go back through the backup sets and see the differences, regardless if the file contents is the same.

You could write some PowerShell to compare the checksum of files in two locations; have it assume that if a file's checksum in Location A (source) matches that of a file in Location B (dest), rename file in Location B to match Location A, else copy file to location B.

 

You can try https://freefilesync.org/ with "content" method when comparing, haven't tried it. The other option is to go to you backup and rename them there as well manually before sync.

Update: Probably won't work if you are not using it already (before the rename) https://freefilesync.org/manual.php?topic=synchronization-settings - Detect Moved Files

  On 24/05/2023 at 05:23, binaryzero said:

You could write some PowerShell to compare the checksum of files in two locations; have it assume that if a file's checksum in Location A (source) matches that of a file in Location B (dest), rename file in Location B to match Location A, else copy file to location B.

Expand  

Already on it! I was Enlisting the help of chat GPT to write me some code.

So nailed out all the bugs and it works great

I wanted to do it properly by creating and comparing the hashes of the files but that brought my underpowered file server to its knees and it took forever given the file size. Plus I wanted to be able to do this over the network

I ended up having ChatGPT create a Python script that compares the first name of each file and the modification date in folder A (Source) and folder B (Destination). If it gets a match, it renames the file in folder B to the name of the file in folder A. This is because the file names will be all the same except for the year at the end of them

It then displays a box, showing me a before and after list of all the files it will be renaming and I have to confirm it. It will also skip files that are already named that name in folder B.

I on the first run it doubled up the file extension, so I told it to not do that and then it stopped doing that.

image.thumb.png.8bc8bdb60b48cb9584b2b01603c982a8.png

 

Once I click yes

image.thumb.png.28e45e47d004972b8a85c0cd4ce312f4.png

 

 

So I was trying to tell the program if it finds a file name with the same first word and the same date and time to skip it and move on but record it in a text file. But that made the code overly complicated and it didn't work.

so I thought let's have it make me a small collision detector .. the same first word same date AND same time. and ...

 

import os
import tkinter as tk
from tkinter import filedialog, messagebox
from collections import defaultdict

def get_first_word(file_name):
    return file_name.split('.')[0].split(' ')[0]

def find_file_collisions():
    folder_path = filedialog.askdirectory(title='Select Folder')

    if not folder_path:
        return

    file_info = defaultdict(list)
    collisions = []

    for file_name in os.listdir(folder_path):
        file_path = os.path.join(folder_path, file_name)

        if os.path.isfile(file_path):
            first_word = get_first_word(file_name)
            modified_time = os.path.getmtime(file_path)

            file_info[(first_word, modified_time)].append(file_path)

    for paths in file_info.values():
        if len(paths) > 1:
            collisions.extend(paths)

    if collisions:
        messagebox.showinfo("Collisions Found", "Collisions found in the directory.")

        desktop_path = os.path.join(os.path.expanduser("~"), "Desktop")
        file_path = os.path.join(desktop_path, "collisions.txt")
        with open(file_path, 'w') as file:
            file.write("Duplicates:\n")
            for path in collisions:
                file.write(f"{os.path.basename(path{path}\n")

        messagebox.showinfo("Results Saved", f"Duplicate file names and paths recorded in {file_path}.")
    else:
        messagebox.showinfo("No Collisions", "No collisions found in the directory.")

# Create the main window
window = tk.Tk()
window.title('Filelision Detector')
window.geometry('400x200')

# Create the "Find Collisions" button
find_button = tk.Button(window, text='Find Collisions', command=find_file_collisions)
find_button.pack(pady=20)

# Run the main event loop
window.mainloop()

it worked great

I had it scan 1 of my directories it found 2 collisions.

2 files had the same first word in their name and both were modified on 10/27/2017 2:56 pm

So cool! Then I got a free program to change the modified date.

 

  On 25/05/2023 at 01:46, binaryzero said:

Cool.

Hopefully one day you learn how to code instead of relying on LLMs...

Expand  

Coding is not for me. I've attempted it many times. But I'm very happy an LLM wrote me some code to solve a problem I was having. The LLM was actually my last resort. I scoured around Google for a program to do what I wanted, but didn't find anything.

  • Like 3
  On 25/05/2023 at 07:19, Jose_49 said:

For massively renaming files I strongly recommend Bulk Rename Utility

It's daunting at first.

 

But after you get to know it. It will do wonders with your files!

Expand  

Yep, i've used it before. Great tool! Just not for right for This scenario.

  On 25/05/2023 at 20:18, Brandon H said:

neat. that's a surprisingly simple code base to achieve what you need there.

Expand  

Here is how the new version works.

First I run a small Collision finder app I had it make for me. This is because I was testing the first app by giving two files the exact same first name and modification date and time. To see what it would do, of course, the program didn't know how to handle it which I expected. I tried to tell it to add code to check for that into the program, but in the end, it was much easier for it to create a separate program I call Collision detector.

I actually ran the collision detector on the actual files I made this program for. It found some collisions. Super cool! in 2 of the cases they were duplicate files that could be removed. Gained back 9GB of data finding those.

image.thumb.png.264c19623c8afe513142214baeeff94a.pngimage.png.13b139b271a7ed56e1d93681043cb26e.png

image.png.020040de0f7b58f4d7dc5a1425c206fa.png

Or if it finds no duplicates it will say

image.png.b856cb549d8797dcf1f253fa9f9b1298.png

Now you can run the primary app

After you choose the source and destination it asks you if you want to save it to a text file. if you choose yes, it saves a text file called differences and then exits. Then go look at the text file first to see what it will be renaming.

image.thumb.png.da8a369e76be8048a81b9ce5a0bed785.png

image.thumb.png.b91de26e6c2d6df46e088e4c55570bb1.png

image.thumb.png.bff68285088f135639f0ab8a6a539fe9.png

Now once we are satisfied with what it will be renaming, we run the app again and choose no since we already examined the text file we can click yes to rename the files.

image.thumb.png.5f8ef5dba35cda5e1bbe9ad6d7211cfc.png

That's all there is to it!

 

image.png

  • +Warwagon changed the title to A python Script which syncs file names between
  • 2 weeks later...

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • Windows 10 KB5063159 fixes bug that wouldn't let some Microsoft Surface devices boot by Sayan Sen Microsoft released Windows 10 Patch Tuesday updates for the month last week. The one for Windows 10 under KB5060533 / KB5060531 / KB5061010 / KB5060998 introduced a bug that would not let Surface Hub v1 devices start due to a Secure Boot validation issue. As such, Microsoft had paused the update similar to the compatibility blocks or safeguard holds it applies for major feature updates as well. This bug was uncovered after the update went live, as Microsoft later added it to the list of known issues for that update and it also put up a big notice in bold. It wrote: Earlier today, the company released an out-of-band (OOB) update to address the issue. It has been published under KB5063159 and is only being offered to Surface Hub v1 devices instead of the buggy KB5060533 Patch Tuesday one. In the description of the new OOB update, Microsoft writes: You can find the support article for KB5063159 here on Microsoft's website. It is downloaded and installed automatically but users can also manually download it from the Microsoft Update Catalog website.
    • I thought I saw that one, and yeah, it was awhile ago, too..
    • Jumping unicorns says that I forgot you. I never grunt.
    • Microsoft can't help break Windows Outlook as even opening an email now sends it crashing by Sayan Sen It is no surprise that right now, Microsoft's priority is to move users onto the New Outlook for Windows if they haven't done so already. The company began default-deployment of the new app on fresh Microsoft 365 systems and also made switching between the New app and the classic app easier. At the same time, the company is also trying to improve the new app as it recently added yet another feature that will allow people to use the app without an internet connection, and it also blocked more file types to improve security. Perhaps as a consequence of this relentless pursuit, the classic Outlook app has become quite prone to bugs and issues in recent times as Microsoft may be losing its focus. For example, in April, the downloads for Classic Outlook broke and more recently, this month, the tech giant confirmed that a major Calendar feature upgrade has broken the app in several different ways. Microsoft has provided workarounds for several of the issues so check this article out if you are having problems. And the bugs just keep on coming as Microsoft earlier today confirmed that opening emails or starting a new email will now crash the application, and it affects all channels. The company explains: Opening a mail on an email client is probably the most common and basic functionality a user is expected to do, so it is annoying to see such a bug not getting caught during testing. Again, this perhaps indicates that Microsoft's focus maybe elsewhere at the moment. The company has shared the Windows Event Viewer Log for the crash and the problematic OLMAPI32 DLL file is related to Microsoft Messaging Application Programming Interface (MAPI). The issue reminds us of the similar OLEAUT32 dll issue fix in our recent ReactOS article. The event viewer log notes: Faulting application name: OUTLOOK.EXE, version: 16.0.14334.20090, time stamp: 0x683fe030 Faulting module name: OLMAPI32.DLL, version: 16.0.14334.20090, time stamp: 0x683ff910 Exception code: 0xc0000409 Fault offset: 0x00196e1c Faulting process id: 0x9100 Faulting application start time: 0x01dbdbe04fe54514 Faulting application path: C:\Program Files (x86)\Microsoft Office\root\Office16\OUTLOOK.EXE Faulting module path: C:\Program Files (x86)\Microsoft Office\root\Office16\OLMAPI32.DLL Thankfully, Microsoft has issued a workaround for the problem as it currently investigates the bug by manually creating a FORMS2 folder which should exist at the following address: C:\Users\\AppData\Local\Microsoft\FORMS2. Microsoft explains: You can find the support article for the issue here on Microsoft's official website.
    • I have no issue with a service not being offered free. That's perfectly understandable. I do have issue with seeing ads literally everywhere these days, though, plastered in layers over every website and in every app. Especially with how annoying they tend to be (hell, many ads these days even promote literal scams). That's why people use ad blockers. Let me choose. Offer an ad-free experience for a couple of bucks (but be reasonable, don't get too greedy) or ads for free access. Don't make it ads or nothing.
  • Recent Achievements

    • Week One Done
      korostelev earned a badge
      Week One Done
    • Week One Done
      rozermack875 earned a badge
      Week One Done
    • Week One Done
      oneworldtechnologies earned a badge
      Week One Done
    • Veteran
      matthiew went up a rank
      Veteran
    • Enthusiast
      Motoman26 went up a rank
      Enthusiast
  • Popular Contributors

    1. 1
      +primortal
      688
    2. 2
      ATLien_0
      268
    3. 3
      Michael Scrip
      184
    4. 4
      +FloatingFatMan
      177
    5. 5
      Steven P.
      140
  • Tell a friend

    Love Neowin? Tell a friend!