A python Script which syncs file names between


Recommended Posts

File Renamer 3.5

This is an app that chatGPT helped me create. I modified some of the code. It lets sync the file names from a source directory to a destination directory if both files are identical and all that was changed was the file's name. This way you don't have to recopy the file all over again.

  

image.png.1f2385aac718e66e8cea9ee662086188.png

image.png.333ff5cea13b2773f063816e4642f0b7.png

 

image.thumb.png.85d106e823e85362ddea018f6e334252.png

Version 3.5.1

  • Fixed a bug, where if a file had a specifically formatted file name it would get incorrectly renamed and shortened.

Change log

Version 3.5

Change log

  • Added the "Save folder paths" to save the file paths for both A and B
  • Added the "Load Folder Paths" to load the folder paths for A and B.

Version 3.1

Change log

  • Brand new UI
  • A save button to save the output of the scan
  • a progress bar
  • If a collision, all the files have been renamed, or there are no files found to be renamed you can't click Continue Renaming, the button is dead.
  • The scan results are now displayed in a table format
  • resizeable window
  • The words "collision found" appears in red at the top
  • All dialog boxes removed
  • The list of files is now scrollable

Version 2.3

Change log

  • Fixed a bug causing the app to hang while scanning a directory with 1000+ files
  • in the corruption detection, the text file now tells you the file (s)  in both A and B that caused the collision.

Version 2.0

Change log

  • After selecting A and B it checks both A and B for collisions and if any are found the program stops. If it finds collisions in either directory it saves the results of both directories to a folder on the desktop called collisions.txt. Once The collisions are fixed or if it doesn't find any the program proceeds Normally.
  • Changed the collision detection to look check the file name as well as the first word of the file name minus the extension. Plus the modified date and time from before.

 

import os
import re
import tkinter as tk
from tkinter import filedialog
from tkinter import messagebox
from tkinter import ttk

IGNORED_EXTENSIONS = ['.srt', '.nfo', '.sub']

def get_folder_path(entry):
    folder_path = filedialog.askdirectory()
    entry.delete(0, tk.END)
    entry.insert(0, folder_path)

def compare_folders():
    folder_a = folder_a_entry.get()
    folder_b = folder_b_entry.get()

    tree.delete(*tree.get_children())

    file_sizes_a = {}
    for filename in os.listdir(folder_a):
        if os.path.isfile(os.path.join(folder_a, filename)) and not filename.endswith(('.srt', '.nfo', '.sub')):
            file_sizes_a[filename] = os.path.getsize(os.path.join(folder_a, filename))

    file_names_b = []
    file_sizes_b = {}
    for filename in os.listdir(folder_b):
        if os.path.isfile(os.path.join(folder_b, filename)) and not filename.endswith(('.srt', '.nfo', '.sub')):
            file_names_b.append(filename)
            file_sizes_b[filename] = os.path.getsize(os.path.join(folder_b, filename))

    differences = []

    for filename_a, size_a in file_sizes_a.items():
        matched = False

        for filename_b, size_b in file_sizes_b.items():
            if size_a == size_b:
                name_b, ext_b = os.path.splitext(filename_b)

                if not re.search(r'\(\d{4}\)$', name_b):
                    differences.append((filename_a, filename_b))
                    matched = True
                    break

        
    for diff in differences:
        tree.insert('', 'end', values=diff)

    check_collisions(folder_b)  # Call check_collisions function after comparing folders

def check_collisions(folder_b):
    file_sizes = {}
    collisions = {}

    for filename in os.listdir(folder_b):
        file_extension = os.path.splitext(filename)[1].lower()

        if os.path.isfile(os.path.join(folder_b, filename)) and file_extension not in IGNORED_EXTENSIONS:
            size = os.path.getsize(os.path.join(folder_b, filename))

            if size in file_sizes:
                if size not in collisions:
                    collisions[size] = []
                collisions[size].extend([file_sizes[size], filename])
            else:
                file_sizes[size] = filename

    if collisions:
        report = "Collision Detected:\n\n"
        for size, filenames in collisions.items():
            report += f"Files with size {size} bytes:\n"
            report += " - ".join(filenames)
            report += "\n\n"

        messagebox.showwarning("Collision Detected", report)
    else:
        messagebox.showinfo("No Collisions", "No collisions found in the directory.")

def rename_files():
    folder_b = folder_b_entry.get()
    skipped_count = 0
    renamed_count = 0
    
    for child in tree.get_children():
        filename_a, filename_b = tree.item(child)['values']
        new_path = os.path.join(folder_b, filename_a)
        
        if not os.path.exists(new_path):
            os.rename(os.path.join(folder_b, filename_b), new_path)
            renamed_count += 1
        else:
            skipped_count += 1
    
    summary_message = f"Renamed files: {renamed_count}\nSkipped files: {skipped_count}"
    messagebox.showinfo('Renaming Summary', summary_message)

def save_folder_paths():
    folder_a = folder_a_entry.get()
    folder_b = folder_b_entry.get()

    save_path = filedialog.asksaveasfilename(defaultextension=".txt", filetypes=(("Text Files", "*.txt"), ("All Files", "*.*")))

    if save_path:
        with open(save_path, 'w') as file:
            file.write(f"Folderfolder_a}\n")
            file.write(f"Folderfolder_b}")

        messagebox.showinfo('Save', 'Folder paths saved successfully.')

def load_folder_paths():
    load_path = filedialog.askopenfilename(filetypes=(("Text Files", "*.txt"), ("All Files", "*.*")))

    if load_path:
        with open(load_path, 'r') as file:
            content = file.read()

        folder_a_match = re.search(r"Folder A: (.+)", content)
        folder_b_match = re.search(r"Folder B: (.+)", content)

        if folder_a_match and folder_b_match:
            folder_a = folder_a_match.group(1).strip()
            folder_b = folder_b_match.group(1).strip()

            folder_a_entry.delete(0, tk.END)
            folder_a_entry.insert(0, folder_a)
            folder_b_entry.delete(0, tk.END)
            folder_b_entry.insert(0, folder_b)

            messagebox.showinfo('Load', 'Folder paths loaded successfully.')
        else:
            messagebox.showwarning('Load', 'Invalid file format. Please select a valid text file.')

root = tk.Tk()

folder_a_entry = tk.Entry(root)
folder_a_entry.pack()
tk.Button(root, text="Use these names in Folder A", command=lambda: get_folder_path(folder_a_entry)).pack()

folder_b_entry = tk.Entry(root)
folder_b_entry.pack()
tk.Button(root, text="To rename files in this folder, Folder B", command=lambda: get_folder_path(folder_b_entry)).pack()

tk.Button(root, text="Compare Folders", command=compare_folders).pack()

tree = ttk.Treeview(root, columns=('Folder A', 'Folder B'), show='headings')
tree.heading('Folder A', text='Folder A')
tree.heading('Folder B', text='Folder B')
tree.pack(side='topill='both', expand=True, padx=10, pady=10)

tk.Button(root, text="Rename Files", command=rename_files).pack()

tk.Button(root, text="Save Folder Paths", command=save_folder_paths).pack()
tk.Button(root, text="Load Folder Paths", command=load_folder_paths).pack()

root.mainloop()

It's not difficult to understand, you've renamed the file so backup software is going to assume it's a new file. The contents of the file may be the same, but backup software isn't going to make assumptions like that. If you want incremental\checksum comparisons etc, the file is going to need to be the same name. Sure it may do some deduping in the background to compress the media set, but it is going to show the file with the amended file name.

If I have a media set, and files have been renamed, I want to be able to go back through the backup sets and see the differences, regardless if the file contents is the same.

You could write some PowerShell to compare the checksum of files in two locations; have it assume that if a file's checksum in Location A (source) matches that of a file in Location B (dest), rename file in Location B to match Location A, else copy file to location B.

 

You can try https://freefilesync.org/ with "content" method when comparing, haven't tried it. The other option is to go to you backup and rename them there as well manually before sync.

Update: Probably won't work if you are not using it already (before the rename) https://freefilesync.org/manual.php?topic=synchronization-settings - Detect Moved Files

  On 24/05/2023 at 05:23, binaryzero said:

You could write some PowerShell to compare the checksum of files in two locations; have it assume that if a file's checksum in Location A (source) matches that of a file in Location B (dest), rename file in Location B to match Location A, else copy file to location B.

Expand  

Already on it! I was Enlisting the help of chat GPT to write me some code.

So nailed out all the bugs and it works great

I wanted to do it properly by creating and comparing the hashes of the files but that brought my underpowered file server to its knees and it took forever given the file size. Plus I wanted to be able to do this over the network

I ended up having ChatGPT create a Python script that compares the first name of each file and the modification date in folder A (Source) and folder B (Destination). If it gets a match, it renames the file in folder B to the name of the file in folder A. This is because the file names will be all the same except for the year at the end of them

It then displays a box, showing me a before and after list of all the files it will be renaming and I have to confirm it. It will also skip files that are already named that name in folder B.

I on the first run it doubled up the file extension, so I told it to not do that and then it stopped doing that.

image.thumb.png.8bc8bdb60b48cb9584b2b01603c982a8.png

 

Once I click yes

image.thumb.png.28e45e47d004972b8a85c0cd4ce312f4.png

 

 

So I was trying to tell the program if it finds a file name with the same first word and the same date and time to skip it and move on but record it in a text file. But that made the code overly complicated and it didn't work.

so I thought let's have it make me a small collision detector .. the same first word same date AND same time. and ...

 

import os
import tkinter as tk
from tkinter import filedialog, messagebox
from collections import defaultdict

def get_first_word(file_name):
    return file_name.split('.')[0].split(' ')[0]

def find_file_collisions():
    folder_path = filedialog.askdirectory(title='Select Folder')

    if not folder_path:
        return

    file_info = defaultdict(list)
    collisions = []

    for file_name in os.listdir(folder_path):
        file_path = os.path.join(folder_path, file_name)

        if os.path.isfile(file_path):
            first_word = get_first_word(file_name)
            modified_time = os.path.getmtime(file_path)

            file_info[(first_word, modified_time)].append(file_path)

    for paths in file_info.values():
        if len(paths) > 1:
            collisions.extend(paths)

    if collisions:
        messagebox.showinfo("Collisions Found", "Collisions found in the directory.")

        desktop_path = os.path.join(os.path.expanduser("~"), "Desktop")
        file_path = os.path.join(desktop_path, "collisions.txt")
        with open(file_path, 'w') as file:
            file.write("Duplicates:\n")
            for path in collisions:
                file.write(f"{os.path.basename(path{path}\n")

        messagebox.showinfo("Results Saved", f"Duplicate file names and paths recorded in {file_path}.")
    else:
        messagebox.showinfo("No Collisions", "No collisions found in the directory.")

# Create the main window
window = tk.Tk()
window.title('Filelision Detector')
window.geometry('400x200')

# Create the "Find Collisions" button
find_button = tk.Button(window, text='Find Collisions', command=find_file_collisions)
find_button.pack(pady=20)

# Run the main event loop
window.mainloop()

it worked great

I had it scan 1 of my directories it found 2 collisions.

2 files had the same first word in their name and both were modified on 10/27/2017 2:56 pm

So cool! Then I got a free program to change the modified date.

 

  On 25/05/2023 at 01:46, binaryzero said:

Cool.

Hopefully one day you learn how to code instead of relying on LLMs...

Expand  

Coding is not for me. I've attempted it many times. But I'm very happy an LLM wrote me some code to solve a problem I was having. The LLM was actually my last resort. I scoured around Google for a program to do what I wanted, but didn't find anything.

  • Like 3
  On 25/05/2023 at 07:19, Jose_49 said:

For massively renaming files I strongly recommend Bulk Rename Utility

It's daunting at first.

 

But after you get to know it. It will do wonders with your files!

Expand  

Yep, i've used it before. Great tool! Just not for right for This scenario.

  On 25/05/2023 at 20:18, Brandon H said:

neat. that's a surprisingly simple code base to achieve what you need there.

Expand  

Here is how the new version works.

First I run a small Collision finder app I had it make for me. This is because I was testing the first app by giving two files the exact same first name and modification date and time. To see what it would do, of course, the program didn't know how to handle it which I expected. I tried to tell it to add code to check for that into the program, but in the end, it was much easier for it to create a separate program I call Collision detector.

I actually ran the collision detector on the actual files I made this program for. It found some collisions. Super cool! in 2 of the cases they were duplicate files that could be removed. Gained back 9GB of data finding those.

image.thumb.png.264c19623c8afe513142214baeeff94a.pngimage.png.13b139b271a7ed56e1d93681043cb26e.png

image.png.020040de0f7b58f4d7dc5a1425c206fa.png

Or if it finds no duplicates it will say

image.png.b856cb549d8797dcf1f253fa9f9b1298.png

Now you can run the primary app

After you choose the source and destination it asks you if you want to save it to a text file. if you choose yes, it saves a text file called differences and then exits. Then go look at the text file first to see what it will be renaming.

image.thumb.png.da8a369e76be8048a81b9ce5a0bed785.png

image.thumb.png.b91de26e6c2d6df46e088e4c55570bb1.png

image.thumb.png.bff68285088f135639f0ab8a6a539fe9.png

Now once we are satisfied with what it will be renaming, we run the app again and choose no since we already examined the text file we can click yes to rename the files.

image.thumb.png.5f8ef5dba35cda5e1bbe9ad6d7211cfc.png

That's all there is to it!

 

image.png

  • +Warwagon changed the title to A python Script which syncs file names between
  • 2 weeks later...

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • I have the Pixel 9 Pro XL...Unless this thing is "leaps and bounds" faster than the 9, I'll pass. And by leaps and bounds, I don't mean on benchmarks. "Real world" faster. Most people don't even come close to topping out the performance of their phones. Tensor G5 is Google's most powerful chip to date, boasting a staggering 36 percent performance leap over G4.
    • MIT's stunning 'bubble wrap' device squeezes water out from thin air even in deserts by Sayan Sen Image by Matteo Roman via Pexels Massachusetts Institute of Technology (MIT) engineers have built a new kind of device that can pull clean drinking water straight out of the air—no electricity needed. It’s designed for areas where water is scarce and traditional sources like rivers or lakes aren’t reliable. Right now, more than 2.2 billion people globally don’t have access to safe drinking water. In the United States alone, 46 million face water insecurity, with either no running water or water that’s not safe to drink. This new device, called an Atmospheric Water Harvesting Window (AWHW), uses a unique hydrogel panel that looks like black bubble wrap. These dome-shaped bubbles soak up water vapor from the air, especially at night when humidity is higher. During the day, sunlight makes the vapor inside evaporate. That vapor then condenses on a glass surface and drips down through a tube, turning into drinkable water. The AWHW doesn’t rely on power sources like batteries or solar panels. It’s completely passive, meaning it works on its own. The team tested a meter-sized panel in Death Valley, California, one of the driest places in North America, and got between 57.0 and 161.5 milliliters of water per day even with humidity as low as 21 percent. That’s more than what other similar passive devices have managed. “We have built a meter-scale device that we hope to deploy in resource-limited regions, where even a solar cell is not very accessible,” said Xuanhe Zhao, a professor at MIT. “It’s a test of feasibility in scaling up this water harvesting technology. Now people can build it even larger, or make it into parallel panels, to supply drinking water to people and achieve real impact.” Another cool part of the design is how they kept the water safe to drink. Usually, these kinds of hydrogels use salts like lithium chloride to absorb more vapor but that can lead to salt leaking into the water, which isn’t ideal. To solve this, MIT’s team mixed in glycerol, a compound that helps keep salt locked inside the gel. In testing, the lithium ion concentration in the harvested water stayed below 0.06 ppm (parts per million), which is way below the safe limit. The hydrogel domes also give the material more surface area, letting it collect more vapor. The outer glass panel is coated with a special polymer film that helps cool the glass, making it easier for vapor to condense. “This is just a proof-of-concept design, and there are a lot of things we can optimize,” said lead author Chang Liu, now a professor at the National University of Singapore. “For instance, we could have a multipanel design. And we’re working on a next generation of the material to further improve its intrinsic properties.” Published in Nature Water, the study says the AWHW could last at least a year and shows promise for making safe, sustainable water in places with harsh climates. The researchers believe an array of vertical panels could one day supply water to individual households, especially in remote or off-grid locations. Source: MIT News, Nature This article was generated with some help from AI and reviewed by an editor. Under Section 107 of the Copyright Act 1976, this material is used for the purpose of news reporting. Fair use is a use permitted by copyright statute that might otherwise be infringing.
    • Clear Linux is open source, indeed, so its source code is available for anyone. They're just shutting down its support from them, they're not forbidding anyone else from taking over.
    • Linux Mint is also my favorite distro, but I fear what will happen with it if Clem were to disappear tomorrow, to be honest.
    • Yeah, I totally get your point, which is possible it could happen. I just hope there is a few people around him who are similar to where if they took over things would run pretty much the same. if not, then yeah, it could start to decline rapidly etc. but I figure something that's been around for a longer period of time with a decent backing, and probably more users than most Linux distro's (which I would 'imagine' Mint is one of the more used Linux desktop distro's by volume of people who use it), is less likely to just disappear. but like you said, nothing is guaranteed. but I do think you are probably right in that Clem is probably the core of what keeps Mint, Mint. I like how it tends to stay pretty much the same with some slight tweaks here and there (but is largely the same) instead of that crap some people go for with change for the sake of change trying to create a overly fancy interface and other unnecessary stuff etc. I also feel Mint keeps a nice balance of things out-of-the-box where it's not too bloated, nor too striped down. p.s. but I see Mint as a better Ubuntu basically. but I get your point like if it was more of a really serious choice of needing a 'safe bet' to use long term, then yeah something like official Ubuntu would be one of the better choices for sure given what you said with it being backed by an actual company which makes it a safer bet than Mint which is smaller and 'could' potentially be more fragile.
  • Recent Achievements

    • First Post
      leoniDAM earned a badge
      First Post
    • Reacting Well
      Ian_ earned a badge
      Reacting Well
    • One Month Later
      Ian_ earned a badge
      One Month Later
    • Dedicated
      MacDaddyAz earned a badge
      Dedicated
    • Explorer
      cekicen went up a rank
      Explorer
  • Popular Contributors

    1. 1
      +primortal
      506
    2. 2
      ATLien_0
      209
    3. 3
      Michael Scrip
      202
    4. 4
      Xenon
      145
    5. 5
      +FloatingFatMan
      121
  • Tell a friend

    Love Neowin? Tell a friend!