• 0

How would you sort 20000 unrelated pictures?


Question

I've accrued around 20000 images of internet randomness, and their disparate file names have finally triggered my OCD. I'm at a complete loss at a practical structure and method for batch sorting and renaming them. My lazy inclination is to rename by creation date, but after meeting people who sort their porn by color and emotion (don't ask) and others with too much time sorting their image collection by subject (i.e. img \bears \ bear tongue \beartonguelong.jpg ), I feel like I should... do better.

If no exiting program can automate this, I think I'm going to have to write or try to convince my developer friend to write a script to determine the RGB value of each picture and rename them as RGB scales so I can search them by rough color ranges. i.e. R24G10B01.jpg for all images with R240-249, G100-109, B 10-19. Maybe too much work for something like this. Either way, I'm just polling to see if there's any organization structures for random pictures I'm not thinking of.

Recommended Posts

  • 0

Is there some sort of metadata available that would tell you where it was downloaded from?

I'm sure something like this exists somewhere, but I can't remember on which operating system and which details were available.

If that is the case, you would be able to arrange it to some degree based on where you got the image from.

  • 0

Obviously you need to put your vacation time in with work, then start looking at the photos one at a time, and moving them into catagories(such as funny, gross, sexy, etc), then naming them based on the catagory.

That or just delete them, because really, are you going to look at them again?

  • 0

Ask yourself if you really need them or are unnecessarily hoarding them.

The only way to sort them if you really need them is the painful way - sort by subject, manually.

If not select all and delete - probably what I would do - and problem solved.

  • 0

It's a good question; what do you even need them for?

If they're just sat there and you have no way of finding anything, surely you have no need for them?

Whenever I've downloaded images that I want to keep (mainly 'good' design and stuff) I've simply sorted them by the source and date. I've done it like this because I thought I'd actually go back to them at some point, and I do, and doing so is a pleasure :)

  • 0

20,000?

Do 100 a day, and 5 1/2 years from now you'll be done!

Seriously though, if I had that many pictures I would just accept the order that it was in rather than trying to change it.

Where did you learn to do maths lol

100 a day for 20k would be 200 days lol

  • 0

Just use something like Picasa, go through and batch tag all the important pictures. Like Holiday, Florida, Dog, Cat, Sister, Party etc. That way all the ones you will want to find can be done easily. I wouldn't worry about actual image path locations as images can be categorised many times. Like Funny, memes, rage.

  • 0

It's a good question; what do you even need them for?

If they're just sat there and you have no way of finding anything, surely you have no need for them?

Whenever I've downloaded images that I want to keep (mainly 'good' design and stuff) I've simply sorted them by the source and date. I've done it like this because I thought I'd actually go back to them at some point, and I do, and doing so is a pleasure :)

They're 80% design or photography stuff from various genres and the rest probably an assortment of cats and gifs. The images are saved automatically from starred google reader items using IFTTT with site title in the file name, but that's pretty redundant with reverse image search which is why I think sorting by color would do the trick. I go through them pretty often too, maybe I have to con some interns into sorting them, but there's way too much questionable stuff mixed in.

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • Anthropic accuses Alibaba of using 25,000 fake accounts to copy Claude's capabilities by Karthik Mudaliar Anthropic has accused Alibaba of using nearly 25,000 fraudulent accounts to extract capabilities from Claude on a huge scale. According to a report from Reuters, Anthropic told US lawmakers that operators linked to Alibaba and the company’s Qwen AI team generated 28.8 million exchanges with Claude between April 22 and June 5, 2026. That is a lot of Claude conversations, but Anthropic says this was not ordinary chatbot use. The company believes the accounts were part of a coordinated effort to collect answers that could help train or improve rival AI systems. The alleged campaign reportedly focused on some of Claude’s most valuable skills, including software development, multi-step reasoning, and agentic tasks. In practical terms, that means getting an AI model to plan and complete work across several stages rather than simply answering a single question. This is called 'distillation,' where AI companies use outputs from a larger model to train a smaller and cheaper one. The smaller model learns to imitate useful parts of the more capable system without needing the same amount of computing power. The distillation process isn't automatically suspicious, but the problem comes when one company gathers another provider's outputs without permission and at an industrial scale. Also, this does not mean Alibaba obtained Claude’s source code, model weights, or original training data. Instead, Anthropic claims the accounts repeatedly asked Claude carefully designed questions and collected the answers. Those answers could then be used as training material for another model. Anthropic has made similar accusations against DeepSeek, Moonshot AI, and MiniMax earlier this year. As Neowin previously reported, Anthropic said those three companies collectively generated more than 16 million Claude exchanges through roughly 24,000 accounts. Anthropic says the new campaign produced almost twice as many exchanges in a matter of weeks. Anthropic reportedly told lawmakers that the campaign could help Chinese AI developers approach the capabilities of its Mythos Preview model. Mythos is focused on advanced cybersecurity work, including finding and exploiting complex software vulnerabilities. via Reuters | Photo via DepositPhotos.com
    • An Indian manufacturer that assembles roughly one-third of Apple's iPhones and supplies semiconductor components to Tesla confirmed Monday that attackers had stolen and publicly published a 630-gigabyte cache of confidential files — including engineering blueprints stamped "TRADE SECRET," a 52-page quality inspection document for iPhone circuit board components, and cryptographic certificates that security experts say could be weaponized in follow-on attacks. https://www.techtimes.com/articles/319019/20260624/apple-tesla-supplier-tata-electronics-confirms-630-gb-data-theft-iphone-specs-dark-web.htm
    • I don't think it was ever a big question. In fact, I don't think anyone ever asked about how clocks work on Mars.
    • I don't know what the price difference is between a 5GbE and a 10GbE part, but it seems that putting a 10GbE port in might be a bit more 'standard'.
  • Recent Achievements

    • Rookie
      krychek57 went up a rank
      Rookie
    • Grand Master
      Jaybonaut went up a rank
      Grand Master
    • One Year In
      Philsl earned a badge
      One Year In
    • Dedicated
      Scoobystu earned a badge
      Dedicated
    • First Post
      Tom Schmidt earned a badge
      First Post
  • Popular Contributors

    1. 1
      +primortal
      441
    2. 2
      +Edouard
      175
    3. 3
      PsYcHoKiLLa
      133
    4. 4
      Michael Scrip
      79
    5. 5
      Xenon
      77
  • Tell a friend

    Love Neowin? Tell a friend!