• 0

How would you sort 20000 unrelated pictures?


Question

I've accrued around 20000 images of internet randomness, and their disparate file names have finally triggered my OCD. I'm at a complete loss at a practical structure and method for batch sorting and renaming them. My lazy inclination is to rename by creation date, but after meeting people who sort their porn by color and emotion (don't ask) and others with too much time sorting their image collection by subject (i.e. img \bears \ bear tongue \beartonguelong.jpg ), I feel like I should... do better.

If no exiting program can automate this, I think I'm going to have to write or try to convince my developer friend to write a script to determine the RGB value of each picture and rename them as RGB scales so I can search them by rough color ranges. i.e. R24G10B01.jpg for all images with R240-249, G100-109, B 10-19. Maybe too much work for something like this. Either way, I'm just polling to see if there's any organization structures for random pictures I'm not thinking of.

Recommended Posts

  • 0

Is there some sort of metadata available that would tell you where it was downloaded from?

I'm sure something like this exists somewhere, but I can't remember on which operating system and which details were available.

If that is the case, you would be able to arrange it to some degree based on where you got the image from.

  • 0

Obviously you need to put your vacation time in with work, then start looking at the photos one at a time, and moving them into catagories(such as funny, gross, sexy, etc), then naming them based on the catagory.

That or just delete them, because really, are you going to look at them again?

  • 0

Ask yourself if you really need them or are unnecessarily hoarding them.

The only way to sort them if you really need them is the painful way - sort by subject, manually.

If not select all and delete - probably what I would do - and problem solved.

  • 0

It's a good question; what do you even need them for?

If they're just sat there and you have no way of finding anything, surely you have no need for them?

Whenever I've downloaded images that I want to keep (mainly 'good' design and stuff) I've simply sorted them by the source and date. I've done it like this because I thought I'd actually go back to them at some point, and I do, and doing so is a pleasure :)

  • 0

20,000?

Do 100 a day, and 5 1/2 years from now you'll be done!

Seriously though, if I had that many pictures I would just accept the order that it was in rather than trying to change it.

Where did you learn to do maths lol

100 a day for 20k would be 200 days lol

  • 0

Just use something like Picasa, go through and batch tag all the important pictures. Like Holiday, Florida, Dog, Cat, Sister, Party etc. That way all the ones you will want to find can be done easily. I wouldn't worry about actual image path locations as images can be categorised many times. Like Funny, memes, rage.

  • 0

It's a good question; what do you even need them for?

If they're just sat there and you have no way of finding anything, surely you have no need for them?

Whenever I've downloaded images that I want to keep (mainly 'good' design and stuff) I've simply sorted them by the source and date. I've done it like this because I thought I'd actually go back to them at some point, and I do, and doing so is a pleasure :)

They're 80% design or photography stuff from various genres and the rest probably an assortment of cats and gifs. The images are saved automatically from starred google reader items using IFTTT with site title in the file name, but that's pretty redundant with reverse image search which is why I think sorting by color would do the trick. I go through them pretty often too, maybe I have to con some interns into sorting them, but there's way too much questionable stuff mixed in.

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • The laptop in the bedroom is an Acer with i7-10510U CPU. Acer's website states they will not be upgrading it so I had little choice other than disable secure boot. I know next to nothing on these matters so hopefully it will be fine.
    • GitHub removes manual model selection from Copilot free and student plans by Karthik Mudaliar GitHub is removing the ability to manually select an AI model from its Copilot Free and Student plans, making its automatic routing system the default and only way to choose a model. This means users on these tiers will no longer be able to deliberately select a particular OpenAI, Anthropic, Google, or Microsoft model for a task. In its announcement, GitHub said Copilot Auto will dynamically choose what it considers the best model for each request. Free and Student accounts will retain access to models from multiple families, although the available selection will continue to depend on the restrictions attached to each plan. GitHub did not identify a fixed pool of models that Auto will always use, and its documentation warns that model availability can change over time. GitHub describes Auto as more than a random fallback system. On supported surfaces, its task-optimization technology evaluates the complexity of a request alongside real-time information about model health and availability. Straightforward prompts can be routed to faster and less expensive models, while more demanding coding tasks may be sent to higher-cost reasoning models. The company says this approach should reduce rate limiting, latency, and failed requests. Auto generally selects one model along natural prompt-caching boundaries rather than repeatedly switching models during a session, as GitHub found that mid-session changes increased costs without producing sufficient improvements in output quality. Users can still check which model generated a response. In Copilot Chat, the information appears when hovering over an answer, while Copilot CLI and the Copilot cloud agent display the selected model alongside their output. Auto is available in Copilot Chat, Copilot CLI, and the cloud agent, with the exact implementation and release status varying between supported development environments. The latest restriction follows several months of adjustments to Copilot’s individual plans. GitHub temporarily halted new Pro, Pro+, and Student subscriptions in April as it sought to manage demand and service reliability. It later introduced token-based billing and began gradually reopening individual-plan registrations on June 17. Alongside the picker change, GitHub is retiring the “Preview” label from Microsoft-developed models. It argues that the label is no longer necessary because Auto handles model routing and models are continuously updated behind the scenes.
    • Look up 'inflation' kid. Ask an AI for the numbers between both games.
    • Google reportedly set to lose two key Gemini and DeepMind researchers to Anthropic by Karthik Mudaliar Google is reportedly preparing to lose two more prominent artificial intelligence researchers, with Gemini contributors Jonas Adler and Alexander Pritzel planning to join rival AI developer Anthropic. According to a report from Bloomberg, both researchers are viewed internally as important contributors to Google’s flagship Gemini model family. Adler worked on Google’s AI coding efforts, while Pritzel was involved in the process used to train AI systems. Neither company has publicly confirmed the moves. The report also does not say when the researchers will formally leave Google or what positions they will hold at Anthropic. Training a large AI model requires decisions covering its architecture, data preparation, distributed computing infrastructure, and post-training methods that shape how the finished system behaves. Researchers with experience operating at the scale of Gemini are consequently difficult to replace quickly. Both Adler and Pritzel have previously contributed to Google DeepMind’s scientific research as well. They are listed among the authors of the company’s work on expanding AlphaFold protein-structure predictions across entire proteomes, alongside AlphaFold researchers including John Jumper. The reported departures arrive shortly after another important change within Google’s Gemini organization. Gemini co-lead Noam Shazeer is leaving Google for OpenAI, after returning to the search company in 2024 through its deal with Character.AI. Shazeer is particularly well known as one of the authors of the Transformer paper, whose architecture became the foundation for most modern large language models. Anthropic, meanwhile, has been recruiting recognizable figures from other leading laboratories. OpenAI co-founder and former Tesla AI director Andrej Karpathy joined Anthropic’s pre-training team in May. His move, followed by the reported recruitment of several Google researchers, suggests Anthropic is strengthening the research teams responsible for the core capabilities of future Claude models rather than concentrating solely on product and enterprise sales. The competition is complicated by the companies’ extensive commercial relationships. Anthropic competes directly with Google’s Gemini models, but it also relies on Google as an infrastructure partner. In April, Anthropic announced an expanded agreement with Google and Broadcom covering multiple gigawatts of next-generation Tensor Processing Unit capacity. TPUs are Google-designed accelerators used to train and run large AI models. via Bloomberg
  • Recent Achievements

    • One Year In
      Philsl earned a badge
      One Year In
    • Dedicated
      Scoobystu earned a badge
      Dedicated
    • First Post
      Tom Schmidt earned a badge
      First Post
    • One Month Later
      D0nn13 earned a badge
      One Month Later
    • Rookie
      +ChiefOfNeo went up a rank
      Rookie
  • Popular Contributors

    1. 1
      +primortal
      461
    2. 2
      +Edouard
      177
    3. 3
      PsYcHoKiLLa
      124
    4. 4
      Michael Scrip
      79
    5. 5
      Xenon
      76
  • Tell a friend

    Love Neowin? Tell a friend!