• 0

Need with with Imgur program


Question

Hey guys,

I wanted to work on a new project just for fun, so I thought a program that downloaded imgur galleries would be cool. I am working on it, but the program will run through duplicates after 51 images usually.

I've looked at all i can and can't find the problem. My guess is that it's the server resetting me. I wanted a challenge so i tried to get past it. I tried creating a new url connection each time, I also created a seperate class that did the downloading and creating a new object of that class everytime but it didn't work.

Thoughts?

Source

http://shayconcepts.com/programming/ImgurGalleryDownloader/imgurDL.zip

Link to comment
https://www.neowin.net/forum/topic/1110175-need-with-with-imgur-program/
Share on other sites

19 answers to this question

Recommended Posts

  • 0

You are parsing the HTML page. Don't do that. Use imgur's API instead - from the documentation, you can just append ".xml" to any gallery page and it returns the gallery as an XML file.

In general, don't parse HTML pages unless there is no other way. It's not easy and it can change any day, making your program useless.

  • 0
  On 04/10/2012 at 20:30, Aethec said:

You are parsing the HTML page. Don't do that. Use imgur's API instead - from the documentation, you can just append ".xml" to any gallery page and it returns the gallery as an XML file.

In general, don't parse HTML pages unless there is no other way. It's not easy and it can change any day, making your program useless.

Thanks! I'll take a look.

EDIT: it seems it only returns 56 images though :/

  On 05/10/2012 at 01:51, Dr_Asik said:

I learned about regex and it has worked really well for me! I'll take a look at your link and find a better way to parse HTML. Thanks!

  • 0
  On 05/10/2012 at 04:14, thatguyandrew1992 said:
I learned about regex and it has worked really well for me! I'll take a look at your link and find a better way to parse HTML. Thanks!
If you really need to parse HTML, you should use a proper HTML library; for Java there is HtmlUnit and Jsoup at least. Better would be to not parse any HTML and use imgur's API as Aethec suggested.
  • 0
  On 05/10/2012 at 06:05, Dr_Asik said:

If you really need to parse HTML, you should use a proper HTML library; for Java there is HtmlUnit and Jsoup at least. Better would be to not parse any HTML and use imgur's API as Aethec suggested.

But when using the API, I append the .xml but it doesn't show ALL the images in the gallery

  • 0
  On 12/10/2012 at 04:27, thatguyandrew1992 said:

Ok guys, I need some more help. How do I get the xml of an album? They are apparently different than galleries! The xml page isn't coming up. Or if there isn't an xml, what is recommended to get the images?

Ex

http://imgur.com/a/HKtAR/

It's all there in the API documentation: http://api.imgur.com/resources_anon#album

Example: http://api.imgur.com/2/album/HKtAR

  • 0
  On 12/10/2012 at 14:05, ZakO said:

It's all there in the API documentation: http://api.imgur.com...rces_anon#album

Example: http://api.imgur.com/2/album/HKtAR

Thanks! I'm learning how to use APIs, I know this is all obvious. I thought I had looked through this. ill be sure to look extra hard!, >.<

This topic is now closed to further replies.
  • Posts

    • Assuming you want the better GPU for gaming put the money into the GPU, the difference between the CPUs is negligible unless you're doing video encoding or similar. A 5700x3d/5800x3d would likely be a better CPU upgrade if you do decide to go that route. Have you tried running the RAM at 3200 or 3000 or bumping the voltage up a bit? It's unlikely to be "faulty" and more likely to just be an incompatibility with your motherboard or CPU memory controller. Having recently upgraded from a very similar system, that CPU is well balanced with a 3080 GPU, you might be able to pick up a decent secondhand one in your budget.
    • NAPS2 (Not Another PDF Scanner 2) 8.2.0 by Razvan Serea NAPS2 is a document scanning application with a focus on simplicity and ease of use. Scan your documents from WIA- and TWAIN-compatible scanners, organize the pages as you like, and save them as PDF, TIFF, JPEG, PNG, and other file formats. NAPS2 creates fully text searchable PDF files that can be imported and indexed within your document management system. NAPS2 is currently available in 40 different languages. NAPS2 key features: Scan documents using WIA- and TWAIN-compatible scanners Scan as many pages as you like from glass or ADF, including duplex support Rotate, flip, remove, and rearrange scanned pages Save as PDF, TIFF, JPEG, PNG, or other file formats Directly email PDFs Search through text included in your PDFs by using optical character recognition (OCR), in any of over 100 languages. Configure brightness, contrast, resolution, and other scan options Save your configurations as profiles to be easily reused later Optional command-line interface (CLI) for automation and scripting MSI installer and application-level configuration available for group policy (GPO) deployment Portable/standalone archives available Translations: English, Català, Čeština, Dansk, Deutsch, Español, Français, Hrvatski, Italiano, Magyar, Nederlands, Polski, Portugues, Russian, Ukrainian, Hebrew NAPS2 8.2.0 changelog: NAPS2 is now available on the Microsoft Store. It costs a small fee to support the developer and provide automatic updates. NAPS2 will continue to be freely available at www.naps2.com Added "Edit with" under the "Image" menu for using an external image editor Added "Share even when NAPS2 is closed" option for Scanner Sharing This will show a system tray icon and restart on login Imported file names are now used as the default file name when saving The "Apply to all selected" checkbox now stays checked Escl: Increased maximum time searching for devices from 5s to 60s Escl: Scanner IPs are now cached for faster and more reliable scanning Windows: Added an arm64 installer Windows: Replaced the "No friendly name" device name from some drivers with "Unknown Scanner" Mac: Fixed an issue where saved files didn't always have the right extension Mac: Disabled the "Apple Mail" email provider when not the default email reader Mac: Updated icons for Split/Combine Linux: Fixed issues with the Save dialog Download: NAPS2 (64-bit) | 43.5 MB (Open Source) Download: Portable NAPS2 8.2.0 | 61.9 MB Link: NAPS2 Home Page | Screenshot Get alerted to all of our Software updates on Twitter at @NeowinSoftware
    • I still remember a prediction that in the future, the BIOS will have all the necessary drivers... for any OS. Still not there
    • No mandatory.... Once again the one-way approach (only incentives or only detractors, or as they say 'the carrot or the stick') has proven ineffective.
  • Recent Achievements

    • Week One Done
      Al_ earned a badge
      Week One Done
    • Week One Done
      MadMung0 earned a badge
      Week One Done
    • Reacting Well
      BlakeBringer earned a badge
      Reacting Well
    • Reacting Well
      Lazy_Placeholder earned a badge
      Reacting Well
    • Dedicated
      Epaminombas earned a badge
      Dedicated
  • Popular Contributors

    1. 1
      +primortal
      477
    2. 2
      +FloatingFatMan
      274
    3. 3
      ATLien_0
      243
    4. 4
      snowy owl
      211
    5. 5
      Edouard
      183
  • Tell a friend

    Love Neowin? Tell a friend!