• 0

Need help making an "offline" copy of a website I'm a member of


Question

Hi, everyone.

I'm a member of a private group, one that costs many thousands of dollars to join. The person that started the group and runs it, I'll call "Bob", isn't very technical at all. He hired someone to create a website for members. Each member has their own username and password. As far as I know, for at least the last few years Microsoft SharePoint Server has been used. I do not know any more specifics about what the website is running on.

"Bob" has always encouraged people to download things from the website here and there since content is always changing and in case anything ever happened to the server. Well, this website has many areas (called "Sites"), thousands of forum/blog type postings, and hundreds or thousands of links, files, and folders scattered all over the place. Yes, I could manually make local folders and download things, but that would be a nightmare and quite difficult.

So, all along over the last year or so, I've thought about just making an "offline" copy and figured I would use WinHTTrack, which I have known about for many years. I went to do it about a week ago, with the intention of backing up the site to a new Western Digital Caviar Black 2TB drive. I think the website is only maybe 100GB. Anyway, I am getting "Access Denied" and "Unauthorized" errors (if I look at the logs), which don't make any sense when I know I am using the correct username and password that I use to login. I am even copying and pasting from my password manager.

Here is the log file generated when telling WinHTTrack to copy http://www.WEBSITE.com :

HTTrack3.46+htsswf+htsjava launched on Wed, 17 Oct 2012 14:12:40 at http://USERNAME:PASS...www.WEBSITE.com +*.png +*.gif +*.jpg +*.css +*.js -

ad.doubleclick.net/* -mime:application/foobar

(winhttrack -WC2%Pns2u1%s%uN0%I0p7DaK0c2R3H0%kf2A25000%c1%f#f -F "Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by HTTrack Website

Copier/3.x [XR&CO'2010], %s -->" -%l "en, en, *" http://USERNAME:PASS...www.WEBSITE.com -O1 I:\WEBSITE_BACKUP_FOLDER +*.png +*.gif +*.jpg

+*.css +*.js -ad.doubleclick.net/* -mime:application/foobar )

Information, Warnings and Errors reported for this mirror:

note: the hts-log.txt file, and hts-cache folder, may contain sensitive information,

such as username/password authentication for websites mirrored in this project

do not share these files/folders if you want these information to remain private

14:12:42 Error: "Access denied" (401) at link USERNAME:[email protected]/ (from primary/primary)

14:12:42 Info: No data seems to have been transfered during this session! : restoring previous one!

Here is the log file generated when telling WinHTTrack to copy http://www.WEBSITE.com/default.aspx :

HTTrack3.46+htsswf+htsjava launched on Wed, 17 Oct 2012 14:16:53 at http://USERNAME:PASS...om/default.aspx +*.png +*.gif +*.jpg +*.css +*.js

-ad.doubleclick.net/* -mime:application/foobar

(winhttrack -WC2%Pns2u1%s%uN0%I0p7DaK0c2R3H0%kf2A25000%c1%f#f -F "Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by HTTrack Website

Copier/3.x [XR&CO'2010], %s -->" -%l "en, en, *" http://USERNAME:PASS...om/default.aspx -O1 I:\WEBSITE_BACKUP_FOLDER +*.png

+*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar )

Information, Warnings and Errors reported for this mirror:

note: the hts-log.txt file, and hts-cache folder, may contain sensitive information,

such as username/password authentication for websites mirrored in this project

do not share these files/folders if you want these information to remain private

14:16:56 Error: "Unauthorized" (401) at link USERNAME:[email protected]/default.aspx (from primary/primary)

14:16:56 Info: No data seems to have been transfered during this session! : restoring previous one!

Attached you will find an a screen-shot of the error I get via a pop-up window (for WEBSITE.com or WEBSITE.com/default.aspx) as well as screen-shots showing the various tabs of the Preferences.

So, I was wondering if there is a problem in general with using WinHTTrack with SharePoint websites? Does anyone have any experience making offline copies of SharePoint websites? Any suggestions? Other software that someone has used for SharePoint websites?

Thanks in advance for any feedback and/or help...

-JayZJay

post-157395-0-82868200-1350509533.jpgpost-157395-0-21688800-1350509544.jpgpost-157395-0-71445100-1350509545.jpgpost-157395-0-52917400-1350509547.jpgpost-157395-0-78965000-1350509548.jpgpost-157395-0-17525600-1350509550.jpgpost-157395-0-44106700-1350509551.jpgpost-157395-0-70143500-1350509552.jpgpost-157395-0-07401200-1350509554.jpgpost-157395-0-25737800-1350509555.jpgpost-157395-0-60818300-1350509556.jpgpost-157395-0-82880800-1350509557.jpg

10 answers to this question

Recommended Posts

  • 0

Why not just do it at a server level backup. I never had any issues with the standard options in httrack in getting websites. Or just FTP into the server and DL the entire website from there. If it uses PHP I believe you still will need a local server to run the website.

  • 0

The server is not physically here, nor do I as only a member have any type of admin access to do a server level backup. As far as I know or can tell, there is not any type of ftp access available to anyone. If there is, I do not know the URL and "Bob", being a control freak, is not going to provide that information to any of us members.

  • 0

well from this

"14:12:42 Error: "Access denied" (401) at link USERNAME:[email protected]/ (from primary/primary)"

Looks to me that you can not use that form of url to auth to the site? If sharepoint its prob using ntml auth, and they prob have basic auth off so you can not do that sort of url to auth.

your never going to download anything if site requires auth to access. I would verify that you can use that url your using just in your browser address bar to access the site. If not then do a bit of google for how to use ntml auth with httrack, the old school was was to use a proxy with httrack that would do the ntml auth for you and then all the connections from httrack would use the local proxy you were running on the same machine. Have not play with httrack in years and years - maybe they support direct ntml auth now?

  • 0

"Bob" runs a website where he encourages people to download, yes sounds very legit...

Try not assuming. It is a research group. We study alternative health, banking, law, trusts, estates, and other topics. The only thing on the site is PDFs, Word docs, discussion on various topics, private presentations recorded from group meetings, webinars, etc. Those are the things on the site and there is a ton of it going back years. "Bob" encourages us to download and keep copies of stuff because 1) the site constantly changes, 2) there are times when you are in a law library or other place without Internet access and cannot access the site, and 3) "Bob" isn't very technical and he loses things, deletes stuff, etc -- so having our own backup copies is certainly a good idea just in case a problem arises.

  • 0

Try not assuming. It is a research group. We study alternative health, banking, law, trusts, estates, and other topics. The only thing on the site is PDFs, Word docs, discussion on various topics, private presentations recorded from group meetings, webinars, etc. Those are the things on the site and there is a ton of it going back years. "Bob" encourages us to download and keep copies of stuff because 1) the site constantly changes, 2) there are times when you are in a law library or other place without Internet access and cannot access the site, and 3) "Bob" isn't very technical and he loses things, deletes stuff, etc -- so having our own backup copies is certainly a good idea just in case a problem arises.

i dont know how to help, but this sounds really cool. Why can't we know who bob is, or this website?

  • 0

Try not assuming. It is a research group. We study alternative health, banking, law, trusts, estates, and other topics. The only thing on the site is PDFs, Word docs, discussion on various topics, private presentations recorded from group meetings, webinars, etc. Those are the things on the site and there is a ton of it going back years. "Bob" encourages us to download and keep copies of stuff because 1) the site constantly changes, 2) there are times when you are in a law library or other place without Internet access and cannot access the site, and 3) "Bob" isn't very technical and he loses things, deletes stuff, etc -- so having our own backup copies is certainly a good idea just in case a problem arises.

All i have gotten from this whole thread is bob is a complete and utter idiot that runs a website that people seem to pay alot of money to join. If you are paying him money to join this secret website and he has no clue on what he's doing then he should hire someone to take care of the tech side of things including backups to another hard drive/server (after all you say your paying alot of money why can he not have more than one server?). If there are so many files like you claim then the best thing would be to get limited ftp access to the downloads directory (can access and download that directory and thats it).

In all honesty you would be better talking to this bob and telling him to hire someone for back up reasons, as for the offline content maybe he does not want that because the information could end up being on the internet for free instead of people paying him to gain access. As you said things change and people leave things laying about which soon gets onto the internet,.

Anyway im out of this thread, i personally think what you are doing is wrong you are a member of a site and nothing more.

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • Honestly that feels even more useless than it did when Win11 was first released. In 2021, the uproar was somewhat justified, but only when comparing how good we've had it since Windows 7. Prior to that, a new Windows release would often require new, or very recent hardware. Windows XP wouldn't run (in any usable way) on hardware released when it's predecessor Win98 was released (let's ignore ME). It was time to shift the goal post, and the way Microsoft did that was actually ok. People have still had another FIVE YEARS of free software support with Windows 10, and those of us who want to have used these tools to bypass the limitations, all while understanding the impacts that may have. Most laptops don't last 5 years (sadly), so now the youngest unsupported hardware is 9 years old, and apparently has another year of support with Windows 10. That's good. Meanwhile, understanding the impacts and limitations, I have my 2013 laptop running Win11 perfectly fine. The thing that's failing on it is the hardware, the 2.5" SATA cable/chip is failing and corrupting the SSDs I put in. Thankfully it has a functional M.2 sata drive that works fine!
    • iPhone 18 Pro drop-test video and photos leak on the dark web following a data breach by Hamid Ganji iPhone 17 Pro - Image via Apple Apple is seemingly facing one of the biggest data breaches in its history, and just a few months before the official debut of the iPhone 18 Pro series, photos, a drop-test video, a supplier list, and key phone components have reportedly been leaked by hackers. Last week, we reported that Tata Electronics, an Apple supplier and iPhone producer in India, was hit by a data breach. As a result, it was reported that more than 200,000 trade secrets and confidential documents belonging to Apple and Tesla were stolen by the ransomware group World Leaks. According to Reuters, the group has now leaked supplier lists, component details, and photos of the upcoming iPhone 18 Pro models on the dark web. One of the materials leaked by the hackers is a drop-test video of the iPhone 18 Pro, which is due to launch this September. The phone is shown in a gray color and has the same familiar design we saw on last year's iPhone 17 Pro series. The device also appears to be quite durable, though it seems to be thicker than last year's model. One possible explanation is that Apple may be using a larger battery in the iPhone 18 Pro series. Moreover, Reuters says it has seen at least six documents mapping many components in the iPhone 18 Pro models to their respective suppliers, including details on chips on the main circuit board and on battery and camera components. The documents reportedly detail hundreds of parts that will be used in the iPhone 18 Pro models. A person familiar with the matter told the outlet that Apple classifies this data as sensitive and “is concerned about the documents being shared on the dark web as they relate to unreleased models.” Apple is reportedly investigating the issue but has yet to issue an official statement.
    • You do you, I've just said that it first appeared in "home" version before it will be available in "work" one. I use Edge only because it still supports MV2 uBO extension even on Android - I'll switch when they stop.
    • I imagine that was a review or something? My reviews mostly contain a lot of images and galleries, but these are all webp too, but yeah it all adds up on the page load. Would help if you were more helpful with your critique instead of bitching and moaning like a Karen 😂 Because then we might be able to fix it for you.
    • If Valve refused to let them make the case, I wonder if they've already partnered with someone else to do it? The fact that they didn't seek permission/licence before diving straight in is incredible though
  • Recent Achievements

    • First Post
      rosiecharles earned a badge
      First Post
    • Reacting Well
      Juan Dela earned a badge
      Reacting Well
    • Week One Done
      Collagen Project earned a badge
      Week One Done
    • Reacting Well
      Wakeen1966 earned a badge
      Reacting Well
    • Rookie
      Almohandis went up a rank
      Rookie
  • Popular Contributors

    1. 1
      +primortal
      516
    2. 2
      +Edouard
      273
    3. 3
      PsYcHoKiLLa
      142
    4. 4
      Steven P.
      100
    5. 5
      macoman
      54
  • Tell a friend

    Love Neowin? Tell a friend!