• 0

Need help making an "offline" copy of a website I'm a member of


Question

Hi, everyone.

I'm a member of a private group, one that costs many thousands of dollars to join. The person that started the group and runs it, I'll call "Bob", isn't very technical at all. He hired someone to create a website for members. Each member has their own username and password. As far as I know, for at least the last few years Microsoft SharePoint Server has been used. I do not know any more specifics about what the website is running on.

"Bob" has always encouraged people to download things from the website here and there since content is always changing and in case anything ever happened to the server. Well, this website has many areas (called "Sites"), thousands of forum/blog type postings, and hundreds or thousands of links, files, and folders scattered all over the place. Yes, I could manually make local folders and download things, but that would be a nightmare and quite difficult.

So, all along over the last year or so, I've thought about just making an "offline" copy and figured I would use WinHTTrack, which I have known about for many years. I went to do it about a week ago, with the intention of backing up the site to a new Western Digital Caviar Black 2TB drive. I think the website is only maybe 100GB. Anyway, I am getting "Access Denied" and "Unauthorized" errors (if I look at the logs), which don't make any sense when I know I am using the correct username and password that I use to login. I am even copying and pasting from my password manager.

Here is the log file generated when telling WinHTTrack to copy http://www.WEBSITE.com :

HTTrack3.46+htsswf+htsjava launched on Wed, 17 Oct 2012 14:12:40 at http://USERNAME:PASS...www.WEBSITE.com +*.png +*.gif +*.jpg +*.css +*.js -

ad.doubleclick.net/* -mime:application/foobar

(winhttrack -WC2%Pns2u1%s%uN0%I0p7DaK0c2R3H0%kf2A25000%c1%f#f -F "Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by HTTrack Website

Copier/3.x [XR&CO'2010], %s -->" -%l "en, en, *" http://USERNAME:PASS...www.WEBSITE.com -O1 I:\WEBSITE_BACKUP_FOLDER +*.png +*.gif +*.jpg

+*.css +*.js -ad.doubleclick.net/* -mime:application/foobar )

Information, Warnings and Errors reported for this mirror:

note: the hts-log.txt file, and hts-cache folder, may contain sensitive information,

such as username/password authentication for websites mirrored in this project

do not share these files/folders if you want these information to remain private

14:12:42 Error: "Access denied" (401) at link USERNAME:[email protected]/ (from primary/primary)

14:12:42 Info: No data seems to have been transfered during this session! : restoring previous one!

Here is the log file generated when telling WinHTTrack to copy http://www.WEBSITE.com/default.aspx :

HTTrack3.46+htsswf+htsjava launched on Wed, 17 Oct 2012 14:16:53 at http://USERNAME:PASS...om/default.aspx +*.png +*.gif +*.jpg +*.css +*.js

-ad.doubleclick.net/* -mime:application/foobar

(winhttrack -WC2%Pns2u1%s%uN0%I0p7DaK0c2R3H0%kf2A25000%c1%f#f -F "Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by HTTrack Website

Copier/3.x [XR&CO'2010], %s -->" -%l "en, en, *" http://USERNAME:PASS...om/default.aspx -O1 I:\WEBSITE_BACKUP_FOLDER +*.png

+*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar )

Information, Warnings and Errors reported for this mirror:

note: the hts-log.txt file, and hts-cache folder, may contain sensitive information,

such as username/password authentication for websites mirrored in this project

do not share these files/folders if you want these information to remain private

14:16:56 Error: "Unauthorized" (401) at link USERNAME:[email protected]/default.aspx (from primary/primary)

14:16:56 Info: No data seems to have been transfered during this session! : restoring previous one!

Attached you will find an a screen-shot of the error I get via a pop-up window (for WEBSITE.com or WEBSITE.com/default.aspx) as well as screen-shots showing the various tabs of the Preferences.

So, I was wondering if there is a problem in general with using WinHTTrack with SharePoint websites? Does anyone have any experience making offline copies of SharePoint websites? Any suggestions? Other software that someone has used for SharePoint websites?

Thanks in advance for any feedback and/or help...

-JayZJay

post-157395-0-82868200-1350509533.jpgpost-157395-0-21688800-1350509544.jpgpost-157395-0-71445100-1350509545.jpgpost-157395-0-52917400-1350509547.jpgpost-157395-0-78965000-1350509548.jpgpost-157395-0-17525600-1350509550.jpgpost-157395-0-44106700-1350509551.jpgpost-157395-0-70143500-1350509552.jpgpost-157395-0-07401200-1350509554.jpgpost-157395-0-25737800-1350509555.jpgpost-157395-0-60818300-1350509556.jpgpost-157395-0-82880800-1350509557.jpg

10 answers to this question

Recommended Posts

  • 0

Why not just do it at a server level backup. I never had any issues with the standard options in httrack in getting websites. Or just FTP into the server and DL the entire website from there. If it uses PHP I believe you still will need a local server to run the website.

  • 0

The server is not physically here, nor do I as only a member have any type of admin access to do a server level backup. As far as I know or can tell, there is not any type of ftp access available to anyone. If there is, I do not know the URL and "Bob", being a control freak, is not going to provide that information to any of us members.

  • 0

well from this

"14:12:42 Error: "Access denied" (401) at link USERNAME:[email protected]/ (from primary/primary)"

Looks to me that you can not use that form of url to auth to the site? If sharepoint its prob using ntml auth, and they prob have basic auth off so you can not do that sort of url to auth.

your never going to download anything if site requires auth to access. I would verify that you can use that url your using just in your browser address bar to access the site. If not then do a bit of google for how to use ntml auth with httrack, the old school was was to use a proxy with httrack that would do the ntml auth for you and then all the connections from httrack would use the local proxy you were running on the same machine. Have not play with httrack in years and years - maybe they support direct ntml auth now?

  • 0

"Bob" runs a website where he encourages people to download, yes sounds very legit...

Try not assuming. It is a research group. We study alternative health, banking, law, trusts, estates, and other topics. The only thing on the site is PDFs, Word docs, discussion on various topics, private presentations recorded from group meetings, webinars, etc. Those are the things on the site and there is a ton of it going back years. "Bob" encourages us to download and keep copies of stuff because 1) the site constantly changes, 2) there are times when you are in a law library or other place without Internet access and cannot access the site, and 3) "Bob" isn't very technical and he loses things, deletes stuff, etc -- so having our own backup copies is certainly a good idea just in case a problem arises.

  • 0

Try not assuming. It is a research group. We study alternative health, banking, law, trusts, estates, and other topics. The only thing on the site is PDFs, Word docs, discussion on various topics, private presentations recorded from group meetings, webinars, etc. Those are the things on the site and there is a ton of it going back years. "Bob" encourages us to download and keep copies of stuff because 1) the site constantly changes, 2) there are times when you are in a law library or other place without Internet access and cannot access the site, and 3) "Bob" isn't very technical and he loses things, deletes stuff, etc -- so having our own backup copies is certainly a good idea just in case a problem arises.

i dont know how to help, but this sounds really cool. Why can't we know who bob is, or this website?

  • 0

Try not assuming. It is a research group. We study alternative health, banking, law, trusts, estates, and other topics. The only thing on the site is PDFs, Word docs, discussion on various topics, private presentations recorded from group meetings, webinars, etc. Those are the things on the site and there is a ton of it going back years. "Bob" encourages us to download and keep copies of stuff because 1) the site constantly changes, 2) there are times when you are in a law library or other place without Internet access and cannot access the site, and 3) "Bob" isn't very technical and he loses things, deletes stuff, etc -- so having our own backup copies is certainly a good idea just in case a problem arises.

All i have gotten from this whole thread is bob is a complete and utter idiot that runs a website that people seem to pay alot of money to join. If you are paying him money to join this secret website and he has no clue on what he's doing then he should hire someone to take care of the tech side of things including backups to another hard drive/server (after all you say your paying alot of money why can he not have more than one server?). If there are so many files like you claim then the best thing would be to get limited ftp access to the downloads directory (can access and download that directory and thats it).

In all honesty you would be better talking to this bob and telling him to hire someone for back up reasons, as for the offline content maybe he does not want that because the information could end up being on the internet for free instead of people paying him to gain access. As you said things change and people leave things laying about which soon gets onto the internet,.

Anyway im out of this thread, i personally think what you are doing is wrong you are a member of a site and nothing more.

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • Flameshot 14.0 Final by Razvan Serea Flameshot is a free and open-source, cross-platform tool to take screenshots with many built-in features to save you time. Using Flameshot is as simple as launching, dragging the selection box to cover the area you want to capture, making annotations as needed in on-screen and saving the shot to your computer, all with a very simple and straightforward interface. Flameshot allows users to simply upload their screenshots directly to the cloud in order to easily share it with others. You can upload your image directly to Imgur with a single click and share the URL with others. In-app screenshot editing - You can choose to add an arrow mark, highlight text, blur a section (blur or pixelate an area), add a text, draw something, add a rectangular/circular shaped border, add an incrementing counter number, and add a solid color box with Flameshot's built-in editing tools. Command-line interface (CLI) - Flameshot has several commands you can use in the terminal without launching the GUI via a command line interface. The command line interface lets you script Flameshot and use it as the subject of key binds. Flameshot 14.0 release notes: This release brings major improvements to multi-monitor support, fractional scaling support, new capture workflows, and a long list of bug fixes across all platforms. Changelog: New Multi-Monitor Capture Workflow New monitor selection screen before capture for better multi-monitor and mixed-scaling support. Option to auto-capture the monitor under the cursor (X11 & Windows). Tray menu can directly select a monitor. Linux Improvements XDG Desktop Portal is now the primary screenshot method. Added legacy X11 fallback option for minimal window managers. New D-Bus capture API for scripting and automation. Windows Enhancements Global screenshot hotkeys now supported (not limited to Print Screen). New portable mode stores settings next to the executable. Clipboard now always uses PNG format for better compatibility. CLI & Platform Updates Redesigned flameshot screen command with per-monitor capture support. Added native Nix Flake support. More compact launcher UI and improved update notifications. Major Fixes Multiple Wayland stability fixes, including KDE Plasma crash fixes. Clipboard compatibility improvements for GNOME, Wayland, X11, Windows, and macOS. Fixed D-Bus hangs, capture crashes, and HiDPI region issues. Other Changes Dropped Ubuntu 20.04 (Focal) support. Updated translations and build infrastructure. Intel macOS builds are no longer provided. [full release notes] Download: Flameshot 14.0 | 18.1 MB (Open Source) Download: Flameshot Portable | 53.0 MB Links: Flameshot Home Page | Screenshot Get alerted to all of our Software updates on Twitter at @NeowinSoftware
    • Helium Browser 0.13.4.1 by Razvan Serea Helium is a private, fast, and honest Chromium-based web browser — built for people, with love. It offers the best privacy by default, unbiased ad-blocking, and a clean experience free from bloat and noise. Proudly based on Ungoogled-Chromium, Helium removes Google’s clutter while keeping a fast, efficient development pipeline. With thoughtful touches like native !bangs and split view, Helium is a people-first, fully open-source browser that puts control back in your hands. Privacy, security, and control come first. Ads, trackers, and third-party cookies are blocked automatically, HTTPS is enforced everywhere, and all Chromium extensions work seamlessly — while Google can’t track your activity. Helium’s 13,000+ offline-ready !bangs let you jump straight to sites or AI tools like ChatGPT instantly. Open-source, people-first, and unbiased, Helium delivers a browsing experience that’s fast, secure, and free from noise, ads, and compromises. Helium Browser key features: Performance Fast, efficient, and lightweight — built on Chromium’s optimized engine. Energy-saving and consistent — stays fast over time without slowing down. No bloat — stripped of unnecessary components for maximum speed. Minimalist interface — compact, clean, and distraction-free. Customizable toolbar — hide elements you don’t need. Smooth and stable — no flicker, lag, or animation glitches. Comfort-focused experience — intuitive and unobtrusive. Privacy & Security Best privacy by default — blocks ads, trackers, phishing, and third-party cookies. Unbiased ad-blocking — powered by community filters and uBlock Origin. No telemetry or analytics — zero background web requests on first launch. Strict HTTPS enforcement — warns for insecure sites. Passkeys supported — modern authentication made simple. No built-in password manager or cloud sync — your data stays yours. Extension Compatibility Full Chromium extension support — including MV2 extensions. Anonymized Chrome Web Store requests — Google can’t track extension installs. Extended MV2 support — maintained for as long as possible. Smart Features Native !bangs — browse faster using 13,000+ offline-ready shortcuts. AI integration — use !chatgpt and others directly from the address bar. Offline functionality — bangs work without an Internet connection. Philosophy People-first design — open source, transparent, and community-driven. No ads, no noise, no bias — privacy and honesty over profit. Helium Browser 0.13.4.1 changelog: 0a4f1149 revision: bump to 4 (#1969) 4848de1f helium/core: enable the chromium screenshot feature (#1968) e0dec3f5 onboarding: integrate strings to i18n system (#1948) 417fa5bc i18n: fix newline parsing for onboarding 7a339b39 i18n: add foraged translations for onboarding 4f090cff i18n/generate: add handling for onboarding strings bfe48d58 i18n_apply: manually override parent grd logic for onboarding strings ab214e3c onboarding: bump in deps, wire up grdp afa6a059 helium/core: disable pdf infobar feature (#1965) eba585e7 helium/ui/vertical: fix new tab button alignment and icon size (#1964) 6ecfc9e0 helium/ui/tabs: fix horizontal tab hover background color (#1963) 3db87dc0 helium/ui/tabs: fix new tab button hover/press colors (#1962) 6bbdcc3e helium/ui: improve tab group UI in all layouts (#1961) 53deb314 helium/ui/tabs: enable tab group hover cards e93aece7 helium/ui/vertical: fix tab group appearance, prevent line overlap 629f5495 helium/ui/tabs: restore solid group header colors, enable new colors 961c962e helium/ui/tabs: move horiz tab group underline to bottom, make it thick c96deab6 merge: update to chromium 149.0.7827.155 (#1959) 36db56b4 i18n: update source.gen.json 5ce006ae patches: refresh for chromium 149.0.7827.155 b4c1ea62 merge: update ungoogled-chromium to 149.0.7827.155 4e5e8671 Update to Chromium 149.0.7827.155 08a3e7da helium/ui/layout: disable mute on collapsed vertical tabs (#1778) a0a5bbaf helium/core: simplify context menu and prevent huge widths (#1951) c4732aac devutils/i18n: add forage command (#1944) 11d16986 devutils/i18n: add an option to translate using local CLI tools (#1942) d820c3a2 i18n/prompt: tighten translation rules to prevent common errors (#1940) cf827007 Update to Chromium 149.0.7827.114 6e3d5164 Update to Chromium 149.0.7827.102 Download: Helium 64-bit | Portable 64-bit |~100.0 MB (Open Source) Download: Helium ARM64 | Portable ARM64 Links: Helium Home Page | macOS | Linux | Screenshot Get alerted to all of our Software updates on Twitter at @NeowinSoftware
  • Recent Achievements

    • Reacting Well
      BizSAR earned a badge
      Reacting Well
    • First Post
      AndreaB earned a badge
      First Post
    • Week One Done
      Huge Trailer earned a badge
      Week One Done
    • Week One Done
      Classifyskilleducation earned a badge
      Week One Done
    • One Month Later
      eurospharma62 earned a badge
      One Month Later
  • Popular Contributors

    1. 1
      +primortal
      579
    2. 2
      +Edouard
      183
    3. 3
      PsYcHoKiLLa
      75
    4. 4
      Michael Scrip
      74
    5. 5
      neufuse
      64
  • Tell a friend

    Love Neowin? Tell a friend!