ESXI VM's inaccessible with black screen after a couple of hours


Recommended Posts

I have a really strange issue with my home ESXi server (ESXi 5.5 2403361), this has run 24/7 for almost three years now without any issues. I've not made any hardware / software changes to it for a long time, or even had to re-boot since I last upgraded to a new ESXi build last January.

 

Essentially the problem I now have is all the VM's running are now becoming inaccessible after a couple of hours. I can't ping any VM's from another device on the network, when i try to view the VM's console from vSphere I just get a black screen and the vSphere client locks up. Likewise I can SSH in to my ESXi server, however if i type anything more than a basic command such as uptime the SSH session will lock up. I can re connect to the ESXi server using a new SSH session / vSphere session, however I cant even re-boot the box as the SSH session just locks up when i try to.

 

It's not like the ESXi server has totally crashed as i can keep re connecting over and over, however I cant really do much other than physically re boot the ESXi server. If I re-boot the server all the VM's will start up ok (Server 2012 R2, Server 2003 and Ubuntu), all these VM's work for a couple of hours, however what I have described above will simply happen again on all the VM's, this is regardless of what OS the VM is running.

 

This first happened a few weeks ago and I simply rebooted the ESXi server, everything was fine for a week. The same then happened again, I rebooted the server and everything was ok for a few days.

 

Now each day when i arrive home from work the same has happened again, it's at the point where it happens every 5-6 hours now.

 

All I can think of is either the SSD the VM's are on is perhaps failing, or the USB drive ESXi is booting from is failing, but if that was the case I cant see why everything would work normally for a couple of hours.

 

Just wondering if anyone has experienced anything like this before?

 

When home tonight I might setup a new USB drive to boot from and start ruling things out, if the same happens again with a new USB boot drive move the VM's off the SSD, then finally test the ram.

 

2rqgc5t.jpg

Go to the server and go to the live error log (think it might be ctrl + f8? Try ctrl+f1-f12) and see if there are any errors. Don't think the SSD would be failing, you could yank it out mid-operating and the VMs would be screwed but ESXi should still function and be connectable...

As far as I'm aware, ESXi loads most of itself into RAM on bootup so it shouldn't be using your USB much after that unless you're doing exotic things

 

My first suspicion would be RAM, then USB, then another hardware issue.

Thanks for the input, I have upgraded to 3116895. I guess time will tell if that's changed anything, if not ill leave memtest running tomorrow night and go from there.

 

Not sure what you mean Budman? My ESXi box is called "virtualserver", I have the standard free licence you can get.

Are you on a mobile Budman? it's unlike you to use abbreviations  such as "u".

 

The two network interfaces on the servers motherboard are Realtek r8168's, these had drivers out the box on ESXi 5.0 and 5.1, however the drivers were removed from ESXi 5.5.

 

With ESXi 6.0 the drivers are blacklisted, apparently a work around exist but its not something I have looked in to yet.  With that in mind and the fact users of free licences are limited to the vSphere Client, which cant configure any of the newer features I never really felt the need to update to 6.0.

 

Maybe things have now changed for free users? I must be honest i've not really been following ESXi news for a while. ESXi 5.5 has always been good enough for my home usage scenario at the moment.

Indeed I agree with hindsight only using hardware on the HCL would have been the better choice, everything did work out the box with the standard ESXi ISO on 5.0 / 5.1 and I could upgrade to 5.5 without loosing the existing network drivers. The motherboard is a Mini-ITX and the 1x PCIe port is been used for the raid controller, so no chance of changing anything there. I don't think network drivers are the issue though as ESXI 5.5 has worked perfectly since release until now, and I can still access ESXi over the network when the VM's are not responding.

 

My VM's became inaccessible again last night unfortunately.

 

I'm thinking it could be the SSD as when viewing the logs on screen this morning I was seeing seeing "below MEDIA WEAROUT threshold (0)" for the SSD, although it's strange I cant power up less important VM's which are not on the SSD.

 

I didn't have much time to do much before work this morning, however i've left memtest running which will hopefully rule out any memory problems when i'm home tonight.

Just to update - memtest found no problems with the ram, so i moved the VM's on the SSD across to a different datastore. Since doing that ESXi has been up for around 18 hours with no issues so far (fingers crossed).

 

Also for anyone reading this using a free VMware licence, VmWare Labs has released a free tool which now allows to manage ESXi host via web interface (using HTML 5 and javascript) without the need of vCenter: Free ESXi web interface.

yeah Im am traveling this week for work.  sorry about the U, its a bad habit when texting.. and using my phone keyboard.

 

Yeah that is a fling that was put out https://labs.vmware.com/flings/esxi-embedded-host-client

 

While agreed some of the stuff in 6 is meant for enterprise solutions..  And you don't have access to these using vclient, etc..  There are plenty of other things that make it worth being on current version.  NVMe for example NFS 4.1 as another..

 

I like to stay current..  You can still manage your esxi with vclient,  And sure you could use the fling to not even need the vclient..  As to the limits to hardware 8 stuff vs 11, the stuff is there and available you can just not easy edit with the client..  But there are other ways to get that stuff going, its not like it restricted you just can not use the vclient to point and click to set that stuff up,.

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • Linux 7.1 stable launch looms as Linus Torvalds releases the final release candidate by Paul Hill Linus Torvalds has just released what’s expected to be the final release candidate of Linux 7.1, rc7. The Linux founder said that this RC is not small, but smaller than recent releases, which is a good sign because he expects the stable version to drop next week if things continue on this trajectory. Linux kernels see a merge window for the first two weeks of their life, where developers add new features, then there are about seven or eight weeks of release candidates before the stable version. Typically, there are seven release candidates, but if more time is needed, then an eighth release candidate is released too. This week’s RC’s biggest area of fixes was for GPUs, with networking just behind. Torvalds said that the rest of the release was “pretty random and spread out” with some architecture fixes, driver fixes, filesystem improvements, and build fixes for more unusual configs. In terms of specific pieces of hardware receiving improvements in this update, we had more AMD Zen6 models supported and fixes for AMD SDMA 7.1 and GFX11. Hardware that got improvements includes Lenovo laptops, HONOR laptops, and MSI laptops. Here are the changelogs for those: ASoC: amd: acp: Add DMI quirk for Lenovo Yoga Pro 7 15ASH11 Input: atkbd - add DMI quirk for Lenovo Yoga Air 14 (83QK) Input: atkbd - skip deactivate for HONOR BCC-N's internal keyboard ASoC: amd: yc: Add MSI Raider A18 HX A9WJG to quirk table ASoC: amd: yc: Enable internal mic on MSI Bravo 17 C7VF When the stable Linux 7.1 is released, it will be up to distribution maintainers, such as Canonical and Red Hat, to release the update to their users via the update manager. Some versions of Linux will get it before others, and some will never get it at all. Fedora and Arch-based distros will be among the first to get it, though. If you don’t get it, the security fixes will be backported to your system’s kernel, so you won’t be at risk, but you won’t get newer hardware support, which is fine if your computer works now.
    • Ideally, the algorithm is smart enough to see the real sender ID and non-spoofed address to block it. Ideally.
    • Helium Browser 0.13.1.1 by Razvan Serea Helium is a private, fast, and honest Chromium-based web browser — built for people, with love. It offers the best privacy by default, unbiased ad-blocking, and a clean experience free from bloat and noise. Proudly based on Ungoogled-Chromium, Helium removes Google’s clutter while keeping a fast, efficient development pipeline. With thoughtful touches like native !bangs and split view, Helium is a people-first, fully open-source browser that puts control back in your hands. Privacy, security, and control come first. Ads, trackers, and third-party cookies are blocked automatically, HTTPS is enforced everywhere, and all Chromium extensions work seamlessly — while Google can’t track your activity. Helium’s 13,000+ offline-ready !bangs let you jump straight to sites or AI tools like ChatGPT instantly. Open-source, people-first, and unbiased, Helium delivers a browsing experience that’s fast, secure, and free from noise, ads, and compromises. Helium Browser key features: Performance Fast, efficient, and lightweight — built on Chromium’s optimized engine. Energy-saving and consistent — stays fast over time without slowing down. No bloat — stripped of unnecessary components for maximum speed. Minimalist interface — compact, clean, and distraction-free. Customizable toolbar — hide elements you don’t need. Smooth and stable — no flicker, lag, or animation glitches. Comfort-focused experience — intuitive and unobtrusive. Privacy & Security Best privacy by default — blocks ads, trackers, phishing, and third-party cookies. Unbiased ad-blocking — powered by community filters and uBlock Origin. No telemetry or analytics — zero background web requests on first launch. Strict HTTPS enforcement — warns for insecure sites. Passkeys supported — modern authentication made simple. No built-in password manager or cloud sync — your data stays yours. Extension Compatibility Full Chromium extension support — including MV2 extensions. Anonymized Chrome Web Store requests — Google can’t track extension installs. Extended MV2 support — maintained for as long as possible. Smart Features Native !bangs — browse faster using 13,000+ offline-ready shortcuts. AI integration — use !chatgpt and others directly from the address bar. Offline functionality — bangs work without an Internet connection. Philosophy People-first design — open source, transparent, and community-driven. No ads, no noise, no bias — privacy and honesty over profit. Helium Browser 0.13.1.1 changelog: Helium Windows e13ddd4 update: helium 0.13.1.1 (#285) 77ee94b helium/windows: winsparkle updater (built-in auto-update support on Windows) e501e98 helium/windows: refresh updater patches for m149 008faee helium/win: move versioning patches to separate file e114701 helium/updater: implement system install update notifications c9e3cda helium/winsparkle: verboser errors b8f787c helium/windows: more version migrations cd376ed ci: sign update helper with different description 5d2383a build: wire up winsparkle envs 3ff28ff helium/updater: init ca58f6f helium/change-branding: use helium version in more internal places 75b2625 .gitignore: add resources/generated abe78f2 downloads: add winsparkle dep 4ed4429 build: build mini_installer again fabf8e9 update: helium 0.13.1.1 Helium-Chromium 5bf45fed merge: update to chromium 149.0.7827.53 (#1857) 3bbe6a3c revision: reset to 1 e3525bb6 helium/zen: reorder patch hunks d48bc496 deps: update ublock to 1.71.0 (#1875) d4e58802 helium/ui: redesign infobar, optimize and rename zen (#1868) c20175cf helium/ui/infobar: redesign, draw proper borders, fix webview relations e8a1bfc1 helium/zen: rename Zen to Frameless in UI, remove feature gates bf560c6a helium/layout: optimize zen top reveal, move out of experiment dir 34ef5f26 patches/brave/chrome-importer: remove os_crypt-related dead code 99cef46c helium/ui: clean up patches, fix accidental string OS gates 5b7dd06c devutils/i18n: add clean command (#1869) 7a32267e helium/updater: add win scaffolding, tighten arch conditions (#1866) d856d800 ci: complete cirrus port to github actions (#1867) c362740d patches: update for chromium 149.0.7827.53 b8a66095 merge: update ungoogled-chromium to 149.0.7827.53 a3a5471d Update to Chromium 149.0.7827.53 c6a41202 helium/updates: decouple update url from helium services + reformat (#1827) 94344c47 Update to Chromium 148.0.7778.215 95f6fe1c Port CirrusCI jobs to github actions (#3776) Download: Helium 64-bit | Portable 64-bit |~100.0 MB (Open Source) Download: Helium ARM64 | Portable ARM64 Links: Helium Home Page | macOS | Linux | Screenshot Get alerted to all of our Software updates on Twitter at @NeowinSoftware
  • Recent Achievements

    • One Month Later
      DJC50PLUS earned a badge
      One Month Later
    • Week One Done
      DJC50PLUS earned a badge
      Week One Done
    • Proficient
      Eric Biran went up a rank
      Proficient
    • Dedicated
      Conjor earned a badge
      Dedicated
    • Week One Done
      Windows Guy earned a badge
      Week One Done
  • Popular Contributors

    1. 1
      +primortal
      493
    2. 2
      PsYcHoKiLLa
      248
    3. 3
      Steven P.
      71
    4. 4
      ATLien_0
      68
    5. 5
      +Edouard
      68
  • Tell a friend

    Love Neowin? Tell a friend!