Recommended Posts

Having weird issues and have a ticket open for a month with vmware.   Basically it goes like this:

environment: vmware esxi 5.5 u2 or 6.0 (happens in both environments)

guest os: windows 2012 r2

e1000/e1000e issue: nic card cannot be found according to event logs and randomly has a 0.0.0.0 conflict and everything no longer can communicate.  The fix is to reboot the guest os.  Vmware recognizes this issue and recommends to not use these drivers with windows 2012.  http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2109922  According to this there is no workaround and to use the vmnetx 3 driver.

vmnetx 3 issue:  users experience slow connection to services taking minutes to get to different pages.  it is so bad that it is unacceptable, it didn't just add seconds between pages... it added minutes.

 

The issue I am at, instability vs completely unusable.  Has anyone seen anything like this or know of a fix. 

 

VMs current stand with e1000/e1000e: not our problem, it is a microsoft problem.  nothing shows up in the logs.  call microsoft and complain

vms current stand with vmnetx:  We have performance tuned that driver before packaging it into vmware, you should not be having those issues. 

 

I have a support person contacting me tomorrow to go through the vmnetx issue to see if he can figure out the issue with that driver.  My expectations are coming out with another day with this issue being unresolved.

 

I am using vmx3net on esxi 6 with 2k12r2

"vmnetx 3 issue:  users experience slow connection to services taking minutes to get to different pages."

Web services, file sharing services?  Can you give me details of the problem so can try and duplicate.

Do you have the vmware tools installed?  Are you on the current 6 build? 2809209  5.5 u2 is quite old, current build is 2718055

 

Don't use e1000 NICs with Server 2008+ guests.  Use VMXNET3.  The e1000 NIC on Server 2008+ can cause all kinds of weird problems like dropped packets, VLAN tags being incorrectly applied, even a PSOD on the ESXi host.  Before you switch over to VMXNET3, make sure you take the IP address off of the e1000 NIC.  Also make sure you remove the e1000 device through Device Manager.

  On 13/08/2015 at 18:01, BudMan said:

I am using vmx3net on esxi 6 with 2k12r2

"vmnetx 3 issue:  users experience slow connection to services taking minutes to get to different pages."

Web services, file sharing services?  Can you give me details of the problem so can try and duplicate.

Do you have the vmware tools installed?  Are you on the current 6 build? 2809209  5.5 u2 is quite old, current build is 2718055

 

Well it is a cluster, it is our erp system and it is in beta.  So while it is an inconvenience now, it is just an inconvenience.  1 html server, 1 sql db, 1 logic and batch server, 1 file server, 1 reporting server, 5 servers altogether. 

Current build 5.5.0 2718055

vmtools version on the guest oses 9.4.12 build 2627939

 

  On 13/08/2015 at 18:06, njeske said:

Don't use e1000 NICs with Server 2008+ guests.  Use VMXNET3.  The e1000 NIC on Server 2008+ can cause all kinds of weird problems like dropped packets, VLAN tags being incorrectly applied, even a PSOD on the ESXi host.  Before you switch over to VMXNET3, make sure you take the IP address off of the e1000 NIC.  Also make sure you remove the e1000 device through Device Manager.

yes but using the vmxnet3 driver makes the system unusable.  I am faced with unsuable or crashing, neither of which are acceptable solutions.  Do you really think if it weren't a big deal that I would get vmware involved or that I would ask a question here after a month of getting no where with vmware?

I will put it to you like this.  e1000 provides instant or near instant logon and instant or near instant database queries and page loads.  With no other change other than changing to the vmnet3 driver (using the same vswitch and statically assigning the same IP) it slows down to the point of watching elmers glue solidify and turn clear before allowing to logon...meaning about 10 minutes before you move on past the logon screen and subsequent pages.  It is really unusable.  It isn't another 10 seconds, it is another 10 minutes...keyword is minutes not seconds.  if you would rather me to display it in seconds, it adds an additional 600 (that is two zeros) seconds to the 1 or less than 1 second with the e1000e driver.

Edited by sc302

so your cluster has 5.5 host and 6 hosts?  or 2 different setups?

What exactly is slow?  so guessing the web server talks to the db server, etc..  So you have a lot of interaction going on..  Are they all on the same vswitch and network?  Are you using standard vswitches or distributed?

What is the physical nics connected to the vswitches? Do you have them in any sort of load balance or failover?  Are you doing offloading of checksums, etc. etc.

 

VMxnet3 is used on millions (probably billions) of VM's around the world, so what needs to be determined is why in your environment is this an issue. I would start with your vSphere networking and go from there.

no 2 different setups.

I have 4 physical hosts the db is on one host, the other servers are spread across the other 3 hosts.  All standard vswitches, each guest has its own dedicated physical nic, so it is one to one.  the vswitch and vmware do not see drops, the 2012 server sees the nic uninstall itself.

  On 13/08/2015 at 19:34, Stokkolm said:

VMxnet3 is used on millions (probably billions) of VM's around the world, so what needs to be determined is why in your environment is this an issue. I would start with your vSphere networking and go from there.

why does the e1000e driver work normally as far as speed goes and the vmxnet3 does not.  something is different between these two, I don't think it is an issue with the vswitch (could be) but why would the vswitch care?  And what would I change to optimize speed on the vswitch.  this is all out of the box configs, nothing really changed outside of naming the vswitch.  I have even switched it down to access vs trunk to eliminate any issues that vmware could possibly have with trunking....it is a flat vlan anyway. 

Edited by sc302

as to uninstalling itself..  Does the vnic get removed from the vm settings, or does the vm just think there is no nic?

I thought the only problem with the vmx3 was performance?  But what part about performance you have lots of balls in the air here, you have webserver I take it and then that talks to your db and what your other servers in this system do talking to each other I don't know.  What does your performance and utilization look like on your vms? Are you seeing drops or retrans across your physical network between your hosts?

What is the performance like if you put all the vms on the same host on the same vswitch?

There can be issues with offloading, is it enabled on your hardware of the host? Is it enabled in the OS driver for vmx3 nic?

Lets see what the tech says about your setup I guess.

This doesn't spell out 2k12 - but have you looked at this http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2008925

 

the setup is very vanilla so whatever the defaults are.  Offloading is not enabled.  Utilization is on the very low side.

 

with the e1000/e1000e eventlog:

event id 27 + 32,

followed by 4201 7043 7036 then the eventual event id 4199

to me it seems like the nic or a component of the nic is not installed, or has the proper software support to run correctly. and eventually commits suicide killing all communications.

 

------------------------------------------------------

the vmxnet 3 only has slowness with no event logs.  never had an issue like this in other setups with using either driver.   It is a complex setup, but it isn't an uncommon setup.

Edited by sc302

you get 32 this error?

event ID: 32 - Source: disk - Description: The driver detected that the device \Device\Harddisk0\DR0 has its write cache enabled. Data corruption may occur.

that could be a problem if your seeing corruption in the file system for sure!!!

so how many of those 4201 are you seeing? do you get the mac address in your 4199 event?

Edited by BudMan

Event id 32:

the description for event id 32 from source e1iexpress cannot be found.  either the component that raises this event is not installed on your local computer or the installation is corrupted.  you can install or repair the component on the local computer.  the following information was included with the event:

Intel(R) 82574L Gigabit Network Connection.

------------------------------------

 

4201 ~ 25 a day

 

-------------------------------

 

Mac address is that of my switch. 

  On 13/08/2015 at 18:06, njeske said:

Don't use e1000 NICs with Server 2008+ guests.  Use VMXNET3.  The e1000 NIC on Server 2008+ can cause all kinds of weird problems like dropped packets, VLAN tags being incorrectly applied, even a PSOD on the ESXi host.  Before you switch over to VMXNET3, make sure you take the IP address off of the e1000 NIC.  Also make sure you remove the e1000 device through Device Manager.

Serious question, Have you got any Evidence to back this up? I know some people that might be interested in this. 

  On 13/08/2015 at 20:56, John Teacake said:

Serious question, Have you got any Evidence to back this up? I know some people that might be interested in this. 

Here are some kbs for you and you can make a decision based on what you read or send them to your people:

http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2109922

http://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&docType=kc&externalId=2058692&sliceId=1&docTypeID=DT_KB_1_1&dialogID=124387093&stateId=0 0 124391023

http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=1001805

 

 

 

  On 13/08/2015 at 20:56, John Teacake said:

Serious question, Have you got any Evidence to back this up? I know some people that might be interested in this. 

I got the Windows version wrong, as it looks like it's 2012+, not 2008+.  But that's still relevant to this post.  Also looks like VMware fixed at least some of the e1000 and e1000e issues in subsequent 5.x and 6 releases.  However after personal horrible experiences with e1000 NICs I always use VMXNET3 and have never had any issues.
http://community.spiceworks.com/topic/640996-esxi5-e1000-server-2012-and-the-purple-screen-of-death-psod
http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2059053

One more...  Experienced this problem on ESXi 5.5, though the article only specifies 5.0 and 5.1
http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2058692

Edited by njeske
clarify OS version.
  On 13/08/2015 at 19:41, sc302 said:

why does the e1000e driver work normally as far as speed goes and the vmxnet3 does not.  something is different between these two, I don't think it is an issue with the vswitch (could be) but why would the vswitch care?  And what would I change to optimize speed on the vswitch.  this is all out of the box configs, nothing really changed outside of naming the vswitch.  I have even switched it down to access vs trunk to eliminate any issues that vmware could possibly have with trunking....it is a flat vlan anyway. 

Did you go through the process of removing the E1000 NIC from Device Manager? If you didn't then the problems you're seeing with VMxnet3 might be related to a ghost NIC.

Follow these instructions to remove the ghost NIC: http://blogs.technet.com/b/danstolts/archive/2010/09/25/how-to-find-a-lost-missing-hidden-or-removed-network-card-nic-or-other-device-and-even-remove-it.aspx

fully aware on how to remove a ghost nic, thanks though.

this is a bit easier to follow along though, looking at that page you linked to gives me a headache.  I would remove that from your favorites and never refer back to that ever.

http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=1179

 

very preliminary, but I think we have a fix for the vmxnet driver.  2 servers are in a processor over commitment state.  Not really an issue for the e1000 driver but, according to support, the vmxnet driver uses processor cores to take some of the load off the nic utilizing receive-side scaling.   Upon disabling receive-side scaling, it seems to have resolved (I cannot tell if this is temporary or not at this point) the slowness issue.  Normally I do not run over committed vms, however this isn't a normal install.  Right now I am faced with 3 options

1. disable receive side scaling

2. drop the cores down on the guest oses

3. purchase new processors and install.

 

This is a beta environment (total current concurrent user count is 5 and they were experiencing extreme slowness with that few user count) so I am not overly concerned with disabling receive side scaling, however if this is a fix my choice will be to either drop the cores or purchase new processors.   If you guys are interested I will keep you updated, otherwise I will go on with those options.

  • Like 2
  On 14/08/2015 at 14:54, sc302 said:

This is a beta environment (total current concurrent user count is 5 and they were experiencing extreme slowness with that few user count) so I am not overly concerned with disabling receive side scaling, however if this is a fix my choice will be to either drop the cores or purchase new processors.   If you guys are interested I will keep you updated, otherwise I will go on with those options.

Very interested, thank you!

Right now we have chose to leave the receive side scaling in the default setting which is enabled.  I have dropped the vcores on the guest oses to be equal or less than the total physical cores on the boxes.  With some light user testing, meaning 1-2 users, everything seems to be running ok with the vmxnet driver.  Monday-Thursday will be the test.  I will post back sometime next week to let you know the outcome, but if you don't hear anything until Friday you can safely assume that this has indeed fixed the issue. 

So related to this - here is a good article on troubleshooting network issues on esxi with vsish

http://www.v-front.de/2015/08/troubleshooting-vm-network-performance.html

So did I read that right you were running more cores in your VMs than you actually had on your boxes?  So for example host has say 4 cores, you were telling your vms they had say 8?  I don't think that has ever been a recommended setup??

 

That is correct.  

 

Yes you shouldn't allow a single guest to use more cores than what is available by the host. I was going by the recommended configuration based on what the vendor recommended,  they had our physical server configurations before they sent us their recommended configuration.  Vendor using a cookie cutter document without really looking at what is available at the individual client site...not the first time something like that has happened. 

 

Regardless, that is what the vmware tech was using when he was looking at the nic/vnic.  No drops,  no ring buffering,  what he saw on the card was perfect.  When he looked further at the setup that is when he looked at the guest configuration and saw the core issue.  I didn't know that or would have that drastic of effect on communications...thought it would treat it like other servers on the hardware where you can have an over commitment of cores through multiple guests (8 physical cores, 10 guests utilizing 4 cores each)

so far no issues, things are moving pretty well.  we are looking to up the processors on those boxes from 2.0 6-core to 3.2 10-core.  Looked at new servers and they are double to triple the price of buying processors (we have 412GB of mem in each host). 

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • Privazer 4.0.107 by Razvan Serea PrivaZer is a PC cleaner that helps you master your security and freedom at home and at work. PrivaZer permanently and irretrievably erases unwanted traces of your past activity on your computer and on your storage devices (USB keys, external drive, and so on) which prevents others from retrieving what you have done, watched, streamed, visited on internet, freeing up valuable hard disk space, and keeping your PC running secure. PrivaZer key features: Deep Cleaning: PrivaZer thoroughly cleans your PC by removing unnecessary files, traces of activity, and potential privacy risks. Advanced Scan Modes: With multiple scan modes, including Quick and Deep scans, PrivaZer ensures comprehensive cleaning tailored to your needs. Customizable Cleaning: PrivaZer allows you to customize cleaning settings, so you can choose exactly what to clean and what to keep. Privacy Protection: PrivaZer safeguards your privacy by securely erasing traces of your online and offline activities, including browsing history and temporary files. Secure File Deletion: PrivaZer securely deletes sensitive files beyond recovery, ensuring your confidential data remains private. Startup Manager: PrivaZer helps you control which programs launch at startup, improving boot times and overall system performance. Automatic Updates: PrivaZer regularly updates its cleaning algorithms to adapt to new threats and ensure effective protection. Scheduled Cleanups: PrivaZer offers the convenience of scheduling automated cleanups, so your PC stays optimized without manual intervention. Portable Version: PrivaZer offers a portable version, allowing you to carry it on a USB drive and clean any PC without installation. Detailed Reports: PrivaZer provides detailed reports after each cleanup, giving you insights into the space reclaimed and the areas cleaned. File Shredder: PrivaZer includes a file shredder feature to securely delete files, making data recovery impossible even with specialized tools. Context Menu Integration: PrivaZer integrates with the context menu, enabling quick and easy access to cleaning functions from any file or folder. Multi-Language Support: PrivaZer supports multiple languages, making it accessible to users worldwide. Automatic Traces Detection: PrivaZer automatically detects traces of activity on your PC, ensuring thorough cleaning without manual intervention. System Restore Point Creation: PrivaZer creates system restore points before cleaning, allowing you to revert changes if needed. Disk Health Analysis: PrivaZer analyzes disk health and alerts you to potential issues, helping you prevent data loss and maintain system stability. Browser Extensions Cleanup: PrivaZer cleans up browser extensions and add-ons, improving browser performance and security. File Association Management: PrivaZer helps you manage file associations, ensuring files open with the correct programs for optimal usability. Intuitive User Interface: PrivaZer features an intuitive user interface, making it easy for both novice and advanced users to optimize their PCs for better performance and privacy. Privazer 4.0.107 changelog: Improved cleanup : Quick Access Improved cleanup : Recent files Minor bug fixes Download: Privazer 4.0.107 | Portable PrivaZer ~30.0 MB (Freeware, paid upgrade available) View: PrivaZer Home Page | Screenshot Get alerted to all of our Software updates on Twitter at @NeowinSoftware
    • Denmark ditching Windows and Office for Linux as it may not want to rely on Microsoft, Trump by Sayan Sen Back in November 2021, the German state of Schleswig-Holstein announced that it was planning to move away from closed-source Microsoft products like Office and Windows to LibreOffice. About two and a half years later, we received an update on the matter, as state officials confirmed that plans for the migration were still intact. As such, about 30,000 government sector computers would be switching by 2027 or so. Following Germany, Denmark seems to be looking to make such a move. This move should resonate pretty loudly now, given that Linux developers, including The Document Foundation, are pushing to encourage users to pick GNU instead of upgrading to Windows 11 from Windows 10. Caroline Stage Olsen, the current Minister for Digital Affairs of Denmark, told the Danish media outlet Politiken that there are plans to phase out Microsoft products in her own ministry next month, and more precisely, moving away from Office 365 to LibreOffice. When discussing the potential challenges of implementing such deployments, Olsen shared a clear backup plan that her ministry has: if the new system proves too challenging at first, they will temporarily revert to the previous setup while they explore other solutions, and overall she is pretty firm on her decision as she stated, "We won't get any closer to our goal if we don't start." The report adds that, thus far, the response from her team has been positive. The minister also added that this initiative isn’t solely about Microsoft. Instead, it’s a step toward reducing an over-reliance on just a handful of providers. The report mentions that the decision could be related to recent events surrounding the International Criminal Court (ICC). For context, the ICC's chief prosecutor, Karim Khan, lost access to his Microsoft Account following Trump's sanctions (via Associated Press). There is also the issue of Trump's keen interest in Greenland, which happens to be a semi-autonomous region of Denmark.
    • Arlo Essential 2K Indoor Camera: Unpacking the features and value of this home security cam by Paul Hill Are you in the UK and looking for internal cameras to keep an eye on your pets or property? If so, the Arlo Essential 2K indoor security camera (2-pack) is now discounted by 26% from its £159.99 RRP to just £119.00. As usual, the product is available with free delivery and free returns, which is helpful if the product turns out to be defective. In addition to the discounted product, the listing also notes there’s a £10-off voucher available until Monday and a £10 Morrisons on Amazon voucher. Arlo is a reputable brand for home security cameras so this deal marks a great opportunity if you’ve been looking for this type of device. Do note that it is a wired camera so it’ll have to be plugged in somewhere. Deep dive into camera features and capabilities The Arlo Essential 2K indoor security camera comes with a very good 2K (up to 2,560x1,440) resolution that provides you with clear, detailed video, great if you want to keep an eye on smaller pets such as kittens. Not only is the camera high-quality, but the camera is equipped with black and white night vision (it can see up to 7 metres), so you can see any events that occur at night. This Arlo security cam features two-way audio with noise reduction and echo cancellation allowing you to chat with anyone coming to feed your pets. There’s also an automatic privacy lens cover that physically blocks the lens when disarmed, providing you with more privacy when at home. There is also passive infrared motion detection that has a range of 7 metres. You can use motion detection in combination with the 80 dB smart siren to scare away intruders. The siren can also be activated manually. The Arlo Essential 2K features a 130-degree wide-angle diagonal view, which is sufficient for most rooms, to capture more of what’s going on in the room and there is 12x digital zoom to take a closer look at objects. It’s compatible with pretty much all Wi-Fi devices with its 2.4GHz Wi-Fi support and it integrates with your smart home via Amazon Alexa, Google Assistant, and IFTTT. Leveraging the Arlo Secure subscription for enhanced security When you buy the Arlo Essential 2K, you get a 30-day free trial of the Arlo Secure subscription, and if you want to continue it, it costs from £11.99 per month or £119.90 per year. This subscription isn’t necessary for basic functionality, but it does unlock the full potential of the camera. When you subscribe you get secure cloud storage for video history (30 to 60 days depending on plan); AI-powered identification of people, animals, vehicles, and packages, reducing false alerts; custom activity zones that let you define areas for motion detection, minimising unwanted notifications; and interactive notifications that can be interacted with from the lock screen like view animated previews, activate siren, and call emergency services. My biggest issue with this camera is that there is no local storage for recordings, necessitating the need to buy the subscription if you want to save any footage. If you’re thinking of using this camera to protect your home from theft and want footage to give to the police, you’ll need a subscription. An alternative to a subscription is buying the Arlo SmartHub (VMB5000) which is compatible with the Arlo Essential 2K indoor camera, according to Arlo’s website. The savings on this camera twin-pack are significant and it’s the lowest price they’ve been at on Amazon UK so they’re definitely worth considering for your home. If you don’t mind the subscription or have the Arlo SmartHub already, then this camera makes sense. If not, then you may be better off with a camera that comes with an SD card slot and recording capabilities. Arlo Essential 2K Indoor Pet Security Camera (2-pack): £119 + £10-off voucher + £10 for Morrisons on Amazon (Amazon UK) / MSRP £159.99 This Amazon deal is U.K. specific, and not available in other regions unless specified. If you don't like it or want to look at more options, check out the Amazon UK deals page here. Get Prime, Prime Video, Music Unlimited, Audible or Kindle Unlimited, free for the first 30 days As an Amazon Associate we earn from qualifying purchases.
    • The Nokia Lumias? Or the third-party HTC One8's? I had HTC's hardware cuz it was slick and reliable... but, yeah, the software left me wanting more and I just couldn't allocate personal time to develop all of the software I would have wanted to see (overworked in other capacities @ MSFT at the time, heh).
    • Microsoft's mobile strategy had great future vision and UX research, but mediocre engineering and inadequate support (third-party and internal business alike). The death knell for WinMo was Google's (mostly YouTube's) incessant API blocking and purposeful release of buggy WinMo builds to force consumers to stay away -- and this was conducted via sabotage of whatever partnerships they were supposed to play nice in. I still yearn for that UI on a modern smartphone...
  • Recent Achievements

    • Week One Done
      evershinefacilityservice earned a badge
      Week One Done
    • One Month Later
      evershinefacilityservice earned a badge
      One Month Later
    • One Month Later
      POR2GAL4EVER earned a badge
      One Month Later
    • One Year In
      Orpheus13 earned a badge
      One Year In
    • One Month Later
      Orpheus13 earned a badge
      One Month Later
  • Popular Contributors

    1. 1
      +primortal
      563
    2. 2
      ATLien_0
      256
    3. 3
      +Edouard
      163
    4. 4
      +FloatingFatMan
      157
    5. 5
      Michael Scrip
      109
  • Tell a friend

    Love Neowin? Tell a friend!