Megaraid Storage Manager / LSI card... What does "PD" really mean?


Recommended Posts

I take it PD = Physical Disk/Drive, but if so, then I'm stumped.

 

I've had for a number of weeks, errors every 20-30 seconds show up in my controller logs:

Controller ID: 0 Unexpected sense: PD = 14-Invalid field in CDB.....

As well as 

Controller ID: 0 Transient error detected while communicating with PD : 14

But here's the kicker....   I just replaced Physical Disk 14, with the drive in 13, and then pulled Disk 14 out.   I'm still getting these errors.  

 

 

The only other thing I can think of, is that it possibly refers to the Enclosure ID which also happens to say "14"  See screenshot:

post-26332-0-76514500-1373434806.png

 

So how the hell am I supposed to find out which cable is possibly bad, or could it be the cable between my Raid Controller and the expander?  I've already tried replacing all the cables I originally bought (from Monoprice) with "approved" LSI cables (not cheap), and that didn't change anything, and by moving to disk 13, that ruled out the slot in my cage...so I'm at a loss on how to troubleshoot this.

 

This is likely a power management firmware bug that I have seen with some drives on these controllers. In my case when I saw a similar error what was happening is the system was unexpectedly awakening the hot spare drive from sleep, sending it to sleep waking it up and this continuously happened all the time. Apparently it was a bug with the firmware of the I believe Seagate drives I had on the controller and the controller itself. It was fixed in a later firmware update for the controller which in effect disabled all power management functionality of the controller and caused the controller to not attempt to send in-active drives to sleep.

 

In my case it also resulted in the controller reporting immature failure of the drives when they were in reality perfectly fine.

 

I would try updating to the latest firmware for your controller card and see if the errors go away. In my case it wasn't anything wrong with the drives or the controller per-se but just a miscommunication between the (2) that caused some of the drives to change state unexpectedly and confuse the controller. In the worst scenarios it would result in a crash of the controller and the OS would freeze in a manner where it would still appear to be running but the connection to the drives were all missing. The firmware update fixed the error and random halting/crashing issue that the powering up and down of the drives caused.

This is likely a power management firmware bug that I have seen with some drives on these controllers. In my case when I saw a similar error what was happening is the system was unexpectedly awakening the hot spare drive from sleep, sending it to sleep waking it up and this continuously happened all the time. Apparently it was a bug with the firmware of the I believe Seagate drives I had on the controller and the controller itself. It was fixed in a later firmware update for the controller which in effect disabled all power management functionality of the controller and caused the controller to not attempt to send in-active drives to sleep.

 

In my case it also resulted in the controller reporting immature failure of the drives when they were in reality perfectly fine.

 

I would try updating to the latest firmware for your controller card and see if the errors go away. In my case it wasn't anything wrong with the drives or the controller per-se but just a miscommunication between the (2) that caused some of the drives to change state unexpectedly and confuse the controller. In the worst scenarios it would result in a crash of the controller and the OS would freeze in a manner where it would still appear to be running but the connection to the drives were all missing. The firmware update fixed the error and random halting/crashing issue that the powering up and down of the drives caused.

I've already got the latest firmware on the LSI 9260-8i, as well as latest MSM, I update when new firmware comes out almost immediately.  There was a "unexpected sense" bug in the intel firmware for the expander, but that was fixed.

I'm wondering if it might be the Samsung or Seagate drives, like you mentioned, though possibly the samsung drives as it's done it for a while, and i only got WD and seagate drives recently.   I just ordered for approved SAS 15k drives, so those will replace the four Samsung drives, but if the errors still show up, then maybe I'll power down the entire set of Seagate drives to see if that solves it.

 

I'm not super worried, as they're just "information" warnings, but they spam my logs, and I'd like to try and figure out the cause of it.  the "PD=14" is confusing since I've pulled PD 14 and it still  shows this every 30-60 seconds.  Perhaps i can log a ticket with LSI, although I'm sure I'll get a "non-approved drives" message back from them

The disks are offset by 1. So PD 14 is likely physical disk 15 in your chassie.

If you're going by the image(12, 13, 15), don't, as I've removed 14 (thinking that was PD 14) so there really is no 14 listed in the photo (even though alerts for 14 still show up)

 

If you're going by LSI numbering them from 0-14, instead of 1-15 ( PD = Slot -1)...then you might be onto something. I'll have to put the old #14 back in, rebuild it, then pull 15 to test. in the meantime I've created a support package and emailed it to LSI to see what they have to say.

This topic is now closed to further replies.
  • Recently Browsing   0 members

    • No registered users viewing this page.
  • Posts

    • We had no idea as kids how much time and energy it took to be an adult 😅
    • The Trump administration doesn't want you to use OpenAI's GPT-5.6 without its approval by David Uzondu Image via @realDonalTrump (X) As OpenAI prepares the release of its next model, GPT 5.6, the White House has instructed the company to limit the distribution of the software to a small group of government-approved partners instead of the general public, as it has done with previous releases. According to The Information, OpenAI Chief Executive Officer Sam Altman sent an internal memo to staff on Thursday explaining that the federal government will approve access "customer by customer" during an initial preview phase. Altman noted in the communication that this restrictive rollout is "not [their] long-term model" for software deployment, and the company plans to work toward a "more sustainable" distribution method later. CNN said that both OpenAI and the Trump administration view the capabilities of GPT 5.6 on the same level as Anthropic's Mythos and that government officials intend to "collaborate with frontier AI labs to develop shared approaches for addressing the challenges of scaling this technology." The latest restriction comes just weeks after the US Commerce Department decided to restrict Fable, a version of Mythos with extra safety "guardrails" to prevent users from exploiting software vulnerabilities. Not long after the release, though, researchers at Amazon found a way to bypass these restrictions, prompting an aggressive response from federal authorities. The government ordered Anthropic to cut off access for non-US citizens located outside the US, non-US citizens living inside the US, and incredibly, even Anthropic's own foreign-born employees. Anthropic now appears to be building a workaround to resolve this compliance block with an update to its Privacy Policy that introduces a category called "Verification Data" to handle KYC and Digital IDs. This setup could mandate digital identity checks to filter users by nationality, requiring a government-issued ID and facial biometric data. Who knows? Maybe in the future, you would have to scan your US Passport or State ID to prove your citizenship before you are allowed to chat with Fable 5 (or any other model).
    • When Windows 7 was released I created an AutoHotkey script that uses Alt+` as a keyboard shortcut to move a window across monitors. I have been using that script for over 15 years and this is the first time I have come across another app that uses the same shortcut!
    • I called it last year that they wouldn't end support when they said there would. There are too many people still on Windows 10 waiting for something better to upgrade to and 11 ain't it! The recent promises of fixing Windows 11's many problems is nice, but unless they deliver on those promises in a big way then I expect customers will still want to stick with 10.
  • Recent Achievements

    • Week One Done
      xvvxcvv earned a badge
      Week One Done
    • One Month Later
      xvvxcvv earned a badge
      One Month Later
    • Enthusiast
      Xonos went up a rank
      Enthusiast
    • Conversation Starter
      Admir earned a badge
      Conversation Starter
    • First Post
      The_Focal_Point earned a badge
      First Post
  • Popular Contributors

    1. 1
      +primortal
      409
    2. 2
      +Edouard
      172
    3. 3
      PsYcHoKiLLa
      127
    4. 4
      neufuse
      69
    5. 5
      Steven P.
      67
  • Tell a friend

    Love Neowin? Tell a friend!