PERC 6/I "Puncturing bad block" ?


Recommended Posts

Got an alert yesterday afternoon in my email that the 16TB RAID-5 Array in my file server at home had marked a drive offline and put itself in a degraded state. Looking at the physical disk list, Drive 2 was missing, and the logs contained an error that looked like there might be a cable problem. I shut the box down, and when I got home, checked all the cables to each drive. I powered the box back up, disk was there, and the array started rebuilding. Array finished rebuilding, Megaraid storage manager however produced a new error, something about "Puncturing BAD BLOCK"

During this time, I was able to access files fine on the array, so it's not gone. Regardless, I shut the box down again, swapped drive 2 out for a new 2TB drive, set the rebuild rate to 100%, and let it go this morning. It's still rebuilding, but I saw the following again in the log:

post-26332-0-62111700-1356111862.png

It's still rebuilding, and I'm going to let it finish. What does this "puncturing bad block" message mean? I've tried googling it, but from what I can find, it just looks like the controller found a bad block, and is either marking it as bad, or skipping it? I see that it shows ":5" meaning disk 5, as well as 2...does that mean theres a bad block on disk 5 that it copied to the new spare? I'm about to go out and buy a few more 2TB drives and replace possibly disk 5 as well... but wanted to see what you all thought.

Oh, and I did try and produce a backup immediately, but the backup tool failed due to a media error... Which is strange because I am able to copy files manually off the array. After it finishes rebuilding with the new drive I'm going to do a manual copy off to get a recent backup.

In case anyone is interested in drive information and system info, here are my system specs and history:

Drives: Samsung HD204UI drives running 24x7 for ~2 years

RAID controller: Dell PERC 6/I

Motherboard: Asrock Z75 Pro 3

Processor: Intel Xeon E3-1230 V2

While running 24x7 for ~2 years without issue, it looks as if the drives are reaching their limit.

Thanks!

Link to comment
https://www.neowin.net/forum/topic/1127146-perc-6i-puncturing-bad-block/
Share on other sites

hmm stil getting errors with the 2nd and 5th disks. trying to perform a backup again now..if this doesn't work I'm going to have to do a teracopy copy of everything off the array and blow it away.

Run a consistency check on it and see if it goes away.

If it does not, here is what dell says about it (We've been hearing about punctured arrays this year when we replace customer's drives that keep dropping and coming back online) and assumes you have Dell Openmanage server administrator installed at the step with C:\Program Files\Dell\SysMgt\oma\bin\omconfig.exe.

Here is the procedure to try and recover from the punctured stripe:

The best way to fix this is to delete the array (at the controller level), recreate it, perform a full initialization, and then restore the data. This will fix the problem 100% of the time.

The alternate procedure is to:

  1. Turn off Patrol Read
  2. Perform a full system backup with states
  3. Once the backup is completed, look in the backup job log for missing or corrupted files.
  4. Got to the path where it is annotated in the job log (ie, c:\windows\...) and either delete the file or restore from the backup. If the file is essential, then you will need to restore it.

Once this is done, run a consistency check:

  1. Click Start, point to Programs, Accessories, System Tools, and choose Scheduled Tasks.
  2. Double-click Add Scheduled Task.
  3. Click Next.
  4. Click Browse... and locate the file omconfig.exe. The default location of this file is
    C:\Program Files\Dell\SysMgt\oma\bin\omconfig.exe
  5. Click Next.
  6. Enter a task name. This is cosmetic and serves as a reminder to you of the scheduled task's purpose.
  7. Select how often you want the task to run, and then click Next.
  8. Choose the date, time, and month options, and then click Next.
  9. Supply the user name and password of the account that will run the scheduled task.
  10. Click Next, and then click Finish
  11. Run a CHKDSK
  12. In about a week turn on Patrol Read.

NOTE: if this procedure fails, you are still looking at wiping out the drive and rebuilding it.

You'll probably have to blow the array away at the controller level.

Run a consistency check on it and see if it goes away.

If it does not, here is what dell says about it (We've been hearing about punctured arrays this year when we replace customer's drives that keep dropping and coming back online) and assumes you have Dell Openmanage server administrator installed at the step with C:\Program Files\Dell\SysMgt\oma\bin\omconfig.exe.

You'll probably have to blow the array away at the controller level.

interesting find. Thanks for that. I had an extra drive laying around, so i tossed it in for Drive 2, booted into the raid controller tool, and let it rebuild. once done, booted into windows and ran a consistancy check. consistany check failed, same message but on drive 5...same block. I tried to run a backup, but the backup fails due to a media error (MSM shows an error as well). I've pulled drive 5, and am in the process of running samsungs extended drive test on it, but from what I can see from smart and the quick test, the drive looks fine, so I'm wondering if the controller might have an issue.

More info:

Controller has a good battery backup on it, and server is plugged into a 1 hour UPS.

I wiped a drive and placed it in drive 5's spot, array is rebuilding right now in dos. Once it's 100%, I'm going to boot again and try a backup. If the backup fails, I'm going to resort to using teracopy or something to just do a mass copy to my backup array, then blow the array away.

FWIW, I recently removed full drive encrpytion from the array, and while I was in the process, changed the stripe size from 512KB to 64KB (for faster access to smaller files). Before doing this, the array was just fine for over 19000 hours (800 days), and in the span of about 7 days, I've now got errors...perhaps Consumer drives aren't liking 64KB stripe sizes (more reading/writing to the drives I assume?)

Thanks. I'll give that a shot tomorrow

This topic is now closed to further replies.
  • Posts

    • Stop asking people that. It's a "No True Scotsman" argument in that you are attempting to discredit the opinions of a person by Attacking the Messenger. The reason that these are logical fallacies is the TRUTH is based on facts as supported by evidence. Nothing else. So, always debate the facts with evidence to reach the truth. Once you learn to do this, you'll be able to recognize when people are fearmongering and lying to you for their own selfish ends.
    • It doesn't matter if you didn't directly hear it from person X or Y. Every one of your statements comes straight from the racist, skinhead, anti-immigrant, be afraid of everything, "they are all taking our jobs", etc. etc. mouthpieces. That's where Farag and Putin heard it from too...and used it against the UK. So, while you keep disavowing the people who publicly peddled that position, you keep proving over and over again that those lies influenced you into being tricked when the Brexit vote came around too. In fact, your final sentence makes it crystal clear that it was the racist/anti-immigrant lies you fell and voted for, since you stated that you didn't have an issue with the economic trade issues with the EU. Ahem. To be clear, all of these LIES are EONS old, mate. They are the same fearmongering lies peddled to the same ignorant, gullible cowards by the same charlatans, snake-oil salesmen, and would be demagogues who've been doing this since caveman Ugh lied about his slightly different neighbor in order to steal his land. And, finally, you answered your own previous question. The reason that the EU isn't clamoring to bring the UK back is that they have had enough of people who would rather shoot themselves in the foot than get over their "insecurity issues". It's the same reason the entire world is moving away from the USA as fast as it can...
    • Onkyo Dolby Atmos AV receivers are really solid deals by Sayan Sen Recently we covered great deals on several soundbar models from the likes of Sony, JBL, Samsung and others for really good prices (the lowest in several months). Aside from that we also reported on the Edifier S3000MKII, a hi-fi two-way bookshelf monitor that's available for only $800. Today we bring a list of AV receivers from Onkyo that are available at great prices including the Onkyo NR7100, RZ30, and 8470 (purchase links under the specs table down below). The Onkyo TX-NR7100 and Onkyo TX-RZ30 are both 9.2-channel AV receivers designed for immersive home theater setups but they occupy slightly different tiers within Onkyo’s lineup with the RZ30 positioned as the more advanced model. The TX-NR7100 is a THX Certified 9.2-channel receiver offering up to 100 W per channel (8 ohms, 2 channels driven). It supports Dolby Atmos, DTS:X, and IMAX Enhanced formats, with flexible configurations such as 5.1.4 or 7.1.2 speaker layouts. A key highlight is its built-in Dirac Live Room Correction which should help optimize sound based on your room and its acoustics. In comparison, both models share several core capabilities though the RZ30 is geared toward enthusiasts seeking more precise calibration and system flexibility, while the NR7100 is positioned as a slightly more accessible, value-focused option with strong all-round performance. The technical specs of the RZ30 and NR7100 9.2 AVRs are given in the table below: Specification Onkyo TX-RZ30 Onkyo TX-NR7100 Power Output (FTC, 2ch driven) ~100 W/ch (8Ω, 20Hz–20kHz, 0.08% THD) 100 W/ch (8Ω, 20Hz–20kHz, 0.08% THD) Dynamic / Peak Power 9 × 170 W (6Ω, 1kHz, 1% THD, 1ch driven) 220 W/ch (6Ω, 1kHz, 10% THD, 1ch driven) Frequency Response 5 Hz – 100 kHz (+1/-3 dB) 10 Hz – 100 kHz (+1/-3 dB) THD 0.08% 0.08% Room Correction Dirac Live (full bandwidth) Dirac Live (with AccuReflex support) Immersive Audio Dolby Atmos, DTS:X, IMAX Enhanced Dolby Atmos, DTS:X, IMAX Enhanced Speaker Layout Support Up to 7.2.2 / 5.2.4 / 9.2 processing Up to 7.2.4 / 5.2.4 / 9.2 processing HDMI Inputs / Outputs 6 inputs / 2 outputs (eARC) 6 inputs / 2 outputs (Main + Sub/Zone 2) HDMI 2.1 Support 8K/60, 4K/120, VRR, ALLM, QFT, DSC, eARC 8K/60, 4K/120, VRR, ALLM, QFT, DSC, eARC Video Formats HDR10+, Dolby Vision, HDCP 2.3 HDR10+, Dolby Vision, HDCP 2.3 Streaming / Network Wi-Fi, AirPlay 2, Chromecast, Bluetooth, DTS Play-Fi Wi-Fi, AirPlay 2, Chromecast, Bluetooth, DTS Play-Fi Get them at the links below: Onkyo TX-RZ30 9.2-Channel AV Receiver: $797.00 (Sold and shipped by Electronic Expo) Onkyo TX-NR7100 9.2-Channel AV Receiver: $699.00 (Sold and shipped by Adorma) Onkyo TX-8470 2 Ch Stereo Receiver: $449.00 (Sold and Shipped by Adorma) Good to know This Amazon deal is U.S. specific, and not available in other regions unless specified. We only use first-party seller links or authorized dealer links (at the time of article publishing); ensure that you purchase from such links only. Check out Today's Deals on Amazon | or our recent tech deals. Become a Prime member (for Students or SNAP) via Neowin Get Prime Access - Prime for half price (for qualifying Medicaid, EBT, SNAP) Subscribe to Prime Video, Audible Plus, Music Unlimited or Kindle Unlimited via Neowin As an Amazon Associate, we earn from qualifying purchases.
  • Recent Achievements

    • Week One Done
      flexorcist earned a badge
      Week One Done
    • One Month Later
      Woland13 earned a badge
      One Month Later
    • Week One Done
      Woland13 earned a badge
      Week One Done
    • One Year In
      bernmeister earned a badge
      One Year In
    • Week One Done
      Scoobystu earned a badge
      Week One Done
  • Popular Contributors

    1. 1
      +primortal
      498
    2. 2
      +Edouard
      224
    3. 3
      PsYcHoKiLLa
      148
    4. 4
      Steven P.
      74
    5. 5
      FloatingFatMan
      69
  • Tell a friend

    Love Neowin? Tell a friend!