Forums before death by AOL, social media and spammers... "We can't have nice things"
|    alt.os.linux.mint    |    Looks pretty on the outside, thats it!    |    30,566 messages    |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
|    Message 30,050 of 30,566    |
|    Paul to All    |
|    Re: Hard Drive techology    |
|    28 Dec 25 04:41:43    |
      XPost: aus.computers       From: nospam@needed.invalid              On Sat, 12/27/2025 6:50 PM, keithr0 wrote:       > On 25/12/2025 4:57 am, Lawrence D’Oliveiro wrote:       >> On Wed, 24 Dec 2025 21:18:19 +1000, keithr0 wrote:       >>       >>> On 24/12/2025 3:33 pm, Lawrence D’Oliveiro wrote:       >>>       >>>> I got a new 12TB drive for my backup machine just a couple months       >>>> ago.       >>>       >>> That's a lot of eggs in one basket ...       >>       >> That’s why I have a backup machine.       >>       >>> ... personally I'd prefer a RAID 5 or 6 setup with smaller drives.       >>       >> RAID is about high availability, not about backup.       >       > RAID is about failure resilience, if you have a single drive a failure loses       everything. Make a RAID group, and the loss of a single drive results in no       data loss. Done properly, it also improves performance.              RAID is about delaying maintenance until after the work day is done.              It's nothing about long term resilience.              If you have one fault and you do not meet the MTTR, the second       fault might knock you out, or depending on RAID class, the third       fault might knock you out.              *******              You really really have not thought it through, if you believe that resilience       thing.              At 2PM in the afternoon at work, the main CAD server (serves copies of CAD       software), I can't remember if it was a PERC or what it was.              The *controller* wrote to all drives at once. It was       not a commanded write. It was a firmware issue of some sort.       And not a capacity-rollover type flaw.              It corrupted some area low in the disk storage.              Causing *all volumes to be wiped out instantly*.              The restore time for the server, ran past 5PM and the       five hundred engineers depending on the server for their       software, had long since gone home when the server came back up.       Somebody in management, did the calc to see what the       "lost time" had cost us. That's what was the first step in       figuring out what the response should be to the event.              And this failure happened *twice*, just on a different server       which was not as critical. Because the first incident was not       as critical, it wasn't taken as seriously and nobody at that       time, had figured what had happened to the server. It was only       when a major infrastructure incident was raised, that the event       was analyzed to its source.              This is why RAID is worth *nothing* to you.              There is a class of "common mode faults" you should consider.              An example of a common mode fault, is when the PSU +12V shoots       up to +15V and burns all disk drives. When I mentioned the possibility       of this happening, a poster wrote in and said this had happened       to him. Loss of all storage on a PSU fault.              This is *why we do backups of our RAID array* :-/              A RAID array is NOT a backup.              A mirror, is just so much bullshit for the people doing it.       Because they haven't thought about the common mode faults.              I can give an example from a USENET posting about a RAID1 two disk mirror.              A user is running RAID 1. One of the drives drops. The user thinks       this is fine and dandy. Plug in replacement drive, and oh... whats this ?       The files on the second drive, stopped updating three months ago.       The array had some kind of fault, where the controller was       not writing to the second (surviving) drive. Which means, sure,       you can do a RAID rebuilt of your mirror, but three months       worth of changes are missing. The controller in this case was       a SIL3112, so it wasn't an actual quality controller. It's just       the deception "you have RAID", that counts.              DO YOUR BACKUPS. And run a Verify on your backup, to ensure it is a good one.       I have stories about that, too :-)               Paul              --- SoupGate-Win32 v1.05        * Origin: you cannot sedate... all the things you hate (1:229/2)    |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
(c) 1994, bbs@darkrealms.ca