I'm going to go out on a limb here....
Some more background - I work with enterprise software - in an environment supporting between 2,000 and 150,000 users. I KNOW things go wrong.
I've seen SAN - which isn't supposed to fail - ever - fail.
I've seen a tiny desktop app take down over 100 servers, databases, and backups.
I've seen a lightning strike destroy a power substation and take down an entire building with a generator because the local grid got fried.
Every host - every system - has some single point of failure. Even co-located data centers with multiple separate hooks into the backbone can - rarely - go down.
That said - I actually got a call from the owner of the company today. We talked and he explained what happened. My biggest instinct in these situation is "How will it be prevented in the future?" and before I even asked that he had talked through that point.
Shit happens.
Some more background - I work with enterprise software - in an environment supporting between 2,000 and 150,000 users. I KNOW things go wrong.
I've seen SAN - which isn't supposed to fail - ever - fail.
I've seen a tiny desktop app take down over 100 servers, databases, and backups.
I've seen a lightning strike destroy a power substation and take down an entire building with a generator because the local grid got fried.
Every host - every system - has some single point of failure. Even co-located data centers with multiple separate hooks into the backbone can - rarely - go down.
That said - I actually got a call from the owner of the company today. We talked and he explained what happened. My biggest instinct in these situation is "How will it be prevented in the future?" and before I even asked that he had talked through that point.
Shit happens.