Forum
Welcome, Guest
Username: Password: Remember me
This is the optional category header for the Suggestion Box.

TOPIC: Host Failure scenario

Host Failure scenario 2 months 1 week ago #1555

Hello all,

Currently i am reviewing HA-Lizard for my HA virtualization needs. At first I had quite a bit of trouble getting this all to work, but with some perseverance I was able to test most scenarios I could think of and recover from them with no hassle.

Right now I am on my last test scenario, Full host failure. I am able to shut down my master server have the slave take over. The HA works great. I have reformatted my old master server and managed to connect the now new slave to my pool. I have recreated my bonds and ran through the no-san installer. However as far as i can tell DRBD is not setup properly anymore.

My question is: What is the best way to recover from a failed host?
Last Edit: 2 months 1 week ago by Ethan Scott.
The administrator has disabled public write access.

Host Failure scenario 2 months 1 week ago #1556

The steps you described in your recovery scenario should work with the exception of one final step which instructs the master to overwrite the storage of the newly introduced slave. It is simple to do, but dangerous as you could wipe your data if you sync in the wrong direction by accident.

Below is the full command that should be run on the host that contains the data, the master in your case.

drbdadm -- --overwrite-data-of-peer primary iscsi1
The administrator has disabled public write access.

Host Failure scenario 2 months 1 week ago #1557

Salvatore,

Your advice worked like a charm. That command was the missing piece of the puzzle for me.

Thank You
The administrator has disabled public write access.

Host Failure scenario 1 month 4 weeks ago #1569

Hi Ethan,

I come here because of my post ( Link ).

I just want to ask you if after attaching the new formatted node to the pool, did you follow the "halizard_nosan_installer" script till the end? Or did you setup the new node by hand?

Thanks.
The administrator has disabled public write access.

Host Failure scenario 1 month 4 weeks ago #1570

Daniel,

I followed the installation script all the way through. Sometimes the script will error out trying to download components are whatever, I don't remember the specific issues were, but the best solution I have found if problems occur when going through the setup script is to just start over. And the command that Was provided by Salvatore was truly the missing piece.
The administrator has disabled public write access.
Time to create page: 0.099 seconds