Event Details

Please note: All times listed below are in Central Time Zone


A Better VM HA Solution: Split-brain Solving & Host Network Fault Awareness

Nowadays, although more and more applications are beginning to cloud, some users' legacy services are still unreformed, and they still need HA capabilities to ensure the reliability of their applications. So how to build a reliable, flexible VM HA solution on OpenStack?

There are two main issues need to be addressed in VM HA range:

  • How to prevent the split-brain problem?
  • How to perceive the specific network plane failure of each host to perform a more appropriate recovery operation?

Based on the above concerns, we've developed a complete VM HA solution:

A Sanlock-based distributed lock manager has been developed to solve the first problem. And add Etcd to implement three physical network(management, storage, service) detection for each host. Once any network is interrupted, the HA-manager will detect the failure and then trigger a recovery operation based on the configured policy.

In addition, it also involves some HA-related functions, such queuing, retrying, and so on.


What can I expect to learn?

You can learn more about this solution during the session. This includes:

  • How to achieve a lock-manager based on Sanlock;
  • How to implement the host network fault awareness;
  • How to realize HA queuing, HA retry mechanism, etc;
  • Problem solving case in development;
  • Performance tuning of the entire solution.
Wednesday, November 14, 4:20pm-5:00pm (3:20pm - 4:00pm UTC)
Difficulty Level: Advanced
Fiberhome Telecommunication Technologies Co.,LTD
Wu Jiang is a Cloud Arcitect at FiberHome Integration Technologies Co., Ltd. He is part of the OpenStack development team at FiberHome. He has involved in OpenStack since the Diablo version, and has continued to be active in the OpenStack community as a contributor to Nova project. FULL PROFILE
Senior Software Engineer
Xiao Ding is a Senior Software Engineer at FiberHome Integration Technologies Co., Ltd. He is part of the OS Platform development team at FiberHome. He has involved in Libvirt Qemu Project since 2013. FULL PROFILE