Barcelona, Spain
October 25-28, 2016

Event Details

On Building an Auto-healing Resource Cluster using Senlin

Building auto-healing cluster is always not an easy job in cloud environment. How to accurately detect failures happen in different layers; how to promptly make fencing to prevent further damage; how to make recovery progress automatically and efficiently, all these headaches need to be addressed before we can announce our systems/applications as auto-healing. In this presentation, we will deep dive into Senlin's health management design to introduce how we address all these issues and fill in the gap.

What can I expect to learn?

Attendees will learn how to build/deploy an auto-healing cluster for some typical application in OpenStack cloud:

(1) Choose proper metrics, events for failure detection

(2) Choose proper recover actions/sequence for the target cluster

(3) Build and customize own health management policy to enable auto-healing for the target cluster

(4) Extend the auto-healing loop by cooperation with other telemetry, workflow, event services.

Thursday, October 27, 2:40pm-3:20pm
Difficulty Level: Intermediate
Xinhui Li is senior architect of VMware and leads the team working on NFV product development and integration with open ecosystem. She serves as ONAP TSC (Technique Steering Committee) member representing VMware and served as OPENO TSC representative. She has led the team of Multi-Cloud project as the PTL (Project Technique Leader) and delivered ONAP Amsterdam and Beijing releases. She is also... FULL PROFILE
VMware , Software Engineer
Ethan Lynn is a software engineer in VMware, and his openstack journey starts from 2013. He developed the first IBM private cloud manager based on OpenStack, contributed many features back to OpenStack community. He also designed and developed HighAvailable solutions for OpenStack product. Now he is the core reviewer of Orchestration(HEAT) and Clusting(SENLIN) project. He is also a... FULL PROFILE