For fault management in cloud, traditional and current telemetry/monitoring architectures are good, but you should enhance the telemetry/monitoring architecture for NFV/Edge/5G/IoT. Because these new services have several system architectures, SLAs that are far from cloud's one, huge amount of targets you have to monitoring.
In Sydney Summit, there were some great sessions for monitoring/fault management.[1][2][3][4][5][6][7] So We can discuss next generation fault management.
In this forum, we are sharing architecture, use cases, gap analysis, OSS for collecting/analyzing/storing/notifying, related projects. OPNFV and ONAP has Doctor[8], Barometer[9] and DCAE[10] that focus on monitoring and fault management. We can also discuss how to adapt these projects to your environments. And the goal of this forum is sharing of real use cases requiring such fault management and those solutions currently used or to be used in future, in order to start discussion of good architecture.
[1] https://www.openstack.org/summit/sydney-2017/summit-schedule/events/20085/monitoring-the-nectar-research-cloud
[2] https://www.openstack.org/summit/sydney-2017/summit-schedule/events/19736/monitoring-performance-of-your-openstack-environment
[3] https://www.openstack.org/summit/sydney-2017/summit-schedule/events/19748/the-experience-of-telemetry-under-large-scale-openstack-cluster
[4] https://www.openstack.org/summit/sydney-2017/summit-schedule/events/19470/proactive-monitoring-for-openstack
[5] https://www.openstack.org/summit/sydney-2017/summit-schedule/events/19448/use-case-help-me-from-tons-of-alarms
[6] https://www.openstack.org/summit/sydney-2017/summit-schedule/events/19178/dmadistributed-monitoring-and-analysis-monitoring-practice-and-lifecycle-management-for-telecom
[7] https://www.openstack.org/summit/sydney-2017/summit-schedule/events/19650/monitoring-openstack-deployments
[8] https://wiki.opnfv.org/display/doctor/Doctor+Home
[9] https://wiki.opnfv.org/display/fastpath/Barometer+Home
[10] https://wiki.onap.org/pages/viewpage.action?pageId=1015831
