May 8-11, 2017

Event Details

Please note: All times listed below are in Central Time Zone

Gnocchi and Collectd for Faster Fault Detection and Maintenance

Collectd is a system statistics collection daemon that reads system metrics and writes them to various formats. Much work has been done to expose more platform statistics as well as statistics from software applications (enabled by OPNFV barometer). These stats were previously made available to OpenStack via Ceilometer, and now via Gnocchi.

Gnocchi provides a times series metric storage service which enables users to capture OpenStack resources and the metrics associated with them. Aodh provides alarms and notification based on metrics.

The telemetry data available to OpenStack is mostly limited to virtual resource metering. In order to ensure continuity of service, it is vital to monitor the physical infrastructure as well. Collectd provides visibility to what’s happening in the system and this data can be consumed by Aodh to notify and act when faults are detected. All this is the result of collaboration within several open source projects (DPDK, collectd, OpenStack, OPNFV).

What can I expect to learn?

What should attendees expect to learn? (1000 chars)

  • How collectd’s statistics can be used in OpenStack for better integrated monitoring capabilities.
  • How collectd can be used in conjunction with OpenStack services to provide fault monitoring and detection capabilities.
  • How gnocchi deals with large datasets and can be used to provide insights into the health of your system.
  • How aodh can be used to react to system faults. 
Thursday, May 11, 11:50am-12:30pm (3:50pm - 4:30pm UTC)
Difficulty Level: Beginner
Intel Corporation
Emma is a Senior Software Engineer at Red Hat working on metrics collection. She has been involved in OpenStack since 2014. when she first tried deploying a cloud. She started working on service assurance at Intel, enabling collectd stats and events to be used in OpenStack as well as maintaining testing, build and  validation tools in OPNFV Barometer. She is an active contributor and current... FULL PROFILE
Principal Software Engineer at Red Hat
Julien is a Free Software hacker since 1998. He works as a Principal Software Engineer at Red Hat, daily improving OpenStack, a project he has been working on since 2011. He leads the OpenStack Telemetry project as its PTL and contribute to common OpenStack code in Oslo. He seated at the OpenStack Technical Committee in 2013. FULL PROFILE