Boston
May 8-11, 2017

Event Details


Gnocchi and Collectd for Faster Fault Detection and Maintenance

Collectd is a system statistics collection daemon that reads system metrics and writes them to various formats. Much work has been done to expose more platform statistics as well as statistics from software applications (enabled by OPNFV barometer). These stats were previously made available to OpenStack via Ceilometer, and now via Gnocchi.

Gnocchi provides a times series metric storage service which enables users to capture OpenStack resources and the metrics associated with them. Aodh provides alarms and notification based on metrics.

The telemetry data available to OpenStack is mostly limited to virtual resource metering. In order to ensure continuity of service, it is vital to monitor the physical infrastructure as well. Collectd provides visibility to what’s happening in the system and this data can be consumed by Aodh to notify and act when faults are detected. All this is the result of collaboration within several open source projects (DPDK, collectd, OpenStack, OPNFV).


What can I expect to learn?

What should attendees expect to learn? (1000 chars)

  • How collectd’s statistics can be used in OpenStack for better integrated monitoring capabilities.
  • How collectd can be used in conjunction with OpenStack services to provide fault monitoring and detection capabilities.
  • How gnocchi deals with large datasets and can be used to provide insights into the health of your system.
  • How aodh can be used to react to system faults. 
Thursday, May 11, 11:50am-12:30pm
Level: Beginner
Intel Corporation
Emma is a Software Engineer in the Network Platforms Group in Intel. Emma has worked on Service Assurance, making more statistics available for the OpenStack cloud, by enabling collectd stats and events to be used in OpenStack. She is committer to the OPNFV Barometer project, and helps to maintain their testing, build and validation tools. Her latest adventures have brought her into testing,... FULL PROFILE
Red Hat
Julien is a Free Software hacker since 1998. He works as a Principal Software Engineer at Red Hat, daily improving OpenStack, a project he has been working on since 2011. He leads the OpenStack Telemetry project as its PTL and contribute to common OpenStack code in Oslo. He seated at the OpenStack Technical Committee in 2013. FULL PROFILE