Collectd is a system statistics collection daemon that reads system metrics and writes them to various formats. Much work has been done to expose more platform statistics as well as statistics from software applications (enabled by OPNFV barometer). These stats were previously made available to OpenStack via Ceilometer, and now via Gnocchi.
Gnocchi provides a times series metric storage service which enables users to capture OpenStack resources and the metrics associated with them. Aodh provides alarms and notification based on metrics.
The telemetry data available to OpenStack is mostly limited to virtual resource metering. In order to ensure continuity of service, it is vital to monitor the physical infrastructure as well. Collectd provides visibility to what’s happening in the system and this data can be consumed by Aodh to notify and act when faults are detected. All this is the result of collaboration within several open source projects (DPDK, collectd, OpenStack, OPNFV).
What should attendees expect to learn? (1000 chars)
- How collectd’s statistics can be used in OpenStack for better integrated monitoring capabilities.
- How collectd can be used in conjunction with OpenStack services to provide fault monitoring and detection capabilities.
- How gnocchi deals with large datasets and can be used to provide insights into the health of your system.
- How aodh can be used to react to system faults.