Event Details

Please note: All times listed below are in Central Time Zone

<< Go back

Optimized HPC/AI cloud with OpenStack acceleration service and composable hardware

HPC / GPU / AI

Today data scientist is turning to cloud for AI and HPC workloads. However, AI/HPC applications require high computational throughput where generic cloud resources would not suffice. There is a strong demand for OpenStack to support hardware accelerated devices in a dynamic model.

In this session, we will introduce OpenStack Acceleration Service – Cyborg, which provides a management framework for accelerator devices (e.g. FPGA, GPU, NVMe SSD). We will also discuss Rack Scale Design (RSD) technology and explain how physical hardware resources can be dynamically aggregated to meet the AI/HPC requirements. The ability to “compose on the fly” with workload-optimized hardware and accelerator devices through an API allow data center managers to manage these resources in an efficient automated manner.

We will also introduce an enhanced telemetry solution with Gnnochi, bandwidth discovery and smart scheduling, by leveraging RSD technology, for efficient workloads management in HPC/AI cloud.

What can I expect to learn?

The attendees will learn

- The background and current state of OpenStack Cyborg project and RSD

- How OpenStack Cyborg manages accelerator devices (e.g. FPGA, GPU, NVMe SSD) with RSD?

- How Cyborg and RSD meet the requirement of HPC/AI?

- Enhanced telemetry solution with Gnnochi for HPC/AI cloud by leveraging RSD technology

- Bandwidth discovery and smart scheduling with RSD API

Thursday, May 24, 1:50pm-2:30pm (8:50pm - 9:30pm UTC)

Vancouver Convention Centre West - Level Three - Room 301

Slides: Optimized HPC/AI cloud with OpenStack acceleration service and composable hardware

View video

Difficulty Level: Advanced

Tags: Technical OpenStack Ironic Arch / Ops Kolla

Shuquan Huang

99cloud

Shuquan Huang is a Technical Director at 99Cloud, where he built the whole engineering teams from the ground up, established agile development methods to design and implement solutions for cloud computing and edge computing. Shuquan has been working with open source software for over 10 years. He started his OpenStack journey at its early days in 2011 and actively involved as technical... FULL PROFILE

Jianfeng JF Ding

Intel Corporation

Jianfeng(JF) is leading his team to work on cloud software community development in Intel, focusing on orchestration projects such as OpenStack and Kubernetes. Meanwhile he and his team have many interests in cloud automation solutions, to leverage mature cloud techniques in several important areas, like AI/ML, NFV and Edge. His team has many experts in different areas, meanwhile... FULL PROFILE