Remuneration: A competitive remuneration package will apply
Be inspired, every day
Enjoy the freedom to discover something new
Take your career in exciting, rewarding directions
There’s a certain feeling you get from working at Monash University. It’s the feeling that you’re a part of something special. Something significant. So if you’re looking for the next chapter in your career, it’s here. You’ll be given the opportunity to challenge yourself, build on your skills, and make a significant contribution to a workplace that’s filled with energetic and inspiring people. Talent thrives here – and so do truly satisfying careers.
We are currently seeking to appoint a highly experienced Research DevOps, Lead Cloud and HPC Architect to lead the architecture and development of the research cloud, high performance computing (HPC) and scientific instrument data and interconnecting fabric, as well as the deployment and continuous improvement of a range of infrastructure systems.
This role will lead the technical oversight of a longstanding Monash University differentiator: a co-designed/ co-built/ co-operated approach for computing infrastructure for researchers.
This role will suit an individual with extensive experience in programming, operating and supporting large scale systems in support of research as well as demonstrated high level project and relationship management skills. A proven ability to manage a highly trained, motivated and efficient technical team will also be essential.
A postgraduate qualification in science, engineering or IT (or equivalent experience) and significant experience in research computing or large-scale computing systems will be critical to your success in this role. We expect candidates will have experience in many of the following: software defined infrastructure, associated hypervisors, software defined networking, software defined storage, HPC/HTC job management, parallel file systems, low latency interconnects, high performance data centre protocols and Linux administration.
This is a brilliant opportunity to join a high performing team working within a rewarding and challenging environment with genuine opportunity to make the role your own.
If this sounds like a position that suits your current career focus we look forward to hearing from you.
This role is a full-time position; however, flexible working arrangements may be negotiated.
At Monash University, we are committed to being a Child Safe organisation. Some positions at the University will require the incumbent to hold a valid Working with Children Check.
The National Center for Computational Sciences (NCCS) at Oak Ridge National Laboratory (ORNL) is seeking highly-motivated individuals to join and contribute to ORNL’s Compute and Data Environment for Science (CADES). CADES is comprised of a user-facing self-provisioning private cloud running on OpenStack, several thousands of cores of high performance parallel compute, and an array of specialized computational appliances providing rare capabilities in data, such as graph analytics and deep learning vital to a variety of scientific domains. These computational services are backed by high-speed interconnects, parallel filesystems, and a distributed object store. In this role, you will work on the Operational Team within the Advanced Data & Workflows Group, both maintaining existing services and contributing to the design and rollout of new services for the research community.
• Design, build and maintain Linux systems for cloud and parallel clustered system architectures
• Work with the team to define and implement methodologies and standards within the organization
• Automate systems administration tasks utilizing open source configuration management tools
• Tune operating systems and applications to increase performance and reliability of services
• Diagnose system operational problems quickly and effectively
• Document system administration procedures for routine and complex tasks
• Participate in scheduled support and off-hours maintenance windows
• Coordinate with vendors to resolve hardware and software problems
• Bachelor’s degree in Computer Science, Computer Engineering or related field.
• 5+ year of professional Linux experience
• Strong scripting skills in shell and/or python
• Experience using configuration management tools (Puppet, Ansible, or similar technologies). Preferred Qualifications:
• Ability to use, and build ways to use, automation frameworks.
• For cloud computing professionals: Openstack experience with Icehouse or Higher; Experience with core services (Nova, Horizon, Neutron, Keystone, Cinder)
• For HPC professionals: expertise with parallel file systems and cluster operations.
• Experience with any container orchestration framework.
• Experience in production environment using Ansible.
• Experience with SDN in production environments.
• Previous work with HPC workflows
• Experience with Cloudforms integration and customization.
• Experience with RHEV, VMWare or other (non OpenStack) virtualization stacks.
• DNS Server administration (BIND).
• Using service / product APIs (Gitlab, Jenkins, ServiceNow, Google, EC2...).
ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply.
We are looking for someone to join our technical team which builds and operates our public cloud and many private clouds for customers all over the world.
We are big on open source. We do all of our work upstream and we do not maintain any local forks which means that you will be working heavily with the upstream teams for any issues that come up.
Deliver the latest technology
Our customers rely on us to bring them the latest technology as soon as possible while making sure that it is stable and reliable. We’ve been one of the first companies to hit Pike and are already on Queens. You will enjoy working with non-legacy environments which can easily have fixes landed for them upstream.
Work on exciting projects
We go beyond the big projects such as Nova and co. We have Magnum deployed, alongside with Sahara and Octavia. Those are projects that very little clouds operate so you will have the experience of working with upstream teams to address any issues.
Variety of exciting opportunities
One a day, you might find yourself troubleshooting a Nova bug upstream, speccing hardware for a cloud, working on deployment tools (Ansible), digging into getting a project working at scale (Gnocchi), managing and automating monitoring (Sensu) or building and operating storage clusters (Ceph).
If you enjoy OpenStack and the world around it, you will enjoy your work here by being able to work on OpenStack and seeing all sides of the story, operator, developer and more. Some of the skills that you might have to use which would be very useful to have:
Experience in deploying and operating OpenStack (DevOps, Hardware, monitoring, day 2)
Understanding of OpenStack components and infrastructure (RabbitMQ, Galera)
Upstream OpenStack development (Gerrit workflows, etc)
Networking knowledge (Spine/leaf and scale-out architecture)