Lustre 101

 

 

 

 

 

 

 

Overview

The Lustre 101 web-based course series is focused on administration and monitoring of large-scale deployments of the Lustre parallel file system. Course content is drawn from nearly a decade of experience in deploying and operating leadership-class Lustre file systems at the Oak Ridge Leadership Computing Facility (OLCF) at Oak Ridge National Laboratory (ORNL), as well as contributions from other sites with large-scale Lustre experience.

A primary concern in deploying a large system such as Lustre is building the operational experience and insight to triage and resolve intermittent service problems. Although there is no replacement for experience, it is also true that there is no adequate training material for becoming a Lustre administration expert. The overall goal of the Lustre 101 course series is to distill and disseminate to the Lustre community the working knowledge of those with significant experience in administration of large-scale Lustre deployments in the hope that others can avoid the trials and tribulations of Lustre administration and monitoring at scale.

Available Courses

1.     Lustre Administration Essentials

This course is targeted at experienced system administrators who are relatively new to Lustre, but may have prior experience with other distributed and parallel file systems. Topics in this course include an introduction to Lustre, hardware selection and benchmarking strategies, Lustre software installation and basic configuration, Lustre tuning and LNet configuration, basic file system administration and monitoring, and methods for problem diagnosis and analysis.

2.     Lustre Administration Tutorials

This course includes a series of tutorials on specific aspects of Lustre administration and monitoring. Lessons in this course are intended for system administrators who are relatively new to Lustre, as well as more experienced admins seeking to learn about new ways of using Lustre or improving its reliability or performance.

 

 

Feedback

We welcome all feedback and  suggestions for improving course content. Please send comments and suggestions to:

lustre101-feedback @ ornl.gov

Acknowledgments

The Lustre 101 course series is developed by the Computational Research and Development Programs at Oak Ridge National Laboratory (ORNL), with support from the U.S. Department of Defense and the Oak Ridge Leadership Computing Facility (OLCF). OLCF is supported by the Office of Science of the U.S. Department of Energy.

 

ORNL    |    ORNL Disclaimer