The Scalable Checkpoint/Restart (SCR) library enables MPI applications to utilize distributed storage on Linux clusters to attain high file I/O bandwidth for checkpointing, restarting, and writing large datasets. The 2.0 release marks a milestone in SCR’s long history of bringing dependable, scalable, file set management to multiple HPC platforms.
Some highlights include:
- Support for multiple platform specific hardware technologies, including Cray DataWarp
- Portability across many HPC centers via scheduler integration
- Scalable checkpoint resilience and restart capabilities
Learn more: