Abstract
As primary provider for research computing services at the University of Minnesota, the Minnesota Supercomputing Institute (MSI) has long been responsible for serving the needs of a user-base numbering in the thousands. In recent years, MSI-like many other HPC centers-has observed a growing need for self-service, on-demand, data-intensive research, as well as the emergence of many new controlled-access datasets for research purposes. In light of this, MSI constructed a new on-premise cloud service, named Stratus, which is architected from the ground up to easily satisfy data-use agreements and fill four gaps left by traditional HPC. The resulting OpenStack cloud, constructed from HPC-specific compute nodes and backed by Ceph storage, is designed to fully comply with controls set forth by the NIH Genomic Data Sharing Policy. Herein, we present twelve lessons learned during the ambitious sprint to take Stratus from inception and into production in less than 18 months. Important, and often overlooked, components of this timeline included the development of new leadership roles, staff and user training, and user support documentation. Along the way, the lessons learned extended well beyond the technical challenges often associated with acquiring, configuring, and maintaining large-scale systems.
Original language | English (US) |
---|---|
Title of host publication | Practice and Experience in Advanced Research Computing 2018 |
Subtitle of host publication | Seamless Creativity, PEARC 2018 |
Publisher | Association for Computing Machinery |
ISBN (Print) | 9781450364461 |
DOIs | |
State | Published - Jul 22 2018 |
Event | 2018 Practice and Experience in Advanced Research Computing Conference: Seamless Creativity, PEARC 2018 - Pittsburgh, United States Duration: Jul 22 2017 → Jul 26 2017 |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Other
Other | 2018 Practice and Experience in Advanced Research Computing Conference: Seamless Creativity, PEARC 2018 |
---|---|
Country/Territory | United States |
City | Pittsburgh |
Period | 7/22/17 → 7/26/17 |
Bibliographical note
Publisher Copyright:© 2018 Copyright held by the owner/author(s).
Keywords
- ACM proceedings
- Cloud Infrastructure
- Controlled-Access Data
- Cost-Recovery
- Lessons Learned
- OpenStack