Can Magic Castle beat our HPC facility? #151
rsignell-usgs
started this conversation in
General
Replies: 1 comment 1 reply
-
I would agree that this looks promising but there are is still some way to go before you get there
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm excited to find out about Magic Castle because I'm in a group at USGS that does a lot of coastal ocean modeling and we've been looking for a way to do end-to-end workflows on the Cloud. We've been using AWS No-Tears-Cluster (which layers on AWS Parallel Cluster), but we were hoping to use a more cloud-agnostic simulation approach to pair with the cloud-agnostic analysis and visualization approach Qhub (Terraform/Ansible deployment of JupyterHub with Dask Gateway on Kubernetes) we are using.
I know that Magic Castle was created for HPC training but the framework can also deploy infrastructure to support real-world applications. We were inspired to try real-world HPC on AWS after a great talk by Danny Arevalo where he demonstrated beating the NAVY HPC using the NAVGEMS global weather model using c5.18xlarge instances with EFA (If curious, watch one minute starting here: https://www.youtube.com/watch?v=GTHWf0OVGrw&t=715s).
For our own applications on AWS, we have found that EFA and FSx for Lustre are both necessary to beat our on-premise HPC facilities, and to compete on price we need to use Spot pricing.
Looks like the ability to create faster networking, faster disk and preemptible nodes are already in the works so this should all be possible eventually with Magic Castle, right?
Beta Was this translation helpful? Give feedback.
All reactions