Building on Data Best Practices: Basics, this one-week course covers advanced topics in reproducible, portable, and scalable scientific computing. Students will containerize software and environments with Podman/Docker; parallelize analyses locally with GNU Parallel and orchestrate cloud-based, asynchronous parallelization with Google Cloud Pub/Sub; and implement CI/CD in Stanford GitLab, using runners to automate scientific data preprocessing and analysis pipelines. It is recommended that students take Data Best Practices: Basics prior to enrolling in this course. Students with equivalent experience may also enroll.
- Dates: 05/27/2026, 05/29/2026, 06/01/2026
- Time: Wed, Fri, Mon 1:30 PM - 4:20 PM
- Units: 1
- Link: ExploreCourses
- Dates: 03/10/2026, 03/12/2026, 03/13/2026
- Time: Tu, Thu, Fri 1:30 PM - 4:20 PM
- Units: 1
- Link: ExploreCourses