HPC & Data Science Support at CBS¶
This is the GitHub repository for HPC & Data Science Support at CBS. The team is dedicated to providing assistance and support to CBS researchers and students in their research utilizing the different HPC systems available at CBS. The repository contains various resources and information related to our services and activities.
Activities¶
The HPC & Da Science Support team provides tutorials and user utilities to assist and support CBS researchers and students in their research utilizing the different HPC systems available at CBS. The tutorials and utilities covering various topics such as:
- Use cases for different HPC systems
- Efficient and secure data tranfer
- parallel computing
- environment management (e.g Conda)
These tutorials offer step-by-step guidance, empowering users to effectively utilize HPC resources for their research.
We conducts teaching activities through researcher and student webinars. Titles include "High Performance Computing", "HPC & Parallel Programming in R," and "HPC & Parallel Programming in Python" with more in the pipeline.
See "Events" section for more information.
Upon receiving requests from course coordinators, we are also available to participate in teaching activities for courses at CBS.
We are committed to the continuous development of HPC resources at CBS. This is both by ensuring that researchers have access to the right facilities, both short- and long-term, but also by providing a clear learning strategy for research to develop their HPC & data science skillset.
As Deic Front Office at CBS are we in charge off all communications with HPC system adminstrators (Back Office) and DeiC.
Ideally, all user requests and troubleshooting should be send to the CBS Front Office(rdm@cbs.dk) as a Single Point of Contact (SPOC) where resulting tickets will be directed accordingly.
This setup provides a better service to users and saves valuable time for Back Office technicians who can concentrate on highly technical issues.
We provide consulting services to researchers and research projects, assisting them with their HPC requirements. Our support includes, but is not limited to, the following examples:
- HPC grant application guidance
- Assessing user needs for HPC resources
- Workflow and code optimization assistance
By offering expert consultation, we help researchers identify and address their specific HPC needs, ensuring they can effectively utilize the available resources and optimize their workflows and code for maximum performance and efficiency.
Database Tools¶
HPC/UCloud Tutorials¶
STATA
SAS
Matlab
- Which GPU to Choose?
- Access GPUs on UCloud
- GPU Libraries for Python and R
- Conda: for easy workflow deployment on AAU GPU VMs
- Run Python and R jupyter notebooks on AAU VMs
- Setting up jupyter-notebook with GPUs on AAU using Docker images (Hosted by RUC)
- Pytorch: Train your deep-learning models on UCloud GPUs
- Tensorflow: Train your deep-learning models on UCloud GPUs
- RAPIDS-cuML: Train your Scikit-learn models on UCloud GPUs
- RAPIDS-cuDF: How To Speed Up Pandas in Python By 150x
Data Science Links¶
- The Turing Way - Guide for Reproducible Research in Data Science
- Cookiecutter Data Science - Reproducible Project Structure
- The Turing Way - Introduction to version control with Git
- Coderefinery - Introduction to version control with Git
- How to Create a Conda Environment Based on a YAML File: A Guide for Data Scientists
HPC Operational Status¶
Planned Maintenance