GPU Programming with OpenACC Training Course

OpenACC is an open standard for heterogeneous programming that enables code to run across various platforms and devices, such as multicore CPUs, GPUs, FPGAs, and others.

This instructor-led live training (available online or onsite) targets beginner to intermediate-level developers looking to leverage OpenACC for programming heterogeneous devices and harnessing their parallel computing power.

By the end of this training, participants will be able to:

Set up an OpenACC development environment.
Write and execute a basic OpenACC program.
Annotate code with OpenACC directives and clauses.
Utilize OpenACC APIs and libraries.
Profile, debug, and optimize OpenACC programs.

Format of the Course

Interactive lecture and discussion.
Plenty of exercises and practice sessions.
Hands-on implementation in a live-lab environment.

Course Customization Options

To request a customized training for this course, please contact us to arrange it.

This course is available as onsite live training in Netherlands or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Introduction

What is OpenACC?
Comparison of OpenACC vs OpenCL vs CUDA vs SYCL
Overview of OpenACC features and architecture
Setting up the development environment

Getting Started

Creating an OpenACC project in Visual Studio Code
Exploring project structure and files
Compiling and running the program
Displaying output with printf and fprintf

OpenACC Directives and Clauses

Understanding OpenACC directives and clauses
Using parallel directives to create parallel regions
Using kernels directives for compiler-managed parallelism
Using loop directives to parallelize loops
Managing data movement with data directives
Synchronizing data with update directives
Improving data reuse with cache directives
Creating device functions with routine directives
Synchronizing events with wait directives

OpenACC API

Understanding the role of the OpenACC API
Querying device information and capabilities
Setting device number and type
Handling errors and exceptions
Creating and synchronizing events

OpenACC Libraries and Interoperability

Understanding OpenACC libraries and interoperability
Using math, random, and complex libraries
Integrating with other models (CUDA, OpenMP, MPI)
Integrating with GPU libraries (cuBLAS, cuFFT)

OpenACC Tools

Understanding OpenACC tools in development
Profiling and debugging OpenACC programs
Performance analysis with PGI Compiler, NVIDIA Nsight Systems, Allinea Forge

Optimization

Factors affecting OpenACC program performance
Optimizing data locality and reducing transfers
Optimizing loop parallelism and fusion
Optimizing kernel parallelism and fusion
Optimizing vectorization and auto-tuning

Summary and Next Steps

Requirements

A solid understanding of C/C++ or Fortran programming languages and parallel computing concepts.
Basic knowledge of computer architecture and memory hierarchy.
Experience with command-line tools and code editors.

Audience

Developers who want to learn how to use OpenACC for programming heterogeneous devices and leveraging their parallelism.
Developers seeking to write portable and scalable code that runs on different platforms and devices.
Programmers interested in exploring the high-level aspects of heterogeneous programming and enhancing their coding productivity.

28 Hours

Custom Corporate Training

Training solutions designed exclusively for businesses.

Customized Content: We adapt the syllabus and practical exercises to the real goals and needs of your project.
Flexible Schedule: Dates and times adapted to your team's agenda.
Format: Online (live), In-company (at your offices), or Hybrid.

Investment

Price per private group, online live training, starting from 6400 € + VAT*

(*The final price may vary depending on the technical specialization of the course, the level of customization, the method of delivery and the number of learners)

Need help picking the right course?
opleidingen@nobleprog.com or +31 208 080 666

GPU Programming with OpenACC Training Course

Course Outline

Requirements

Custom Corporate Training

Provisional Upcoming Courses (Contact Us For More Information)

GPU Programming with OpenACC

GPU Programming with OpenACC

GPU Programming with OpenACC

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

GPU Programming with OpenACC Training Course

Course Outline

Requirements

Custom Corporate Training

Provisional Upcoming Courses (Contact Us For More Information)

GPU Programming with OpenACC

GPU Programming with OpenACC

GPU Programming with OpenACC

Related Courses

Developing AI Applications with Huawei Ascend and CANN

Deploying AI Models with CANN and Ascend AI Processors

AI Inference and Deployment with CloudMatrix

GPU Programming on Biren AI Accelerators

Cambricon MLU Development with BANGPy and Neuware

Introduction to CANN for AI Framework Developers

CANN for Edge AI Deployment

Understanding Huawei’s AI Compute Stack: From CANN to MindSpore

Optimizing Neural Network Performance with CANN SDK

CANN SDK for Computer Vision and NLP Pipelines

Building Custom AI Operators with CANN TIK and TVM

Migrating CUDA Applications to Chinese GPU Architectures

Performance Optimization on Ascend, Biren, and Cambricon

Related Categories

GPU

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites