Upload

Loading icon Loading...

This video is unavailable.

An introduction to OpenACC (Part 3); Alistair Hart (Cray)

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to like cscsch's video.

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to dislike cscsch's video.

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to add cscsch's video to your playlist.

Published on Dec 18, 2013

I first describe some extra OpenACC features (the "update" directive, array sectioning and "present" clauses). I also discuss race conditions and how to avoid them, as well as a few other "gotchas" that can lead to incorrect results. I then cover performance tuning using the "gang", "worker" and "vector" clauses as well as "collapse" and "cache". I've got a few examples to show how much performance we might gain through tuning, first with simple kernels and then with the Himeno benchmark. Finally, I discuss asynchronicity and how to handle streams of tasks and dependency trees with OpenACC.

Programming for GPUs Course: Introduction to OpenACC 2.0 & CUDA 5.5 - December 4-6, 2013

Loading icon Loading...

Loading icon Loading...

Loading icon Loading...

The interactive transcript could not be loaded.

Loading icon Loading...

Loading icon Loading...

Ratings have been disabled for this video.
Rating is available when the video has been rented.
This feature is not available right now. Please try again later.

Loading icon Loading...

Loading...
Working...
to add this to Watch Later

Add to