Talk about GPU computing with CUDA and OpenCL
In March 2017 I gave a talk about GPU computing with CUDA and OpenCL at the para//el 2017 conference in Heidelberg, Germany.
All posts tagged with #c++ in chronological order with newest first.
In March 2017 I gave a talk about GPU computing with CUDA and OpenCL at the para//el 2017 conference in Heidelberg, Germany.
CUDA 8.0 on Mac OS X 10.12 gave me the following error:
In CUDA 8.0 C++ 11 lambdas can be used on the device. See the following code.
Every multiplayer gamer knows it: “Lag”. When you see an opponent too late or you are certain you hit them a hundred percent, but the game decides otherwise: the culprit...
The implementation of the heat diffusion equation on multiple GPUs is explained in the talk (in german).
During my talk at the parallel 2015 conference i was asked how one can measure traffic on the PCI express bus. For multi GPU computing it is very important to...
“C++ AMP: Accelerated Massive Parallelism with Microsoft Visual C++” is a good introduction to programming with C++ AMP. Unfortunately, at the time of this review, it is the only book...
I tested the parallel ray tracer on all NVIDIA cards i bought. This page contains the results.
I wanted to learn some frameworks for parallel computing and i decided to implement a simple ray tracer as a "visual benchmark".
What makes “OpenCL in Action: How to Accelerate Graphics and Computation” special is that the author, Matthew Scarpino, truly tries to explain the programs. You learn the reasons behind his...
While browsing through “CUDA Fortran for Scientists and Engineers”, I noticed that the authors provide valuable optimization tips. That piqued my curiosity.
The book “GPU Computing Gems (Emerald Edition)” is a collection of 50 scientific articles about experiences using GPU computing in various subject areas.
I had already read many articles from the series “CUDA, Supercomputing for the Masses” by Rob Farber on the Dr. Dobb’s website and was quite impressed. Back then in 2009,...
With CUDA, it’s easy to speed up a calculation by a factor of 2-5. How to achieve higher speedups is explained in “CUDA Programming” by Shane Cook.
In “The CUDA Handbook”, Nicholas Wilt proves himself to be very competent and keen to experiment with kernel optimization. Advanced readers will find this book very beneficial.
“Using OpenMP” is a good introduction and contains a few good performance tips.
If you want to bring a bit of systematism to your parallel programming skills, “Structured Parallel Programming: Patterns for Efficient Computation” is the book to start with.
I had already read the second edition back in 1992. Since then, a lot has changed for the better, both with C++ and this “classic.”
In this small book, the creator of C++ provides an overview of the new features in C++ 11 and the STL.
"Massiv parallel - Grundlagen der GPU Programmierung"
"GPU-Programmierung in der Praxis mit CUDA"
“API Design for C++” is an excellent introduction to the implementation of APIs with C++.
What distinguishes “Programming Massively Parallel Processors: A Hands-On Approach” from other books is the precise description of how the hardware works.
The OpenCL standard is not always easy and the technical proof was a challenge, because the code had to run on different os'es and platforms.
With the title “The Art of Concurrency”, high expectations are set that are not met. The subtitle “A Thread Monkey’s Guide to Writing Parallel Applications” is more fitting for the...
“Ray Tracing from the Ground Up” is an excellent book for beginners and advanced readers. You can tell that the author has spent many years working on the subject and...
I installed Snow Leopard on my laptop yesterday. I was very curious about OpenCL and installed the drivers and the GPU Computing SDK from NVIDIA.
I have got a new computer. As alway i build it myself. How much faster is it?
I tested my ray tracer on Intel Core i7 920 @2.67 MHz.
I tested my ray tracer on my Mac Book Pro.
I wanted to learn about the cell broadband engine in the Playstation 3 and wrote a parallel ray tracer for it.