Colorado State University Cuda Exercise one: add vector and coalescing

The goal of the exercise is to build on your last touch vector program (the one with a 2D grid of 2D thread blocks) to add two vectors element wise: (forall i in 0 to N-1 C[i]=A[i]+B[i]), in two ways

and the difference in performance. Let's discuss

Here is a Makefile and an incomplete cuda program to get started with.