CS475 Lab 6: Introduction to GPU and CUDA

This lab session is intended to learn about the various GPUs installed in the CS lab machines and to use NVIDIA tool to find out the various features

1. Details of the GPUs on the CS machines machines list

Log into 3 different CS machine: 1 of the capital machines, and 2 in the hpc-lab. (get the list from the and do one of each color on this slide).
1. ssh to a hpc-lab machine and run the command /sbin/lshw
2. This will list the details of all the hardware details
3. Check the GPUs listed and record their details. When you are done with this you should have a list of 3 different GPUs. Note that each machine has more than one GPU and you have to choose which one is CUDA capable.

Check the features

Use the NVIDIA built in tool to check the features of the installed GPUs
Copy the directory /usr/local/cuda-7.5/samples/1_Utilities/deviceQuery into your own work space
Type the command "make" which will compile the .cpp file
The make command is going to fail

You need to fix the Makefile by changing 3 parts of the file:

INCLUDES  := -I../../common/inc
to
INCLUDES  := -I/usr/local/cuda-7.5/samples/common/inc

$(EXEC) mkdir -p ../../bin/$(TARGET_ARCH)/$(TARGET_OS)/$(BUILD_TYPE)
$(EXEC) cp $@ ../../bin/$(TARGET_ARCH)/$(TARGET_OS)/$(BUILD_TYPE)
to
$(EXEC) mkdir -p bin/$(TARGET_ARCH)/$(TARGET_OS)/$(BUILD_TYPE)
$(EXEC) cp $@ bin/$(TARGET_ARCH)/$(TARGET_OS)/$(BUILD_TYPE)

rm -rf ../../bin/$(TARGET_ARCH)/$(TARGET_OS)/$(BUILD_TYPE)/deviceQuery
to
rm -rf bin/$(TARGET_ARCH)/$(TARGET_OS)/$(BUILD_TYPE)/deviceQuery

Run deviceQuery

Machine Description
Prepare the machine description report(for future reference) consisting of the below fields for all the different GPUs installed on the machines along with a brief description of each host machine.
1. Cuda Cores
2. Clock Speed
3. Memory Clock Speed
4. Total amount of shared memory per block
5. L2 Cache Size
6. CUDA Capability level

2. Run Hello World in Cuda!

Run the given HelloWorld.cu program in Cuda
Modify the number of Threads and Blocks and observe the differences
Uncomment the if statemnet in the kernel code and run the program again

Instructions to run the program

Make sure you have cuda in your path and library load path.

echo $path
 ... /usr/local/cuda-7.5/bin

echo $LD_LIBRARY_PATH
 ... /usr/local/cuda-7.5/lib64

To set the environment variable in Bash terminals, use the following commands

export PATH=/usr/local/cuda-7.5/bin:$PATH
export LD_LIBRARY_PATH=/usr/local/cuda-7.5/lib64:$LD_LIBRARY_PATH

You can compile using the below command:
nvcc HelloWorld.cu -o hello
Run it with the command: ./hello to get the output
You can alternatively use a makefile to compile it