So here is my video:
I am satisfied with what I accomplished and I got a lot more familiar with choosing number of threads and debugging CUDA.
There are a few things I would like to do once I get a good computer like:
- Get the velocity advection in the GPU
- Get the whole simulation in the GPU so there is no data going back and forth between the CPU and GPU
- Make it look good (i.e. better renderer)