Category: Parallel Computing

N body simulation in CUDA

Hey there!

I found this code in GitHub which solves N-body problem using traditional Newtonian gravitational equations. The repository owner, pchapin, has already tried various parallelizing methods like – pthreads, OpenMP, MPI, and CUDA.

While going through the whole programs and running it for different inputs. I discovered that there few of spots for improvement for the CUDA code. So I compiled the CUDA code with nvcc and ran it on nvprof.

Continue reading “N body simulation in CUDA”

Tensorflow – A Parallel perspective

Hey there!

A few days ago, I started learning about tensorflow, it is the google’s new open source library for doing various machine learning.

One of the reasons I wanted to try tensorflow was – I heard that tensorflow is heavily parallelized, so I just became curious to know how people do it. Tensorflow can also run on GPU which made me more curious.

Continue reading “Tensorflow – A Parallel perspective”