You can use the GPU profiling capabilities when working with NVIDIA CUDA programs.
See also Accelerator metrics.