Abstract: We present performance results for dense linear algebra using recent NVIDIA GPUs. Our matrix-matrix multiply routine (GEMM) runs up to 60% faster than the vendor's implementation and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results