General Matrix Multiplication or GEMM kernels take center place in high performance
computing and machine learning. Recent NVIDIA GPUs include GEMM accelerators, such as
NVIDIA’s Tensor Cores. In this paper we show how it is possible to program these
acce... (more…)
Read more »
I'm switching to Julia after running this comparison of Julia vs Numpy and Fortran, for performance and code simplicity. (more…)
Read more »
Improving the user's experience when using C libraries from Julia by using CBinding.jl to avoid the pain points. (more…)
Read more »
"dplyr equivalent in Julia" might be one of first things R user google when they try to learn Julia. However, not all of {tidyverse} is available in Julia, b... (more…)
Read more »