Flexible Performant GEMM Kernels on GPUs in Native Julia

General Matrix Multiplication or GEMM kernels take center place in high performance
computing and machine learning. Recent NVIDIA GPUs include GEMM accelerators, such as
NVIDIA’s Tensor Cores. In this paper we show how it is possible to program these
acce… Read more

Read full article

Similar

AlgebraicJulia: Applied Category Theory in Julia

Applied Category Theory is a new paradigm of applied mathematics that incorporates the advances in type theory to analyze scientific and engineering systems.... (more…)

Julia Parser

The Julia Language: A fresh approach to technical computing. - JuliaLang/julia... (more…)

A mixed experience with Julia and saving data frames

A mixed experience with Julia and saving data frames... (more…)

JuMPing at Gcd, with Julia

Recently, I was teaching my kids how to compute gcd(Greatest Common Divisor). Instead of just teaching the mechanics of calculation, I wanted to show them some interesting properties of gcd. (more…)

Some CUDA programming fun with Julia

Suppose we want to draw a batch of images, where each image is made up of randomly positioned and colored triangles, that are blending. It will look like this: (more…)

Latest News

Booming AI demand threatens global electricity supply

APU: Agent Processing Unit-Democratizing High-Speed AI with an Open Source Chip

Python trading bot using the Solana blockchain