Recent
From Sequential to Parallel: Your Journey into GPU Programming with Triton
11 mins
Gpu-Programming
Triton
Basics
The Transformer's Anatomy: A Deep Dive into the Architecture that Revolutionized Machine Learning
17 mins
Transformers
Dl
Move Fast or Die Slow
6 mins
Strategy
Business
The Machine Learning Surgeon's Guide to Quantization: Precision Cuts for Smarter Models
26 mins
Quantization
Inference
Optimization
The Operating Room Setup
9 mins
Setup
Cuda
Cpp
Libtorch
Dissecting torch.compile: Surgical Precision in PyTorch Optimization
23 mins
Torch-Compile
Compiler