Skip to main content
Dario Salvati

Dario Salvati

A surgeon for Machine Learning models.

Recent

Dissecting torch.compile: Surgical Precision in PyTorch Optimization
23 mins
Torch-Compile Compiler
A quick incision: ten minutes to RAG
8 mins
Rag Llm Vector-Db
Performing Kernel Surgery: Profiling CUDA Kernels with NVIDIA Nsight Compute
9 mins
Cuda Profiling Optimization
A Machine Learning Surgeon’s Toolkit: Advanced Matrix Multiplication in CUDA
16 mins
Cuda Gpu Optimization Matrix-Multiplication
Cerebral Cortex and Hippocampus: Understanding the Computational and Memory Design of GPUs
13 mins
Gpu Architecture
Hello CUDA: A Surgical Dissection
13 mins
Gpu-Programming Cuda Basics