Embarrassingly Parallel Reduction in CUDA19 February 2025·10 minsA step-by-step guide on turning simple math into a flex.
'Notes' of a Launch21 December 2024·Updated: 24 May 2025·3 minsHunting down music from SpaceX’s Starship Flight Test recaps.
PyTorch: I’m Fast, JAX: You Call That Fast?16 August 2024·7 minsA recipe to train Object Detection Transformers (really) fast.
Data Parallelism using standard Ethernet5 August 2024·8 minsBag-of-tricks for multi-node training for the GPU Poor.
Aster DM, Intel, CARPL collaborate for 'Secure Federated Learning Platform' ↗ ↖1 July 2022Redirects to an external URL.