Chaim RandAI Model Optimization on AWS Inferentia and TrainiumTips for Accelerating ML with AWS Neuron SDK1d ago11d ago1
Chaim RandImplementing Sequential Algorithms on TPUAccelerating AI/ML Model Training with Custom Operators — Part 3.AOct 7Oct 7
Chaim RandinTowards Data ScienceThe Rise of Pallas: Unlocking TPU Potential with Custom KernelsAccelerating AI/ML Model Training with Custom Operators — Part 3Oct 62Oct 62
Chaim RandinTowards Data ScienceUnleashing the Power of Triton: Mastering GPU Kernel Optimization in PythonAccelerating AI/ML Model Training with Custom Operators — Part 2Aug 133Aug 133
Chaim RandinTowards Data ScienceAccelerating AI/ML Model Training with Custom OperatorsOn the potential benefits of creating model-specific GPU kernels and their application to optimizing the use of dynamically shaped tensorsAug 111Aug 111
Chaim RandinTowards Data SciencePyTorch Native FP8Accelerating PyTorch Training Workloads with FP8 — Part 2May 21May 21
Chaim RandA Priority Based Scheduler for Amazon SageMaker Training JobsOptimizing the use of limited AI training accelerators — Part 2Mar 8Mar 8