Just released a version which works on GPUs too!
Getting some nice speedups: Training a Transformer (CPU and GPU) Tutorial - Nabla
Just released a version which works on GPUs too!
Getting some nice speedups: Training a Transformer (CPU and GPU) Tutorial - Nabla