https://github.com/pytorch/pytorch/issues/140229
Factor 3-4 speedup when compiled vs not compiled.
Mismatch stride issue on torch 2.7.0, downgrade to v2.5.1 and apply patch manually
https://github.com/pytorch/pytorch/issues/140229
Factor 3-4 speedup when compiled vs not compiled.
Mismatch stride issue on torch 2.7.0, downgrade to v2.5.1 and apply patch manually