Created by: stephenroller
Patch Description There's a known bug where training on bf16 and then requeuing will result in the model temporarily being converted to fp16, causing potential noise issues. This avoids this conversion.
This likely explains why some users are seeing some loss explosions after they requeue.
Testing steps Launched the API and manually checked the code path taken.