Created by: lilisierrayu
Patch Description Making the inference task tokenizer match with cm3_streaming. Note that the inference task is very redundant. Zetta team is working on unifying them, will adopt their change once they did that.
Testing steps Describe how you tested your changes