Created by: ArmenAg
Patch Description
Implements the multi-modal causal masked language modeling objective. This implementation supports image/speech tokens.
Created by: ArmenAg
Patch Description
Implements the multi-modal causal masked language modeling objective. This implementation supports image/speech tokens.