Convert Vocabulary
To start the conversion process, you need to have Python (3.8+) installed and install the required packages.
To convert the vocabulary to mllm vocabulary, follow these steps. We currently support two types of tokenizers: Unigram and BPE.