VLM
Qwen2.5-VL 7B
Alibaba · 7B · Image · video · OCR
The everyday VLM. Tunes fast, serves cheaply, and handles screenshots, forms, and chart QA out of the box.
Quickstart
Launch a Qwen2.5-VL 7B fine-tune
~ fourbit
$ fourbit train \
--base qwen2-5-vl-7b \
--data s3://acme/dataset.jsonl \
--recipe lora