Sign in Request access

LLM

Qwen3 8B

Alibaba · 8B · 256k ctx

Same recipe surface as Llama 8B but with 256k context out of the box. Good fit for retrieval-replacement or long-doc Q&A tunes.

Train this model →Read the recipe

Quickstart

Launch a Qwen3 8B fine-tune

~ fourbit

$ fourbit train \
  --base qwen3-8b \
  --data s3://acme/dataset.jsonl \
  --recipe lora

More LLMs

Llama 3.3 70B
Frontier open-weights generalist. Best-in-class for instruction-tuning at the 70B tier.
70B
Qwen3 72B
Strong multilingual generalist with long context. Excellent for non-English and code tasks.
72B
Llama 3.1 8B
Workhorse 8B. Cheap to tune, fast to serve, ships on one GPU.
8B