Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · ByteDance

BAGEL-7B-MoT

Model family: bagel

An open-weight "unified" multimodal model — one model that both understands and generates text, images, and video — under a permissive Apache 2.0 license. Useful as a single self-hostable building block for mixed media tasks.

Identity

Creator: ByteDance
Model family: bagel
Release date: 2025-05-19

Technical specs

Parameter count

Context window

33K tokens

Modalities

Image Input
Image Output
Text
Video Input

Primary capabilities

Chat
Image Generation
Instruction Following
Reasoning
Vision

License

License

Apache License 2.0

Commercial use

Allowed

Terms

Modification ✓
Redistribution ✓
Attribution ✓

Access

Openness

Open Weight

Access methods

Local Runtime Vllm
Weights Download Hf

Cost tier

Self Hosted Only

Sources

Full model card →

open-weight
multimodal
image-generation
vision
small
china-based
apache-2-0