Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →
Qwen3.7-Plus
Model family: qwen3-7
Qwen3.7-Plus — the closed, multimodalA model that can handle more than one type of input or output — typically text plus images, sometimes plus audio or video. "GPT-4 Vision" and "Llama 3.2 11B Vision" are multimodal models that accept both text and images. A text-only model is called "unimodal" but nobody uses that term; text-only is the assumed default. sibling of the Max flagship; vision input, 1M context, API-only, proprietary.
Identity
- Creator
- Qwen
- Model family
- qwen3-7
- Release date
- 2026-05-19
Technical specs
- Parameter count
- The closed, multimodal sibling of Qwen3.7-Max — adds image and video input. API-only on DashScope; no downloadable weights.
- Context window
- 1M tokens
- Modalities
- Image Input
- Text
- Video Input
- Primary capabilities
- Chat
- Function Calling
- Instruction Following
- Long Context
- Reasoning
- Tool Use
- Vision
License
- License
- Qwen Proprietary (Alibaba Cloud)
- Commercial use
- Allowed
- Terms
- Modification ✗
- Redistribution ✗
- Attribution ✗
Access
- Openness
- Closed Api
- Access methods
- Api First Party
- Api Third Party
- Cost tier
- Paid Api
- llm
- closed-api
- frontier
- multimodal
- long-context
- china-based
- proprietary