BAGEL AI icon
BAGEL AI icon

BAGEL AI

We present BAGEL, an open-source multimodal foundation model with 7B active parameters (14B total) trained on large-scale interleaved multimodal data. BAGEL outperforms the current top-tier open-source VLMs like Qwen2.5-VL and InternVL-2.

BAGEL AI screenshot 1

Cost / License

  • Free
  • Open Source

Platforms

  • Self-Hosted
  • Python
-
No reviews
0likes
0comments
0news articles

Features

Suggest and vote on features
  1.  Text to Image Generation
  2.  AI Writing
  3.  AI-Powered

 Tags

  • multimodal
  • ai-model

BAGEL AI News & Activities

Highlights All activities

Recent activities

Show all activities

BAGEL AI information

  • Developed by

    CN flagByteDance
  • Licensing

    Open Source (Apache-2.0) and Free product.
  • Written in

  • Alternatives

    53 alternatives listed
  • Supported Languages

    • English

AlternativeTo Category

AI Tools & Services

GitHub repository

  •  5,500 Stars
  •  481 Forks
  •  142 Open Issues
  •   Updated  
View on GitHub
BAGEL AI was added to AlternativeTo by Paul on and this page was last updated .
No comments or reviews, maybe you want to be first?
Post comment/review

What is BAGEL AI?

We present BAGEL, an open-source multimodal foundation model with 7B active parameters (14B total) trained on large-scale interleaved multimodal data. BAGEL outperforms the current top-tier open-source VLMs like Qwen2.5-VL and InternVL-2.5 on standard multimodal understanding leaderboards, and delivers text-to-image quality that is competitive with strong specialist generators such as SD3. Moreover, BAGEL demonstrates superior qualitative results in classical image-editing scenarios than the leading open-source models. More importantly, it extends to free-form visual manipulation, multiview synthesis, and world navigation, capabilities that constitute "world-modeling" tasks beyond the scope of previous image-editing models. The figure below showcases BAGEL's qualitative performance.

Official Links