Skip to main content

Playground

An ultra-fast Multimodal Thinking model with a 256K context. It is optimized for real-time visual and text understanding in high-throughput applications.

Key Features

  • Ultra-Fast Latency: Optimized for speed, making it suitable for real-time interactive applications and chat.
  • 256K Context: Retains the large context window while significantly increasing inference speed.