Mar 13, 2025 at 8:21 PM

Cohere AI launches Command A: high-performance, cost-efficient enterprise AI model

Cohere (AI) has released its latest enterprise AI model, Command A, designed to deliver high performance with minimal compute requirements. Command A competes with models like GPT-4o and DeepSeek-V3, particularly excelling in business, STEM, and coding tasks, while offering greater efficiency. It operates on just two GPUs (A100 or H100), compared to the 32 GPUs needed by some competitors, significantly reducing infrastructure costs.

The model processes 156 tokens per second, making it 1.75 times faster than GPT-4o and 2.4 times faster than DeepSeek-V3, with a reduced latency of 6,500ms for time-to-first-token. It supports a context length of 256,000 tokens, double that of its predecessor, Command-R, allowing for larger inputs. Command A also offers multilingual capabilities, supporting 23 global languages with enhanced Arabic dialect matching.

Command A is integrated with Cohere’s North AI platform, supporting retrieval-augmented generation (RAG) and agentic tool use, aligning with an enterprise-first strategy. It provides cost-effective private deployments, up to 50% cheaper than API-based access. The model is available on Cohere’s platform and for research on Hugging Face under a CC-BY-NC 4.0 license, with pricing set at $2.50 per million input tokens and $10.00 per million output tokens.

Mar 13, 2025 by Mauricio B. Holguin

city_zen found this interesting

MORE ABOUT: #Large Language Model (LLM) Tools #Cohere (AI)

Cohere (AI)

Cohere (AI) is an API platform designed to integrate large language model (LLM) functionalities, including chatting and text classification, into applications. It enables developers to enhance their software with advanced natural language processing capabilities. Cohere (AI) is an option for those seeking LLM integration, with top alternatives available for comparison.

External links

Introducing Command A: Max performance, minimal compute
Cohere • Official source
Cohere targets global enterprises with new highly multilingual Command A model requiring only 2 GPUs
VentureBeat
Cohere says Command A model edges out LLM competition in speed and energy efficiency
BetaKit • Official source

No comments so far, maybe you want to be first?