Spokes.wiki Search Graph Growth About

llm-providers-wiki

Software Application source ↗ source url updated Mon Jun 15 2026 00:00:00 GMT+0000 (Coordinated Universal Time)

North Mini Code (Cohere)

Cohere’s first agentic coding model and the inaugural member of its next-generation North family, released 9 June 2026 under Apache 2.0 — an open-weight model, not a closed sovereign-only product. (cohere‘s own announcement + the Cohere Labs model card on Hugging Face.)

Architecture

A Mixture-of-Experts model: 30B total parameters, 3B active (128 experts, 8 activated per token). Interleaved sliding-window and full self-attention in a 3:1 ratio; SwiGLU FFN blocks with a sigmoid router. 256K context, 64K max generation. Minimum hardware is a single H100 at FP8 — small enough to run locally, which is how it serves the sovereignty pitch: own the weights and run them on-premise rather than route data to a shared cloud.

Benchmarks (vendor-reported)

Trained on 70%/61% code tokens (two SFT stages) over 70k+ verifiable tasks across ~5k repositories — positioned for code generation, agentic software engineering, and terminal tasks (agent orchestration, architecture mapping, code reviews).

Availability

Hugging Face (BF16 + FP8 quantized weights), the Cohere API, Model Vault (managed inference), OpenRouter, and OpenCode.

Why it matters

This revises the read on cohere: the lab is no longer only a closed-deployment sovereign player. North Mini Code is Apache-2.0 open-weight, so Cohere now achieves sovereignty through open weights and local deployment — putting it on the open-weight axis beside qwen, deepseek, and mistral-ai, not opposite it. The enterprise/sovereign positioning is the go-to-market, not the licensing.

Tier note (T3 → upgraded from a T4 stub): earlier ingested as a JS-gated The New Stack headline (trade press, body unrecoverable); now rebuilt from Cohere’s own blog + the Cohere Labs Hugging Face model card. Vendor-primary (self-interested on benchmark selection), but the specs are inspectable in the open weights and the Artificial Analysis score is independently defined.

Related: cohere · llm-provider · open-weight-models · artificial-analysis · qwen