Text Generation
gemma-4-26B-A4B-it
Gemma 4 26B is Google DeepMind's latest open-source MoE model, built on a 26B-parameter Mixture of Experts architecture that activates only 3.8B parameters during inference for exceptionally fast token throughput. Purpose-built for advanced reasoning and agentic workflows, it ranks #6 among all open models on the Arena AI leaderboard — outperforming models up to 20x its size — with native function-calling, 256K context, and full Apache 2.0 licensing....
Total Context:
262K
Max output:
262K
Input:
$
0.12
/ M Tokens
Input:
$
text
/ M Tokens
Output:
$
0.4
/ M Tokens
Text Generation
gemma-4-31B-it
Gemma 4 31B is Google DeepMind's latest open-source model, built on a 31B dense architecture from the same research foundation as Gemini 3. Purpose-built for advanced reasoning and agentic workflows, it ranks #3 among all open models on the Arena AI leaderboard — outperforming models up to 20x its size — with native function-calling, 256K context, and full Apache 2.0 licensing....
Total Context:
262K
Max output:
262K
Input:
$
0.13
/ M Tokens
Input:
$
text
/ M Tokens
Output:
$
0.4
/ M Tokens

