🎉 LongCat-2.0 tersedia di SiliconFlow. Coba SEKARANG.

Model-model

Produk

Harga

Dokumen

Blog

Tentang

Kontak

State-of-the-Art

AI Model Library

One API to run inference on 200+ cutting-edge AI models, and deploy in seconds

State-of-the-Art

AI Model Library

One API to run inference on 200+ cutting-edge AI models, and deploy in seconds

State-of-the-Art

AI Model Library

One API to run inference on 200+ cutting-edge AI models, and deploy in seconds

All

Featured

LLM

Vision

Image

Video

Audio

Serverless

Google

Text Generation

gemma-4-12B-it

Dirilis pada: 9 Jun 2026

Gemma 4 26B is Google DeepMind's latest open-source MoE model, built on a 26B-parameter Mixture of Experts architecture that activates only 3.8B parameters during inference for exceptionally fast token throughput. Purpose-built for advanced reasoning and agentic workflows, it ranks #6 among all open models on the Arena AI leaderboard — outperforming models up to 20x its size — with native function-calling, 256K context, and full Apache 2.0 licensing....

Total Context:

262K

Max output:

262K

Input:

0.1

/ M Tokens

Input:

text

/ M Tokens

Output:

0.3

/ M Tokens

Google

Text Generation

gemma-4-26B-A4B-it

Dirilis pada: 7 Apr 2026

Total Context:

262K

Max output:

262K

Input:

0.12

/ M Tokens

Input:

text

/ M Tokens

Output:

0.4

/ M Tokens

Google

Text Generation

gemma-4-31B-it

Dirilis pada: 7 Apr 2026

Gemma 4 31B is Google DeepMind's latest open-source model, built on a 31B dense architecture from the same research foundation as Gemini 3. Purpose-built for advanced reasoning and agentic workflows, it ranks #3 among all open models on the Arena AI leaderboard — outperforming models up to 20x its size — with native function-calling, 256K context, and full Apache 2.0 licensing....

Total Context:

262K

Max output:

262K

Input:

0.13

/ M Tokens

Input:

text

/ M Tokens

Output:

0.4