🎉 gemma-4-12B-it tersedia di SiliconFlow. Coba SEKARANG.

Model-model

Produk

Harga

Dokumen

Blog

Tentang

Kontak

State-of-the-Art

AI Model Library

One API to run inference on 200+ cutting-edge AI models, and deploy in seconds

State-of-the-Art

AI Model Library

One API to run inference on 200+ cutting-edge AI models, and deploy in seconds

State-of-the-Art

AI Model Library

One API to run inference on 200+ cutting-edge AI models, and deploy in seconds

All

Featured

LLM

Vision

Image

Video

Audio

Penyedia

MiniMaxAI

Text Generation

MiniMax-M3

Dirilis pada: 1 Jun 2026

MiniMax-M3 is MiniMax’s frontier multimodal coding and agentic model, built on the MiniMax Sparse Attention (MSA) architecture. It supports up to a 1M-token context window and accepts image and video inputs. The model is designed for code generation, agentic workflows, tool use, long-context understanding, and multi-step reasoning, showing strong performance on benchmarks such as SWE-Bench Pro, Terminal-Bench 2.1, and MCP Atlas....

Total Context:

1049K

Max output:

131K

Input:

0.3

/ M Tokens

Input:

text

/ M Tokens

Output:

1.2

/ M Tokens

DeepSeek

Text Generation

DeepSeek-V4-Pro

Dirilis pada: 24 Apr 2026

DeepSeek-V4-Pro is DeepSeek's flagship open-source MoE model with 1.6T total parameters and 49B activated, purpose-built for frontier-level reasoning, coding, and agentic tasks. Supporting a 1M-token context window and three reasoning effort modes up to Think Max, it achieves top-tier performance on coding benchmarks such as LiveCodeBench and Codeforces — rivaling leading closed-source models — and is released under the MIT License....

Total Context:

1049K

Max output:

393K

Input:

1.6

/ M Tokens

Input:

text

/ M Tokens

Output:

3.135

/ M Tokens

DeepSeek

Text Generation

DeepSeek-V4-Flash

Dirilis pada: 24 Apr 2026

DeepSeek-V4-Flash is DeepSeek's latest open-source MoE model featuring 284B total parameters with only 13B activated during inference, delivering high-speed generation without sacrificing capability. With native support for a 1M-token context window and three switchable reasoning modes — Non-Think, Think High, and Think Max — it offers flexible intelligence scaling from everyday tasks to complex reasoning, all under the MIT License....

Total Context:

1049K

Max output:

393K

Input:

0.13

/ M Tokens

Input:

text

/ M Tokens

Output:

0.28

/ M Tokens

Moonshot AI

Text Generation

Kimi-K2.6

Dirilis pada: 21 Apr 2026

Kimi K2.6 is an open-source, native multimodal agentic model by Moonshot AI, achieving open-source state-of-the-art on benchmarks including HLE with tools, SWE-Bench Pro, and BrowseComp. Built on a MoE architecture with 1T total parameters and 32B activated, the model supports a 256K-token context window and multimodal inputs (image and video) via its MoonViT vision encoder. K2.6 is optimized for agentic workloads: it sustains 4,000+ tool calls over 12+ hours of continuous execution, scales to 300 parallel sub-agents × 4,000 steps per run to produce 100+ files from a single prompt, and supports both Thinking and Instant inference modes with function calling and multi-turn Preserve Thinking...

Total Context:

262K

Max output:

262K

Input:

0.77

/ M Tokens

Input:

text

/ M Tokens

Output:

4.0