Moonshot AI · FinOps Profile

Moonshot Ai Finops

FOCUS-aligned FinOps profile for Moonshot AI / Kimi. Billing model is prepaid recharge with pay-as-you-go consumption metered against per-million-token rates per model and per direction (input cache-hit, input cache-miss, output). Batch jobs are billed at 50% of online rates.

Moonshot Ai Finops is the FinOps profile for Moonshot AI on the APIs.io network, aligned with the FinOps Foundation Framework.

It defines 5 billable meters, billed in CNY/USD, on a on-demand (recharge) cycle, and pricing category usage based.

The profile maps 7 FOCUS columns for cost-allocation reporting.

Tagged areas include AI, LLM, Inference, Long Context, and Kimi.

Category: AI and Machine Learning Pricing: Usage Based Billing: On-Demand (Recharge) FOCUS v1.3
AILLMInferenceLong ContextKimiFinOpsCost ManagementFOCUS

Framework Alignment

Framework
Data Spec

Charge Categories

UsagePurchaseAdjustment

FOCUS Columns

BillingCurrency
CNY
ChargeCategory
Usage
InvoiceIssuerName
Moonshot AI
ProviderName
Moonshot AI
PublisherName
Moonshot AI
ServiceCategory
AI and Machine Learning
ServiceName
Moonshot AI Platform

Meters

input_tokens_cache_miss
Unit: tokens
Input tokens billed when cache miss occurs.
input_tokens_cache_hit
Unit: tokens
Discounted input tokens served from prompt cache.
output_tokens
Unit: tokens
Generated output tokens.
batch_tokens
Unit: tokens
Batch endpoint tokens (50% discount versus online).
web_search_calls
Unit: calls
Web search tool invocations.

Sources