Google Gemini · FinOps Profile

Google Gemini Finops

FOCUS-aligned FinOps for Gemini API: per-model token-based pay-as-you-go pricing with separate input vs output rates and multimodal-vs-audio bands; long-context price tiers above 200K-token prompts; Batch API offering 50% off; optional Google Search grounding meter; Enterprise via Vertex AI with provisioned throughput. Spend rolls into the standard Google Cloud Billing pipeline.

Google Gemini Finops is the FinOps profile for Google Gemini on the APIs.io network, aligned with the FinOps Foundation Framework.

It defines 7 billable meters, billed in USD, on a monthly cycle, and pricing category token-based usage.

The profile maps 7 FOCUS columns for cost-allocation reporting.

Tagged areas include Generative AI, LLM, AI Infrastructure, Google, and FinOps.

Category: AI Infrastructure / LLM Pricing: Token-Based Usage Billing: Monthly FOCUS v1.3
Generative AILLMAI InfrastructureGoogleFinOpsFOCUS

Framework Alignment

Framework
Data Spec

Charge Categories

UsagePurchaseTaxAdjustmentCredit

FOCUS Columns

BillingCurrency
USD
ChargeCategory
Usage
InvoiceIssuerName
Google LLC
ProviderName
Google Cloud
PublisherName
Google LLC
ServiceCategory
AI Infrastructure
ServiceName
Gemini API

Meters

tokens_input
Unit: token
Input tokens billed per model and modality band
tokens_output
Unit: token
Output tokens billed per model and band
tokens_cached
Unit: token
Tokens served from context cache (lower rate than fresh input)
batch_tokens_input
Unit: token
Input tokens billed via the Batch API (50% discount)
batch_tokens_output
Unit: token
Output tokens billed via the Batch API (50% discount)
search_grounding_queries
Unit: request
Google Search grounding queries (free up to 5K/month, then $14/1K)
provisioned_throughput
Unit: hour
Reserved Gemini capacity hours via Vertex AI Enterprise

Sources