Scalable Inference Serving · FinOps Profile
Scalable Inference Serving Finops
FOCUS-aligned FinOps shape for the Scalable Inference Serving stack: open-source software (KServe, BentoML, vLLM, Triton) with no software license fee; FinOps cost is the underlying GPU / Kubernetes infrastructure on the deploying cloud account.
Scalable Inference Serving Finops is the FinOps profile for Scalable Inference Serving on the APIs.io network, aligned with the FinOps Foundation Framework.
It defines 1 billable meter, billed in USD (varies by contract), on a per-contract cycle, and pricing category contract / negotiated.
The profile maps 6 FOCUS columns for cost-allocation reporting.
Tagged areas include FinOps, FOCUS, AI, Inference, and Open Source.
Category: AI Infrastructure
Pricing: Contract / Negotiated
Billing: Per-Contract
FOCUS v1.3
FinOpsFOCUSAIInferenceOpen Source
Framework Alignment
Charge Categories
PurchaseUsageAdjustment
FOCUS Columns
BillingCurrency
USD
InvoiceIssuerName
Various (open source)
ProviderName
Scalable Inference Serving
PublisherName
Various (open source)
ServiceCategory
AI Infrastructure
ServiceName
Scalable Inference Serving
Meters
contracted_consumption