Prompt Caching
⚙️ How It Works
Token Type
Description
Billing
Cost Calculation
input_cost = text_tokens × input_rate
+ cached_tokens × cache_read_rate
+ cache_write_tokens × cache_write_rate🔍 Cache usage fields
Field
Description
🏢 Supported Providers
Amazon Bedrock (Claude models)
Google Vertex (Gemini models)
Azure (OpenAI models)
Mistral
Inceptron
Other Providers
💡 Best practices for cache usage
Last updated