•Anthropic silently reduced the cache Time-To-Live (TTL) for an unspecified API from 1 hour to 5 minutes around early March 2026.
•This change was not officially announced, leading to unexpected 'quota and cost inflation' for users relying on previous caching behavior.
•Developers and teams using Anthropic services should review their API usage patterns and billing statements from early March onwards to detect potential impacts.
•Anthropic silently reduced the cache Time-To-Live (TTL) for an unspecified API from 1 hour to 5 minutes around early March 2026.
•This change was not officially announced, leading to unexpected 'quota and cost inflation' for users relying on previous caching behavior.
•Developers and teams using Anthropic services should review their API usage patterns and billing statements from early March onwards to detect potential impacts.