
Pre-Warm Prompt Cache to Cut Time-to-First-Token
Sending your system prompt before the user prompt lets Claude write it to the cache without generating output. When the real request arrives, it hits a warm cache and responds faster.
ClaudeDevs·Thu, May 14 7:35pm ET