Check if a cached response exists for the given messages
exact, semantictrue for cache hitsexact or semanticfalse for cache misses| Threshold | Behavior | Best For |
|---|---|---|
| 0.95-1.0 | Very strict matching | High-precision applications |
| 0.85-0.94 | Balanced (recommended) | General purpose |
| 0.75-0.84 | Loose matching | Maximum cache utilization |
| 0.60-0.74 | Very loose | Experimental/testing |
400 - Bad Request
401 - Unauthorized
429 - Rate Limit Exceeded