Intelligent response caching based on semantic similarity. Reduce costs and latency by serving cached responses for semantically similar requests.
response.ExtraFields.CacheDebug
(as a JSON object)
Fields:
CacheHit
(boolean): true
when response served from cacheCacheHitType
(string): "semantic"
for similarity match, "direct"
for hash matchCacheID
(string): Unique cache entry ID for management operationsCacheThreshold
(number): Similarity threshold used for the matchCacheSimilarity
(number): Similarity score for the match