context caching

context caching

/ˈkɒntekst ˌkæʃɪŋ/

Model Optimization

saving the processed state of a prompt prefix to avoid recomputing it

Context caching is ideal for chatting with long documents.

Origin: Latin contextus + caching