As LLMs become more capable, many RAG applications can be replaced with cache-augmented generation that include documents in ...