Return to Article Details A Survey on Hybrid Caching Techniques to Reduce Latency in Large Language Model Systems