Search Results

Found 1 results for "700d24f9ad6fad37d1a898f6d902874b" across all boards searching md5.

Anonymous /g/106163327#106165998
8/6/2025, 9:17:17 PM
is there any backend with smarter KV cache invalidation that llama.cpp? when I cut a few tokens at the end, it deletes the entire cache and needs to process the whole prompt from scratch