Mammoth KVCache sharing

Does mammoth support or plans to support sharing KV cache between LLM replicas?

Thanks for reaching out! We don’t currently support KV cache sharing between replicas, but it’s a feature we have on our roadmap and plan to add in the future.

1 Like