feat: support continuous kvcache using virtual memory. #92

XuZhang99 · 2025-09-04T13:20:02Z

No description provided.

)

liutongxuan · 2025-09-11T01:38:04Z

xllm/core/scheduler/kv_cache_manager_client.h

+
+ private:
+  // these two pointers must be one null and one non-null
+  BlockManagerPool* block_manager_pool_;


Can you build a ContinousBlockManagerPool class inherits from BlockManagerPool? In this way, this Client is not needed

yq33victor · 2025-09-11T12:28:46Z

xllm/core/distributed_runtime/remote_worker.cpp

+        options[1].num_kv_heads());
+    xtensor_options_vec.mutable_value_options()->set_head_size(
+        options[1].head_size());
+    xtensor_options_vec.mutable_value_options()->set_max_context_len(


call function allocate_continuous_kv_cache directly?

yq33victor · 2025-09-11T12:32:17Z

xllm/core/framework/batch/batch_input_builder.h

@@ -67,6 +67,10 @@ class BatchInputBuilder {
                           uint32_t n_kv_cache_tokens,
                           uint32_t seq_len,
                           uint32_t q_seq_len);
+  void setup_continuous_kv_cache_info(Sequence* sequence,


function setup_continuous_kv_cache_info is not used ?

it should be used in func process_single_sequence, but i forgot...

xllm/core/framework/batch/batch_input_builder.cpp

yq33victor · 2025-09-11T12:39:16Z

xllm/core/framework/kv_cache/kv_cache.h

  }

 private:
  torch::Tensor key_cache_;
  torch::Tensor value_cache_;
+


nit: maybe it's better create a new class like class ContinuousKVcache.

Anyway, it's ok currently. :) ignore.

yq33victor · 2025-09-11T12:53:49Z

xllm/core/framework/page/page_manager.cpp

+  // TODO: refine this
+  seq_id = multi_layer_kv_xtensor_.first->allocate_seq_id();
+  seq_id = multi_layer_kv_xtensor_.second->allocate_seq_id();
+}


no returned value ?

yq33victor · 2025-09-11T12:56:28Z

xllm/core/framework/page/page_manager_client.cpp

+
+namespace xllm {
+
+bool PageManagerClient::allocate(int32_t& seq_id, size_t num_tokens) {


PageManagerClient is just the wrapper of page_manager_, could we use page_manager_ directly ?

it's like WorkerImpl and WorkerClient, i think WorkerClient is used for single node serving, so i use PageManagerClient for single node serving.

yq33victor · 2025-09-11T13:02:22Z

xllm/core/runtime/engine.h

@@ -119,6 +125,9 @@ class Engine {
  // block manager
  std::unique_ptr<BlockManagerPool> block_manager_pool_;

+  // page manager
+  std::unique_ptr<PageManagerPool> page_manager_pool_;


can we create a base ManagerPool class, BlockManagerPool and PageManagerPool can inherit from the base class. Then here we can only define a member std::unique_ptr<ManagerPool> manager_pool_;

yq33victor · 2025-09-11T13:07:25Z

xllm/core/framework/page/page_manager_server.cpp

+#include "util/net.h"
+
+namespace xllm {
+void PageManagerServer::create_server(const page::Options& options,


nit: we'd better put these files here to their respective directories according to their functions.
like: page_manager_pool.h to framework/block/, page_manager_server.h to core/distribute_runtime, service ... etc. we can add a subdirectory under these directories like page/.

Anyway, this can be refactored later, currently is ok.

xllm/core/framework/request/sequence.h

yq33victor · 2025-09-11T13:21:28Z

xllm/core/scheduler/kv_cache_manager_client.cpp

+namespace xllm {
+
+KVCacheManagerClient::KVCacheManagerClient(BlockManagerPool* block_manager_pool)
+    : block_manager_pool_(block_manager_pool) {}


Here, after we create a base class ManagerPool, there will be no need to determine whether it is a block or a page. We can directly handle it with the base class pointer. And the KVCacheManagerClient will also become unnecessary.

XuZhang99 requested review from yq33victor and liutongxuan September 4, 2025 13:20

XuZhang99 force-pushed the feature/continuous_kvcache branch 2 times, most recently from 425339a to 62fb1c1 Compare September 4, 2025 13:28

XuZhang99 changed the title ~~feat: support continuous kvcache using virtual memory.~~ [WIP] feat: support continuous kvcache using virtual memory. Sep 4, 2025

XuZhang99 changed the title ~~[WIP] feat: support continuous kvcache using virtual memory.~~ feat: support continuous kvcache using virtual memory. Sep 4, 2025

XuZhang99 force-pushed the feature/continuous_kvcache branch 5 times, most recently from 6f37ec9 to 3796d69 Compare September 9, 2025 06:58

feat: support continuous kvcache using virtual memory.

df37855

XuZhang99 force-pushed the feature/continuous_kvcache branch from 3796d69 to df37855 Compare September 9, 2025 07:03

feat: add model layer add atb_layers support for continuous kvcache (#14

378287b

)

liutongxuan reviewed Sep 11, 2025

View reviewed changes

yq33victor reviewed Sep 11, 2025

View reviewed changes

bugfix: add setup_continuous_kv_cache_info in process_single_sequence.

c042223

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support continuous kvcache using virtual memory. #92

feat: support continuous kvcache using virtual memory. #92

Uh oh!

XuZhang99 commented Sep 4, 2025

Uh oh!

liutongxuan Sep 11, 2025

Uh oh!

yq33victor Sep 11, 2025

Uh oh!

yq33victor Sep 11, 2025

Uh oh!

XuZhang99 Sep 11, 2025

Uh oh!

Uh oh!

yq33victor Sep 11, 2025

Uh oh!

yq33victor Sep 11, 2025

Uh oh!

yq33victor Sep 11, 2025

Uh oh!

XuZhang99 Sep 11, 2025

Uh oh!

yq33victor Sep 11, 2025

Uh oh!

yq33victor Sep 11, 2025

Uh oh!

Uh oh!

yq33victor Sep 11, 2025

Uh oh!

Uh oh!


		namespace xllm {

		bool PageManagerClient::allocate(int32_t& seq_id, size_t num_tokens) {

feat: support continuous kvcache using virtual memory. #92

Are you sure you want to change the base?

feat: support continuous kvcache using virtual memory. #92

Uh oh!

Conversation

XuZhang99 commented Sep 4, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!