-
Notifications
You must be signed in to change notification settings - Fork 0
Comparing changes
Open a pull request
base repository: CyCle1024/lmdeploy
base: main
head repository: InternLM/lmdeploy
compare: main
- 19 commits
- 199 files changed
- 11 contributors
Commits on Mar 17, 2025
-
Configuration menu - View commit details
-
Copy full SHA for 1fd1f32 - Browse repository at this point
Copy the full SHA 1fd1f32View commit details -
Fix the bug for reading dict error (InternLM#3196)
* Update qwen2.py * Update mllama.py fix the bug for reading dict * Update qwen2_vl.py fix the bug for reading dict * fix qwen2_5_vl.py readdict error --------- Co-authored-by: zxy <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9958b89 - Browse repository at this point
Copy the full SHA 9958b89View commit details
Commits on Mar 18, 2025
-
docs: update ascend docs for docker running (InternLM#3266)
* docs: update ascend docs for docker running * ci: fix mdformat linting
Configuration menu - View commit details
-
Copy full SHA for 9bff3a7 - Browse repository at this point
Copy the full SHA 9bff3a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for d95ecc0 - Browse repository at this point
Copy the full SHA d95ecc0View commit details -
* fix get_ppl * remove useless code * remove debug logs
Configuration menu - View commit details
-
Copy full SHA for 028b94c - Browse repository at this point
Copy the full SHA 028b94cView commit details -
* Add gemma3 text model * Add gemma vl * update doc * add tp * fix doc * readmes
Configuration menu - View commit details
-
Copy full SHA for 7c33db5 - Browse repository at this point
Copy the full SHA 7c33db5View commit details
Commits on Mar 19, 2025
-
bump version to v0.7.2 (InternLM#3252)
* bump version to v0.7.2 * bump version to v0.7.2 * remote print
Configuration menu - View commit details
-
Copy full SHA for 6f1277e - Browse repository at this point
Copy the full SHA 6f1277eView commit details
Commits on Mar 20, 2025
-
fix activation grid oversize (InternLM#3282)
* fix activation grid oversize * optimize * fix quant fp8
Configuration menu - View commit details
-
Copy full SHA for 1e77ed2 - Browse repository at this point
Copy the full SHA 1e77ed2View commit details -
Add spaces_between_special_tokens to /v1/interactive and make compati…
…ble with empty text (InternLM#3283) * add spaces between special token to interactive endpoint * empty input
Configuration menu - View commit details
-
Copy full SHA for 9f211a8 - Browse repository at this point
Copy the full SHA 9f211a8View commit details
Commits on Mar 21, 2025
-
add env var to control timeout (InternLM#3291)
* add env var to control timeout * update * update * fix lint
Configuration menu - View commit details
-
Copy full SHA for da0bf7b - Browse repository at this point
Copy the full SHA da0bf7bView commit details -
Configuration menu - View commit details
-
Copy full SHA for a2c38da - Browse repository at this point
Copy the full SHA a2c38daView commit details -
Configuration menu - View commit details
-
Copy full SHA for 81c815e - Browse repository at this point
Copy the full SHA 81c815eView commit details -
refactor attn param (InternLM#3164)
* refactor attn param * fix lint * fix build * fix ut * use creator to create rope_param * reuse parse func * fix ut * fix comments * update name * fix dynamic * fix deepseekv2 yarn * use single dataclass * fix loading workspace model
Configuration menu - View commit details
-
Copy full SHA for 82d0a90 - Browse repository at this point
Copy the full SHA 82d0a90View commit details
Commits on Mar 22, 2025
-
Torch dp support (InternLM#3207)
* better dist context * can not exit * multinode support * better exception * refactor * fix local rank * replace group * fix dist * remove useless code * remove finish flag * refactor engine and model agent * uni executor * wip * tp * fix * less async * circle buf * event per block * fast mp * fix error handler * remove safe wait * context in model agent * fix on stop * check before init * fix tp close * ray ver0 * fix close * fix remote code * optimize ray * better checker and logger * pack tensor * auto check dist * fix mp gloo * add timer tools * better scheduler * fix mp hang * fix mp * fix chat * less output * merge main * optimize ray get output * remove nsight runtime env * dag * optimize mp & lint * optimize mp * add base workerwrapper * fix gather, update flags * better return mask * add choice * enable mp,ray with worldsize=1 * fix mp exit * fix mp vlm * chat exit * add docs * lint * doc * dp check * fix blocked fp8 moe * remove mask * support dp, async * remove debug line * fix model tp * support sync execute * fix chat stopwords * refactor chat * add warmup * disable warmup * dp support * fix ut, merge main, force eager * support qwen2/internlm2/internlm3 * support blocked fp8 all gather * add more model support * fix exit * fix merge * fix sync long context * support process group on ray * change dp master addr and master port * update log level * support moe tp * fix tp1 dp2 * fix * fix * wait handle * remove flag * ensure execute order * remove import * add serve args * force eager
Configuration menu - View commit details
-
Copy full SHA for f6e7ec7 - Browse repository at this point
Copy the full SHA f6e7ec7View commit details -
Add deep gemm with tma pre allocated (InternLM#3287)
* add deep gemm with tma pre allocated * add comment * add comment * dispatch * no use_deep_gemm arg * remove DeepGemmBlockedF8 * missed op type * latest get_best_config * add a line of debug
Configuration menu - View commit details
-
Copy full SHA for e37a76d - Browse repository at this point
Copy the full SHA e37a76dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8f68177 - Browse repository at this point
Copy the full SHA 8f68177View commit details
Commits on Mar 24, 2025
-
[ci] add think function testcase (InternLM#3299)
* update * update * update * update timeout * update * update * update * update * updaste * update * update * update * update * update * update * update * update * update * update * update * update * update --------- Co-authored-by: zhulinJulia24 <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2774f8e - Browse repository at this point
Copy the full SHA 2774f8eView commit details -
Add mixed DP + TP (InternLM#3229)
* comm abstraction * add custom * fused rms norm * refactor * push-based kernel * optimize for small hidden dims * integration * clean up * export options & fix things * allgather2d & VMM allocation * optimize allgather2d * remove obsolete comm utils * handle non-multi-gpu build * fix lint * fix lint * avoid using mscclpp repo (some deps are not needed) * fix lint * fix nccl version & clean up deps * fix lint * custom -> native * rename * fix p-lora * fix lm head * log fatal exception explicitly * initial data parallel support * sync dp + tp * mixed `d*t0 | t1` * refactor * refactor * fix ut * refactor * asymmetrical allreduce * fix * fix nccl<2.18 * fix * fix * fix * fix * fix converter * fix converter * fix tp for converted model * fix tp for converted model * assert tp size loading from workspace
Configuration menu - View commit details
-
Copy full SHA for 63b13e8 - Browse repository at this point
Copy the full SHA 63b13e8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8b12b4d - Browse repository at this point
Copy the full SHA 8b12b4dView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff main...main