Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add recurrent_gated_delta_rule kernel improvement
#4376 opened Feb 27, 2026 by grimoire Loading…
Reduce MLA kv-cache memory improvement
#4373 opened Feb 27, 2026 by lzhangzz Loading…
[Ascend] support qwen3next
#4371 opened Feb 26, 2026 by wanfengcxz Draft
[WIP] Support video inputs
#4360 opened Feb 13, 2026 by CUHKSZzxy Draft
Improve proxy server improvement
#4354 opened Feb 12, 2026 by lvhan028 Loading…
Support glm4.7 with mtp improvement
#4346 opened Feb 10, 2026 by RunningLeon Loading…
Support MiniMax-M2 in TurboMind engine enhancement New feature or request
#4343 opened Feb 10, 2026 by zh-nj Loading…
[WIP]Support torch compile
#4336 opened Feb 8, 2026 by grimoire Draft
return BadRequest for all invlid inputs Bug:P2
#4291 opened Jan 26, 2026 by lvhan028 Loading…
support repetition ngram logits processor enhancement New feature or request
#4288 opened Jan 23, 2026 by grimoire Loading…
fix dllm mask on set_step
#4278 opened Jan 18, 2026 by grimoire Loading…
[ascend] fix awq and smoothq
#4277 opened Jan 16, 2026 by wanfengcxz Draft
Update benchmark serving script for proxy_server
#4173 opened Dec 1, 2025 by lvhan028 Loading…
Update installation.md
#4095 opened Nov 3, 2025 by krescent Loading…
Add step_map to track token decoding order in DLLM
#4057 opened Oct 21, 2025 by Auraithm Loading…
4 tasks done
[POC] Encoder Disaggregation
#4047 opened Oct 17, 2025 by CUHKSZzxy Draft
2 of 7 tasks
quant blocked fp8 enhancement New feature or request
#4018 opened Sep 29, 2025 by CUHKSZzxy Loading…
4 of 5 tasks
Add reasoning parser for GPT-OSS style channels.
#3998 opened Sep 21, 2025 by GY19A Loading…
ProTip! Follow long discussions with comments:>50.