Skip to content

Actions: InternLM/lmdeploy

publish-docker

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
837 workflow runs
837 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[side-effect] bring back quantization of qwen2-vl, glm4v and etc. (#2…
publish-docker #880: Commit 4e5cc16 pushed by lvhan028
December 26, 2024 12:55 Waiting main
December 26, 2024 12:55 Waiting
Torchrun launching multiple api_server (#2402)
publish-docker #879: Commit d9b8372 pushed by lvhan028
December 26, 2024 10:33 Waiting main
December 26, 2024 10:33 Waiting
Fix torch_dtype in lite (#2956)
publish-docker #878: Commit 191a7dd pushed by lvhan028
December 26, 2024 06:42 Waiting main
December 26, 2024 06:42 Waiting
[dlinfer] add DlinferFlashAttention to fix qwen vl (#2952)
publish-docker #877: Commit 3a98ae9 pushed by lvhan028
December 26, 2024 03:39 Waiting main
December 26, 2024 03:39 Waiting
Fallback to pytorch engine when the model is quantized by smooth quan…
publish-docker #876: Commit a0a7728 pushed by lvhan028
December 26, 2024 03:16 Waiting main
December 26, 2024 03:16 Waiting
Fix exception handler for proxy server (#2901)
publish-docker #875: Commit f62b544 pushed by lvhan028
December 26, 2024 02:59 Waiting main
December 26, 2024 02:59 Waiting
Support torch_dtype modification and update FAQs for AWQ quantization…
publish-docker #874: Commit 9565505 pushed by lvhan028
December 25, 2024 04:21 Waiting main
December 25, 2024 04:21 Waiting
fix mllama inference without image (#2947)
publish-docker #873: Commit 35a5591 pushed by lvhan028
December 25, 2024 02:29 Waiting main
December 25, 2024 02:29 Waiting
support unaligned qkv heads (#2930)
publish-docker #872: Commit dfeee42 pushed by lvhan028
December 23, 2024 12:50 Waiting main
December 23, 2024 12:50 Waiting
fix torch_dtype (#2933)
publish-docker #871: Commit 92475b0 pushed by lvhan028
December 23, 2024 06:06 Waiting main
December 23, 2024 06:06 Waiting
[side effect] vlm quant failed (#2914)
publish-docker #870: Commit 182d1c8 pushed by lvhan028
December 22, 2024 03:56 Waiting main
December 22, 2024 03:56 Waiting
[dlinfer] fix moe op for dlinfer. (#2917)
publish-docker #869: Commit 87f1783 pushed by lvhan028
December 20, 2024 11:24 Waiting main
December 20, 2024 11:24 Waiting
fix lora name and rearange wqkv for internlm2 (#2912)
publish-docker #868: Commit 33f5b19 pushed by lvhan028
December 20, 2024 03:19 Waiting main
December 20, 2024 03:19 Waiting
replicate kv for some models when tp is divisble by kv_head_num (#2874)
publish-docker #867: Commit e20999f pushed by lvhan028
December 18, 2024 12:48 Waiting main
December 18, 2024 12:48 Waiting
support tp > n_kv_heads for pt engine (#2872)
publish-docker #866: Commit 7deb69c pushed by lvhan028
December 18, 2024 11:56 Waiting main
December 18, 2024 11:56 Waiting
fix typo (#2916)
publish-docker #865: Commit 1b219e3 pushed by lvhan028
December 18, 2024 08:27 Waiting main
December 18, 2024 08:27 Waiting
unfreeze torch version in dockerfile (#2906)
publish-docker #864: Commit bafa3d2 pushed by lvhan028
December 18, 2024 03:52 1h 6m 4s main
December 18, 2024 03:52 1h 6m 4s
Optimize tp broadcast (#2889)
publish-docker #863: Commit 8afb84c pushed by lvhan028
December 17, 2024 13:40 Waiting main
December 17, 2024 13:40 Waiting
[dlinfer] only compile language_model in vl models (#2893)
publish-docker #862: Commit 1efed79 pushed by lvhan028
December 16, 2024 07:46 1d 5h 16m 44s main
December 16, 2024 07:46 1d 5h 16m 44s
Fix llama3.1 chat template (#2862)
publish-docker #861: Commit abd90db pushed by lvhan028
December 16, 2024 04:14 Waiting main
December 16, 2024 04:14 Waiting
Refactor VLM modules (#2810)
publish-docker #860: Commit 96e82eb pushed by lvhan028
December 13, 2024 11:05 Waiting main
December 13, 2024 11:05 Waiting
[dlinfer] fix engine checker (#2891)
publish-docker #859: Commit 422b9f2 pushed by lvhan028
December 13, 2024 06:50 Waiting main
December 13, 2024 06:50 Waiting
refine multi-backend setup.py (#2880)
publish-docker #858: Commit 0749ca5 pushed by lvhan028
December 13, 2024 03:33 Waiting main
December 13, 2024 03:33 Waiting
Fix args type in docstring (#2888)
publish-docker #857: Commit 8f34eb1 pushed by lvhan028
December 13, 2024 03:26 Waiting main
December 13, 2024 03:26 Waiting
refactor PyTorchEngine check env (#2870)
publish-docker #856: Commit b99a5da pushed by lvhan028
December 12, 2024 09:45 Waiting main
December 12, 2024 09:45 Waiting