james.chan's picture

4

james.chan

yyg201708

AI & ML interests

None yet

Recent Activity

new activity about 6 hours ago

zai-org/GLM-4.7-Flash:Why does the KV cache occupy so much GPU memory?

new activity about 11 hours ago

zai-org/GLM-4.7-Flash:Cannot run vLLM on DGX Spark: ImportError: libcudart.so.12

new activity 15 days ago

IQuestLab/IQuest-Coder-V1-40B-Loop-Instruct:What vLLM version should I use to deploy this model?

View all activity

Organizations

None yet

New activity in zai-org/GLM-4.7-Flash about 6 hours ago

Why does the KV cache occupy so much GPU memory?

#21 opened about 6 hours ago by

New activity in zai-org/GLM-4.7-Flash about 11 hours ago

Cannot run vLLM on DGX Spark: ImportError: libcudart.so.12

#18 opened about 11 hours ago by

New activity in IQuestLab/IQuest-Coder-V1-40B-Loop-Instruct 15 days ago

What vLLM version should I use to deploy this model?

#13 opened 16 days ago by

New activity in IQuestLab/IQuest-Coder-V1-40B-Loop-Instruct 16 days ago

What vLLM version should I use to deploy this model?

#13 opened 16 days ago by