Skip to content

Lab Overview

Grading Policy (Total 100%)

ComponentWeightDescription
Lab1(个人)10%大模型qwen3 部署
Lab2(个人)20%大模型的性能瓶颈分析
Lab3(个人)20%基于vLLM的prefill/decode调度策略实现
课程大项目(2-3团队)40%一个高效大模型推理优化策略设计
课堂小测10%共2次 (5% each)

Schedule

LabTopicRelease DateDue Date
Lab 1Qwen3 Deployment待定待定
Lab 2Performance Analysis待定待定
Lab 3vLLM Scheduling待定待定
Final ProjectProject Guidelines待定待定

Submission Policy

Submission details will be announced later.

Released under CC BY-NC-SA 4.0 License.