Lab Overview
Grading Policy (Total 100%)
| Component | Weight | Description |
|---|---|---|
| Lab1(个人) | 10% | 大模型qwen3 部署 |
| Lab2(个人) | 20% | 大模型的性能瓶颈分析 |
| Lab3(个人) | 20% | 基于vLLM的prefill/decode调度策略实现 |
| 课程大项目(2-3团队) | 40% | 一个高效大模型推理优化策略设计 |
| 课堂小测 | 10% | 共2次 (5% each) |
Schedule
| Lab | Topic | Release Date | Due Date |
|---|---|---|---|
| Lab 1 | Qwen3 Deployment | 待定 | 待定 |
| Lab 2 | Performance Analysis | 待定 | 待定 |
| Lab 3 | vLLM Scheduling | 待定 | 待定 |
| Final Project | Project Guidelines | 待定 | 待定 |
Submission Policy
Submission details will be announced later.