Skip to content

Lab 2: 大模型的性能瓶颈分析

Introduction

In this lab, you will analyze performance bottlenecks in LLM inference.

Objectives

  • Profile memory usage and computation time.
  • Identify bottlenecks in attention mechanism.

Tasks

(Coming soon)

Released under CC BY-NC-SA 4.0 License.