CSE 234: Data Systems for Machine Learning

Name: CSE 234 Winter 2025
Author: Hao Zhang

Instructor: Hao Zhang, UC San Diego, Winter 2025

Announcements

Week 6 Announcements

Feb 11 · 0 min read

We have just released PA2, due March 4th 2025. Please start early!

Week 1

Jan 7

1 Introduction: Slides

Survey Beginning of Quarter Survey (Due: End of Week 2 - 1/19)

Readings (Due 1/14)

Required: 1.1 - MLSys : Intro, 1.2 - DNN
Optional: 1.3 - Petuum, 1.4 - Systems Challenges for AI

Jan 9

2 Basics: Modern DL, computational graph, frameworks: Slides • Recording • Scribe Note

Week 2

Jan 14

3 Basics: autodiff, ML system architecture overview: Slides • Recording • Scribe Note

Readings (Due 1/21)

Required: 2.1 - TensorFlow, 2.2 - PyTorch
Optional: 2.3 - AudoDiff (Chapter 1 - 3), 2.4 - DyNet, 2.5 - Tensorflow Control Flow Implementation, 2.6 - Dynamic Control Flow

Jan 16

4 Tensor format, matmul deep dive, accelerators: Slides • Recording • Scribe Note

Week 3

Jan 21

5 GPUs and CUDA: Slides • Recording • Scribe Note

Readings (Due 1/28)

Required: 3.1 - GPU Performance, 3.2 - MI300X vs H100
Optional: 3.3 - Moore’s Law, 3.4 - The Future of Moore’s Law

Jan 23

6 GPU matmul, operator compilation: Slides • Recording • Scribe Note

Week 4

Jan 28

7 Triton, graph optimization and compilation: Slides • Recording • Scribe Note

Readings (Due 2/4)

Required: 4.1 - TVM, 4.2 - Triton
Optional: 4.3 - TASO, 4.4 - DL Compiler, 4.5 - Tensor Comprehensions

Jan 30

8 Memory: Slides • Recording • Scribe Note

Week 5

Feb 4

9 Quantization: Slides • Recording • Scribe Note

Readings (Due 2/11)

Required: 5.1 - Deep Compression, 5.2 - Quantization Survey
Optional: 5.3 - AWQ, 5.4 - QLoRA, 5.5 - Scaling Laws for Mixed quantization

Feb 6

10 Guest Lecture by Tianqi Chen: Slides • Recording • Scribe Note

Week 6

Feb 11

11 Parallelization: Slides • Recording • Scribe Note

Readings (Due 2/18)

Required: 6.1 - ML Parallelism Blog, 6.2 - Megatron
Optional: 6.3 - Pytorch DDP, 6.4 - Parameter Server, 6.5 - Megatron v2

Feb 13

12 Parallelization - 2, collective communication: Slides • Recording • Scribe Note

Week 7

Feb 18

13 Parallelization - 3, data, inter- and intra-op parallelism: Slides • Recording • Scribe Note

Readings (Due 2/25)

Required: 7.1 - GPipe, 7.2 - Alpa
Optional: 7.3 - Megatron v3, 7.4 - PipeDream, 7.5 - Chimera, 7.6 - GShard

Feb 20

14 Guest Lecture by Hector Liu: Slides • Recording; Scribe Note

Week 8

Feb 25

15 Parallelization - 4: Slides • Recording • Scribe Note

Readings (Due 3/4)

Required: 8.1 - GPT3, 8.2 - Chinchilla Scaling Law
Optional: 8.3 - Attention is All You Need, 8.4 - MoE, 8.5 - GQA, 8.6 - Deepseek v3

Feb 27

16 LLM - 1: transformers, attention: Slides • Recording • Scribe Note

Week 9

Mar 4

17 LLM - 2: Scaling law, MoE: Slides • Recording • Scribe Note

Readings (Due 3/11)

Required: 9.1 - Paged Attention, 9.2 - FlashAttention
Optional: 9.3 - Orca, 9.4 - Speculative Decoding, 9.5 - Distserve, 9.6 - Eagle

Mar 6

18 Guest Lecture by Hongyang Zhang: Slides • Recording • Scribe Note

Week 10

Mar 11

19 LLM - 3: Continuous batching, paged attention, Disaggreged prefill and decoding: Slides • Recording • Scribe Notes

Reading:

Required: None
Optional: 10.1 - Situational Awareness AI

Mar 13

20 LLM - 4: flash attention, deepseek-v3 review, ending words: Slides • Recording • Scribe Notes