Talks

LLM/VLM gaming agents, model evaluation and training games.

Unified post-training and inference framework for accelerated video generation

Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving

Efficiently Serving Reasoning Programs with Certaindex

DistServe: disaggregating prefill & decoding for LLM inference

Chatbot Arena (UCSD / LMSys)

Finetuning, Serving, and Evaluating LLMs in the Wild

I-X Seminar: How to train your Vicuna – finetuning & evaluating LLMs in the wild.

Generative AI Summit 2023 - Stage 2

THE AI QUORUM

Alpa - Simple large model training and inference on Ray

AAAI 2021 Tutorial: Simplifying and Automating Parallel ML via a Programmable and Composable System