Talk at Faster LLM Inference Seminar @ Weizmann Institute of Science 02/2025

Efficiently Serving Reasoning Programs with Certaindex

Talk at PyTorch Webinar 10/2024

DistServe: disaggregating prefill & decoding for LLM inference

Talk at NSF Open-Source Generative AI (OSGAI) Workshop 03/2024

Chatbot Arena (UCSD / LMSys)

Talk at PKU Alumni Association of Northern California (PKUAANC) 02/2024

Tutorial at ODSC West 11/2023

Finetuning, Serving, and Evaluating LLMs in the Wild

Talk at I-X Seminar Series at Imperial College London 10/2023

I-X Seminar: How to train your Vicuna – finetuning & evaluating LLMs in the wild.

Talk at Generative AI Summit, ODSC 07/2023

Generative AI Summit 2023 - Stage 2

Talk at 1st CASL Workshop, MBZUAI 10/2022

THE AI QUORUM

Talk at Ray Summit 08/2022

Alpa - Simple large model training and inference on Ray

Tutorial at AAAI 2021 01/2021

AAAI 2021 Tutorial: Simplifying and Automating Parallel ML via a Programmable and Composable System