Talks
Talk at Faster LLM Inference Seminar @ Weizmann Institute of Science 02/2025
Efficiently Serving Reasoning Programs with Certaindex
Talk at PyTorch Webinar 10/2024
DistServe: disaggregating prefill & decoding for LLM inference
Talk at NSF Open-Source Generative AI (OSGAI) Workshop 03/2024
Chatbot Arena (UCSD / LMSys)
Talk at PKU Alumni Association of Northern California (PKUAANC) 02/2024
Tutorial at ODSC West 11/2023
Finetuning, Serving, and Evaluating LLMs in the Wild
Talk at I-X Seminar Series at Imperial College London 10/2023
I-X Seminar: How to train your Vicuna – finetuning & evaluating LLMs in the wild.
Talk at Generative AI Summit, ODSC 07/2023
Generative AI Summit 2023 - Stage 2
Talk at 1st CASL Workshop, MBZUAI 10/2022
THE AI QUORUM
Talk at Ray Summit 08/2022
Alpa - Simple large model training and inference on Ray
Tutorial at AAAI 2021 01/2021
AAAI 2021 Tutorial: Simplifying and Automating Parallel ML via a Programmable and Composable System