Hao AI Lab @ UCSD
    • Home
    • Blogs
    • Projects
    • Talks
    • People
    • Publications
    • Contact

    Blogs

    Dynasor: More Efficient Chain-of-Thought Through Certainty Probing

    February 16, 2025

    Yichao Fu*, Junda Chen*, Yonghao Zhuang, Zheyu Fu, Ion Stoica, Hao Zhang

    GameArena: Evaluating LLM Reasoning through Live Computer Games

    February 10, 2025

    Game Arena Team

    Efficient LLM Scheduling by Learning to Rank

    January 13, 2025

    Yichao Fu, Siqi Zhu, Runlong Su, Aurick Qiao, Ion Stoica, Hao Zhang

    MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving

    May 20, 2024

    Jiangfei Duan, Runyu Lu, Haojie Duanmu, Xiuhong Li, Xingcheng Zhang, Dahua Lin, Ion Stoica, Hao Zhang

    Consistency Large Language Models: A Family of Efficient Parallel Decoders

    May 6, 2024

    Siqi Kou*, Lanxiang Hu*, Zhezhi He, Zhijie Deng, Hao Zhang

    Throughput is Not All You Need: Maximizing Goodput in LLM Serving using Prefill-Decode Disaggregation

    March 17, 2024

    Junda Chen, Yinmin Zhong, Shengyu Liu, Yibo Zhu, Xin Jin, Hao Zhang

    • ««
    • «
    • 1
    • 2
    • »
    • »»
    © 2025 Hao AI Lab @ UCSD Powered by Hugo & PaperMod , Adapted by Lanxiang Hu, Junda Chen & Hao Zhang