My name is Xiaoke Huang, and I am a Ph.D. student at the UC Santa Cruz, advised by Prof. Yuyin Zhou and Prof. Cihang Xie. My research focuses on multi-modal reasoning, agentic models, and AI for healthcare; previously, I received my Master’s degree from Tsinghua University (worked on vision–language learning and 3D reconstruction) and Bachelor’s degree from Beijing Normal University. I have interned at Microsoft Research and Meta.

News

[More]

Publications and Preprints

For more works please check here.

* indicates equal contribution.

Vision-Language Learning

  • Segment and Caption Anything
    Xiaoke Huang, Jianfeng Wang, Yansong Tang, Zheng Zhang, Han Hu, Jiwen Lu, Lijuan Wang, Zicheng Liu
    Conference on Computer Vision and Pattern Recognition (CVPR), 2024
    [project page] [paper] [code]

    sca-teaser
  • OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression
    Wanhua Li*, Xiaoke Huang*, Zheng Zhu, Yansong Tang, Xiu Li, Jiwen Lu, Jie Zhou
    Conference on Neural Information Processing Systems (NeurIPS), 2022
    [project page] [paper] [code] [中文解读]

    ordinalclip_framework

Human Digitization

Internship

Research Intern, Meta (MGenAI), London, UK. June-Nov., 2025.

Research Intern, Microsoft Research, Asia, Beijing, China. April-September, 2023.

Misc

I enjoy reading non-fiction.

Some recent and highly recommended selections (Dec. 2025):

  • Exercised: Why Something We Never Evolved to Do Is Healthy and Rewarding,
  • The Story of the Human Body: Evolution, Health and Disease,
  • Fooled by Randomness: The Hidden Role of Chance in Life and in the Markets,
  • The Black Swan,
  • Antifragile: Things That Gain from Disorder,
  • Skin in the Game: Hidden Asymmetries in Daily Life,
  • and The Fabric of Reality: The Science of Parallel Universes and Its Implications.