Bei Liu
Senior Researcher, Microsoft Research Asia, Beijing.
📍 Beijing, China
🏢 Microsoft Research Asia
🔬 Visual Computing Group
My research focuses on Multimodal AI, Document Understanding, and AI Agents. I also serve as a Guest Associate Professor at Nagoya University in Japan. Before joining Microsoft, I earned my Ph.D. and Master’s degrees from Kyoto University, Japan, under the guidance of Professors Katsumi Tanaka, Masatoshi Yoshikawa, and Makoto P. Kato. I hold a Bachelor’s degree from Nanjing University, China.
My current interest is in enabling agents that actively read, navigate, and reason over complex documents, combining perception, planning, and tool use.
I am open to research collaboration, academic visits, and supervising interns working on multimodal agents. Feel free to reach out!
news
| Jan 1, 2025 | One paper accepted to MMM 2025, awarded 🏆 Best Paper! |
|---|---|
| Dec 1, 2024 | One paper accepted to ACM MMAsia 2024, awarded Best Student Paper Runner-Up. 🎉 |
selected publications
- NeurIPSLong-Form Video-Language Pre-Training with Multimodal Temporal Contrastive LearningIn NeurIPS, 2022
- CVPRAdvancing High-Resolution Video-Language Representation with Large-Scale Video TranscriptionsIn CVPR, 2022
- ICLR
- NeurIPS
- MMAsiaViCo: Engaging Video Comment Generation with Human Preference RewardsIn MMAsia, 2024Best Student Paper Runner-Up