Biography
I am advised by Prof. Zhihui Zhu at The Ohio State University. My work spans various directions:
- Layerwise representation analysis in deep networks. Characterizing how representations evolve across layers to reveal what each layer learns and how it shapes generalization.
- In-context learning in large language models. Understanding the internal mechanisms of in-context learning and extending it to more general settings.
- LLM-driven scientific discovery. Designing principled algorithms that turn LLMs into search engines for scientific codes.
- Multimodal LLMs and image restoration. Improving vision–language alignment and building content- and task-aware models for unified low-level vision tasks.
Feel free to reach out to me by email—I am always happy to chat about research and collaborations!
Education
-
The Ohio State University (OSU)Doctor of Philosophy in Computer Science and Engineering
-
University of Michigan (UMich)Master of Science in Electrical & Computer Engineering
-
Beihang University (BUAA)Bachelor of Science in Electrical Engineering
Experience
-
Research Intern@ Microsoft, Redmond, WA
-
Research Intern@ Microsoft, Redmond, WA
-
Research Assistant@ The Ohio State University, Columbus, OH
Selected Publications
(★ indicates first-author work)
-
Improving Visual Discriminability of CLIP for Training-Free Open-Vocabulary Semantic Segmentation
Transactions on Machine Learning Research (TMLR), 2026
-
Understanding Task Vectors in In-Context Learning: Emergence, Functionality, and Limitations
International Conference on Learning Representations (ICLR), 2026
-
Learning to Adapt: In-Context Learning Beyond Stationarity
International Conference on Learning Representations (ICLR), 2026
-
On the Convergence of Gradient Descent on Learning Transformers with Residual Connections
IEEE Signal Processing Letters (SPL), 2026
-
ProCrop: Learning Aesthetic Image Cropping from Professional Compositions
AAAI Conference on Artificial Intelligence (AAAI), 2026
-
Can Reasoning LLMs Eliminate Conformity in Multi-Agent Systems?
IEEE International Conference on Data Mining Workshops (ICDMW), 2025
-
Are All Layers Created Equal: A Neural Collapse Perspective
Conference on Parsimony and Learning (CPAL), 2025
-
DREAM: Diffusion Rectification and Estimation-Adaptive Models
Computer Vision and Pattern Recognition (CVPR), 2024
Selected Preprints
-
From Emergence to Control: Probing and Modulating Self-Reflection in Language Models
arXiv preprint, 2025
-
The Efficiency Spectrum of Large Language Models: An Algorithmic Survey
arXiv preprint, 2023
Visitors