Yanchao Sun's Homepage

Yanchao Sun (孙妍超)

ycsun1531 at gmail dot com
Google Scholar
Github
LinkedIn

About me

I am curretly a machine learning research engineer at Apple AI/ML. I work on foundation models, with a focus of LLM reasoning and reinforcement learning.

Prior to Apple, I was a research scientist at JPMorgan AI Research focused on building LLM-powered autonomous agents and foundation models for finance.

I obtained my Ph.D. degree from the University of Maryland, College Park, where I am fortunate to be advised by Prof. Furong Huang. My Ph.D. research mainly focuses on reinforcement learning (RL) and trustworthy machine learning. Here is my thesis: thesis.

News and Highlights Back to Top

[2025.3]	Our paper on token-level DPO was accepted by ICLR 2025. Paper Link. Great work by our intern Aiwei Liu!
[2025.3]	I'm honored to give a talk at the Steve Jobs Theater!
[2025.1]	My first paper at Apple, which presents an easy-to-use LLM Agent Benchmark, was accepted by NAACL 2025. Try it out at: Project Page.
[2024.7]	Our paper on LLM agent with offline learning got accepted by the 1st COLM conference. Thanks to all collaborators at JPMorgan! Link.
[2024.1]	We have 4 papers accepted by ICLR 2024: 1 spotlight and 3 posters (paper 1, paper 2, paper 3).
[2023.9]	Two papers accepted by NeurIPS 2023: 1 spotlight and 1 poster.
[2023.7]	Our paper about generalist agent has been accepted by ICCV 2023: Paper Link.
[2023.7]	I am co-organizing the ICCV 2023 workshop PerDream: PERception, DEcision making, and REAsoning through Multimodal foundational modeling. Visit our website for details.
[2023.1]	We have 3 papers accepted by ICLR 2023 — 1 spotlight and 2 posters (paper 1, paper 2)!
[2022.11]	I am co-organizing the first Reincarnating RL workshop at ICLR 2023. Learn more at reincarnating-rl.github.io.
[2022.09]	Three of our papers were accepted to NeurIPS 2022. 1 spotlight and 2 posters (paper 1 paper 2)
[2022.01]	Two papers accepted to ICLR 2022! Check them out: adversarial RL and transfer RL.
[2021.12]	Our paper "Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL" received the Best Paper Award at the SafeRL 2021 Workshop!

Selected Publications Back to Top

Please see here for a full list of my publications.

NAACL 2025	MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains. Guoli Yin, Haoping Bai, Shuang Ma, Feng Nan, Yanchao Sun, et al. In Proceedings of the 2025 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2025.* Paper Code HTML
COLM 2024	O3D: Offline Data-driven Discovery and Distillation for Sequential Decision-Making with Large Language Models. Yuchen Xiao, Yanchao Sun, Mengda Xu, Udari Madhushani, Jared Vann, Deepeka Garg, and Sumitra Ganesh. In The 1st Conference on Language Modeling (COLM), 2024. *Equal Contribution Paper
ICLR 2023	SMART: Self-supervised Multi-task pretrAining with contRol Transformers. Yanchao Sun, Shuang Ma, Ratnesh Madaan, Rogerio Bonatti, Furong Huang, Ashish Kapoor Spotlight presentation In International Conference on Learning Representations, 2023 Paper Code Models HTML
NeurIPS 2022	Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning. Yongyuan Liang, Yanchao Sun, Ruijie Zheng, and Furong Huang. (* Equal Contribution) In the 36th Conference on Neural Information Processing Systems, 2022 Paper Code
ICLR 2022	Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL. Yanchao Sun, Ruijie Zheng, Yongyuan Liang, and Furong Huang. In International Conference on Learning Representations, 2022 Best Paper Award in NeurIPS 2021 Workshop of Safe and Robust Control of Uncertain Systems (SafeRL 2021). Paper Code HTML
ICLR 2022	Transfer RL across Observation Feature Spaces via Model-Based Regularization. Yanchao Sun, Ruijie Zheng, Xiyao Wang, Andrew Cohen, and Furong Huang. In International Conference on Learning Representations, 2022 Paper Code HTML
ICLR 2021	Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics. Yanchao Sun, Da Huo, and Furong Huang. In International Conference on Learning Representations, 2021 Paper Code HTML
AAAI 2021	TempLe: Learning Template of Transitions for Sample Efficient Multi-task RL. Yanchao Sun, Xiangyu Yin, and Furong Huang. In AAAI Conference on Artificial Intelligence, 2021 Paper Code HTML

Please see here for a full list of my publications.