MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models
Published in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026
Dohwan Ko, Jinyoung Park, Seoung Choi, Sanghyeok Lee, Seohyun Lee, and Hyunwoo J. Kim.
Accepted to CVPR 2026.
Recommended citation: Dohwan Ko, Jinyoung Park, Seoung Choi, Sanghyeok Lee, Seohyun Lee, and Hyunwoo J. Kim. (2026). "MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models." CVPR.
