MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models

Published in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

Dohwan Ko, Jinyoung Park, Seoung Choi, Sanghyeok Lee, Seohyun Lee, and Hyunwoo J. Kim.

Accepted to CVPR 2026.

Paper

Recommended citation: Dohwan Ko, Jinyoung Park, Seoung Choi, Sanghyeok Lee, Seohyun Lee, and Hyunwoo J. Kim. (2026). "MoE-GRPO: Optimizing Mixture-of-Experts via Reinforcement Learning in Vision-Language Models." CVPR.