TAMU SKY Lab
TAMU SKY Lab
People
Publications
RM-R1: Reward Modeling as Reasoning
Xiusi Chen
,
Gaotang Li
,
Ziqi Wang
,
Bowen Jin
,
Cheng Qian
,
Yu Wang
,
Hongru Wang
,
Yu Zhang
,
Denghui Zhang
,
Tong Zhang
,
Hanghang Tong
,
Heng Ji
April 2026
PDF
Code
Project
Model
Type
Conference paper
Publication
ICLR 2026
Yu Zhang
Assistant Professor
Cite
×