Publications
*Participant name in bold works at KRAFTON
Filter
RoDAC: A Robust Data-centric Anti-Cheat Framework for Fair Online Competitive Gaming
RoDAC: A Robust Data-centric Anti-Cheat Framework for Fair Online Competitive Gaming
FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control
FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control
Identifiable Token Correspondence for World Models
Identifiable Token Correspondence for World Models
Convex Distance Operator Transport: Convex and Geometry-Preserving Formulation
Convex Distance Operator Transport: Convex and Geometry-Preserving Formulation
How to Correctly Report LLM-as-a-Judge Evaluations
How to Correctly Report LLM-as-a-Judge Evaluations
ReJump: A Tree-Jump Representation for Analyzing and Improving LLM Reasoning
ReJump: A Tree-Jump Representation for Analyzing and Improving LLM Reasoning
Fine-Tuning Without Forgetting In-Context Learning: A Theoretical Analysis of Linear Attention Models
Fine-Tuning Without Forgetting In-Context Learning: A Theoretical Analysis of Linear Attention Models
Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling
Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling
Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO
Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO
Coverage Improvement and Fast Convergence of On-policy Preference Learning