Filter

AgentVidBench: A Multi-Hop Video Question Answering Benchmark for Evaluating MLLM Agents

AgentVidBench: A Multi-Hop Video Question Answering Benchmark for Evaluating MLLM Agents

Junhyuck Kim, Jihun Yun, Haechan Kim, Gyeongman Kim, Joonghyun Bae, Jaewoong Cho

Language Model ICML 2026 Workshop
AsyncOPD: How Stale Can On-Policy Distillation Be?

AsyncOPD: How Stale Can On-Policy Distillation Be?

Wonjun Kang, Kevin Galim, Seunghyuk Oh, Minjun Kang, Sanghyun Park, Donghoon Kim, Minjae Lee, Minseo Kim, Rishabh Tiwari, Yuchen Zeng, Hyung Il Koo, Kangwook Lee

Language Model ICML 2026 Workshop
Online Agent-as-a-Judge: Situation-Generating Evaluation for Interactive Agents

Online Agent-as-a-Judge: Situation-Generating Evaluation for Interactive Agents

Hyogon Ryu, Jeonghwan Kim, Yewon Lim, Chaeun Lee, Jeongwook Kim, Donghoon Ham

Language Model ICML 2026 Workshop
Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling

Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling

Seojeong Park*, Jiho Choi*, Junyong Kang, Seonho Lee, Jaeyo Shin, Hyunjung Shim

Language Model ICML 2026
Meta-Harness: End-to-End Optimization of Model Harnesses

Meta-Harness: End-to-End Optimization of Model Harnesses

Yoonho Lee, Roshen Nair, Qizheng Zhang, Kangwook Lee, Omar Khattab, Chelsea Finn

Language Model ICML 2026 Workshop
Uniform Spectral Growth under Factor-wise Muon Orthogonalization in Matrix Factorization and LoRA

Uniform Spectral Growth under Factor-wise Muon Orthogonalization in Matrix Factorization and LoRA

Changmin Kang*, Jihun Yun*, Baekrok Shin, Yeseul Cho, Chulhee Yun

Theoretical ICML 2026 Workshop
Pruning and Distilling Mixture-of-Experts into Dense Language Models

Pruning and Distilling Mixture-of-Experts into Dense Language Models

Junhyuck Kim, Jihun Yun, Haechan Kim, Gyeongman Kim, Joonghyun Bae, Jaewoong Cho

Language Model ICML 2026 Workshop
AMUSE: Anytime Muon with Stable Gradient Evaluation

AMUSE: Anytime Muon with Stable Gradient Evaluation

Jueun Kim*, Baekrok Shin*, Jihun Yun, Beomhan Baek, Minhak Song, Chulhee Yun

Theoretical ICML 2026 Workshop
RoDAC: A Robust Data-centric Anti-Cheat Framework for Fair Online Competitive Gaming

RoDAC: A Robust Data-centric Anti-Cheat Framework for Fair Online Competitive Gaming

Minsu Kim, Junwoo Park, Chanho Lee, Gibum Seo, Steven Euijong Whang, Hyuck Lee

Data-centric AI KDD 2026
Identifiable Token Correspondence for World Models

Identifiable Token Correspondence for World Models

Youngin Kim, Ray Sun, Inho Kim, Bumsoo Park, Hyun Oh Song

Language Model ICML 2026