Research
I am interested in reinforcement learning, generative models, sample-efficient exploration, robotics, and computer vision. Some papers are highlighted.
|
|
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson*,
Qiyang Li*,
Kevin Frans,
Sergey Levine
ICML, 2025
project page
/
arXiv
/
bibtex
Leveraging unlabeled offline data for both skill pretraining and optimistic relabelling leads to extremely efficient online exploration.
|
|
Polynomial Regression as a Task for Understanding In-context Learning Through Finetuning and Alignment
Max Wilcoxson*,
Morten Svendgård*,
Ria Doshi*,
Dylan Davis*,
Reya Vir,
Anant Sahai
ICML Workshop on In-Context Learning, 2024
arxiv
/
bibtex
Univariate polynomial regression is a simple task which captures in-context fine-tuning and alignment behavior, facilitating informative visualizations and deeper understanding.
|
|