Follow the quickstart below to install dependencies, download datasets, train, and merge your model. If you run into any issues, please open an issue on your repository — we’ll help as soon as possible.
pip install -r requirements.txt
If the installation fails, please open an issue on GitHub.
You can obtain ReasonMap-Plus (for evaluation) and ReasonMap-Train (for RewardMap
training) from Hugging Face, or run the script below; then place all data under the data/
directory.
python utils/download_dataset.py
Start RewardMap training with the provided script:
# RewardMap training
bash scripts/reward_map.sh
After training, merge the trained weights:
# merge trained model
bash scripts/merge_model.sh
@article{feng2025rewardmap,
title={RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning},
author={Feng, Sicheng and Tuo, Kaiwen and Wang, Song and Kong, Lingdong and Zhu, Jianke and Wang, Huan},
journal={arXiv preprint arXiv:2510.02240},
year={2025}
}