This repository provides codes and models for two powerful multi-task transformer models for scene understanding. Please check the following pages for details:
Hanrong Ye and Dan Xu, TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding. ICLR 2023
Hanrong Ye and Dan Xu, Inverted Pyramid Multi-task Transformer for Dense Scene Understanding. ECCV 2022
BibTex:
@InProceedings{invpt2022,
title={Inverted Pyramid Multi-task Transformer for Dense Scene Understanding},
author={Ye, Hanrong and Xu, Dan},
booktitle={ECCV},
year={2022}
}
@InProceedings{taskprompter2023,
title={TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding},
author={Ye, Hanrong and Xu, Dan},
booktitle={ICLR},
year={2023}
}
@article{ye2023invpt++,
title={InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding},
author={Ye, Hanrong and Xu, Dan},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
year={2024}
}
Please do consider 🌟 star our project to share with your community if you find this repository helpful!
Please contact Hanrong Ye if any questions.
Few-show learning of multiple tasks: Visual Token Matching (ICLR 2023 Outstanding Paper Award)