Anwesa Choudhuri

I am a Senior Research Scientist at United Imaging Intelligence, Boston, MA

I completed my Ph.D. in Electrical and Computer Engineering from the University of Illinois Urbana-Champaign (UIUC) in 2024, where I was advised by Girish Chowdhary and Alex Schwing.

My research focuses on video understanding, image and video segmentation, and image and video captioning.

News

🎗️ I will be serving as an Area Chair for CVPR 2026.
🎗️ I will be serving as an Area Chair for ICLR 2026.
📰 We are hosting the First Workshop on Advanced Perception in Autonomous Healthcare in conjunction with ICCV 2025.
📚 2 papers accepted at ICCV 2025: CHROME, 7DGS.
📚 1 paper accepted at MICCAI 2025: PolypSegTrack.
📚 1 paper accepted at CVPR 2025: Seq2Time.
📚 3 papers accepted at ICLR 2025: Order-aware Interactive Segmentation, 3D Vision-Language Gaussian Splatting, 6D Gaussian Splatting.
📚 1 paper accepted at NeurIPS 2024: OW-VISCaptor.
💼 [July 2024] Joined United Imaging Intelligence as a Senior Research Scientist.
🎓 [May 2024] Defended my Ph.D. thesis.
📚 1 paper accepted at CVPR 2023: Context-Aware Relative Object Queries to Unify Video Instance and Panoptic Segmentation.
📚 1 paper accepted at ICCV 2021: Assignment-Space-Based Multi-Object Tracking and Segmentation.

Some of my recent publications are listed below.

	PolypSegTrack: Unified Foundation Model for Colonoscopy Video Analysis A Choudhuri, Z Gao, M Zheng, B Planche, T Chen, Z Wu MICCAI, 2025 arXiv A foundation model for joint detection, segmentation, classification, and tracking of polyps in endoscopy videos.
	Order-aware Interactive Segmentation B Wang, A Choudhuri, M Zheng, Z Gao, B Planche, A Deng, Q Liu, T Chen, Z Wu ICLR, 2025 arXiv Transformer-based model for fine-grained interactive segmentation.
	OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning A Choudhuri, G Chowdhary, A Schwing NeurIPS, 2024 Project / arXiv Video instance segmentation and captioning of open-world and close-world objects.
	Context-Aware Relative Object Queries to Unify Video Instance and Panoptic Segmentation A Choudhuri, G Chowdhary, AG Schwing CVPR, 2023 Project / Paper Simple approach for multiple video segmentation tasks using context-aware relative object queries (CAROQ).
	Assignment-Space-Based Multi-Object Tracking and Segmentation A Choudhuri, G Chowdhary, AG Schwing ICCV, 2021 Project / Paper Globally optimal solution for multi-object tracking and segmentation (MOTS) over the space of assignments.

Workshop Organizer: ICCV 2025 Workshop on Advanced Perception in Autonomous Healthcare, 2025
Reviewer: CVPR, ECCV, ICCV, NeurIPS, ICML, ICLR, ICRA, JFR, AAAI
Teaching: Computer Systems and Programming (ECE 220), UIUC, Fall 2021
Mentoring:
- Bin Wang, Ph.D. student, Northwestern University, Summer 2024
- Yuhao Su, Ph.D. student, Northeastern University, Summer 2025

Template adopted from here.