
Sophia Bano
UCL, United Kingdom
Robot vision and scene understanding for minimally invasive surgery.An ECCV workshop on data as the bottleneck for robust surgical computer vision - from curated multi-endoscope datasets to large multimodal clinical benchmarks and surgical visual understanding.
A focused program of invited talks, peer-reviewed papers, and dataset spotlights.
Reliable medical systems depend on data quality as much as model novelty. Intraoperative video is scarce, expensive to annotate, and ethically restricted. DCA-MI advances data-centric methods and benchmarks as first-class research contributions, with particular attention to surgical scene understanding, endoscopic geometry, and clinical translation.
"The bottleneck is rarely the model. It is almost always the data."
Organizing Committee

UCL, United Kingdom
Robot vision and scene understanding for minimally invasive surgery.
DKFZ / Heidelberg University, Germany
Surgical data science, benchmarking, and reproducible evaluation.
Universidad de Zaragoza, Spain
Visual SLAM, deformable SLAM for endoscopy, EndoMapper.
CUHK, Hong Kong
Medical and surgical AI across MICCAI, IPCAI, and ICRA.How robust visual perception supports minimally invasive surgical workflows.
How dataset design, provenance, and reproducible evaluation shape trustworthy medical AI.
How calibrated surgical data enables mapping, reconstruction, and deformable scene representations.
How surgical datasets connect perception research to deployable clinical systems.
Featured advisor
NCT Dresden professor working on surgical data science, computer-assisted surgery, robotic vision, and AI-enabled clinical translation.
Profile
Invited speakerUCL researcher focused on robot vision and scene understanding for minimally invasive surgery.
Invited speakerDKFZ and Heidelberg University researcher in surgical data science, benchmarking, and reproducible evaluation.
Invited speakerUniversidad de Zaragoza researcher known for visual SLAM, deformable SLAM for endoscopy, and EndoMapper.
Invited speakerChinese University of Hong Kong researcher working on medical and surgical AI.
Each spotlight treats a dataset as a research contribution: acquisition decisions, ground-truth design, split strategy, and evaluation pitfalls.

A Multi-Endoscope Dataset for Surgical 3D Perception
iMED focuses the workshop on synchronized, dual-view, deformable, specular surgical scenes. The dataset is designed to stress test visual geometry, photogrammetry, and endoscopic reconstruction beyond rigid-world assumptions.

Clinical Large-scale Integrative Multimodal Benchmark
A multimodal clinical benchmark spanning imaging, language, time series, graph, and multimodal patient data.
arXiv:2503.07667
Surgical Visual Understanding Dataset
Robot-assisted surgical video with tool and task labels, built for visual understanding under operating-room scale and imbalance.
arXiv:2501.09209Figure sources: iMED local paper assets; CLIMB Figure 2 from arXiv:2503.07667; SurgVU Figure 1 from arXiv:2501.09209.
Each spotlight treats a dataset as a research contribution: acquisition decisions, ground-truth design, split strategy, and evaluation pitfalls.

A Multi-Endoscope Dataset for Surgical 3D Perception
iMED focuses the workshop on synchronized, dual-view, deformable, specular surgical scenes. The dataset is designed to stress test visual geometry, photogrammetry, and endoscopic reconstruction beyond rigid-world assumptions.

Clinical Large-scale Integrative Multimodal Benchmark
A multimodal clinical benchmark spanning imaging, language, time series, graph, and multimodal patient data.
arXiv:2503.07667
Surgical Visual Understanding Dataset
Robot-assisted surgical video with tool and task labels, built for visual understanding under operating-room scale and imbalance.
arXiv:2501.09209Figure sources: iMED local paper assets; CLIMB Figure 2 from arXiv:2503.07667; SurgVU Figure 1 from arXiv:2501.09209.
OpenReview information, templates, and final submission instructions become available.
Full papers, extended abstracts, and dataset submissions due by 23:59 AOE.
Decisions sent by email with presentation type.
Final PDFs uploaded and program frozen.
DCA-MI 2026 meets during ECCV on September 8-9.
We invite contributions across curation, augmentation, restoration, 3D perception, learning with limited or imperfect data, and clinical translation. Submissions may be empirical, methodological, position-style, or new datasets and benchmarks with reproducible baselines.
We are especially interested in work that shows how upstream data decisions propagate into downstream clinical and surgical performance.
Full papers, extended abstracts, and dataset / benchmark submissions are welcome. Final page limits and template links will be posted with the OpenReview call.
Each submission receives technical and domain review, with dataset papers evaluated for provenance, license clarity, and reproducibility.
All datasets must document source, consent basis, licensing, and known coverage limits. Submissions with unclear protected-data handling may be desk-rejected.
Welcome and workshop overview from the organizers.
Invited talk on the data bottleneck for robust medical and surgical AI.
Interactive poster session and attendee discussion.
Four accepted papers, 15 minutes each.
Lunch break.
Invited talk on endoscopic geometry, mapping, and scene representation.
iMED: Multi-Endoscope Dataset | 20 min
CLiMB: Benchmark for Colonoscopy SLAM | 20 min
SurgVU: Surgical Visual Understanding | 20 min
Second poster block and hallway discussion.
Invited talk on translating surgical data into robust clinical systems.
Four accepted papers, 15 minutes each.
Closing notes and next steps from the organizers.

UCL, United Kingdom
Robot vision and scene understanding for minimally invasive surgery.
DKFZ / Heidelberg University, Germany
Dataset scarcity, design, curation, and reproducible surgical data science.
Universidad de Zaragoza, Spain
SLAM, neural rendering, deformable reconstruction, and EndoMapper.
CUHK, Hong Kong
Autonomous surgery, clinical translation, and medical computer vision.
Primary contact | Intuitive Surgical

iMED dataset lead | UCL Hawkes Institute

CLiMB benchmark lead | Universidad de Zaragoza

Organizer | Intuitive Surgical

Organizer | Intuitive Surgical

Computer Vision & Medical Imaging engineer at Intuitive Surgical, specializing in advanced imaging and robotic-assisted procedures. CMU Robotics Institute alumnus; presenter and reviewer across IEEE TRO, IROS, CVPR, and ICML.
shuoqi.chen@intusurg.com

Research Scientist | Intuitive Surgical

SurgVU lead | Intuitive Surgical
Featured advisor
Professor for Translational Surgical Oncology at NCT Dresden, working on surgical data science, computer-assisted surgery, robotic vision, and AI-enabled clinical translation.
Profile
Robot vision
UCL researcher focused on robot vision and scene understanding for minimally invasive surgery.
Profile
Surgical data science
DKFZ and Heidelberg University researcher in surgical data science, benchmarking, and reproducible evaluation in medical AI.
Profile
Visual SLAM
Universidad de Zaragoza researcher known for visual SLAM, ORB-SLAM, deformable SLAM for endoscopy, and the EndoMapper dataset.
Profile
Medical AI
Chinese University of Hong Kong researcher working on medical and surgical AI, with recent work across MICCAI, IPCAI, and ICRA.
Profile