Loading...

Near-Field Photometric Stereo for 3D Endoscopic Reconstruction

Indian Institute of Technology Madras, RBCDSAI | Oct 2021 – Mar 2023
Advisor: Prof. Kaushik Mitra

Conventional endoscopy misses ~25% of gastrointestinal lesions because it relies on color contrast, yet many clinically significant lesions are defined by surface topography. Photometric stereo can recover pixel-wise depth and surface normals from varying illumination without camera motion — but endoscopic scenes violate its core assumptions: light sources are millimeters from tissue, depth range is large relative to source distance, and per-pixel illumination direction varies drastically across the image.

Capture hardware. Left: Benchtop setup with 24-LED ring (4cm radius) and DSLR camera, later replaced with FLIR.

Over 18 months, I designed several capture setups (benchtop 24-LED ring and ex-vivo 3D-printed stand), we built a Unity pipeline generating 5,244 synthetic endoscopic scenes with ground-truth normals and depth, trained a near-field photometric stereo model (Lichy et al., CVPR 2022), and systematically evaluated it across four hardware configurations converging toward real endoscopic geometry — from a 4cm-radius benchtop ring down to 6 LEDs mounted directly on an endoscope at 1cm radius. The configuration with 8-LED ring, 1.25cm radius at the endoscope opening produced the strongest dense reconstructions; the direct-LED attempt failed due to LED directionality, establishing a practical lower bound on source geometry.

Input scene

Predicted depth map

3D reconstruction

Key insight: Reconstruction quality depended more on hardware design choices — LED directionality, radius-to-distance ratio, training-deployment geometry matching — than on model architecture. Cast shadows remain the primary failure mode, motivating fusion with complementary depth methods.

Code & Data: Available upon request through Prof. Kaushik Mitra's Computational Imaging Lab

Website credits to Takuya Matsuyama3D model credits to Tzeshi