3D reconstruction – Classical SfM vs. Deep Learning (NeRF)

Overview: This project explores 3D reconstruction techniques by comparing a classical Structure from Motion (SfM) approach with a deep learning-based Neural Radiance Fields (NeRF) model. We reconstructed a building’s 3D structure using feature matching, camera pose estimation, and triangulation, then compared the study by generating photorealistic novel views using NeRF, a neural network-based volumetric rendering technique.

GitHub Repository: View Source Code on GitHub

Key Features:

  • Structure from Motion (SfM): Uses classical computer vision techniques like feature detection (SIFT), fundamental & essential matrices, PnP, and bundle adjustment. Generates a 3D point cloud representing the building’s geometry.
  • Neural Radiance Fields (NeRF): Uses a deep neural network to synthesize realistic novel views from a sparse set of 2D images. Implements Tiny NeRF, an efficient version of NeRF, for volumetric scene representation.
  • Dataset: A set of images captured with a GoPro Hero 3 of a building at Levine Hall, UPenn, corrected for fisheye lens distortion.
  • Camera Pose Estimation: Extracted using Essential Matrix Decomposition and PnP with RANSAC. Triangulated 3D points were refined using Non-Linear Triangulation and Bundle Adjustment.

Results & Impact

SfM Approach: Successfully reconstructed a 3D point cloud of the building, though it remained sparse due to dataset limitations.

NeRF Approach: Generated high-quality novel views, offering a realistic visualization of the scene, but without explicit 3D coordinates.

Comparison:

  • SfM excels in accurate spatial measurements and geometric scene reconstruction.
  • NeRF provides photorealistic renderings but lacks geometric data.
  • Combining both techniques can enhance 3D mapping and visualization for AR/VR and autonomous navigation.

Technologies Used:

  • Classical Computer Vision: OpenCV, SIFT, RANSAC, PnP, Bundle Adjustment
  • Deep Learning: Tiny NeRF, PyTorch, Positional Encoding
  • 3D Reconstruction: Poisson Surface Reconstruction, Camera Pose Estimation
  • Dataset: Custom GoPro Hero 3 dataset, Lego dataset (for NeRF)

This project evaluates classical vision-based 3D reconstruction and modern AI-driven rendering techniques, demonstrating the strengths and trade-offs of each approach in real-world applications.

Project Report

Below is the full project report. You can view it directly here or download it.

Download Report