Computer Vision Engineer | 3D Reconstruction, Gaussian Splatting & Generative AI | Perception, Unity3D & Spatial Computing
Aktualisiert am 16.04.2026
Profil
Freiberufler / Selbstständiger
Remote-Arbeit
Verfügbar ab: 16.04.2026
Verfügbar zu: 100%
davon vor Ort: 100%
Computer Vision
3D-Rendering
Generative AI
Gaussian Splatting
NeRF
Unity
WebGL
Point Cloud
Mesh Reconstruction
Scene Understanding
Python
C++
COLMAP
English
native
German
Advanced

Einsatzorte

Einsatzorte

Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

3 Monate
2026-02 - heute

Post-Training LOD Optimization for Gaussian Splatting

  • Developing post-training level-of-detail strategies for large-scale indoor Gaussian scenes without retraining the model.
  • Constructing room-level spatial partitions over trained splats using COLMAP structure and boundary estimation.
  • Implementing renderer-side frustum culling and region-aware activation to reduce active splat count and improve runtime performance on resource-constrained devices.
4 Monate
2026-01 - heute

GeoFuse-SFM: Geometry-Aware Dense Reconstruction from Sparse Structure-from-Motion

  • Built a full incremental SfM pipeline from scratch supporting single- and multi-camera reconstruction systems.
  • Directed geometry-aware seed-based dense depth propagation from sparse SfM points.
  • Introduced reference-anchored multi-view fusion to eliminate view-dependent reconstruction artifacts.
  • Applied triangulation-angle and reprojection error filtering for robust dense reconstruction.
2 Monate
2022-07 - 2022-08

Object Tracking using YOLOv8

  • Implemented and evaluated object detection pipelines using YOLOv3 and YOLOv8 for multi-object scenarios.
  • Integrated DeepSORT to enable real-time multi-object tracking across video sequences.
  • Contributed to the DeepSORT open-source project by updating the codebase for compatibility with latest Python and TensorFlow versions; contribution accepted via merged pull request.
4 Monate
2022-05 - 2022-08

Monocular-Omni-Human-Pose-Estimation

  • Built and annotated a synthetic top-view human pose dataset with structured JSON/XML labels using LabelMe.
  • Optimized a HRNet-based pose estimation model for monocular top-view inputs, reaching ~75% pose accuracy.
  • Performed keypoint evaluation and reduction (17 ? 13) to improve stability and suitability for downstream research experiments.

Aus- und Weiterbildung

Aus- und Weiterbildung

3 Jahre 9 Monate
2020-10 - 2024-06

Studium - Embedded Systems

Master, Technical University of Chemnitz (Germany)
Master
Technical University of Chemnitz (Germany)
Thesis: on request
4 Jahre
2016-08 - 2020-07

Studium - Electronics

Bachelor, Mumbai University (India)
Bachelor
Mumbai University (India)
Thesis: on request

Position

Position

Computer Vision Engineer

Kompetenzen

Kompetenzen

Top-Skills

Computer Vision 3D-Rendering Generative AI Gaussian Splatting NeRF Unity WebGL Point Cloud Mesh Reconstruction Scene Understanding Python C++ COLMAP

Produkte / Standards / Erfahrungen / Methoden

Profile
Computer Vision Engineer specializing in neural 3D scene representations, Gaussian Splatting, and large-scale reconstruction systems. Experienced in building Neural based pipelines, multi-view geometry systems, and synthetic data generation for real-world applications. Focused on research-driven development of scalable 3D understanding and spatial computing systems, with growing interest in extending these approaches to dynamic and temporally consistent representations.

Skills
  • 3D Vision & Geometry: NeRFs, Gaussian Splatting, SFM, SMPL/ SMPL-X, Multiview Geometry, Point Clouds, Camera Calibration
  • Systems & Tools:  Unity3D, Blender, MeshLab, NeRF Studio, Meshroom, EasyMocap, ARKit, RoomPlan, CVAT, LabelMe
  • Vision Libraries & Platforms: OpenCV, Open3D, Panda3D, Trimesh, ROS2, Docker, GCP, Segment Anything, VLMs, Vision Transformer
  • Programming & ML Frameworks: Python, C++, PyTorch, TensorFlow, Three.js, TensorBoard, Weights & Biases

Experience
10/2025 - 11/2025
Kukan, Bendorf (Germany)
Computer Vision Engineer (Contract)
  • Developed a room-scale geometry capture pipeline using ARKit and RoomPlan, converting spatial depth data into structured point clouds and refined mesh representations
  • Trained and deployed 3D Gaussian Splat models from captured imagery, integrating splat-based photorealistic rendering into a Three.js web viewer for interactive spatial exploration
  • Created a collision-aware navigation system using BVH-accelerated physics. Facilitated human-like room-to-room traversal instead of direct teleportation

07/2024 - 06/2025
Kaptura GmbH, Bendorf (Germany)
Computer Vision Engineer
  • Developed NeRF and Gaussian Splatting pipelines for a large-scale multi-camera 3D scanning system, (~250 cameras) improving reconstruction stability and fidelity
  • Built a 2D?3D camera - LED calibration & tracking pipeline with custom markers and homography-based correction; reduced calibration time (~2.5 hours to ~30 minutes)
  • Applied SfM and multi-view geometry to refine camera poses and mitigate drift, scale & alignment errors; reduced RMSE (~4.8 to ~2.1) and stabilized reconstructions
  • Investigated reconstruction of reflective and transparent objects using hybrid rasterization and ray-tracing approaches
  • Containerized applications with Podman and integrated ROS and Streamlit to control scanning system components efficiently

10/2021 - 06/2024
Technical University of Chemnitz (Germany)
Research Assistant - Computer Vision
  • Generated synthetic 3D datasets and reconstructed indoor environments (~500 scenes) using Unity3D and NeRF pipelines
  • Developed simulation datasets for autonomous driving perception experiments (~50?70 outdoor scenes under varied conditions) using Unity3D
  • Built perception pipelines for 3D object recognition, segmentation, and tracking using point clouds, meshes and monocular 3D detection methods
  • Supported dataset preparation and annotation workflows for human pose and object understanding tasks

01/2023 - 02/2024
Technical University of Chemnitz (Germany)
Research Experience - Computer Vision
  • Built a NeRF and Gaussian Splatting based large-scale dataset (~600K images) for fisheye-view human reconstruction and pose estimation
  • Trained neural rendering and pose estimation models using multi-GPU workflows; implemented fine-tuning strategies to extend reconstructions across multiple poses without full retraining
  • Integrated SAM, GroundingDINO, and Vision Transformers for automated human segmentation and scene understanding
  • Generated SMPL-X / SMPL-H meshes and evaluated downstream 2D/3D pose estimation pipelines for reconstruction accuracy and robustness

Einsatzorte

Einsatzorte

Deutschland, Schweiz, Österreich
möglich

Projekte

Projekte

3 Monate
2026-02 - heute

Post-Training LOD Optimization for Gaussian Splatting

  • Developing post-training level-of-detail strategies for large-scale indoor Gaussian scenes without retraining the model.
  • Constructing room-level spatial partitions over trained splats using COLMAP structure and boundary estimation.
  • Implementing renderer-side frustum culling and region-aware activation to reduce active splat count and improve runtime performance on resource-constrained devices.
4 Monate
2026-01 - heute

GeoFuse-SFM: Geometry-Aware Dense Reconstruction from Sparse Structure-from-Motion

  • Built a full incremental SfM pipeline from scratch supporting single- and multi-camera reconstruction systems.
  • Directed geometry-aware seed-based dense depth propagation from sparse SfM points.
  • Introduced reference-anchored multi-view fusion to eliminate view-dependent reconstruction artifacts.
  • Applied triangulation-angle and reprojection error filtering for robust dense reconstruction.
2 Monate
2022-07 - 2022-08

Object Tracking using YOLOv8

  • Implemented and evaluated object detection pipelines using YOLOv3 and YOLOv8 for multi-object scenarios.
  • Integrated DeepSORT to enable real-time multi-object tracking across video sequences.
  • Contributed to the DeepSORT open-source project by updating the codebase for compatibility with latest Python and TensorFlow versions; contribution accepted via merged pull request.
4 Monate
2022-05 - 2022-08

Monocular-Omni-Human-Pose-Estimation

  • Built and annotated a synthetic top-view human pose dataset with structured JSON/XML labels using LabelMe.
  • Optimized a HRNet-based pose estimation model for monocular top-view inputs, reaching ~75% pose accuracy.
  • Performed keypoint evaluation and reduction (17 ? 13) to improve stability and suitability for downstream research experiments.

Aus- und Weiterbildung

Aus- und Weiterbildung

3 Jahre 9 Monate
2020-10 - 2024-06

Studium - Embedded Systems

Master, Technical University of Chemnitz (Germany)
Master
Technical University of Chemnitz (Germany)
Thesis: on request
4 Jahre
2016-08 - 2020-07

Studium - Electronics

Bachelor, Mumbai University (India)
Bachelor
Mumbai University (India)
Thesis: on request

Position

Position

Computer Vision Engineer

Kompetenzen

Kompetenzen

Top-Skills

Computer Vision 3D-Rendering Generative AI Gaussian Splatting NeRF Unity WebGL Point Cloud Mesh Reconstruction Scene Understanding Python C++ COLMAP

Produkte / Standards / Erfahrungen / Methoden

Profile
Computer Vision Engineer specializing in neural 3D scene representations, Gaussian Splatting, and large-scale reconstruction systems. Experienced in building Neural based pipelines, multi-view geometry systems, and synthetic data generation for real-world applications. Focused on research-driven development of scalable 3D understanding and spatial computing systems, with growing interest in extending these approaches to dynamic and temporally consistent representations.

Skills
  • 3D Vision & Geometry: NeRFs, Gaussian Splatting, SFM, SMPL/ SMPL-X, Multiview Geometry, Point Clouds, Camera Calibration
  • Systems & Tools:  Unity3D, Blender, MeshLab, NeRF Studio, Meshroom, EasyMocap, ARKit, RoomPlan, CVAT, LabelMe
  • Vision Libraries & Platforms: OpenCV, Open3D, Panda3D, Trimesh, ROS2, Docker, GCP, Segment Anything, VLMs, Vision Transformer
  • Programming & ML Frameworks: Python, C++, PyTorch, TensorFlow, Three.js, TensorBoard, Weights & Biases

Experience
10/2025 - 11/2025
Kukan, Bendorf (Germany)
Computer Vision Engineer (Contract)
  • Developed a room-scale geometry capture pipeline using ARKit and RoomPlan, converting spatial depth data into structured point clouds and refined mesh representations
  • Trained and deployed 3D Gaussian Splat models from captured imagery, integrating splat-based photorealistic rendering into a Three.js web viewer for interactive spatial exploration
  • Created a collision-aware navigation system using BVH-accelerated physics. Facilitated human-like room-to-room traversal instead of direct teleportation

07/2024 - 06/2025
Kaptura GmbH, Bendorf (Germany)
Computer Vision Engineer
  • Developed NeRF and Gaussian Splatting pipelines for a large-scale multi-camera 3D scanning system, (~250 cameras) improving reconstruction stability and fidelity
  • Built a 2D?3D camera - LED calibration & tracking pipeline with custom markers and homography-based correction; reduced calibration time (~2.5 hours to ~30 minutes)
  • Applied SfM and multi-view geometry to refine camera poses and mitigate drift, scale & alignment errors; reduced RMSE (~4.8 to ~2.1) and stabilized reconstructions
  • Investigated reconstruction of reflective and transparent objects using hybrid rasterization and ray-tracing approaches
  • Containerized applications with Podman and integrated ROS and Streamlit to control scanning system components efficiently

10/2021 - 06/2024
Technical University of Chemnitz (Germany)
Research Assistant - Computer Vision
  • Generated synthetic 3D datasets and reconstructed indoor environments (~500 scenes) using Unity3D and NeRF pipelines
  • Developed simulation datasets for autonomous driving perception experiments (~50?70 outdoor scenes under varied conditions) using Unity3D
  • Built perception pipelines for 3D object recognition, segmentation, and tracking using point clouds, meshes and monocular 3D detection methods
  • Supported dataset preparation and annotation workflows for human pose and object understanding tasks

01/2023 - 02/2024
Technical University of Chemnitz (Germany)
Research Experience - Computer Vision
  • Built a NeRF and Gaussian Splatting based large-scale dataset (~600K images) for fisheye-view human reconstruction and pose estimation
  • Trained neural rendering and pose estimation models using multi-GPU workflows; implemented fine-tuning strategies to extend reconstructions across multiple poses without full retraining
  • Integrated SAM, GroundingDINO, and Vision Transformers for automated human segmentation and scene understanding
  • Generated SMPL-X / SMPL-H meshes and evaluated downstream 2D/3D pose estimation pipelines for reconstruction accuracy and robustness

Vertrauen Sie auf Randstad

Im Bereich Freelancing
Im Bereich Arbeitnehmerüberlassung / Personalvermittlung

Fragen?

Rufen Sie uns an +49 89 500316-300 oder schreiben Sie uns:

Das Freelancer-Portal

Direktester geht's nicht! Ganz einfach Freelancer finden und direkt Kontakt aufnehmen.