A robust end-to-end Semantic Segmentation pipeline built with PyTorch, focusing on Human Object Interaction. The system utilizes an LR-ASPP MobileNetV3 architecture optimized for consumer GPUs. It features a custom OOM-safe training loop that automatically reduces batch size and image dimensions to prevent memory crashes, along with support for Mixed Precision (AMP) and Gradient Accumulation.

Computer Vision Engineer — Architecture, Pipeline Development