Here are some of the projects I’ve worked on, ranging from research in 3D computer vision to practical applications of generative AI.

Research Projects

Convolution Augmented Transformers for 3D Data Processing - Master’s Thesis

PyTorch | Point Cloud Processing | Deep Learning

Developed a novel architecture combining convolutional operations with transformer self-attention mechanisms for improved 3D point cloud understanding.

Key achievements:

  • Designed a custom convolutional block that enhances self-attention for 3D data
  • Achieved improved performance on ModelNet and ShapeNet datasets
  • Demonstrated better computational efficiency compared to pure transformer approaches

Technologies: PyTorch, CUDA, Point Cloud Library (PCL), Weights & Biases


3D Object Part Segmentation using Self-supervised Learning

PyTorch | Self-supervised Learning | 3D Vision

Created a self-supervised learning framework for 3D object part segmentation, eliminating the need for expensive manual annotations.

Key achievements:

  • Implemented SimCLR-based contrastive learning for 3D point clouds
  • Adapted PointNet architecture for feature extraction
  • Achieved comparable results to supervised methods on ShapeNet dataset
  • Reduced annotation requirements by 90%

Technologies: PyTorch, PointNet, SimCLR, Open3D


Deep Neural Network Based Action Recognition with Transformers

PyTorch | Video Understanding | Transformers

Developed a transformer-based model for action recognition in video sequences, focusing on temporal modeling and efficiency.

Key achievements:

  • Designed efficient temporal attention mechanisms for video processing
  • Evaluated on multiple datasets: Jester, Something-Something v2, and Kinetics
  • Achieved state-of-the-art results on hand gesture recognition tasks
  • Reduced inference time by 40% compared to 3D CNN baselines

Technologies: PyTorch, TimeSformer, Video Processing Libraries

Hackathon Projects

Paperwork Topology Sorter

Pisano Hackathon - 1st Prize | MongoDB | Express

Developed a web application to help users organize documents based on topological dependencies.

Key achievements:

  • Built efficient graph-based sorting algorithm
  • Implemented real-time collaborative features
  • Won first place among 50+ teams

Technologies: Node.js, MongoDB, Express, React

Interested in collaborating on a project or have questions about any of these works? Get in touch!