My Projects

100 Days of CUDA Kernels

January 2025 - Present

Everyday coding challenge to develop CUDA kernels from scratch.

LLM Library

January 2025

Code for my library to create simple LLMs for research purposes.

Creativity

December 2024

A modular library focused on generative AI experimentation and development.

YOLO Workflow

October 2024 - Present

Workflow and methodology for YOLO models.

RWKV_Learning

September 2024

Project mostly for learning experience.I implement different RWKV architectures

Energy Based Models

September 2024

Energy Based Models that I implemented based on various papers and courses/talks by different researchers

RNNs

July 2024

Developed several RNNs in PyTorch as a learning project, focusing on implementing mathematical formulas into models.

ConvNets

July 2024

Built several ConvNets as foundational components, providing a great learning experience on various architectures.

MathModel

May 2024

Developed an image2latex model for extracting LaTeX formulas from images and an LLM for explaining mathematical concepts.

LLama 2

April 2024

Constructed the LLama 2 architecture entirely from the ground up. Throughout this project, I gained knowledge about KV-caching and ROPE

Custom-GPT

April 2024

GPT-2 based model trained on selected books from open libraries, leveraging existing GPT-2 weights for better language understanding and improved output quality.

Stable Diffusion 1.5

March 2024

The Stable Diffusion architecture, implemented in PyTorch, was used for inference with version 1.5 weights from Hugging Face, providing key insights into building complex models.

Generative Adversarial Networks (GANs) Projects

January-February 2024

Implemented various GAN architectures including: WGAN, DCGAN, Linear GAN, C-GAN, CycleGAN, VAE Model, Pix2Pix Model for different image generation tasks.

Text to Image CLIP

March 2024

Text-to-Image pipeline based on CLIP architecture and used pretrained weights.

Transformer

March 2024

Created Transformer architecture for English-German text translation, fine-tuning with HuggingFace's open weights.

NN from Scratch

December 2024

Created a simple MLP neural network using numpy

Neural Style Transfer

November 2023

Using VGG-16 pretrained feature maps for creating artworks through styletransfer.

License Plate Detection

November 2023

Fine-tuned YOLOv8 and crafted custom multi-object detection model for specialized tasks.

Age/Gender/Race Classifier

October 2023

PyTorch CNN with ResNet50 backbone for accurate multi-class classification.

Bus Truck Detection

October 2023

Developed FR-CNN model for robust object detection between busses and trucks.

Road Semantic Segmentation :

October 2023

Implemented U-NET with VGG-16 backbone for precise street semantic segmentation.

CNN Tumour Detection

September 2023

Achieved exceptional accuracy in tumor detection using diverse backbone architectures(VGG-16, ResNet50, MobileNetV3, Inception).