This repository includes different types of computer vision projects that was done using OpenCV package of python and tensorflow. Some of them are personal projects and remaining are done by following tutorials.
- AI Response Drone for SAR Respository
- Gender based cleaning algorithm Repository
- Face Mask 2 level protection for public safety Repository
- Stabilizing videos using python Repository
- Smile detection using OpenCV Repository
- Different images transformation Repository
- Edge and contour Detection Repository
- Face dectection using OpenCV Repository
- Warp perspective using Opencv Repository
- Virtual Paint Repository
- Face Mask detection Repository
- Image captioning using Flicker dataset Repository
- Neural Style Transfer Repository
- Creating videos using text prompts using stable diffusion Repository
- DCGAN Repository
- Lane-Detection-and-Segmentation-for-Autonomous-Vehicles Repository
- Object-Detection-and-Tracking-for-Autonomous-Vehicles Repository
- Automated-Segmentation-of-Brain-Tumors-in-MRI-Scans Repository
- Object tracking using YOLO v8 Repository
- Canny edge detection from scratch
- Created an algorithm from scratch to detect the edge and compared the results obtained from the OpenCV python package. This includes implementing fuction for each of the steps that are undertaken within the the canny edge detection before reaching the final results. Repository
- Hough transform and corner detection
- This includes hough transform to identify different shapes and the implementation of the harris corner detection. Repository
- Image stitching and panorama creation
- This assignment includes the process of creating a panorama by identifying the key points in different image and combining them to obtain the stitched image. Repository
- 2D Face Reconstruction
- This assignment includes the process of reconstructing a face with the help of different number of eigen vectors. This work compares the reconstruction ability of different combination of the eigen reconstructed faces. Repository
- Narrating the Unseen: Real-Time Video Descriptions for Visually Impaired Individuals
- This is the final project of the computer vision course. The paper includes the use of creation of the model setup where the visually impaired people can understand their surroundings in the best way compared to all other existing methods. The system uses GPT-4_vision to do the image captioning of the surrounding in real time. Repository