Capable Senior Technical Architect and Data Scientist with robust background in designing and implementing complex technical solutions. Proven track record in leading cross-functional teams to deliver innovative architecture projects. Demonstrated expertise in system integration and project management.
Task 1: Custom Combined Loss function in Faster RCNN model of Detectron2
The task was to add a new head to the Faster RCNN model of Detectron2 to implement a custom combined loss function so that a custom structural constraint can be applied on the model.
Project Highlights:
Technologies used
Python, Pytorch, Tensorboard, FasterRCNN, Detectron2.
Task 2: Calculating Keypoints from Bounding Boxes and Segmentation Masks in Detectron2
This task involved the calculation of Keypoints from Segmentation Masks and Bounding Box information.
Project Highlights:
Technologies used
Python, Pytorch, Tensorboard, MaskedRCNN, Detectron2.
Project 1: Generating Test cases from PDF, Excel and JSON data
This project was about automatically generating Test cases from PDF data using large language models. The problem was solved using Google's AI Gemini-flash-2.0 LLMmodel. The solution involved usage of Prompt Engineering, RAG architecture and Langchain pipelines.
Project Highlights:
Technologies used
Large Language Model (Gemini-flash2.0), RAG, Langchain, Python.
Project 2: Identifying and measuring the Degraded Road Markings
Since annotating degraded markings is impossible (because of the random distribution of the degrades and the lack of information of the annotators about every degeneration) found a novel solution which can solve both the problems. The solution was successfully adopted in the product.
Project Highlights:
Technologies used:
YOLOv8, COCO, Bounding-Box, Masking, Contouring, Polygon Determination, IOU, Mask Area, Custom Loss Function, Custom Image Transformations, Pytorch, LangChain, Onnx, Detectron2, FiftyOne (Voxel51), Unsupervised Clustering, Auto Labeling, Transfer Labeling.
Project 3: Intravascular Ultrasound Image Analysis
Developed a simple Intravascular Ultrasound Image Analyzer Model to predict plaque regions within the blood vessels. Although the training was supervised and respectively simpler but the real challenge was in collecting labeled IVUS images as these are medical documents and mustn't be shared publicly. Moreover, it's illegal to buy the images from any source as well. So used a Triple GAN model to simultaneously generate and classify synthetic IVUS images for classification.
Project Highlights:
Technologies used:
Radial to Polar Image Conversion, Logical Image Splitting to create Strips, Triple GAN model training, Synthetic Strip Generation and Classification
Project 4: C/C++ Static Code Analyzer
Developed an AIML based static code analyzer for C/C++ projects. This project served as an simple but powerful alternative for existing code analyzers. The purpose of the analyzer was to predict vulnerable regions of code so that we can stress the test cases around those areas.
Project Highlights:
Technologies used:
Graphs and Trees, Recursion, Seq-2-Seq AttentionEncoder Decoder Model, Reinforcement Learning.
Project 5: Horizontal and Vertical Vector Search
A challenging but very common use case for search. The situation is when we have tabular data with variations in both the columns and rows. Variations can range from Text Variations, Number Variation, Date Variation, Keyword Variations etc. Currently the state of the art tech to solve such problems is using LLMs (Paid, Unpaid, Downloadable, CloudBased) that depends on the number of parameters and its variations. Solved the problem with a unique approach using Vector Databases and Ranking Based Hybrid Search. The entire solution was multi-threaded (Python Lock was used) where one thread was dedicated in accepting, processing and updating the VectorDB and the other thread is dedicated to the CRUD operations triggered from the user actions via a Menu. Python Signals were used for this.
Project Highlights:
Used Qdrant’s Hybrid search feature.
Crafted complicated queries by adding query points.
Allocated rank values to increase or decrease query points.
Technologies used:
Qdrant, ChromaDB, FTS, Ranking, Hybrid Search, Custom Embedding Logic, Modifying Standard Search Algorithm of Qdrant, Multi-threading, Python Locks, Python Signals
I unfortunately met an accident and had to take a break of 2 years from regular work. I spent these 2 years in personal development healthwise and skillwise. Now I perfectly healthy and working at my 100% capacity.
Project 1: Generic PDF Parser
Extremely interesting and one of my favorite self-made creation. Its not hidden from any NLP engineer how difficult but important it is to be accurately extract information from PDFs. Normal PDF parsing - Block of Text or even Text inside Images are pretty simple but what about extracting tabular information from a PDF into an Excel file? Although there are tools which claim that they can do it but none of them work in all cases, especially when the table's layout and orientation is very different from usual plain tables. Developed the most efficient table data extractor from any PDF with any orientation or layout. The interesting part in this work was its entirely developed using traditional algorithms and not AI.
Project Highlights:
Technologies used:
Core Python, Numpy, Dataframes, Recursion, Graphs and Trees, PyMuPDF, PyTesseract.
Project 2: Siamese Model for Image Identification
Had to identify automobile parts from manuals. The challenge was that the same part had multiple 3D views from multiple angles in many places of the manual. Some similar, some different. There was no information about the geometrical numbers of each subparts as well. And there was no training data to train a classifier which could identify apart irrespective of it view. So, instead of supervised learning, chose contrastive learning to at least predict if two parts are same or not, albeit without predicting the class of the part. Then cluster similar parts and compare with ground truth to manually label the cluster.
Technologies used:
DeTr, Siamese Contrastive Learning.
Project 3: Document Intelligence Contract Review
This product was primarily used for reviewing of contracts by EY auditors. The Product provided contextual AI hints about topics inside a contract, thus, drastically reducing the time and effort of the auditors. Used NLP based techniques such as - Word embedding, NER (Named Entity Recognition), OCR (Optical Character Recognition) - ABBYY and Poppler etc, Online Learning.
Used libraries like - numpy, scipy, sklearn, keras, tensorflow, opencv, pandas etc. Was also involved in the deployment activities which included technologies like - Azure ML Studio, Docker, RabbitMQ, Flask, Kubernetes, CI-CD pipelines
etc.
Model Development