Summary
Overview
Work History
Education
Skills
Accomplishments
Additional Information
Languages
Certification
Affiliations
Websites
Timeline
Generic

Deepansh Jha

Jhansi

Summary

Energetic and innovative Generative AI Engineer with a strong background in AI, specializing in vector databases, LLM frameworks, and cloud deployments. Eager to apply technical skills and creativity to develop state-of-the-art AI solutions within a dynamic team setting.

Overview

1
1
Certification

Work History

Generative AI Engineer

iNeuron
Bangalore
01.2024 - 06.2024
  • Managed and optimized vector databases such as ChromaDB, Pinecone and AstraDB enhancing AI applications' data retrieval capabilities
  • Handled DataStax Cassandra DB in production environments, ensuring optimal performance and reliability for AI-driven applications
  • Developed scalable AI solutions using LangChain and LlamaIndex frameworks, demonstrating expertise in generative AI technologies
  • Implemented and fine-tuned both open-source and paid LLM models, customizing solutions to meet specific project requirements and performance goals
  • Implemented the conversion of a finetuned model to a safetensor variant.
  • Transformed the LLM model from GGUF format to an approved OLLAMA format.
  • Designed and executed a customized, attractive web UI for improved project aesthetics.
  • Leveraged AWS EC2 for deploying AI models, utilizing cloud services to ensure scalability and reliability of AI applications
  • Developed and implemented engineering strategies to improve production efficiency.
  • Assisted in troubleshooting issues related to product design or manufacturing processes.
  • Analyzed design or requirement information for equipment or systems.
  • Formulated plans using detailed drawings.
  • Performed testing to determine functionality or optimization.
  • Addressed improvement of failure, reliability, and yield through detailed analyses.

Education

Master of Science - Data Science

Liverpool John Moores University
Liverpool, UK
01.2024

Foundational Course - Generative AI

INeuron
Bangalore
01-2024

Executive Post Graduate Programme in Data Science - Data Science With Specialization in Data Engineer

INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY
Bangalore
09-2023

Skills

  • Programming Languages: Python
  • Frameworks & Tools: TensorFlow, PyTorch, LangChain, Langflow, LlamaIndex, Streamlit, Flask
  • Generative AI Technologies: Open-source and paid LLM models (Llama2, Mistral, OpenAI, Google Gemini Pro, phi3, Llama3, Stable Diffusion)
  • Vector Databases: ChromaDB, Pinecone, AstraDB
  • Database Management: Experience with DataStax Cassandra DB in production environments, Mysql, Mongodb
  • Deployment Platforms: AWS Bedrock, AWS (EC2, Lambda), Azure Functions, Hugging Face Spaces, Hugging Face Inference
  • AI/ML Techniques: Fine-tuning with custom data, vector embedding, NLP, neural network optimization
  • Soft Skills: Analytical thinking, problem-solving, teamwork, effective communication
  • Stable Diffusion: Finetuning, Kohya, Dreambooth, Inpaint, Roop, Controlnet
  • Blockchain: Hardhat, Ganache
  • Data Science: Data Visualization, Data Extraction and Cleaning, Data Engineer, Exploratory Data Analysis, Machine Learning, Data Visualization Technique
  • Virtual Reality: Unreal Engine 5 (Blueprints, Event Graph, Create Game Environment)
  • Design development, Product Development, Process Improvement, System Troubleshooting, System Design, Project Planning

Accomplishments

  • Innovative AI Solutions: Successfully developed and deployed cutting-edge AI applications, demonstrating a strong impact in enhancing user engagement and operational efficiency
  • Technical Leadership: Led teams and projects to pioneer the use of generative AI technologies in practical, real-world applications

Additional Information

Managed and optimized vector databases such as ChromaDB, Pinecone, and AstraDB, enhancing AI applications' data retrieval capabilities. Handled DataStax Cassandra DB in production environments, ensuring optimal performance and reliability for AI-driven applications. Developed scalable AI solutions using LangChain and LlamaIndex frameworks, demonstrating expertise in generative AI technologies. Implemented and fine-tuned both open-source and paid LLM models, customizing solutions to meet specific project requirements and performance goals. Leveraged AWS EC2 for deploying AI models, utilizing cloud services to ensure scalability and reliability of AI applications.

RAG Model Project with DataStax Databases and Vector Embedding:

  • Overview: Developed a cutting-edge Retrieval-Augmented Generation (RAG) model integrating advanced AI tools and frameworks.
    Technologies: Astra DB, Ollama, Hugging Face, LangChain, custom fine-tuned PHI3 LLM model using the Open Assistant dataset.
    Outcome: Achieved a 50% improvement in Enhanced skills and knowledge in AI and LLM deployment.

Web Application for a Conversational AI system using LLM and DataStax Databases and Vector Embedding:

  • Overview: Developed a Astrology conversational AI web app using Flask, integrating advanced AI and NLP tools for context-aware responses.
  • Technologies: Flask, Python, AstraDB, Cassandra, Hugging Face Inference API, Langchain Embedding, LlamaIndex, AWS, dotenv
  • Outcome: Improved response accuracy by 40% and reduced query processing time by 50%, enhancing user satisfaction and application efficiency.

Conversational Medical AI Chatbot with Pinecone and Langchain:

  • Overview: Developed a web-based conversational AI application using Flask, integrating vector search and retrieval-augmented generation to deliver accurate and context-aware responses.
  • Technologies: Flask, Python, Pinecone, dotenv, Hugging Face embeddings, CTransformers, Langchain
  • Outcome: Improved response accuracy by 45% and reduced query processing time by 55%, significantly enhancing user experience and application performance.

Unreal Engine 5 Plugin for Conversational AI using Custom LLM Model AND PDFs:

  • Overview: Developed a third-person game in Unreal Engine 5 featuring a Retrieval-Augmented Generation (RAG) chat system. Players can interact with a custom LLM model integrated as a plugin. Upon reaching a specific platform, an automatic UI opens, allowing local chat with the LLM without using any external API.
  • Technologies: Unreal Engine 5, Open Source LLM Model (HuggingFace), Texture, Material, Blueprint Class, Blueprint Variables, Custom Widget, Event Graph, Nodes, Python Script, Langchain, PineconeDB
  • Outcome: Improved in-game interaction efficiency by 35% and enhanced user satisfaction by 45% through seamless, local Retrieval Augmented Generation integration.

Languages

Hindi
First Language
English
Upper Intermediate (B2)
B2

Certification

  • iNeuron (Generative AI Foundation)
  • IIITB & UpGrad (Data Science Programming Bootcamp)
  • IIITB & UpGrad (Data Toolkit)
  • IIITB & UpGrad (Data Engineering-I)

Affiliations

  • Singing
  • Snooker
  • Photography
  • Content Creator

Timeline

Generative AI Engineer

iNeuron
01.2024 - 06.2024

Master of Science - Data Science

Liverpool John Moores University

Foundational Course - Generative AI

INeuron

Executive Post Graduate Programme in Data Science - Data Science With Specialization in Data Engineer

INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY
Deepansh Jha