Sri Ram Relangi

Generative AI & Machine Learning Engineer

Professional Summary

Innovative and results-driven Generative AI and Machine Learning Engineer with 3 years of experience delivering high-impact AI solutions across NLP, computer vision, and real estate applications. Proven expertise in developing scalable models and automating workflows that enhance business efficiency, accuracy, and growth.

Work Experience

Generative AI Engineer- Data Scientist

Citi Group - Dallas, USA

Apr 2025 - Present

  • Architected a production-grade RAG system using Multi-Component Pipelines (MCP) to deliver context-rich, low-latency responses.
  • Developed agentic frameworks with dynamic tool use and memory-aware planning for autonomous multi-step reasoning.
  • Led the design and deployment of an LLM-powered AI console, achieving a 40% boost in task completion and reducing hallucination.

Generative AI Engineer- Data Scientist

Visual Technologies - Dallas, USA

Jul 2024 - Mar 2025

  • Designed advanced generative AI models (GPT-4, Stable Diffusion), increasing content generation efficiency by 70%.
  • Developed custom prompt engineering strategies, optimizing LLM effectiveness by 50%.
  • Implemented scalable RAG pipelines with Pinecone/FAISS, reducing query response times by 40%.
  • Built multimodal generative AI solutions combining vision transformers and diffusion models, improving content quality by 60%.

Data Scientist- Machine Learning Engineer

Prosway - Memphis, USA

Aug 2024 - Oct 2024

  • Engineered a real estate recommender system with 75% prediction accuracy for a hedge fund.
  • Architected a Firebase-based data storage solution for real-time property analysis.
  • Developed robust RESTful APIs for automated data acquisition, reducing manual intervention by 60%.

Graduate Assistant – Machine learning & Big Data

University of North Texas - Denton, USA

Jul 2023 - May 2024

  • Accelerated student success by 30% by creating interactive ML and Big Data coursework.
  • Guided students in ML projects, leading to a 25% increase in research paper quality and a 20% rise in student job placements.

Data Scientist- Programmer Analyst

Cognizant - Chennai, India

Dec 2020 - Jul 2022

  • Developed automation solutions with Java & Selenium, reducing manual errors by 70% in billing workflows.
  • Optimized SQL queries on 1 million+ records, improving query speed by 40%.
  • Improved demand forecasting precision by 15% through data analysis in Python and SQL.

Technical Skills

Generative AI & LLMs

Agentic FrameworksRAG PipelinesPrompt EngineeringFine-TuningLLM EvaluationLangChain & LangGraphVector DatabasesOpenAI & Hugging FaceDiffusion ModelsMulti-modal Models

ML & Data Science

PythonTensorFlow & KerasPyTorchScikit-learnPandas & NumPyComputer Vision (CV)Natural Language ProcessingPredictive AnalysisA/B TestingDeep LearningComputer VisionHadoop & Spark

Tools & Platforms

AWS (SageMaker, EC2, S3)Google Cloud (GCP)DockerKubernetesSQLFlask & StreamlitCI/CD (GitHub Actions)DatabricksAirflow

Key Projects

Multi-Agent Personal Assistant

  • Architected a hierarchical multi-agent system with an orchestrator for autonomous goal decomposition and execution.
  • Achieved a 90% reduction in manual intervention by implementing dynamic tool-use (Google Workspace, Web Search).
Agentic AIGPT-4n8nOrchestration

AI Travel Itinerary Generator

  • Engineered an autonomous agent using LangGraph for multi-step reasoning to decompose user requests into structured travel plans.
  • Leveraged Gemini Pro's function calling for structured JSON generation and integrated with a Streamlit front-end.
Gemini ProLangChainLangGraphStreamlit

Context-Aware Medical Chatbot

  • Developed a medical chatbot using LLMs and a RAG pipeline to provide accurate, context-driven medical insights.
  • Implemented Pinecone for efficient, low-latency vector-based retrieval from a corpus of medical information.
LLMRAGPineconeLangChain