KRISHNA VAMSI

AI/ML Engineer
US.

About

Highly accomplished AI/ML Engineer with a proven track record of designing and deploying cutting-edge Generative AI solutions for document intelligence, fraud detection, and enterprise automation across automotive and banking sectors. Expert in Large Language Models (LLMs), NLP, and MLOps, I excel at transforming complex projects from prototyping to production, leveraging cloud deployment and explainability to deliver significant, measurable business impact in highly regulated environments.

Work

Toyota North America
|

Machine Learning Engineer

Texas, US

Summary

Led the development and deployment of GenAI-powered solutions, optimizing service operations, document processing, and customer support workflows within a regulated automotive environment.

Highlights

Developed and deployed a GenAI-powered service operations assistant, summarizing subscriber documents and extracting structured data for automated case resolution, incorporating PromptLayer and Guardrails AI for hallucination mitigation.

Engineered an automated policy summarizer using fine-tuned T5 models, converting 100+ pages of complex policy documents into concise highlights for field technicians and support engineers.

Designed and implemented production-grade GenAI solutions with GPT, T5, and LLaMA, automating customer support workflows and document summarization, reducing average handling time by 30% through LoRA and PEFT fine-tuning.

Constructed Retrieval-Augmented Generation (RAG) pipelines using LangChain, FAISS, and OpenAI APIs, integrating knowledge graphs to power telecom-specific assistants with real-time query capabilities.

Developed enterprise-grade LLM-based tools for knowledge base search, plan eligibility reasoning, and root cause analysis, enabling 24/7 AI-driven decision support via semantic search.

Orchestrated scalable, secure deployment of LLM workloads on Amazon Bedrock, Docker, and Kubernetes (EKS), integrating real-time inference APIs into internal platforms and enabling serverless deployments via AWS Lambda.

Established and monitored robust MLOps pipelines using MLflow, SageMaker, Airflow, and AWS DevOps, automating training, evaluation, retraining, and deployment of classification and forecasting models at scale with full model versioning and drift detection.

KeyBank
|

Machine Learning Engineer

Texas, US

Summary

Designed and implemented advanced machine learning solutions for real-time fraud detection and risk management, significantly enhancing security and compliance within the banking sector.

Highlights

Developed and deployed a real-time fraud detection system utilizing Graph Neural Networks (GNNs), XGBoost, and anomaly detection to identify multi-account fraud rings and atypical banking transaction patterns.

Engineered scalable machine learning pipelines in Python, TensorFlow, and Vertex AI, integrating high-throughput data and automating model retraining across BigQuery, SQL, and Airflow.

Designed and implemented graph-based feature engineering and transaction embeddings, integrating time series features with XGBoost classifiers for accurate, real-time fraud prediction.

Reduced false positive rates by 45% and minimized fraud losses through advanced analytics and real-time decisioning within KeyBank's risk management operations.

Implemented robust MLOps practices using MLflow, Docker, and Vertex AI Pipelines for version control, drift monitoring, and audit-ready compliance, ensuring regulatory adherence.

Enhanced model explainability with SHAP and LIME, providing interpretable risk scores for front-line staff and facilitating FDIC, AML, and KYC regulatory compliance.

Collaborated with internal stakeholders to analyze transaction data, report confirmed fraud activity, and document detailed findings, supporting robust risk mitigation and compliance initiatives.

efficiently
|

Data Scientist

Hyderabad, Telangana, India

Summary

Spearheaded the development and optimization of AI products, driving significant automation and cost reduction through advanced machine learning and NLP techniques.

Highlights

Spearheaded the development and optimization of an AI product (Virtual Assistant) leveraging advanced machine learning and deep learning techniques to enhance operational efficiency.

Implemented AI-driven automation, reducing workforce dependency by 85% and significantly cutting operational costs while accelerating transaction turnaround times.

Developed scalable data processing pipelines using PySpark, optimizing ETL workflows for large-scale structured and unstructured datasets and improving data efficiency in AWS.

Engineered a custom deep learning BERT-based model to accurately classify over 200 documents based on OCR-extracted text, utilizing advanced word tokenization and semantic understanding.

Designed custom NLP models with Tesseract, AWS Textract, spaCy, and NLTK to extract and analyze critical financial data, enabling automated financial stability assessments.

Built end-to-end automated ML model evaluation pipelines using Python, SQL, and PySpark, integrated with AWS S3, SageMaker, and Redshift for continuous performance monitoring and real-time data accessibility.

Developed a configurable PySpark utility to generate client-specific XML reports by reading JSON files, enhancing reporting flexibility.

Created on-demand tables on S3 using Lambda Functions and AWS Glue with Python and PySpark, facilitating dynamic data access and transformation.

Google
|

Machine Learning Intern

India

Summary

Contributed to the development and optimization of machine learning models for search ranking and video recommendations, applying advanced methodologies in a research-driven environment.

Highlights

Contributed to a key internship initiative, developing and optimizing machine learning models for Google's search ranking and YouTube video recommendation systems.

Implemented neural networks in TensorFlow within Google's infrastructure and engineered efficient Python/SQL data preprocessing pipelines for training data preparation.

Applied A/B testing and visualization dashboards to monitor model performance and optimized inference speed, enhancing user experience and system scalability.

Collaborated with cross-functional teams to debug and enhance ML pipeline components, adopting advanced methodologies and industry best practices in a research-driven setting.

Education

Acharya Bangalore Business School
India

Bachelor of Business Administration

Finance

Texas A&M University, Commerce
Commerce, Texas, United States of America

Master of Science

Business Analytics

Awards

Google Developer Expert (ML)

Awarded By

Google Developer Experts program

Recognized by Google for exemplary expertise in Google Developer and open-source technologies, joining a global network of over 1,000 experienced Google technology experts, influencers, and thought leaders.

Certificates

Academy Accreditation - Generative AI
Data Science Essentials With Python
Machine learning
SQL (Advanced)
Problem Solving
Building RAG Apps
Building AI Agents with MongoDB
Oracle Cloud Infrastructure 2025 Certified Generative AI Professional
Model Context Protocol (MCP) - Anthropic

Skills

Programming Languages

Python, SQL, PySpark, Spark-SQL, R, Java.

Machine Learning & Deep Learning

TensorFlow, XGBoost, Graph Neural Networks (GNN), BERT, T5, LLaMA, GPT, OpenAI APIs, Mistral.

Natural Language Processing

spaCy, NLTK, AWS Textract, Tesseract OCR.

GenAI & LLM Deployment

Retrieval-Augmented Generation (RAG), LangChain, FAISS, LoRA, PEFT, PromptLayer, Guardrails AI.

Cloud & MLOps

AWS (SageMaker, S3, Lambda, SQS, Redshift, Glue, Bedrock), Vertex AI, BigQuery, Docker, Docker Compose, Kubernetes (EKS), Airflow, MLflow, AWS DevOps.

Model Monitoring & Explainability

MLflow, drift detection, SHAP, LIME.

Enterprise Integration

Knowledge graphs, internal ticketing systems, secure APIs, compliance (FDIC, AML, KYC).

Automation Tools

PySpark utilities, automated XML generation, on-demand table creation.

Interests

Artificial Intelligence & Machine Learning

Google Developer Student Club, AI Hackathons, Google Cloud Lab, Machine Learning.

Projects

AI-Powered Tumor Detection System

Summary

Developed a prototype tumor detection system leveraging Convolutional Neural Networks (CNNs) during an AI hackathon.