Sunil Belde's profile

ML Engineer - GenAI/LLM Automation

|

About

Sunil Belde's profile

Here is a little background

I’m an ML Engineer 🤖 driven by curiosity for how intelligence can be systematized — from algorithms to automation. My work sits at the intersection of Generative AI 🧠, large language models, and data engineering ⚙️, where I design frameworks that help machines understand human intent and translate it into action.

I love building AI systems that don’t just process data but reason with it — combining retrieval, prompting, and evaluation to make workflows adaptive and self-improving. My interests span code generation, multi-agent orchestration, and the broader challenge of making machine learning infrastructure more explainable and efficient ☁️.

To me, building AI isn’t just about automation; it’s about alignment, accessibility, and the art of simplifying complexity ✨.

Experience

Capital One

ML Engineer - GenAI/LLM Automation

Capital One

Leveraging Generative AI and LLMs to revolutionize ETL automation and enhance data engineering workflows.

GenAILLMsETL AutomationRAGPrompt EngineeringData EngineeringPython

Jul 2024 Present

1y 4m

View Key Highlights

Key Highlights

  • Built a GenAI-powered platform that auto-generates end-to-end ETL pipeline code (PySpark/Scala) deployable via GitHub, enabling business users with SQL knowledge to build production-grade data pipelines through a UI
  • Designed modular pipeline components (Source, Joiner, Filter, Transform, Target) and orchestrated them with DAG logic using Python and NetworkX
  • Implemented a Retrieval-Augmented Generation (RAG) system using SimSearch and LlamaIndex for context-aware prompt injection; used Few-Shot and Chain-of-Thought prompting to guide LLM-based code generation
  • Integrated an automated evaluation engine to validate generated code with retry-feedback loops, improving reliability and reducing developer intervention
  • Collaborated cross-functionally to evaluate GenAI research, design trade-offs, and architecture

Click to flip back ↩️

University of Illinois Chicago

Graduate Research Assistant

University of Illinois Chicago

Performed advanced NLP research to extract insights from large-scale social media datasets.

NLPText MiningTransformersData AnalysisVisualization

Jan 2023 May 2024

1y 4m

View Key Highlights

Key Highlights

  • Built end-to-end NLP pipelines using HuggingFace Transformers, SpaCy, and scikit-learn to analyze political trends from 10M+ tweets
  • Implemented scalable preprocessing modules including scraping, text normalization, sentiment analysis, topic modeling (LDA), and visualization with Pandas, Matplotlib, Seaborn
  • Insights contributed to published academic research

Click to flip back ↩️

Docskiff (A Jaggaer Company)

Machine Learning Engineer

Docskiff (A Jaggaer Company)

Engineered intelligent document processing systems to optimize contract metadata extraction.

Document IntelligenceOCRComputer VisionNERPyTorchAutomation

Jun 2021 Jul 2022

1y 1m

View Key Highlights

Key Highlights

  • Built document intelligence pipelines combining NER, OCR, and NLP to extract contract metadata using PyTorch, Tesseract, and regex-based postprocessing
  • Increased classification throughput by 30% through document segmentation using Faster-RCNN
  • Enhanced OCR quality with custom image preprocessing techniques (denoising, deskewing) to improve text accuracy

Click to flip back ↩️

Applied AI

Machine Learning Intern

Applied AI

Completed an intensive ML program and developed real-world projects in Machine Learning and NLP.

Machine LearningDeep LearningNLPCV

May 2020 Jul 2021

1y 2m

View Key Highlights

Key Highlights

  • Completed a 12-month ML program with 250+ hours of content and 30+ coding assignments on Machine Learning, Deep Learning, NLP, and CV.
  • Built end-to-end ML and NLP projects with real-world datasets and published technical blogs

Click to flip back ↩️

Acheron Software Consultancy

Associate Software Developer

Acheron Software Consultancy

Developed enterprise-grade software solutions and optimized infrastructure for cost efficiency.

Backend DevelopmentSpring BootKubernetesDevOpsELK StackCloud

Jan 2019 Apr 2020

1y 3m

View Key Highlights

Key Highlights

  • Built enterprise-grade applications using Spring Boot, MySQL, and Angular 8, delivering production-ready backend APIs and responsive UIs
  • Set up a centralized observability stack using the ELK suite (Elasticsearch, Logstash, Kibana) on GCP
  • Migrated legacy systems to Kubernetes using Helm, reducing deployment overhead and cutting infra costs by 15%

Click to flip back ↩️

Skills

Artificial Intelligence

≈ 2 Years XP

LLMsRAGLangChainAgentic WorkflowsPrompt-Engineering

Data Science

≈ 5+ Years XP

NLPCVData ModelingPyTorchTensorFlow

Data Engineering & Pipelines

≈ 2 Years XP

PySparkSQLETLDAGsData Integration

Software Development

≈ 6 Years XP

PythonJavaAngularSpring BootSystem Design

Cloud & DevOps

≈ 3 Years XP

AWSDockerKubernetesCI/CDELK Stack

Data Visualization & Tools

≈ 3 Years XP

TableauPower BIKibanaMLflowThree.js

Contact

I'd love to hear from you. Reach out anytime.