Skip to content
View singhvks's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report singhvks

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
singhvks/README.md



Vikas Kumar Singh

AI • Machine Learning • Generative AI Systems

Building production-grade AI systems that scale


👨‍💻 About Me

I work at the intersection of Machine Learning, Distributed Systems, and Generative AI, focusing on turning research ideas into reliable production platforms.

Currently working as a Data Science Manager, while staying deeply hands-on with:

  • AI system architecture
  • model development
  • scalable ML infrastructure
  • production deployment

🚀 What I Work On

🧠 Generative AI Systems

  • Retrieval Augmented Generation (RAG)
  • LLM orchestration pipelines
  • Prompt engineering & evaluation
  • AI agents and tool-based workflows

⚙️ Machine Learning Platforms

  • Scalable model training pipelines
  • Feature engineering at scale
  • Experiment tracking frameworks
  • Production MLOps workflows

📊 Distributed Data Systems

  • PySpark based data pipelines
  • Streaming architectures
  • Data lakehouse systems
  • Large-scale data processing

🏗 AI Infrastructure

  • FastAPI model serving
  • GPU inference optimization
  • Scalable microservices
  • Containerized deployments

🎯 Current Focus

  • Designing end-to-end GenAI platforms
  • Building LLM inference systems on Linux/Ubuntu
  • Exploring Snap-based AI model deployment
  • Architecting scalable ML platforms on cloud

⭐ Featured Projects

📊 LLM-Augmented Weekly Business Review (WBR) System

Enterprise analytics platform that automates weekly and monthly executive business reviews using LLM-generated insights over TB-scale transactional data.

flowchart LR
    A[Transactional Data Sources] --> B[Data Lake / Warehouse]
    B --> C[OLAP Analytics Pipeline]
    C --> D[Feature Engineering Layer]
    D --> E[LLM Insight Generation]
    E --> F[FastAPI Service Layer]
    F --> G[Executive Dashboards / Reports]
Loading

Highlights

  • OLAP-first analytics pipeline for high-volume business reporting
  • Feature engineering for consistent enterprise metrics
  • LLM-based narrative insight generation with cost/latency optimization
  • FastAPI architecture separating analytics, services, and AI inference
  • Governance via caching, rate limiting, and budget controls

➡️ Full case study


🗺 Geospatial ML Platform for Retail Site Selection & Sales Forecasting

Production geospatial ML system supporting new retail site selection and 3-year sales forecasting for a $10B+ retailer with 2,500+ locations.

flowchart LR
    A[Geospatial Datasets] --> B[Databricks + Spark Feature Engineering]
    B --> C[Trade Area Modeling]
    C --> D[XGBoost Forecasting Models]
    D --> E[Explainability Layer SHAP]
    E --> F[MLflow Model Registry]
    F --> G[Serverless Model Serving]
    G --> H[Real Estate Decision Platform]
Loading

Highlights

  • Large-scale geospatial feature engineering using Spark and Databricks
  • Dual trade-area modeling using drive-time isochrones and radial distance
  • Cold-start modeling via clustering-based “statistical twins”
  • Multi-category XGBoost forecasting architecture
  • Explainability-first design using SHAP for executive trust
  • Production deployment using MLflow, Delta Lake, and serverless serving

➡️ Full case study


🛠 Tech Stack

Languages

PythonSQL

Data & Machine Learning

PySparkPandasScikit-learnTensorFlowPyTorch

Generative AI

LLM APIsRAG frameworksVector Search

Infrastructure

DockerKubernetesFastAPILinux / Ubuntu

Cloud

GCPDistributed ML Platforms


🌍 Interests

Outside work:

🏍 Motorcycling through mountain roads
🧠 Exploring emerging AI technologies
👥 Building high-performance engineering teams


🤝 Open To

Opportunities to build large-scale AI platforms and Generative AI systems.

Roles

  • Senior AI / ML Engineer
  • AI Architect
  • Applied AI Leadership

Locations

  • India
  • Middle East

📌 Philosophy

Great AI systems are not just models.

They are well-designed systems built with:

  • strong data foundations
  • reliable infrastructure
  • disciplined engineering

Pinned Loading

  1. agentic-compliance-api agentic-compliance-api Public

    Python

  2. XML_data_extraction XML_data_extraction Public

    DATA MINING - from XML files - API testing

    Python 1 1

  3. Predict-the-category Predict-the-category Public

    ADABOOST - MULTICLASS CLASSIFICATION - MACHINE LEARNING - PYTHON : Predict category of problem solved as part of Piramal Hackathon

    Python

  4. customer_segmentation customer_segmentation Public

    K MEANS CLUSTERING - Categorizing customers for marketing campaign

    Python