The Principal Architect

Governance.
Scale.
Intelligence.

Transforming Generative AI from experimental research into ISO-compliant enterprise infrastructure.

I am a Product Engineer at Tata Consultancy Services and an AWS Certified AI Practitioner. I specialize in the ‘Last Mile’ of AI—engineering custom Large Language Models (LLMs) that are secure, cost-efficient, and aligned with NIST AI Risk Management Frameworks.

Ishav Verma, Principal AI Architect, professional portrait
Scroll
01

The Engineering Philosophy

Beyond Wrappers

Real value lies in training and fine-tuning custom architectures—Llama, Gemma, Qwen—not just wrapping APIs. I build from the model layer up, ensuring every parameter serves a business objective.

Strict Governance

AI without safety is liability. I implement ISO/IEC 42001 standards to harden models against prompt injection and data leakage, ensuring compliance from day one.

Operational Excellence

Reducing inference latency by 30% through advanced batching and caching strategies. Production AI must be fast, reliable, and cost-effective at scale.

30% Latency Reduction
ISO 42001 Compliance Standard
NIST RMF Risk Framework
02

Selected Works

Technical Briefs from the Field

BRIEF 01 Research

Cognitive Gradient Sparsification

White Paper ↗

Domain ML Optimization / Sparse Training
Architecture PyTorch / MNIST / Custom Gradient Masks
Innovation A novel training framework that applies cognitive-inspired sparsification to gradient updates, achieving measurable accuracy retention with significantly reduced computational overhead—validated on MNIST benchmarks with honest trade-off analysis.
BRIEF 02 Research

LLM-FROM-SCRATCH

GitHub Repo ↗

Domain AI Architecture / Transformers
Architecture PyTorch / RoPE / GQA
Innovation A scalable, decoder-only Transformer laboratory built entirely from scratch with sophisticated internal caching.
BRIEF 03 Research

HD-VDTP

White Paper ↗

Domain Data Compression / Cryptography
Architecture 100% Native Python Arrays
Innovation Scalable offline protocol packing arbitrary binary files into highly dense .y4m video streams without external deps.
BRIEF 04 Research

Auditable LLM Chain

White Paper ↗

Domain AI Audit & Security
Architecture AES-GCM & Hash Chains
Innovation Tamper-proof interaction auditing using cryptographic chaining to ensure robust local LLM integrity.
BRIEF 05 Deployed

The Knowledge Engine

RegionGPT

Domain Enterprise Knowledge Retrieval
Architecture RAG + Vector Search (FAISS)
Innovation Integrated with RPA to allow AI to execute tasks for tourists, not just answer questions about destinations but also to book tickets & hotels. (Implemented before MCP was a thing)
BRIEF 06 Implemented

AI Career Copilot

Project Report ↗

Domain EdTech / Career Intel
Architecture Streaming AI / AES-256-GCM
Innovation Extracts PDF resumes to dynamically generate roadmaps, matching jobs, and host scored mock AI interviews.
BRIEF 07 Implemented

AI Enabled HMS

Product Report ↗

Domain Healthcare Administration
Architecture GenAI / WebSockets
Innovation Embeds LLM for clinical decisions and intercepts medication supply issues weeks ahead with proactive alerts.
BRIEF 08 Implemented

Mental Health Platform

Assessment & Support System

Domain Healthcare / Machine Learning
Architecture PyTorch / XGBoost / Llama-3.1
Innovation Dual-dashboard platform using AI for real-time sentiment analysis and patient mental health classification.
BRIEF 09 Implemented

PolicyGen

AI-Assisted Legal Doc Gen

Domain Legal Tech / Automation
Architecture SHA-256 Integriy Checks
Innovation Automates creation of HR/legal docs integrating 35+ verified Startup India templates via smart forms.
BRIEF 10 Deployed

Cyber-Defense Grid

Intrusion Detection System

Domain Cybersecurity / Anomaly Detection
Architecture Deep Learning on KDD Datasets
Metric Optimized F1 Scores via class rebalancing and feature selection.
BRIEF 11 Prototype

Edge Biometrics

Smart Bowling Machine

Domain Computer Vision / Sports Tech
Architecture Lightweight Pose Estimation on Edge
Innovation Real-time OpenCV inferencing — instant athletic feedback, zero cloud latency.
03

Professional Timeline

2025 — Present

Tata Consultancy Services

Product Engineer — Generative AI

Architecting secure LLM pipelines and leading AI Governance initiatives for enterprise-scale deployments.

2022 — 2025

Independent Consultant

AI Solutions Architect

Designed and delivered end-to-end ML lifecycles across EdTech, Healthcare, Legal Tech, and Cybersecurity verticals—building RAG pipelines, LLM-integrated platforms, and cloud-native deployments on AWS and Cloudflare for diverse client portfolios, before joining TCS full-time.

2023 — 2024

Centre for Research, Innovation & Entrepreneurship

Research Fellow (SDE Intern)

Reproducible ML pipelines and Dockerized training environments for academic research infrastructure.

04

Verified Competence

The Standards That Define My Practice

Featured

AWS Certified AI Practitioner

+ Cloud Practitioner

2026

Oracle Certified Generative AI Professional

Cloud Infrastructure

2025

McKinsey & Company Forward Program

Leadership & Strategy

Graduate

IBM Enterprise Design Thinking

Co-Creator

Certified

Google Cloud Certified

Professional Data Analytics

2024

Zyxel Certified Network Engineer

Enterprise Networking

Certified

National Winner — Smart India Hackathon

Recognized for technical excellence in AI for Mental Health. Government of India grant awarded.