Vikas Anand - AI Engineer | vikasanand.dev

Vikas
Anand

AI Engineer turning data into decisions — building RAG pipelines, LLM systems, and enterprise data platforms that scale.

0 Years experience

0 Analysts served

30min RAG latency achieved

Building intelligent systems that actually ship.

I'm a Senior AI/ML Engineer based in London with 8+ years of experience designing and delivering production AI systems across insurance, media, and fintech. I don't just prototype — I own the full lifecycle: architecture, infrastructure, deployment, and monitoring.

I built a natural-language analytics interface used by 200+ analysts, a RAG pipeline that cut research time from 30 minutes to under a minute, and a real-time ML recommendation engine for personalised cross-sell campaigns. At Sky UK, I scaled their primary conversational AI across help and sales journeys with production guardrails — prompt injection filtering, PII redaction, and RAG grounding.

I'm currently building FinRAG Eval — an open-source evaluation framework for financial RAG systems — as my contribution to the community and signal of depth for what comes next.

LinkedIn Profile →

The stack I ship with.

AI & LLMs

RAG Pipelines LangChain Azure OpenAI HuggingFace LLM Evaluation Prompt Engineering QLoRA / PEFT DeBERTa / NLI

Data Engineering

Snowflake Azure Synapse SQL PostgreSQL + pgvector MongoDB Data Quality (DMFs) sentence-transformers

MLOps & Cloud

Azure MLOps MLflow Docker Kubernetes Terraform Azure DevOps CI/CD AWS · GCP

Languages & Frameworks

Python FastAPI TypeScript Go Java Streamlit Next.js

Things I've built.

FinRAG Eval

⭐ Open Source

Hallucination detection · Citation accuracy · Retrieval quality — in one pipeline

A production-grade RAG evaluation framework built for financial documents. Uses claim-level decomposition with a local NLI model (DeBERTa-v3), fuzzy citation matching, and a Streamlit dashboard for visual inspection. Adapter-based design means you can swap one module to benchmark any RAG system. Automated dataset construction from real SEC 10-K/10-Q filings with LLM-generated QA candidates and human-in-the-loop review.

PythonAzure OpenAIDeBERTaStreamlitHuggingFaceSEC EDGAR

GitHub

ROI Copilot

Full-Stack SaaS

Zero hard failures across PDF, DOCX, XLSX, CSV

AI deal intelligence platform. Document intelligence pipeline combining Azure OpenAI assumption extraction with regex fallback, plus sentence-transformers + pgvector for semantic retrieval. Multi-tenant FastAPI backend (JWT, RBAC, 19 Postgres tables), 3-queue Celery worker pipeline, and a safe AST-based formula engine with topological dependency resolution.

PythonFastAPINext.js 14pgvectorCelery

QLoRA Financial Fine-Tuning

ML Research

Mistral-7B · 4-bit NF4 · rank-32 LoRA

Fine-tuned Mistral-7B on a custom SEC EDGAR pipeline — API ingestion, HTML parsing, section extraction, company-aware splits — for financial document summarisation. MLflow experiment tracking with config-driven reproducibility throughout.

Mistral-7BPEFTMLflowSEC EDGARPyTorch

GitHub

Sky UK RAG Pipeline

Enterprise · Production

30 min → under 1 min · 1M+ UK customers

Scaled Sky's primary conversational AI across help and sales journeys on web and mobile. Architected multi-tenant LLM serving with production guardrails — prompt injection filtering, PII redaction, intent safety classification, and retrieval-augmented knowledge grounding.

AzureRAGLLM GuardrailsPII RedactionPython

Learning & Experiments

Concurrency Bug Analyser

Lock graph + DFS cycle detection via Python AST & Go go/ast for deadlocks, races, goroutine leaks across 12 hazard types.

ML-Learnings

Jupyter notebooks covering core ML topics — from fundamentals to applied experiments.

Marine Insurance Predictor

Predictive modelling for marine insurance risk using classical ML approaches.

Where I've worked.

Zensar Technologies Jun 2021 – Present

↳ NFU Mutual (Jun 2022 – Present)

Senior AI/ML Engineer

Architect and delivery owner for 4 production AI systems at a top-20 UK insurer. Rated "Exceeds High Bar"; recognised by client CTO as primary technical liaison.
Built a natural-language analytics interface on Snowflake Cortex used by 200+ analysts, replacing routine SQL requests with self-serve conversational queries.
Shipped a RAG pipeline (Azure Functions, GPT-4o, hybrid vector search) over underwriting documents — cut per-query research time from ~30 min to under 1 min. Owned Terraform IaC, CI/CD, prompt versioning, and automated faithfulness evaluation.
Built a Snowflake-native ML prediction system end-to-end and a real-time recommendation engine using behavioural clustering for personalised cross-sell campaigns.

↳ Sky UK (Jun 2021 – Jun 2022)

Senior AI/ML Engineer

Scaled Sky's primary conversational AI chatbot across help and sales journeys, serving 1M+ UK accounts.
Architected multi-tenant LLM serving with production guardrails: prompt injection filtering, PII redaction, intent safety classification, and RAG knowledge grounding.

Accenture Innovation Hub Sep 2017 – May 2021

Senior Software Engineer

Top-rated (5/5) all review cycles with multiple spot bonuses. Led a team of 4 to automate Hyperledger Fabric deployments on Azure (K8s, Helm, Ansible), cutting deploy time from 3 days to under an hour.
Backend tech lead for myNav Cloud (Node.js, Azure) serving 270+ enterprise clients with automated cloud migration assessments.

Education & Certifications

Indian Institute of Science (IISc), Bangalore

Certificate in Data & AI · 2025

Bangalore Institute of Technology

B.Eng. Computer Science · 2017

Certifications: Generative AI with LLMs (deeplearning.ai) · AWS Developer Associate · Kubernetes for Administrators · Salesforce AI Associate · Dataiku GenAI Practitioner

Writing about what I build.

Coming Soon ~8 min read #RAG #LLMs #OpenSource #AI

How I Built FinRAG Eval: A Production-Grade RAG Evaluation Framework

RAG evaluation is deceptively hard. String matching doesn't capture semantic accuracy, and LLM-as-judge alone is expensive and inconsistent. Here's how I combined claim-level decomposition, local NLI models, and fuzzy citation matching to build an evaluation pipeline that actually reflects what users care about.

Hallucination Detection Citation Accuracy DeBERTa SEC Filings

More posts in the pipeline — covering QLoRA fine-tuning, Snowflake data quality patterns, and enterprise RAG in production.

Let's work together.

Open to roles globally

I'm looking for Staff / Senior AI Engineer roles at ambitious companies — FAANG, AI-first startups, or strong mid-tier tech. I bring 8+ years of proven delivery, not just prototypes.

Based in London, open to remote and relocation. Happy to talk about RAG, LLM systems, data platforms, or anything in between.

View LinkedIn recommendations →

Vikas
Anand

Building intelligent systems that actually ship.

8+ years in AI/ML engineering

Insurance · Media · Fintech

Open-source contributor

The stack I ship with.

Things I've built.

FinRAG Eval

ROI Copilot

QLoRA Financial Fine-Tuning

Sky UK RAG Pipeline

Concurrency Bug Analyser

ML-Learnings

Marine Insurance Predictor

Where I've worked.

Writing about what I build.

How I Built FinRAG Eval: A Production-Grade RAG Evaluation Framework

Let's work together.

VikasAnand

Building intelligent systems that actually ship.

8+ years in AI/ML engineering

Insurance · Media · Fintech

Open-source contributor

The stack I ship with.

Things I've built.

FinRAG Eval

ROI Copilot

QLoRA Financial Fine-Tuning

Sky UK RAG Pipeline

Concurrency Bug Analyser

ML-Learnings

Marine Insurance Predictor

Where I've worked.

Writing about what I build.

How I Built FinRAG Eval: A Production-Grade RAG Evaluation Framework

Let's work together.

Vikas
Anand