Tag: RAG

Generative AI

Fine-Tuning vs RAG vs Prompting: Which One, and When (GenAI Series, Part 15)

Dr. Pranay Jha

June 18, 2026

Prompting steers, RAG adds facts, fine-tuning changes behaviour. The one question that decides which to use, a side-by-side comparison, and why to escalate in order of cost.
Continue Reading
Generative AI

Vector Databases: How Semantic Search Really Works (GenAI Series, Part 14)

Dr. Pranay Jha

June 18, 2026

A vector database stores embeddings and finds the closest in meaning, fast, across millions of items. How semantic search, ANN indexes like HNSW and IVF, and pgvector work.
Continue Reading
Generative AI

RAG: How to Stop Your AI Making Things Up (GenAI Series, Part 13)

Dr. Pranay Jha

June 18, 2026

Retrieval-augmented generation lets a model answer from your own documents by fetching the relevant passages at question time. How RAG works, and why it beats fine-tuning for facts.
Continue Reading
AI Stack, AI/ML, VMware & Cloud

VMware Private AI Agent Builder: Composing Models, Knowledge Bases and Prompts (Private AI Series, Part 15)

Dr. Pranay Jha

June 15, 2026

Agent Builder in VMware Private AI Services lets you compose a model endpoint, a knowledge base and prompt instructions into a grounded agent. Here is what it actually does, where it sits, and where the agentic hype gets ahead of reality.
Continue Reading
AI Stack, AI/ML, VMware & Cloud

Building a RAG Pipeline on VMware Private AI: 7 Failures That Quietly Break Retrieval (Private AI Series, Part 14)

Dr. Pranay Jha

June 15, 2026

Most RAG failures on VMware Private AI Foundation are not the LLM. Here are the seven pipeline failures that quietly wreck retrieval quality on PAIF 9, and how I fix each one in the field.
Continue Reading
AI Stack, AI/ML, VMware & Cloud

Vector Databases in VMware Private AI: Running pgvector on Data Services Manager (Private AI Series, Part 13)

Dr. Pranay Jha

June 15, 2026

A reference-architecture look at the retrieval tier of VMware Private AI: where DSM-managed PostgreSQL with pgvector sits, how to place and size it, and whether to index with HNSW or IVFFlat.
Continue Reading

Architect’s Toolkit

About the Author

Dr Pranay Jha

Dr. Pranay Jha is a Cloud and AI Consultant with 18+ years of experience in hybrid cloud, virtualization, and enterprise infrastructure transformation. He specializes in VMware technologies, multi-cloud strategy, and Generative AI solutions. He holds a PhD in Computer Applications with research focused on Cloud and AI, has published multiple research papers, and has been a VMware vExpert since 2016 and a VMUG Community Leader.

Dr. Pranay Jha

Tag: RAG

Fine-Tuning vs RAG vs Prompting: Which One, and When (GenAI Series, Part 15)

Vector Databases: How Semantic Search Really Works (GenAI Series, Part 14)

RAG: How to Stop Your AI Making Things Up (GenAI Series, Part 13)

VMware Private AI Agent Builder: Composing Models, Knowledge Bases and Prompts (Private AI Series, Part 15)

Building a RAG Pipeline on VMware Private AI: 7 Failures That Quietly Break Retrieval (Private AI Series, Part 14)

Vector Databases in VMware Private AI: Running pgvector on Data Services Manager (Private AI Series, Part 13)

Architect’s Toolkit

VMware Cloud Foundation

Nutanix

AI & Cloud-Native Platform

Architecture & Design

About the Author

Dr Pranay Jha

You May Have Missed

VKS: The Verdict and When to Use It vs Alternatives (VKS Series, Part 17)

VKS Day-2 Operations: Backup, Multi-Tenancy and Capacity (VKS Series, Part 16)

Troubleshooting VKS: The Failure Modes That Actually Bite (VKS Series, Part 15)

Running GPU and AI Workloads on VKS (VKS Series, Part 14)

Deploying Applications on VKS with GitOps: Argo CD, Flux and Helm (VKS Series, Part 13)