Tag: GenAI Series

Generative AI

“Looks Good” Isn’t Enough: Evaluating GenAI Output (GenAI Series, Part 18)

Dr. Pranay Jha

June 18, 2026

Fluent is not the same as correct. How to evaluate GenAI output properly: build a golden set, choose human, automatic or model-graded scoring, and run it as a harness.
Continue Reading
Generative AI

Multimodal AI: When One Model Handles Text, Images, and Audio (GenAI Series, Part 17)

Dr. Pranay Jha

June 18, 2026

Multimodal models handle text, images, and audio at once by turning every input into vectors in one shared space. How vision-language models work, and what they make possible.
Continue Reading
Generative AI

AI Agents: What Actually Works, and What’s Hype (GenAI Series, Part 16)

Dr. Pranay Jha

June 18, 2026

An AI agent is a model in a loop that plans, calls tools, and observes results. What agents genuinely do well today, and why reliability, not intelligence, is the real bottleneck.
Continue Reading
Generative AI

Fine-Tuning vs RAG vs Prompting: Which One, and When (GenAI Series, Part 15)

Dr. Pranay Jha

June 18, 2026

Prompting steers, RAG adds facts, fine-tuning changes behaviour. The one question that decides which to use, a side-by-side comparison, and why to escalate in order of cost.
Continue Reading
Generative AI

Vector Databases: How Semantic Search Really Works (GenAI Series, Part 14)

Dr. Pranay Jha

June 18, 2026

A vector database stores embeddings and finds the closest in meaning, fast, across millions of items. How semantic search, ANN indexes like HNSW and IVF, and pgvector work.
Continue Reading
Generative AI

RAG: How to Stop Your AI Making Things Up (GenAI Series, Part 13)

Dr. Pranay Jha

June 18, 2026

Retrieval-augmented generation lets a model answer from your own documents by fetching the relevant passages at question time. How RAG works, and why it beats fine-tuning for facts.
Continue Reading
Generative AI

Prompt Engineering That Actually Works (GenAI Series, Part 12)

Dr. Pranay Jha

June 18, 2026

Prompt engineering is not secret incantations, it is clear communication. The four moves that do most of the work, system vs user prompts, and the anti-patterns that waste tokens.
Continue Reading
Generative AI

Why AI Models Make Things Up (and What Temperature Does) (GenAI Series, Part 11)

Dr. Pranay Jha

June 18, 2026

AI models generate by sampling likely words from a probability distribution. Why that produces confident hallucinations, what the temperature setting really does, and how to reduce it.
Continue Reading
Generative AI

The Context Window, and Why Models Forget (GenAI Series, Part 10)

Dr. Pranay Jha

June 18, 2026

The context window is everything an AI can see at once. Why models have no memory between turns, why longer prompts cost more, and why details get lost in the middle.
Continue Reading
Generative AI

Training vs Inference: Why Using AI Is the Real Cost (GenAI Series, Part 9)

Dr. Pranay Jha

June 18, 2026

Training builds a model once in three stages; inference runs it on every request, forever. Why the recurring inference bill, not the headline training cost, decides AI economics.
Continue Reading
Generative AI

Attention, the Idea That Made Modern AI Work (GenAI Series, Part 8)

Dr. Pranay Jha

June 18, 2026

How attention lets every word in a sentence weigh every other word, why it replaced slow left-to-right models, and why running in parallel is what let AI scale.
Continue Reading
Generative AI

How Words Become Numbers: Tokens and Embeddings (GenAI Series, Part 7)

Dr. Pranay Jha

June 18, 2026

A model does math, not language. How tokenizing chops text into chunks and embedding turns each into a vector of meaning-coordinates, where similar ideas sit close together.
Continue Reading

Architect’s Toolkit

About the Author

Dr Pranay Jha

Dr. Pranay Jha is a Cloud and AI Consultant with 18+ years of experience in hybrid cloud, virtualization, and enterprise infrastructure transformation. He specializes in VMware technologies, multi-cloud strategy, and Generative AI solutions. He holds a PhD in Computer Applications with research focused on Cloud and AI, has published multiple research papers, and has been a VMware vExpert since 2016 and a VMUG Community Leader.

Dr. Pranay Jha

Tag: GenAI Series

“Looks Good” Isn’t Enough: Evaluating GenAI Output (GenAI Series, Part 18)

Multimodal AI: When One Model Handles Text, Images, and Audio (GenAI Series, Part 17)

AI Agents: What Actually Works, and What’s Hype (GenAI Series, Part 16)

Fine-Tuning vs RAG vs Prompting: Which One, and When (GenAI Series, Part 15)

Vector Databases: How Semantic Search Really Works (GenAI Series, Part 14)

RAG: How to Stop Your AI Making Things Up (GenAI Series, Part 13)

Prompt Engineering That Actually Works (GenAI Series, Part 12)

Why AI Models Make Things Up (and What Temperature Does) (GenAI Series, Part 11)

The Context Window, and Why Models Forget (GenAI Series, Part 10)

Training vs Inference: Why Using AI Is the Real Cost (GenAI Series, Part 9)

Attention, the Idea That Made Modern AI Work (GenAI Series, Part 8)

How Words Become Numbers: Tokens and Embeddings (GenAI Series, Part 7)

Architect’s Toolkit

VMware Cloud Foundation

Nutanix

AI & Cloud-Native Platform

Architecture & Design

About the Author

Dr Pranay Jha

You May Have Missed

VKS: The Verdict and When to Use It vs Alternatives (VKS Series, Part 17)

VKS Day-2 Operations: Backup, Multi-Tenancy and Capacity (VKS Series, Part 16)

Troubleshooting VKS: The Failure Modes That Actually Bite (VKS Series, Part 15)

Running GPU and AI Workloads on VKS (VKS Series, Part 14)

Deploying Applications on VKS with GitOps: Argo CD, Flux and Helm (VKS Series, Part 13)