Category: VMware & Cloud

AI Stack, VCF, VMware & Cloud

Networking for VMware Private AI Workloads: Segmentation, Ingress and the East-West Path (Private AI Series, Part 26)

Dr. Pranay Jha

June 17, 2026

Model serving lives or dies on the network nobody designed. Here is how to segment AI namespaces with NSX, expose inference endpoints through the Gateway API and the load balancer, and keep RAG east-west traffic fast and private.
Continue Reading
AI Stack, AI/ML, VMware & Cloud

NVIDIA NIM Operator on VMware Private AI: The Reference Architecture for Declarative Model Serving (Private AI Series, Part 25)

Dr. Pranay Jha

June 17, 2026

The NIM Operator is the Kubernetes-native control plane for model serving on VMware Private AI. Here is how its CRDs, caching and autoscaling actually fit together, and the vGPU constraint that bites multi-GPU models.
Continue Reading
AI Stack, VCF, VMware & Cloud

VMware Private AI Foundation Upgrade: Moving from VCF 9.0 to 9.1 Without Breaking Your GPUs (Private AI Series, Part 24)

Dr. Pranay Jha

June 16, 2026

A practical 9.0 to 9.1 upgrade runbook for VMware Private AI Foundation, plus a closing verdict on the platform after 24 parts. The order of operations, the vGPU driver branch trap, and host-by-host GPU domain remediation.
Continue Reading
AI Stack, AI/ML, VMware & Cloud

Troubleshooting VMware Private AI Foundation: 7 Failures That Actually Bite (Private AI Series, Part 23)

Dr. Pranay Jha

June 16, 2026

The seven failures I hit most often on VMware Private AI Foundation with NVIDIA, from a dark GPU on the ESXi host to a NIM pod crashing on CUDA out of memory, with the real error strings and the checks that isolate each layer.
Continue Reading
AI Stack, AI/ML, VMware & Cloud

VMware Private AI MLOps: Built-In Model Lifecycle vs DIY MLflow and KServe (Private AI Series, Part 22)

Dr. Pranay Jha

June 15, 2026

Two ways to run model lifecycle on VMware Private AI: the built-in Model Store and Model Runtime, or a DIY MLflow and KServe stack on VKS. Here is when each one wins, and the verdict.
Continue Reading
AI Stack, AI/ML, VMware & Cloud

How to Benchmark LLM Inference on VMware Private AI with genai-perf (Private AI Series, Part 21)

Dr. Pranay Jha

June 15, 2026

A practical runbook for benchmarking NIM inference on VMware Private AI Foundation: the metrics that matter, the concurrency sweep that exposes the real latency-throughput curve, and how to pick an operating point you can defend.
Continue Reading
AI Stack, VCF, VMware & Cloud

Is VMware Private AI Actually Private? A Security and Data Privacy Reality Check (Private AI Series, Part 20)

Dr. Pranay Jha

June 15, 2026

On-prem Private AI keeps your data in the building, but the breach risk is inside the cluster. How vDefend microsegmentation, confidential computing and RBAC in VCF 9.1 actually secure a Private AI pipeline.
Continue Reading
AI Stack, VCF, VMware & Cloud

Air-Gapped VMware Private AI Foundation: Mirroring, AMT and the Bootstrap Problem (Private AI Series, Part 19)

Dr. Pranay Jha

June 15, 2026

Deploying VMware Private AI Foundation in a fully disconnected enclave: what to mirror, how the artifact mirroring tool (AMT) fits, the Harbor bootstrap problem, and how to validate offline NIM and GPU before handover.
Continue Reading
AI Stack, VCF, VMware & Cloud

VMware Private AI Sizing and Cost: GPU Memory Math, Capacity Planning and TCO (Private AI Series, Part 18)

Dr. Pranay Jha

June 15, 2026

How to size a VMware Private AI platform from the workload up: GPU memory math, the KV cache trap, a model-to-card matrix, and the four-layer cost model that actually decides the business case.
Continue Reading
AI Stack, VCF, VMware & Cloud

GPU Monitoring with VCF Operations for VMware Private AI: The Signals That Actually Catch a Failing Workload (Private AI Series, Part 17)

Dr. Pranay Jha

June 15, 2026

VCF Operations gives you GPU dashboards out of the box, but the metric most teams trust is the one that lies. Here is what to watch on a Private AI Foundation estate, why GPU utilization misleads, and the hardware-health signals the default dashboards never surface.
Continue Reading
AI Stack, Automation, VMware & Cloud

Self-Service AI Catalog Items with VCF Automation for VMware Private AI (Private AI Series, Part 16)

Dr. Pranay Jha

June 15, 2026

How to publish self-service GPU catalog items for VMware Private AI Foundation with the VCF Automation Quickstart, plus the namespace, vGPU class and quota bindings that decide whether the catalog is safe to hand out.
Continue Reading
AI Stack, AI/ML, VMware & Cloud

VMware Private AI Agent Builder: Composing Models, Knowledge Bases and Prompts (Private AI Series, Part 15)

Dr. Pranay Jha

June 15, 2026

Agent Builder in VMware Private AI Services lets you compose a model endpoint, a knowledge base and prompt instructions into a grounded agent. Here is what it actually does, where it sits, and where the agentic hype gets ahead of reality.
Continue Reading

Architect’s Toolkit

About the Author

Dr Pranay Jha

Dr. Pranay Jha is a Cloud and AI Consultant with 18+ years of experience in hybrid cloud, virtualization, and enterprise infrastructure transformation. He specializes in VMware technologies, multi-cloud strategy, and Generative AI solutions. He holds a PhD in Computer Applications with research focused on Cloud and AI, has published multiple research papers, and has been a VMware vExpert since 2016 and a VMUG Community Leader.

Dr. Pranay Jha

Category: VMware & Cloud

Networking for VMware Private AI Workloads: Segmentation, Ingress and the East-West Path (Private AI Series, Part 26)

NVIDIA NIM Operator on VMware Private AI: The Reference Architecture for Declarative Model Serving (Private AI Series, Part 25)

VMware Private AI Foundation Upgrade: Moving from VCF 9.0 to 9.1 Without Breaking Your GPUs (Private AI Series, Part 24)

Troubleshooting VMware Private AI Foundation: 7 Failures That Actually Bite (Private AI Series, Part 23)

VMware Private AI MLOps: Built-In Model Lifecycle vs DIY MLflow and KServe (Private AI Series, Part 22)

How to Benchmark LLM Inference on VMware Private AI with genai-perf (Private AI Series, Part 21)

Is VMware Private AI Actually Private? A Security and Data Privacy Reality Check (Private AI Series, Part 20)

Air-Gapped VMware Private AI Foundation: Mirroring, AMT and the Bootstrap Problem (Private AI Series, Part 19)

VMware Private AI Sizing and Cost: GPU Memory Math, Capacity Planning and TCO (Private AI Series, Part 18)

GPU Monitoring with VCF Operations for VMware Private AI: The Signals That Actually Catch a Failing Workload (Private AI Series, Part 17)

Self-Service AI Catalog Items with VCF Automation for VMware Private AI (Private AI Series, Part 16)

VMware Private AI Agent Builder: Composing Models, Knowledge Bases and Prompts (Private AI Series, Part 15)

Architect’s Toolkit

VMware Cloud Foundation

Nutanix

AI & Cloud-Native Platform

Architecture & Design

About the Author

Dr Pranay Jha

You May Have Missed

VKS: The Verdict and When to Use It vs Alternatives (VKS Series, Part 17)

VKS Day-2 Operations: Backup, Multi-Tenancy and Capacity (VKS Series, Part 16)

Troubleshooting VKS: The Failure Modes That Actually Bite (VKS Series, Part 15)

Running GPU and AI Workloads on VKS (VKS Series, Part 14)

Deploying Applications on VKS with GitOps: Argo CD, Flux and Helm (VKS Series, Part 13)