TL;DR · Key Takeaways
- Hybrid with VCF 9 is not one stretched control plane across clouds. It is three separate planes: consistent infrastructure, workload mobility, and unified operations. Design each one deliberately.
- The connectivity spine is the NSX transit gateway. VCF 9.1 adds an isolated VPN service inside the transit gateway, multiple external connections, and multiple transit gateways per tenant, so you can route hybrid traffic without bolting on external edge appliances.
- HCX (now packaged as VCF Operations Workload Mobility) is what actually moves and stretches workloads to Azure VMware Solution, Google Cloud VMware Engine, Oracle Cloud VMware Solution and VMware Cloud on AWS.
- The hyperscaler VCF services lag your on-prem version. Plan for vSphere 8-era targets while your data center runs 9.x, and design around the gap rather than assuming parity.
- The two costs that surprise people: HCX Network Extension tromboning, and cloud egress. Both are architecture decisions, not line items you can fix later.
The word “hybrid” does a lot of quiet damage in design workshops. Someone says it, everyone nods, and three people in the room are picturing three different architectures. One imagines a single cluster stretched across a data center and a cloud region. One imagines lift-and-shift migrations that never come back. One imagines a single pane of glass that bills both sides. None of them is wrong, and that is exactly the problem: VCF 9 hybrid cloud is not one capability. It is three planes that you wire up separately, and the failures I see in the field almost always trace back to a team that designed one plane and assumed the other two came free.
What “hybrid” actually buys you in VCF 9
Break the marketing word into the three planes it really means, because each has a different owner, a different failure mode, and a different design artifact.
Consistent infrastructure. The same SDDC stack runs on-prem and in the cloud: vSphere, vSAN, NSX. This is the genuine value of the hyperscaler VCF services. Your operational muscle memory, your NSX segments, your storage policies, all transfer. But “consistent” does not mean “identical version,” and that distinction is where most of the pain lives. More on that below.
Workload mobility. Moving VMs between the two estates, in both directions, with the network coming along for the ride. This is HCX. It is the plane people underestimate, because a one-time migration looks easy in a demo and behaves very differently when you stretch a production L2 segment and leave it stretched for nine months.
Unified operations. One place to see cost, capacity, health and lifecycle across both estates. In VCF 9 this is VCF Operations, which can register VMware Cloud on AWS, Azure VMware Solution, Google Cloud VMware Engine, Oracle Cloud VMware Solution and IBM Cloud as cloud providers and pull their consumption into the same capacity and cost views as your private cloud. This is the plane that justifies the word “hybrid” to your finance team, and the one teams forget to design until the first surprise invoice.
The reference topology
Here is the shape I draw on the whiteboard for almost every hybrid VCF 9 engagement. On-prem you have your VCF 9 instance with its management domain and one or more VI workload domains. In the cloud you have a provider-managed VCF service. Two data paths connect them: a routed path through the NSX transit gateway and its VPN for normal east-west and north-south traffic, and an HCX path for migration and stretched networking. Sitting above both, not in the data path, is VCF Operations as the single observability and cost plane.
The thing to internalize from this picture: the routed path and the HCX path do different jobs and have different lifecycles. The routed path is permanent infrastructure. The HCX network extension should be temporary. Teams that treat the HCX extension as permanent infrastructure are the ones who call me eight months later asking why latency-sensitive apps are misbehaving.
The connectivity layer
VCF 9.1 reworked hybrid connectivity around the NSX transit gateway, and this is the part of the release that genuinely changes how you design. A transit gateway can now attach to multiple external connections, you can run multiple transit gateways per tenant, and there is a VPN service that lives inside the transit gateway itself. In plain terms, you can build isolated, multi-site routing with static routes and custom NAT without standing up external routing appliances to glue it together. The Remote Networks field on each external connection decides which prefixes route where, with a default route option when no remote network is specified.
For the mobility plane, HCX is the tool, now packaged and licensed as VCF Operations Workload Mobility. It pairs an on-prem HCX Connector with an HCX Cloud Manager at the target and gives you the migration types you actually care about: bulk migration for batches, Replication Assisted vMotion (RAV) for large fleets with minimal downtime, and Network Extension to stretch an L2 segment so a VM keeps its IP after it moves. HCX works across the full set of VMware cloud services, so the same playbook applies whether the target is Azure VMware Solution, Google Cloud VMware Engine, Oracle Cloud VMware Solution or VMware Cloud on AWS.
If you want the detailed migration mechanics, I covered them in how to migrate workloads into VCF 9 with HCX and vMotion. This post is about where those paths sit in the overall design, not the click-by-click.
Design matrix: where does each plane really live?
The single most useful artifact from a hybrid design workshop is a matrix that pins each plane to a concrete component, names the owner, and writes down the assumption that has to hold. Skip the matrix and you get hand-waving; build it and the gaps reveal themselves.
| Dimension | On-Prem VCF 9 | Hyperscaler VCF service | Assumption to validate |
|---|---|---|---|
| Control plane | You own SDDC Manager, vCenter, NSX Manager | Provider-managed; limited admin scope | Which operations are blocked on the cloud side |
| Version parity | VCF 9.0 / 9.1 on your schedule | Often a vSphere 8-era release; provider-driven cadence | HCX and feature compatibility across the version gap |
| Routing | NSX transit gateway + VPN (9.1) | Provider gateway (ExpressRoute, Interconnect, Direct Connect) | MTU, BGP/static design, overlapping CIDRs |
| Mobility | HCX Connector | HCX Cloud Manager (bulk, RAV, NE) | Whether L2 extension is temporary or permanent |
| Operations | VCF Operations (native) | Registered as a cloud provider in VCF Operations | Billing data granularity and refresh interval |
| Cost driver | Hardware + VCF licensing | Per-host subscription + egress | Egress volume from extended segments and DR replication |
The rightmost column is the one that earns its keep. Most hybrid designs fail not because a component was wrong but because an unstated assumption (“the cloud side will be on the same version,” “egress will be negligible,” “we can do that operation in the cloud SDDC”) turned out to be false after the contract was signed.
What actually bites you in production
Version skew is the default, not the exception. Your on-prem estate will be on VCF 9.0 or 9.1 while the hyperscaler service runs a vSphere 8-era build on the provider’s cadence. That gap is normal and manageable, but only if you design for it: check the HCX interoperability matrix for your specific source and target builds before you commit, and do not assume a 9.1-only networking feature is available on the cloud side. In practice the provider sets the upgrade calendar, not you, so treat the cloud version as a fixed input to the design rather than something you can dial in.
HCX network extension tromboning. When you stretch an L2 segment and a migrated VM in the cloud talks to a gateway that still lives on-prem, traffic hairpins back across the link and out again. Two hops over a wide-area link for what should be a local conversation. Mobility Optimized Networking (MON) exists precisely to fix this by letting the cloud side route locally, and you should enable it for any extension that will carry real east-west traffic. The deeper point: a stretched segment is a migration aid with a shelf life. Plan the cutover that moves the gateway and retires the extension. If you cannot name the date you will collapse the stretch, you have not finished the design.
Egress is an architecture decision. Inbound is cheap, outbound is not. A DR pattern that replicates continuously from cloud back to on-prem, or a chatty app split across the link, can generate egress charges that dwarf the compute subscription. Decide deliberately which direction your data flows and how often, and model it before you build, because you cannot refactor your way out of a bad data-gravity decision after the workloads have landed.
My take
For most enterprises the honest reason to go hybrid with VCF 9 is operational consistency and a real migration on-ramp, not bursting. The single stretched control plane that the word “hybrid” conjures up is mostly a mirage; what you actually get, and what is genuinely valuable, is the same operating model on both sides plus a clean path to move workloads between them.
For the operations plane that ties this together, see VCF 9 multi-instance fleet management, and for the on-prem multi-site building block that the hybrid pattern extends, see VCF 9 stretched clusters and multi-site design.
What I’d Do
Design the three planes separately and on purpose. Make the routed transit gateway path your permanent spine, treat every HCX network extension as a temporary structure with a named retirement date, and stand up VCF Operations as the cost and capacity plane on day one rather than after the first invoice. Write the assumption column of the design matrix in ink and get the provider to confirm each one, especially version cadence and egress. Do that, and “hybrid” stops being a word three people in the room each interpret differently and becomes an architecture you can actually operate. What is the assumption in your own hybrid design that nobody has written down yet?
References
- Transit Gateway Connectivity Options in VCF 9.1 (VMware Cloud Foundation Blog)
- VMware Cloud Foundation: Extending the Data Center into Public Cloud
- VCF Operations Workload Mobility (HCX)
- VCF Operations Cloud Providers Overview (Broadcom TechDocs)









