Document
Comparison Brief · R1
Subject
Rafay + OpenClaw vs. SARAH AI Suite
Prepared For
Scale Growth Advisors LLC
Classification
Confidential · Partner & Investor Discussion
Architecture Comparison Brief

An orchestration layer
Vs.
A Sovereign Appliance

Rafay sells a Kubernetes-based control plane that turns rented GPUs into a token-metered cloud, and positions the open-source OpenClaw agent as the demand driver it monetises. SARAH AI Suite is the opposite shape — a fully integrated agentic-AI appliance running on NVIDIA DGX GB300 in our own Mini Data Centre, delivered over a Global Private Enterprise IP Network, with off-grid power as primary and the public grid demoted to redundancy. Different category. Different defensibility.

01Executive Summary

What one SARAH Hyperscale appliance delivers

Headline numbers from Schedule 1 of the Hyperscale White-Label Partner Agreement. Every figure is a specification, not a marketing claim.

30,000
concurrent voice AI agents on a single appliance
< 400 ms
first-word voice latency, end-to-end
99.95%
uptime SLA · < 15 min RTO · < 10 min RPO
17 / 23
voice languages / text languages at native fluency
309
enterprise connectors · API endpoints included
02Differentiators

Four lines that separate the two stacks

Rafay is a control plane that runs on top of someone else's infrastructure. OpenClaw is an open-source agent loop. Useful tools — but neither addresses what an enterprise actually needs to run mission-critical conversational AI in production.

01 · Power

Off-grid by design

Our Mini Data Centre runs on Magnetic Generators primary, Microsonic Generators secondary, Battery Backup tertiary. The public grid is a redundant fallback only. Rafay runs on whatever power the customer happens to have.

02 · Network

Global Private Enterprise IP Network

10GE–400GE private fibre from client site to our Mini Data Centre. One dedicated VLAN per customer. Zero public-internet hop. Rafay rides over whatever network the customer brings.

03 · Hardware

Sovereign DGX GB300 stack

We host the NVIDIA DGX GB300 platform with Lightmatter Passage™ L20 photonic interconnect, dedicated to one customer — no shared scheduler, no noisy neighbour, no rented-pool surprises. Rafay orchestrates whatever GPUs the customer rents.

04 · Stack

Whole platform, not a layer

Dual-brain LLM, voice AI in 17 languages, full SIP PBX with WebRTC, 9 AI departments and 34,792,085 Live Enterprise Features, Connectors & APIs are in the box. Rafay sells the pipes; OpenClaw is a loop on top. Customers still have to assemble the rest.

03Architecture

Two fundamentally different shapes

One is a control plane sold to organisations who run AI infrastructure. The other is the AI infrastructure, sold to the organisations who consume it.

Rafay + OpenClaw

Orchestration layer over rented GPUs

  • Rafay AI Factory and Token Factory sit on top of GPUs the customer rents from a neocloud or hyperscaler.
  • The customer remains responsible for the underlying data centre, power, network, hardware refresh and physical security.
  • An OpenAI-compatible inference API is exposed to internal teams; metering and chargeback are handled by Rafay.
  • OpenClaw is positioned as an open-source agent that "drives usage" — Rafay monetises that usage; OpenClaw itself is not Rafay's product.
  • Voice AI, SIP PBX, telephony, multi-channel inbox and enterprise connectors are out of scope; the customer integrates third parties.
  • Multi-tenant Rafay control plane shared across every Rafay customer; the underlying GPU pool is typically shared too.
  • Pricing: platform fee plus per-GPU, per-CPU or per-node consumption. Sales-led, opaque.
SARAH AI Suite

End-to-end sovereign appliance

  • NVIDIA DGX GB300 with Lightmatter Passage™ L20 photonic interconnect, dedicated to the customer, hosted in our Mini Data Centre.
  • Power is off-grid: Magnetic Generators primary, Microsonic Generators secondary, Battery Backup tertiary, public grid as redundancy only.
  • Delivered over a Global Private Enterprise IP Network — 10GE–400GE private fibre, one VLAN per client site, zero public-internet hop.
  • Dual-brain architecture — 1-trillion-parameter Deep Thinker + 235-billion-parameter Doer — running simultaneously on the same hardware.
  • Full voice AI: 17 voice languages, 23 text languages, sub-400 ms first-word latency, mid-conversation language switching, 3 GB GPU VRAM per call.
  • Full SIP PBX with WebRTC, unified multi-channel inbox, bank-grade KYC, sentiment-based routing, 34,792,085 Live Enterprise Features, Connectors & APIs with API endpoints included — included.
  • Pricing: one-time Capex, locked retail, zero per-token, per-call, per-seat or per-minute fees. Ten percent annual maintenance from year two.
04Feature Comparison

Line by line

Each row reflects what is in the product today, not a roadmap. Sources are listed at the bottom of this document.

Dimension Rafay + OpenClaw SARAH AI Suite
Where does it run? Whatever GPUs the customer rents on a public cloud, a neocloud, or in their own colo. Our own Mini Data Centre on NVIDIA DGX GB300 with Lightmatter Passage™ L20 photonic interconnect.
Power Public grid. Customer-supplied UPS at best. No independent generation. Off-grid: Magnetic Generators + Microsonic Generators + Battery Backup; public grid is the redundant fallback, not the primary.
Network Customer's network, with public-internet egress for every inference API call. Global Private Enterprise IP Network — 10GE–400GE private fibre to client site, one VLAN per customer, zero public-internet hop.
Tenancy Multi-tenant control plane across every customer; the GPU pool is typically shared too. Dedicated sovereign appliance per customer — no shared inference pool, no shared scheduler, no contention.
Reasoning model Bring your own. Rafay does not ship a proprietary model. 1-trillion-parameter Deep Thinker (MoE) + 235 B Doer, both running in parallel on the same hardware.
Voice AI Out of scope. Token Factory exposes an inference API only. Full voice AI: 17 voice languages, sub-400 ms first-word latency, mid-conversation language switching, voice cloning from a 30-second sample.
Telephony / PBX Not included. Customer integrates Twilio, 8x8 or Genesys separately. Full SIP PBX with WebRTC, 9 AI departments, 36 permission controls per extension, real-time interpreter on *9.
Enterprise integrations None natively. Platform integrates with NVIDIA NIM and OpenAI-compatible clients. 34,792,085 Live Enterprise Features, Connectors & APIs built from scratch — API endpoints included across CRM, ERP, PMS, GDS, HR, Healthcare, Aviation, Accounting, Payments, IoT.
Agentic loop Open-source OpenClaw is positioned as a third-party demand driver. Rafay does not own it. Agentic execution is part of the platform — the Doer calls tools, runs CRM updates, processes payments and sends invoices mid-conversation.
Audio path Audio leaves the customer's premises over the public internet to reach the inference API. Audio never leaves the private fibre. Only signed text traverses the Global Private Enterprise IP Network.
Onboarding Customer-led. Rafay provides reference architectures; partner integrators do the build. 6-week turnkey onboarding by the Australian engineering team that built the Suite — hardware, photonic commissioning, 30,000-agent load test, voice cloning, UAT, cutover.
SLA & response Standard support 8AM–8PM EST; 24/7 enterprise tier is an additional 20% surcharge. 99.95% uptime, < 15 min RTO, < 10 min RPO, every-10-minute backups, 394+ restore points, P1 response in 15 minutes, 24/7 by the engineers who built it.
Concurrent capacity Whatever the customer's rented GPU pool holds; scales by buying more rented GPUs. 30,000 concurrent voice AI agents on a single Hyperscale appliance, made possible by photonic acceleration.
Pricing model Platform fee plus per-GPU, per-CPU or per-node consumption. Sales-led, no public tiers. One-time Capex with locked retail. Zero per-token, per-call, per-seat or per-minute fees. Ten percent annual maintenance from year two.
Compliance posture SOC-compliant control plane. Underlying GPU compliance is the cloud provider's responsibility. Single-stack audit trail. Designed in conformance with SOC 2 Type II, ISO 27001, GDPR, CCPA, HIPAA, PCI DSS.
Brand control Rafay is the platform brand the customer's developers interact with. Full white-label. Partner brand only — SARAH, iDesks, SGA, NVIDIA and Lightmatter are never disclosed to End Customers.
05Power Independence

The grid is our backup. Not our lifeline.

Every other "sovereign AI" vendor in the market depends on the public power grid for its primary feed. We inverted that assumption. Four independent power layers — with the public grid demoted to last-resort redundancy.

1 Primary

Magnetic Generators

200 kW per Mini Data Centre. Runs continuously, independent of any grid event, fuel supply or weather front.

2 Secondary

Microsonic Generators

Acoustic-resonance generation. Engages automatically if the magnetic feed is taken offline for maintenance.

3 Tertiary

Battery Backup

Sized to ride out any conceivable transient on both generation layers. The appliance never sees the event.

4 Redundancy

Public Grid

The mains feed is the last fallback, not the first. If everything else fails, the grid is still there. If the grid fails, everything else still runs.

06The Honest Verdict

Different category. Different defensibility.

Rafay and OpenClaw are useful open-source-adjacent building blocks. Run on a rented public-cloud VPS, they will get you a developer demo. They will not get you a sovereign enterprise contract. Once the conversation matters — when audio cannot leave the premise, when the grid cannot be a single point of failure, when the customer wants one signed contract for the whole stack — the architecture decides everything. A sovereign, GB300-class appliance in our own Mini Data Centre, delivered over a Global Private Enterprise IP Network, powered by Magnetic Generators with the grid demoted to redundancy, is simply a different category of system than a multi-tenant agent on a rented GPU. — Scale Growth Advisors · Architecture Brief R1

Ready to put it in front of a customer?

30 minutes with Chris Ismail and the Australian engineering team that built the SARAH AI Suite. We walk through the Mini Data Centre topology, the Global Private Enterprise IP Network design and the Hyperscale Appliance commercial structure — end-to-end, no slideware.

Schedule a 60-minute review

Sources & Methodology

  1. Rafay product positioning, AI Factory and Token Factory feature claims, customer logos and partner list: rafay.co (Platform, Token Factory, AI Factory, GPU PaaS for Sovereign Clouds, GPU PaaS for Neoclouds pages, retrieved May 2026).
  2. Rafay's positioning of OpenClaw as a third-party agentic demand driver: rafay.co/platform/rafay-platform-token-factory-and-openclaw.
  3. Rafay pricing posture (platform fee plus per-GPU / per-CPU / per-node consumption, sales-led): rafay.co/platform/pricing.
  4. SARAH AI Suite specifications, SLA, onboarding programme, support tiering, integration list and capacity figures: Schedule 1 (Specification), Schedule 2 (Capabilities and Integrations) and Schedule 3 (Support Boundary) of the SARAH Hyperscale Appliance White-Label Partner Agreement, document reference SGA-WLPA-ENT-2026-R3.
  5. SARAH AI Suite commercial structure: Sections 4 (Acquisition and Price) and 5 (Revenue Share) of SGA-WLPA-ENT-2026-R3.