What is data sovereignty and why does it matter?

Data sovereignty means your data is stored, processed, and managed under Australian law, by Australian personnel, on Australian-owned infrastructure. Unlike simple 'data residency' where data is only stored locally, true sovereignty ensures your data is exclusively governed by Australian law and protected from foreign access laws like the US CLOUD Act.

What AI models does SCX.ai offer?

SCX.ai provides access to 14 language models including ALLaM-7B-Instruct-preview, DeepSeek-R1-0528, DeepSeek-R1-Distill-Llama-70B, DeepSeek-V3-0324, DeepSeek-V3.1, and more. We also offer 1 embedding model(s) and 1 audio model(s) - all hosted onshore in Australia with complete data sovereignty.

What compliance frameworks does SCX.ai align with?

SCX.ai aligns with APRA CPS 234, ASIC AI governance guidance, Australian Privacy Principles, and ISO 27001/IRAP-aligned controls. We provide immutable audit trails, 99.9% uptime SLA, and comprehensive logging for audit-ready infrastructure.

What GPU options are available?

SCX.ai offers NVIDIA A100 (40GB/80GB), H100 (80GB HBM3), and H200 (141GB HBM3e) GPUs. You can access them on-demand from A$5.50/hour or reserve dedicated pods with 3-12 month commitments for better rates.

Where is SCX.ai infrastructure located?

Our sovereign data centres are located in NSW and South Australia, with planned expansion to WA, QLD, and VIC for nationwide coverage by 2026. All infrastructure is 100% Australian-owned and operated.

SCX.ai Business Manager Primer

A practical guide to deploying production-grade AI with Australian Sovereignty, predictable cost, and measurable efficiency.

1. The AI Stack in Plain English

Think in three layers:

Layer 1: Services

The Interface (What you buy)

API Surface

Inference APIs

RAG Vault

Fine-Tuning

GLP Protection

Tool Runner

Capacity

Layer 2: Models

The Brains (What runs it)

Intelligence

Foundation ModelsLlama, GPT-OSS, Mistral

AgentsReasoning + Tool Use

EmbeddingsVector Search

RAG EngineContext Retrieval

Layer 3: Infrastructure

The Metal (Sovereign Compute)

AU Hardware

AcceleratorsRDUs / LPUs

Sovereign DCNSW / SA

What you buy (Services)

Inference-as-a-Service: APIs that turn prompts into answers. Pay per token.
RAG Vault: Managed retrieval to ground answers in your docs.
Fine-tuning: LoRA adapters to add tone/skills.
GLP: Real-time filtering for safety & leakage.
Secure Tool Runner: Safe execution of tool/DB calls.

What runs it (Models)

Foundation models: (Llama, Gemma, GPT-OSS) do the reasoning.
Agents: Models + rules + tools.
Embeddings: Numeric fingerprints for search.
RAG: Fetches snippets for grounded answers.

What it runs on (Chips)

Accelerators: (SambaNova RDUs) for high throughput.
GPUs: For flexibility.
Facilities: Australian Sovereign Cloud (for compliance).

Why this matters: You get production-grade AI with Australian Sovereignty, predictable cost, and measurable efficiency (tokens/kWh), without building a data centre or hiring a research lab.

2. How Workflows Actually Run

Client/App

(Auth)

GLP Pre-Filter

(PII Redaction)

AU Router

(Sovereignty Check)

RAG & Tools

(Local Retrieve)

Model

(Locked Version)

GLP Post-Filter

(Safety)

Result: Deterministic Egress → Audit Bus (≤100ms p95 Latency)

The Lifecycle of a Prompt

Your app calls SCX.ai with a prompt (and RAG context).
GLP pre-filter removes PII/secrets and blocks injection.
The router picks an approved model under policy.
The model answers; if needed, it retrieves via RAG or calls the Secure Tool Runner.
GLP post-filter checks the response for compliance.
Return answer to app; everything logged with version/model.

What you measure

Speedp95 Latency
QualityAccept Rate
Cost$ / 1M tokens
ESGTokens/kWh & gCO₂e

3. Making Good Commercial Decisions

Match model to task

Small models for classification. Standard for reasoning. Premium only when truly necessary.

Budget by workflow

(Avg input + retrieved context + output) × requests/month → tokens/month → cost.

Reserve peaks

Buy reserved throughput for peak hours; throttle or queue the rest.

Cache wins

Cache frequent retrievals and common answers to cut spend and latency.

Fine-tune sparingly

Use LoRA when prompts/RAG aren't enough; treat adapters like software releases.

Track efficiency

Report tokens/kWh alongside $ / 1M tokens to align finance and ESG.

Token Cost Composition

Support Q&AInput / Context / Output

Policy LookupInput / Context / Output

Email DraftInput / Context / Output

Input

Context (RAG)

Output

Model Pricing Multipliers

Light

3.5x

Standard

Premium

4. Sovereign Security & Compliance

Control Plane

Identity (OIDC/SAML)
Keys (KMS/HSM)
Policy Engine
Model Registry
Audit Logs

Data Plane

GLP Pre/Post Filters
Sovereign RAG
Model Endpoint
Tool Runner
Deterministic Egress

Control vs Data Plane: Identity and policy are separate from execution.
Deterministic Egress: Outputs return only to caller; tools allow-listed only.
GLP Guardrails: Enforced on input/output; logged with reason codes.

Version-locked: Signed artifacts, reproducible builds, one-click rollback.
Audit by default: Immutable logs tie answer to model/policy/tools.

Australian Alignment

IRAP AssessedISO 27001Australian Privacy PrinciplesData SovereigntyEssential Eight

5. What Your Team Needs to Do

Nominate an AI product owner (owns outcomes and KPIs).

Appoint a data steward (owns corpus quality, chunking, retention).

Involve security early (GLP rules, egress lists, key handling).

Start with one use case (60–90 days to 'boringly good' production).

Publish SLOs & dashboards (latency, cost, grounded answers).

Plan rollbacks (prompts and models), then practice them.

6. Quick Wins by Industry

Financial Services

Trust deed review, ESG scanning, adviser assistants (RAG + LoRA tone).

Government

Citizen agents with grounded answers, secure document processing, grants/fraud triage.

Healthcare

Clinical summarisation, coding assistance, device telemetry analysis (strict GLP, PHI handling).

AI-first SaaS

Multi-tenant RAG, tool-rich agents, continuous LoRA updates—ship features faster.

Glossary (Business-Focused)

One-Page Decision Checklist

Key questions for your first 90 days.

Use case

What outcome and KPI in 90 days?

Model choice

Small/standard/premium—why?

RAG

Which corpus, chunking plan, and filters?

Guardrails

GLP rules (PII, secrets, jailbreaks).

Tools

Which endpoints, credentials, allow-lists?

Latency target

p95 requirement and peak plan.

Budget

Tokens/month, $/1M tokens, STUs, cache.

Governance

Version lock, rollback, audit export.

ESG

Tokens/kWh and gCO₂e/token reporting.

Sovereignty

Data remains in AU

Ready to Deploy Sovereign AI?

Start your journey with Australian-hosted, production-grade AI infrastructure today.

Explore Products Book a Demo Learn More

SCX.ai Business Manager Primer

1. The AI Stack in Plain English

What you buy (Services)

What runs it (Models)

What it runs on (Chips)

2. How Workflows Actually Run

The Lifecycle of a Prompt

3. Making Good Commercial Decisions

4. Sovereign Security & Compliance

5. What Your Team Needs to Do

6. Quick Wins by Industry

Glossary (Business-Focused)

Agent

Agent

API Gateway

API Gateway

ASIC / Accelerator

ASIC / Accelerator

Attestation

Attestation

Audit (Immutable/WORM)

Audit (Immutable/WORM)

Context Window

Context Window

Control Plane

Control Plane

Data Plane

Data Plane

Deterministic Egress

Deterministic Egress

Embeddings

Embeddings

Fine-tuning (LoRA)

Fine-tuning (LoRA)

GLP (Generative Language Protection)

GLP (Generative Language Protection)

GPUaaS

GPUaaS

Inference

Inference

KMS / HSM

KMS / HSM

Latency (p95)

Latency (p95)

MAGPiE (SCX.ai)

MAGPiE (SCX.ai)

Model Registry / SBOM

Model Registry / SBOM

PUE

PUE

RAG (Retrieval-Augmented Generation)

RAG (Retrieval-Augmented Generation)

RDU (Reconfigurable Dataflow Unit)

RDU (Reconfigurable Dataflow Unit)

RPO/RTO

RPO/RTO

SLA/SLO

SLA/SLO

STU

STU

Tenant Isolation

Tenant Isolation

Tokens / $ per 1M tokens

Tokens / $ per 1M tokens

Tokens-per-kWh

Tokens-per-kWh

Tool Runner

Tool Runner

Vector Store

Vector Store

Version Lock / Rollback

Version Lock / Rollback

Ready to Deploy Sovereign AI?