Nodex — GPU Infrastructure
Hong Kong · NVIDIA H800

GPU infrastructure for AI workloads.

Bare-metal H800 clusters, OpenAI-compatible inference, and managed fine-tuning — operated from Hong Kong.

H800
NVIDIA · 80GB HBM3
400Gb/s
InfiniBand fabric
99.5%
Uptime SLA
HKSTP
Tier-3 datacenter

What we provide.

01 / Bare-metal

GPU Server Rental

Dedicated NVIDIA H800 servers and multi-node clusters with InfiniBand interconnect. Full root access, persistent storage, custom OS images.

H800 · 8-GPU nodes NVLink + IB
02 / API

Inference API

OpenAI-compatible endpoint for open-weight language models. Drop-in replacement for existing applications. Token-based billing, no minimum commitment.

Open-weight models REST + streaming
03 / Training

Managed Fine-Tuning

Pre-configured training infrastructure for language and diffusion models up to 70B parameters. Bring your dataset, receive a trained model.

LoRA · Full FT Distributed training
04 / Media

Video & Audio AI

Aggregated access to leading generative video, voice cloning, and dubbing services through a unified API. Single contract, consolidated billing in USD.

Generation · Dubbing Unified API

Current-generation hardware. Predictable performance.

Hong Kong Science Park

Tier-3 datacenter operations at HKSTP. Robust power and cooling, multi-carrier connectivity, physical access controls.

NVIDIA H800 SXM

80GB HBM3 memory per GPU, 8-GPU nodes with NVLink, multi-node clusters connected via 400 Gb/s NDR InfiniBand fabric.

OpenAI-compatible API

Inference endpoints speak the OpenAI protocol. Switch existing applications by changing one environment variable. Streaming, function calling, embeddings supported.

Privacy by design

Customer workloads run in isolated tenant environments. We do not train on customer data, log prompts, or retain model outputs. Operated under HK PDPO.

List prices in USD.

Volume discounts and dedicated reservations available — contact us for committed-use agreements.

Product
Spec
On-demand
Sustained
H800 · Single GPU
80GB HBM3
$3.20/hr
$2.40/hr
H800 · 8-GPU node
640GB · NVLink
$24.80/hr
$18.40/hr
H800 · Cluster
4–16 nodes · IB
Custom
Inference · Input
per 1M tokens
$0.27
$0.20
Inference · Output
per 1M tokens
$1.10
$0.85

> Sustained pricing applies after 720 GPU-hours/month of committed usage.

Get in touch.

Office
Suite 1404, Tung Wai Commercial Building
109-111 Gloucester Road, Wan Chai
Hong Kong SAR
Hours
Mon–Fri · 09:00–18:00 HKT

For inquiries about pricing, technical specifications, capacity, or onboarding — please reach out by email. We typically respond within one business day.

Made on
Tilda