Nodex — GPU Infrastructure

Hong Kong · NVIDIA H800

GPU infrastructure for AI workloads.

Bare-metal H800 clusters, OpenAI-compatible inference, and managed fine-tuning — operated from Hong Kong.

Get in touch View pricing

H800

NVIDIA · 80GB HBM3

400Gb/s

InfiniBand fabric

99.5%

Uptime SLA

HKSTP

Tier-3 datacenter

— Services

What we provide.

01 / Bare-metal

GPU Server Rental

Dedicated NVIDIA H800 servers and multi-node clusters with InfiniBand interconnect. Full root access, persistent storage, custom OS images.

H800 · 8-GPU nodes NVLink + IB

02 / API

Inference API

OpenAI-compatible endpoint for open-weight language models. Drop-in replacement for existing applications. Token-based billing, no minimum commitment.

Open-weight models REST + streaming

03 / Training

Managed Fine-Tuning

Pre-configured training infrastructure for language and diffusion models up to 70B parameters. Bring your dataset, receive a trained model.

LoRA · Full FT Distributed training

04 / Media

Video & Audio AI

Aggregated access to leading generative video, voice cloning, and dubbing services through a unified API. Single contract, consolidated billing in USD.

Generation · Dubbing Unified API

— Infrastructure

Current-generation hardware. Predictable performance.

Hong Kong Science Park

Tier-3 datacenter operations at HKSTP. Robust power and cooling, multi-carrier connectivity, physical access controls.

NVIDIA H800 SXM

80GB HBM3 memory per GPU, 8-GPU nodes with NVLink, multi-node clusters connected via 400 Gb/s NDR InfiniBand fabric.

OpenAI-compatible API

Inference endpoints speak the OpenAI protocol. Switch existing applications by changing one environment variable. Streaming, function calling, embeddings supported.

Privacy by design

Customer workloads run in isolated tenant environments. We do not train on customer data, log prompts, or retain model outputs. Operated under HK PDPO.

— Pricing

List prices in USD.

Volume discounts and dedicated reservations available — contact us for committed-use agreements.

Product

Spec

On-demand

Sustained

H800 · Single GPU

80GB HBM3

$3.20/hr

$2.40/hr

H800 · 8-GPU node

640GB · NVLink

$24.80/hr

$18.40/hr

H800 · Cluster

4–16 nodes · IB

—

Custom

Inference · Input

per 1M tokens

$0.27

$0.20

Inference · Output

per 1M tokens

$1.10

$0.85

> Sustained pricing applies after 720 GPU-hours/month of committed usage.

— Contact

Get in touch.

nodex.hk@gmail.com

Office

Suite 1404, Tung Wai Commercial Building
109-111 Gloucester Road, Wan Chai
Hong Kong SAR

Hours

Mon–Fri · 09:00–18:00 HKT

For inquiries about pricing, technical specifications, capacity, or onboarding — please reach out by email. We typically respond within one business day.