> embedded ops · devops · sre · dataops

Your DevOps team on contract. For Web3, AI, ZK and DePIN.

We know where to source hardware fast and how to ship nodes fast. We run your infra, so you don't hire SREs for four months only to lose them in eight.

team@ximtrx:~$
devops status --client=$YOU
@maxc UTC+3 k8s · validators @ru-sre UTC+0 vLLM · GPU ops @zksre UTC-5 provers · ZK next handoff in --:--  ·  147 runbooks · 19d since last Sev-1

> who it's for

Web3 / Validators · RPC · Testnets

Testnet next quarter. Hiring an SRE who actually knows Cosmos SDK is a 4-month process, plus equity. We get your validators live in 3 regions in 5 days, with slashing alerts and a signed uptime SLA.

AI / LLM Inference · Fine-tuning

You raised, you bought GPUs, the bill is bleeding. Your ML team can train but doesn't want to babysit vLLM, OOMs and Triton at 3 AM. We run the inference layer (autoscaling, tracing, cost-per-token), so your researchers stop being on-call.

ZK / Prover farms

Proof generation is GPU-bound, deadline-bound, parallel. Dropped jobs = missed blocks. We build the farm, the queue, the retries, and per-circuit benchmarks on SP1 / RISC Zero / Boundless / Brevis.

DePIN / Distributed networks

The network pays for uptime, not excuses. 500 nodes across 10 regions by hand is a part-time job no one on your team signed up for. We onboard, monitor and rebalance the fleet, and reconcile rewards weekly. We work with Filecoin, Akash, io.net, Render, Gensyn.

> how we engage

01
Discovery
1 call. Scope, stack, regions, deadlines, SLA targets.
02
Plan
One-page deployment plan in 48h. Architecture, hardware sourcing, milestones, budget.
03
Deploy
Delivery via Terraform. Repo + IaC + monitoring + runbooks as one package.
04
Operate
Signed SLA. 24/7 on-call. Post-mortem after every Sev-1.

> what we run for ourselves

Map of the XIMTRX fleet

We run our own fleet: 132 nodes across 12 countries, 99.982% uptime over the last 90 days. This isn't the product. It's the training ground. Every dashboard, runbook and on-call rotation you'd get from us is battle-tested on infra we pay for ourselves.

[ See the fleet → ]

> cases

ZK rollup · 6 mo · validator ops + prover farm

slashing: 0 · downtime: 11 min/90d

LLM startup · 4 mo · vLLM cluster in 3 regions

cost/token: −60% · p95 latency: 380ms

DePIN sub-operator · ongoing · 200 nodes in 8 regions

uptime: 99.94% · reward tier: top-10%

Incentivized testnet · 8 wk · 50 nodes burst

top-5 operator · onboarding in 72h

> stack we operate

Web3: Cosmos SDK Geth Reth OP Stack Arbitrum Orbit Polygon CDK EigenDA Celestia
AI / LLM: vLLM Triton TensorRT-LLM NVIDIA H100 / A100 Ray Kubeflow
ZK: SP1 RISC Zero Boundless Brevis Jolt Halo2
DePIN: Filecoin Akash Render io.net Gensyn
Platform: Kubernetes Terraform Ansible Prometheus Grafana Loki OpenTelemetry PagerDuty

> FAQ

No. We are a managed DevOps team. We deploy and operate infrastructure on the cloud or bare metal you own (or source on your behalf). You pay for the team, not the nodes.

A single L1/L2 validator: 5-10 business days end-to-end. GPU inference cluster across 3 regions: 10-14 days. Burst up to 100 nodes for an incentivized testnet: 72h.

A mix: tier-1 clouds (AWS / GCP / Azure / Hetzner / OVH / Latitude), bare-metal partners (Latitude.sh, OpenMetal), and regional providers in 12+ countries. We pick by latency, price and supply window.

Retainer + project work in cash. Tokens optional, case-by-case. Equity-only engagements: no.

You do. HSM/KMS workflow where keys never leave your control. We sign, we don't custody.

Tiered. Default: p95 first response 15 min for Sev-1, 1h for Sev-2. Higher tiers come with dedicated on-call.

Kubernetes-first, Terraform for IaC, Prometheus + Grafana + Loki for observability, PagerDuty for on-call. We adapt to the client's environment.

The one-page deployment plan has a fixed turnaround (48h). Price depends on scope: send a short brief, we respond within 24h. Beyond that: hourly or monthly.

Often yes, depends on region + GPU type. Send the spec, we'll quote the supply window within 24h.

NDA standard. In public cases, details are anonymized.