The bigger server. A proper rack-mount with two power supplies, two network cables, and hot-swap drives — built so a single failure doesn’t knock anyone offline. It runs the biggest model at full precision for 30 to 100 lawyers across one or more offices. Built for full-service Québec firms and the handful of bay-street-adjacent firms in the province. Nothing leaves the building.
30 to 100 lawyers across M&A, litigation, real estate, tax, and labour. The kind of firm that sees questionnaires from banks, insurers, and pension funds every week — and where the partnership wants the same answer on every one.
Montréal plus Québec City, Ottawa, or Toronto - one Firm-tier Box serves every associate over the corporate VPN. Per-office latency under 50 ms within province.
Where AI downtime affects a litigation deadline or a closing. Deterministic failover under 30 seconds. Optional HA pair for single-rack redundancy.
Firms that started on the smaller Practice server and have grown past 30 lawyers. We swap the server for the price difference, re-train on the new hardware over a weekend, and the lawyers don’t lose a day of work.
Needs a proper rack room or sound-isolated closet
2x C19 (30A) on dedicated 208-240V circuits. Recommended UPS: 3500 VA on each feed for 10 min runtime.
With up to 160 GB of GPU memory across two A100s and PCIe Gen 5 interconnect, host Gemma 4 70B FP16 + a Gemma 4 12B agent + Gemma 4 Embedding concurrently with efficient tensor parallelism. H200 NVL upgrade lifts that to 282 GB HBM3e.
Recommended for production workloads. In-flight request queue replication means no lost prompts during failover. The $29 / lawyer / month subscription covers both servers — only the hardware is doubled.
The firm has 30 or more lawyers (or is on its way there)
The work has hard deadlines you cannot miss — litigation calendars, closings, regulator filings
Different practice groups need their own model, isolated from each other
A regulator or a client contract requires a second server that takes over automatically if the first one fails
You want the biggest model at full precision (the more accurate version)
Otherwise: the Practice tier may fit the firm better
Most full-service firms in this band have a server room and an IT director. If the firm already runs GPU-capable enterprise hardware (or has approved capex for it), we deploy onto it. The Firm spec on this page is the reference: 2× RTX 6000 Ada / A100, dual EPYC, 512 GB ECC, 4U rack with enterprise redundancy.
If the partnership wants to add private AI without the IT scope expanding, we source the Firm-spec server, install it on-site, configure the network, and own the runbook. Hardware sits on a separate, transparent quote at OEM list — your call once you see the spec. We don’t mark up the GPU.
If the firm grows past 100 lawyers, we swap the chassis for the National cluster and re-train on the new hardware. No re-platforming charge. The per-lawyer line stays the same.
A 45-minute call within one business day. We’ll tell you whether the Firm tier is the right size for your firm — and if it isn’t, we’ll tell you which one is. No pitch. No deck. No payment today.