The server we install for most of our firms. It sits quietly in your own server room — about the size of a small filing cabinet — and serves 10 to 30 lawyers across two practice groups. We train it on every contract, memo, and matter your firm has handled. Your lawyers ask it instead of pasting client files into ChatGPT. Nothing leaves the building. Ever.
10 to 30 lawyers, often across two practice groups. A major client just sent the firm an AI questionnaire. The managing partner already suspects associates are using ChatGPT off the corporate network — and the partnership wants the answer to be the firm’s, not Microsoft’s.
Smaller firms whose clients are major banks, pension funds, or insurers — clients who treat your firm like a 500-lawyer firm when it comes to confidentiality. Their AI questionnaires arrive the same week the partnership realises associates have been using ChatGPT.
A notarial office or a mid-size in-house legal department lives under the same Loi 25 rules and the same professional-secrecy obligations as a law firm. The Practice tier handles them with the same server, the same training on their files, and the same nothing-leaves-the-building promise.
Firms that started on the smaller Foundation server (5 to 10 lawyers) and have grown past it. We swap the server for the price difference, re-train on the new hardware over a weekend, and the lawyers don’t lose a day of work.
Quieter than a normal server because it’s water-cooled.
Dual-corded: 208-240V / 20A on each feed. Recommended UPS: 2000 VA on each feed for 15 min runtime.
With 64 GB combined VRAM across the 2x RTX 5090, host Gemma 4 27B as the serving model + Gemma 4 Embedding + a Gemma 4 4B side agent concurrently with comfortable KV-cache headroom using vLLM’s dynamic partitioning.
The server can handle 25 lawyers using it at the exact same second. In a 30-lawyer firm, that’s every associate drafting, every partner reviewing, and the assistants transcribing — all at once, without anyone waiting in line.
For firms where the AI ends up in front of clients and a few hours of downtime isn’t acceptable. Two servers running side by side; if one ever fails, the second takes over in under 30 seconds — nobody loses a question. The per-lawyer line stays the same: $29 covers both servers.
If the firm already runs GPU-capable hardware (or has approved budget through your usual IT channel), we deploy onto it. The asset stays on the firm’s books. Nothing extra on the Bilbs line. The Practice spec on this page is a reference for what works comfortably for 10–30 lawyers.
No IT director, no server room yet? After the audit we source the Practice-spec server, install it on-site, configure the network, and own the runbook. Hardware sits on a separate, transparent quote at OEM list price — your call once you see the spec. We don’t mark up the GPU.
If the firm grows past 30 lawyers, we swap the chassis for the Firm spec and re-train on the new hardware over a weekend. No re-platforming charge. The per-lawyer line stays the same.
A 45-minute call within one business day. We’ll tell you whether the Practice tier is the right size for your firm — and if it isn’t, we’ll tell you which one is. No pitch. No deck. No payment today.