Available for new projectsOne client at a timeBased in the UK

I ship the production AI your team will inherit.

Independent software engineer. Multi-provider AI gateways, agent orchestration, real-time voice, evaluation harnesses, infrastructure as code. Twelve open-source repos. Fifty-seven long-form posts. One client at a time, every commit on me.

By the numbers

live

12

Open-source repos

MIT-licensed

57

Technical posts

with charts + citations

10+

Years in production

shipping software

< 50ms

AI failover handoff

across 14 engines

Self hosting, new playbook

Turn the old PC in your cupboard into a NAS with ZimaOS.

Replace Google Drive, iCloud, Plex Pass, and a paid password manager with one machine you already own. Add a local AI model on the same box for free. The full step by step guide, hardware specs, install, Immich, Jellyfin, Tailscale, Ollama, mobile setup, free vs paid.

Up to
£420
saved / year
From
0
if you have a PC
In about
1hr
to set up
ZimaOS running on a small home NAS
Flagship · open source · MIT

SarmaLink-AI

Multi-provider AI backend with 14-engine failover. OpenAI-compatible proxy, persistent memory, image generation, live web search. Now with cross-repo plugin system and Manus integration.

  • · Sub-50ms handoff when an engine returns 5xx or rate-limits
  • · Powered by DeepSeek V3.2 (685B), Gemini 3, GPT-OSS 120B
  • · Intent-based plugin auto-routing to ten sibling repos
  • · Manus webhook persistence for long-running agentic tasks
  • · White-label guide with copy-paste v0 prompt
36
engines
7
providers
14
max failover
41ms
fastest token
10
plugins
£0
frontier tier
How I work

One project. One client. At a time.

Most agencies are running ten projects simultaneously. Your work gets squeezed between everyone else's deadlines. I do the opposite: I work with a single client until the job is done, then take on the next.

It means I'm expensive and selective. But it also means the work is consistently excellent, and you always know exactly where things stand.

Your project gets my full attention, not shared with four others

You talk directly to me, not an account manager

I know your codebase, your goals, and your constraints inside out

No briefing a junior developer who has never seen your product

Faster decisions, faster progress, better results

What I build

Six lanes I have shipped repeatedly enough to publish the underlying tools as open source. Each tile links to the matching service page and the open-source repo it draws from.

AI gateways and chat backends

Multi-provider failover, OpenAI-compatible proxies, persistent memory, image generation, live tools. Single-provider risk in production is a pager waiting to happen.

Agent orchestration

Durable multi-agent workflows with deterministic replay, journaled state in Postgres, tool/token/wall-clock budgets, BullMQ queue, Inspector UI. Workflows that survive restarts and pass audit.

Real-time voice and RAG

Sub-second WebRTC voice loops with pluggable STT/LLM/TTS adapters and barge-in. End-to-end RAG starters with cited streaming answers, ten-minute clone-to-ship.

Evals, observability, audit

Datasets as files, scorers as functions, traces in DuckDB, regression mode that fails CI when a release loses ground. OpenTelemetry across every workflow step.

Production infrastructure

Helm charts for Next.js with full observability stack preconfigured. Terraform composing Vercel + Supabase + Cloudflare + DigitalOcean. Eight minutes from empty cluster to fully observable.

Full-stack web applications

Next.js App Router, TypeScript, Supabase Postgres with Row Level Security, Resend, Vercel. Server actions, ISR, OG images, JSON-LD, the boring tests passed.

Why SarmaLinux?

I'm a solo engineer who partners directly with founders, CTOs, and technical leaders.

No account managers. No handoffs. Just focused, high-quality work.

When you work with SarmaLinux, you work with me, and I bring the same care to your project as if it were my own.

Join the SarmaLinux Builder Community

SarmaLinux is not only a technology studio. It is also a growing community of engineers, developers and builders collaborating on meaningful technology projects.

Engineers & Developers

Connect with skilled professionals working on meaningful projects

AI & Automation

Collaborate on AI systems and automation platforms

Experimental Tools

Build and test new software tools together

Knowledge Sharing

Learn from others and share your expertise

Members collaborate on AI systems, automation platforms, cloud architecture and experimental software tools. Anyone interested in building impactful technology is welcome to participate.

Join the Community

Twelve open-source repos

Every project I publish solves a problem I hit in real client work and decided to write down properly. All MIT licensed.

AI gateways, agent orchestration, voice loops, evaluation harnesses, RAG starters, OCR pipelines, MCP servers, and the infrastructure glue underneath them.

Playbooks

The way I actually build things

Long form, opinionated walkthroughs of the systems I run for myself and for clients. Real commands, real config, real warnings about what tends to go wrong.

AI engineeringNew
28m

Building a multi-engine LLM gateway with failover

How Sarmalink-AI routes a single OpenAI-shaped request across 14 backends, fails over in under 50ms, and never returns a 500. Health checks, intent routing, cost guards.

Read playbook
AI engineeringNew
25m

Voice agent stack with sub-second round trip

Mediasoup, Whisper, Groq, OpenTTS. The architecture I use for natural voice agents that respond in under a second on a £6 VPS.

Read playbook
AI engineeringNew
24m

Production MCP server, from skeleton to deployed

A real Model Context Protocol server: FastAPI, tools, resources, auth, deployment, observability. Built on the mcp-server-toolkit pattern.

Read playbook
AI engineeringNew
26m

Running Ollama on a Tesla P40, the actual setup

Bringing an old datacentre GPU back to life for local LLM inference. Cooling, driver gotchas, model picks (Gemma 3 12B, Llama 3.3, Qwen 2.5 Coder), real tokens-per-second.

Read playbook
Self-hostingNew
24m

Deploying a Next.js app to a £6 VPS without Vercel

When the Vercel bill starts to bite, here is the exact recipe to move a production Next.js 16 app to a self-hosted VPS without losing any features.

Read playbook
Stack setupNew
20m

Cloudflare R2 as S3-compatible storage for a Next.js app

Replace S3 in your stack with R2: zero egress fees, S3-compatible API, signed uploads from the browser. The pattern I use for invoices, blog images, contracts.

Read playbook
Stack setupNew
22m

Stripe billing in a Next.js app, in 200 lines

Pricing tiers, customer portal, webhooks, recurring usage. The smallest amount of code that gets you a real, live, SCA-compliant billing flow.

Read playbook
Stack setupNew
22m

Magic link auth with Supabase, no foot guns

The exact magic-link flow I use on the admin and client portals: server actions, cookie session, callback handling, RLS hardening. With the four mistakes I made first.

Read playbook
DatabasesNew
20m

Postgres backups that actually restore

Daily pg_dump, point in time recovery, off site to R2, monthly restore drills. Most teams have backups. Far fewer have backups that work.

Read playbook
Stack setupNew
22m

A pragmatic Terraform stack for solo founders

What to put in Terraform, what to leave in the dashboard, and how to keep the diff small. The exact stack I use for Vercel, Supabase, Cloudflare, DigitalOcean.

Read playbook
Self-hosting
35m

Turn an old PC into a NAS with ZimaOS

Replace Google Drive, iCloud, Plex, and a password manager with one box you already own. Hardware, install, Immich, Jellyfin, Tailscale, local Ollama AI, free vs paid.

Read playbook
Self-hosting
20m

Deploying n8n on a VPS

A production-ready n8n install on a £6/mo VPS, Docker, Caddy, Postgres, backups, the lot. The setup I run for myself.

Read playbook
Stack setup
18m

Setting up Supabase with Vercel

Wiring Supabase to a Next.js app on Vercel without the auth foot-guns. RLS, server actions, type-safe queries.

Read playbook
Automation
22m

Migrating from Zapier to n8n

Cutting a £400/mo Zapier bill to £6 without breaking the business. A real migration plan with patterns and pitfalls.

Read playbook
AI engineering
25m

Building a Claude-powered receipt scanner

A working OCR receipt to JSON pipeline using Claude vision, with the prompt and post-processing.

Read playbook
Databases
24m

Zero-downtime database migrations

The pattern I use to ship schema changes in production without a maintenance window. Add, deploy, backfill, switch, drop.

Read playbook
Self-hosting
20m

Replacing SaaS with self-hosted tools

The five SaaS subscriptions I cancelled last year, and what they were replaced with. Real numbers, real trade offs.

Read playbook

Technology for Good

Technology should not only power businesses, it should also help communities.

Charities

Helping charitable organizations operate more effectively with modern tools

Education

Supporting education programs with digital platforms and automation

Community Organizations

Enabling community groups to focus on their mission, not administration

SarmaLinux supports initiatives where modern technology can help charities, education programs and community organizations operate more effectively.

Automation systems, AI tools and digital platforms can help organizations focus on their mission instead of administrative work. Organizations working on meaningful causes are welcome to reach out.

charity@sarmalinux.com

Let's build something good.

You've got a problem. I solve problems with software for a living.The fastest way to find out if we can work together is to talk.

Stack I build with

Next.js 16TypeScriptPythonSupabasePostgreSQLVercelCloudflareDigitalOceanResendn8nClaudeOpenAITailwind v4