Explainer · Local Inference

Fine-Tune Gemma 4 with Unsloth

Unsloth has published a fine-tuning guide for Google's Gemma 4 open model family. A free cloud laptop, a small synthetic dataset, and under 20 minutes of training put a domain-specific model inside reach of any small UK team willing to spend an afternoon on it.

R
RAR Editor
Published June 2026 · 5 min read
The Quick Version
  • Fine-tuning teaches a model your tone, templates and specialist vocabulary, so it stops sounding like everyone else's assistant.
  • Unsloth has published a fine-tuning guide for Google's Gemma 4 family of open AI models, working on free cloud laptops.
  • Unsloth claims around 1.5x faster training and roughly 60% less memory than standard setups, with no accuracy loss.
  • Free cloud-hosted notebooks and a new no-code web UI let a small team start without a workstation.
  • A small synthetic dataset and a short training run on the free tier is enough to see a clear difference.

Unsloth, a library for training AI models more efficiently, has published a fine-tuning guide for Google’s Gemma 4, a family of open-weights models, this month. The guide covers all five variants in the line-up.

According to the Unsloth documentation, the library trains Gemma 4 roughly 1.5x faster and uses around 60% less memory than standard setups, with no accuracy loss claimed. The guide documents several Gemma 4-specific training bugs that Unsloth says it has fixed at the library level.

What Unsloth released alongside it

The team launched Unsloth Studio, a no-code local app for loading, training and export, that runs on macOS, Windows and Linux. Free cloud-hosted notebooks cover the smaller variants; the larger ones ship with high-memory hardware notebooks.

What the Ideas2IT team did with it

The engineering team at Ideas2IT walked through the same path on a free cloud laptop and documented every step. Their target was a small HR policy assistant, fine-tuned on synthetic conversations generated from the firm’s own policy documents. They used Unsloth’s data tool to turn raw PDFs into a structured training set, fine-tuned the smallest Gemma 4 variant, and exported the result.

Their published result: a small synthetic dataset, trained for a short run on the free tier, produced a model that stopped inventing policy details. The fine-tuned model pulled in the specific policy terms and eligibility conditions from the training set, where the base model produced only generic HR language. The full pipeline, from raw PDFs to a deployable model, fits inside a single notebook.

20 minof training on a free cloud laptop was enough to turn the smallest Gemma 4 variant into a model that stopped inventing policy details.

What to do with this

For a small UK firm wondering whether fine-tuning is worth an afternoon, three questions frame the test:

  • Is the task narrow and repetitive? A fine-tuned small model is at its best on a fixed format: customer replies, contract clauses, internal Q&A, classification, structured summarisation. It is not a replacement for a frontier model on open-ended reasoning.
  • Is the format as important as the content? If your house style, product names or policy terms are the part that keeps breaking under prompting, a fine-tune is the lever.
  • Can a human review the training set? A synthetic dataset generated unsupervised will bake in the generator’s confusions. Five minutes of human review before the run is worth the effort.

The hardware bar is now low enough that a sole trader can test the thesis on a free cloud laptop, no workstation required. The data-residency argument is the one to put in front of a sceptical partner or DPO: once trained, the model runs locally on kit the firm already owns, and the source documents never have to leave the building during inference. If the experiment works, you have a model that sounds like your firm. If it does not, you have lost an afternoon.

Sources & quotes

Every quotation in this article is verbatim from a named source — click any 1 to see where it came from. It's part of how we keep an AI-run newsroom honest. How we verify →

  1. Gemma 4 Fine-tuning Guide | Unsloth Documentation
  2. Fine-Tune Gemma 4 E2B with Unsloth on a Free T4 GPU
Filed under Explainer · Local Inference

Continue Reading