A Gemma 4 fine-tune targets marketing copy

A community-built writer for marketing copy

A new AI fine-tuned specifically for marketing and creative writing has reportedly beaten Google’s open-weights Gemma 4 base model by 290 Elo points on the EqBench3 benchmark, according to a post on the r/LocalLLaMA forum. The build points to what targeted fine-tuning on open-weights models can now buy a UK team — and to the licence shift that made the underlying model commercially usable in the first place.

It is fine-tuned on Gemma 4 31B, the largest dense variant in the family that Google DeepMind released on 2 April 2026, as recorded in a technical overview of the release. The 31B is the workstation flagship: small enough to fit a single rented H100, large enough for serious reasoning. That is the combination a narrow creative-writing fine-tune needs — enough headroom to hold a brand voice across thousands of words, cheap enough to retrain in an afternoon.

+290Elo points over base Gemma 4 31B on EqBench3 — a community-reported gain on a benchmark that scores stylistic range, narrative coherence and avoidance of generic phrasing.

What Gemma 4 changed

The fine-tune is only possible because of a quiet shift in Gemma 4’s licensing. Previous Gemma models shipped under a custom Google licence with usage caps and acceptable-use restrictions. Gemma 4 ships under Apache 2.0 — the same licence used by Qwen, DeepSeek and most of the rest of the open-weight ecosystem. No monthly active user caps, no enforcement clauses, full commercial freedom. For UK teams that had to route Gemma 3 past legal before piloting it, that is the headline change.

The benchmark deltas over Gemma 3 27B, drawn from Google’s Gemma 4 model card, explain why the family is a credible base for a creative-writing fine-tune:

Mathematics (AIME 2026): 89.2% for 31B vs 20.8% for Gemma 3 27B.
Agentic tool use (τ²-bench): 76.9% vs 16.2%.
Competitive code (LiveCodeBench v6): 80.0% vs 29.1%.
Graduate-level science (GPQA Diamond): 84.3% vs 42.4%.

A marketing-copy fine-tune builds on reasoning gains like these. Better instruction-following, better long-form coherence, better tool use — all useful inputs to a writer-focused model that has to hold a brand voice across thousands of words. A technical overview of the Gemma 4 release places the licence change alongside the benchmark jump as the reason enterprise teams are finally evaluating Gemma.

Why a narrow fine-tune works now

Gemma 4 supports native system prompts, structured JSON output and a configurable thinking mode — the three pieces a creative-writing fine-tune needs most. The chat template is standard (system, user and assistant roles), so off-the-shelf training stacks like Unsloth and TRL plug straight in. The 31B dense variant is small enough to fine-tune on a single rented GPU overnight, and quantised weights shrink the inference footprint further.

The community build targets EqBench3 specifically, which scores models on stylistic range, narrative coherence and avoidance of generic phrasing. A 290 Elo gain on that benchmark suggests the fine-tune has shed the typical marketing opener clichés and learned to handle brand voice, register shifts and longer-form structure. For a UK marketing team, that maps directly onto the day-to-day: product pages, email sequences, LinkedIn posts, ad variants.

We have already covered how to fine-tune Gemma 4 with Unsloth and the hardware side in Gemma 4 on your own hardware — the new fine-tune fits the same workflow.

The variant being fine-tuned is Gemma 4 31B Dense — the workstation flagship. It has 30.7B parameters across 60 layers, a 1024-token sliding window, and a 256K-token total context. The vision encoder adds roughly 550M parameters for image input. Sampling defaults are temperature 1.0, top_p 0.95, top_k 64. Thinking mode is toggled by adding a control token to the system prompt; removing it disables reasoning and speeds generation. The model uses a hybrid attention pattern that interleaves local sliding-window attention with full global attention, with the final layer always global. Unified Keys/Values on global layers and Proportional RoPE keep long-context memory in check. For a creative-writing fine-tune, start from the instruction-tuned 31B checkpoint — pre-trained is the wrong starting point for chat-style output. The 12B Unified variant is a smaller, cheaper alternative at 11.95B parameters, with no separate vision or audio encoder.

Try the fine-tune, or build your own

For a UK team running marketing for a few brands, two paths open up.

Try the published fine-tune. If the community build is hosted on Hugging Face or Kaggle, it will run on the same infrastructure as the base 31B — a single rented H100, or a quantised version on a 48GB consumer card. Test it against your existing copy workflow on three jobs: a product page, an email sequence, a LinkedIn post. Measure how much of the AI draft you have to rewrite before it’s shippable.

Fine-tune Gemma 4 yourself. The 12B Unified variant is the cheaper target — Apache 2.0, 11.95B parameters, runs on a single 24GB consumer GPU when quantised. With a few hundred examples of your own brand copy, plus the human-edited rewrites you wish it had produced, you can fine-tune over a weekend. Existing Unsloth and TRL recipes carry over unchanged. The realistic outcome: a model that knows your house voice well enough to produce a first draft that needs light editing, not a rewrite.

A year ago, a custom model was a procurement project. Today it is a weekend job on a rented GPU — and the Apache 2.0 shift on Gemma 4 is what makes it commercially safe to ship the result.

Sources & quotes

Every quotation in this article is verbatim from a named source — click any ¹ to see where it came from. It's part of how we keep an AI-run newsroom honest. How we verify →

Filed under News · Models

A Gemma 4 fine-tune targets marketing copy

A community-built writer for marketing copy

What Gemma 4 changed

Why a narrow fine-tune works now

Try the fine-tune, or build your own

Sources & quotes

Continue Reading

Gemma 4 outpaces Qwen 3.6 on code review

Ai2 ships Tmax-27B terminal agent

A tiny local model can sort tickets