Maya chooses her shaping model

A digital services project by Flexion

About Flexion

We build digital services for federal, state, and local government agencies. Learn more.

Open source

This project is developed in the open. View the source code on GitHub.

Forms Lab

Home
Forms
Projects
Catalog
Sign in

Color theme

In this section

← Back to Catalog
All Stories
#2 Maya signs in to access form authoring
#3 Maya uploads a PDF and reviews the extracted specs
#4 Maya shapes the form experience
#5 Maya reviews her proposed changes before publishing
#6 Carlos fills out a published form
#7 Maya receives a completed PDF
#8 Maya refines the data model with LLM assistance
#9 Carlos completes complex sections through conversation
#14 Developer establishes and maintains a living threat model
#48 Evaluator experiences a guided walkthrough of the project
#52 LLM responds to review comments to evolve forms
#59 Maya chooses her shaping model
#60 Carlos's conversation uses a chosen model
#61 Maya verifies AcroForm mapping
#62 Maya's extractions cite the law
#63 Maya's extractions learn from curated examples
#64 Maya's extractions use a tuned prompt
#65 Maya's extractions use our fine-tuned model
#66 Maya extracts via structured tool-use
#71 Developer navigates LLM integrations via a screaming service layer
#73 Maya's extractions use a tuned prompt (prompt optimization)
#74 Maya's extractions cite the law (RAG)
#75 Run live shaping model evaluation
#87 Maya authors a SNAP form guided by policy corpus and auto-evaluation

Catalog
Stories
Maya chooses her shaping model

closedFinal Project

llm-integration

GitHub #59

User Story

As Maya, in order to choose which LLM drives form shaping and see how different models perform on that task, I want shaping variants to be selectable from Settings → Variants, backed by a quantitative benchmark.

Preconditions

#58 (Story 10 variant picker) merged to main

Acceptance Criteria

New evaluation kind shaping-commands scores a variant against scripted intents with expected Command[] outputs (precision/recall on command-kind + args)
Variants registered: shaping/haiku, shaping/sonnet (promoted baseline), shaping/opus
Shaping tab in Settings → Variants renders all three variants with descriptions and Learn more links
Provenance recorded in shaping-log.json entries (extend existing entry schema with variantId + modelId)
<VariantBadge task="shaping" ...> rendered wherever shaping output is shown (review/compare views)
New catalog suite catalog/experiments/shaping-model-comparison/ with _suite.md, haiku.md, sonnet.md, opus.md, each containing metrics, approach, and findings
catalog/experiments/_roadmap.md updated: row marked shipped, one-line finding added
Existing catalog/experiments/shaping-architecture/ entries remain intact (architecture story is separate from model comparison)

Success Metrics

Meaningful recall/precision separation between variants across ~6 scripted intents (same set as the shaping-architecture qualitative comparison)
Picker UI renders cleanly on every screen that renders shaping output

Notes

Seeded intents already exist in catalog/experiments/shaping-architecture/_suite.md — reuse them as the benchmark corpus
src/services/forms/shaping/registry.ts already exists with a Sonnet-only entry — extend it, don’t replace
The picker’s filling/mapping tabs stay empty until their respective stories land

Definition of Done

Acceptance criteria met
Tests pass (bun run check)
Type checking passes
Threat model updated if security-relevant
CI pipeline green
Deployed and demoable

Home
Catalog
Design System

Forms Lab — LLM-Assisted Forms Platform