Maya's extractions learn from curated examples

A digital services project by Flexion

About Flexion

We build digital services for federal, state, and local government agencies. Learn more.

Open source

This project is developed in the open. View the source code on GitHub.

Forms Lab

Home
Forms
Catalog
Sign in

Color theme

In this section

← Back to Catalog
All Stories
#2 Maya signs in to access form authoring
#3 Maya uploads a PDF and reviews the extracted specs
#4 Maya shapes the form experience
#5 Maya reviews her proposed changes before publishing
#6 Carlos fills out a published form
#7 Maya receives a completed PDF
#8 Maya refines the data model with LLM assistance
#9 Carlos completes complex sections through conversation
#14 Developer establishes and maintains a living threat model
#48 Evaluator experiences a guided walkthrough of the project
#52 LLM responds to review comments to evolve forms
#59 Maya chooses her shaping model
#60 Carlos's conversation uses a chosen model
#61 Maya verifies AcroForm mapping
#62 Maya's extractions cite the law
#63 Maya's extractions learn from curated examples
#64 Maya's extractions use a tuned prompt
#65 Maya's extractions use our fine-tuned model
#66 Maya extracts via structured tool-use
#71 Developer navigates LLM integrations via a screaming service layer
#73 Maya's extractions use a tuned prompt (prompt optimization)
#74 Maya's extractions cite the law (RAG)
#75 Run live shaping model evaluation
#87 Maya authors a SNAP form guided by policy corpus and auto-evaluation

Catalog
Stories
Maya's extractions learn from curated examples

closedFinal Project

llm-integration

GitHub #63

User Story

As Maya, in order to improve extraction quality on complex government forms by teaching the model what good output looks like, I want an extraction variant that uses curated few-shot examples without requiring fine-tuning.

Preconditions

#58 (Story 10 variant picker) merged to main

Acceptance Criteria

New variant extraction/few-shot-sonnet that prepends 2-3 canonical (PDF description → spec) exemplar pairs to the extraction prompt
Exemplars committed under src/services/extraction/exemplars/ (or similar); each includes a short description, a compact spec, and rationale for inclusion
Extraction tab in Settings → Variants lists the few-shot variant
Evaluation run comparing sonnet baseline vs few-shot-sonnet on all three fixtures with the LLM-judge scorer
Both scorers (deterministic + LLM-judge) reported
New catalog page catalog/experiments/pdf-field-extraction/few-shot-sonnet.md with exemplar descriptions, approach, metrics, and findings on what the exemplars seemed to help with
catalog/experiments/_roadmap.md updated with shipped status and one-line finding

Success Metrics

Non-trivial delta (positive or negative) in at least one metric vs the Sonnet baseline — teaches us something either way
Exemplars are documented well enough that another contributor could add more

Notes

Class topic: prompt conditioning (Ch 8)
Keep exemplar count small (2-3) to avoid blowing token budget
Consider exemplars that deliberately demonstrate edge cases (nested groups, sensitivity labels, conditional fields)

Definition of Done

Acceptance criteria met
Tests pass
Type checking passes
CI pipeline green
Deployed and demoable

Home
Catalog
Design System

Forms Lab — LLM-Assisted Forms Platform