Skip to primary interaction
Healthcare

AI Triage + Appeal Drafting

Triage denied claims, draft cited appeals, and estimate recovery

  • Classifies denied claims as recover, write-off, or borderline
  • Drafts cited appeal letters for human review in seconds
  • Estimates overturn probability, resolution timing, and recoverable value

Primary interaction

Selected denial flows through the appeal draft, triage score, and overturn estimate.

Workflow

Triage, draft, cite, recover

The selected denial drives the draft, classifier, citations, and recovery estimate.

TriageDraftCiteRecover
Drafter composing letter…
355ms
This surface is temporarily unavailable.

Low-dollar denials are often ignored because labor costs more than recovery. The drafter changes that math.

Loading economics…
Under the hood

Capability proof

Capability proof

Denial recovery workflow

Service model

Live denial-management service with triage, drafting, prediction, and metrics surfaces.

Intelligence layer

Scores denied claims, drafts cited appeals, and estimates overturn likelihood.

Operational state

Uses structured denial data with payer, claim, policy, and recovery fields.

Human control

Appeal drafts remain reviewable before use.

Business value

Makes low-dollar or borderline denials economically worth pursuing.

The compounding signal

AI Triage + Appeal Drafting — Live AI Demo

The visible value is recovered spend. The compounding value is the labeled signal: every triage decision teaches routing, every drafted appeal improves template quality, and every overturn prediction tests payer-history assumptions. This is how a drafting workflow becomes reusable operating intelligence.

Architecture

Estimated monthly cost at portfolio traffic: under $1/month (Lambda free tier + minimal API Gateway requests).

LayerTechPurpose
Synthetic dataJSON · 8 payers · 5 appeal templates · 40 denials · 40 triage results · 35 appeal drafts · 35 outcome predictionsReal WPC ANSI CARC/RARC codes and real CPT codes; pre-computed AI outputs (triage decisions, drafted letters with citations, overturn predictions); one deterministic seed (20260515).
ComputeAWS Lambda (nodejs20.x · 512MB)Single function, route-based dispatch across 8 endpoints (payers, appeal-templates, denials, denials/{id}, triage, draft-appeal, predict-outcome, metrics).
AI surface #1 — Triage classifierPre-computed model output + 120-380ms latency simulationModels a gradient-boosted classifier scoring recoverability. Features: CARC + RARC + payer + CPT + dollar amount + days since denial + prior appeal count + clinical doc availability. Returns decision + confidence + expected economics + feature contributions.
AI surface #2 — Appeal drafter (agentic)Pre-computed LLM-agent output + 600-1400ms latency simulationThe headline agentic moment. Composes a citation-laden appeal letter from template + payer policy citations + regulatory references + supporting-doc checklist. Production would be a fine-tuned classifier-then-drafter pipeline or a managed RAG service over payer policy corpus.
AI surface #3 — Outcome predictorPre-computed survival-analysis output + 180-450ms latency simulationModels a survival-analysis classifier predicting overturn probability and days-to-resolution as a joint distribution with confidence interval. Anchored to payer-history baselines.
APIAPI Gateway (HTTP API)Shared with the other healthcare demos on the same SAM stack.
FrontendReact + CSS modules · EDGE design tokens · shared DenialPicker across AI surfacesLibrary overview, three AI surfaces with independent default denials, denial inbox with filter/detail split-pane, appeal template library with click-to-expand cards.
DeployAWS SAM · us-east-2`sam build && sam deploy`; CloudFormation manages all resources.

Appeal template library

Failed to load appeal templates.

What this demo is, and isn't

What this demo is, and isn't

  • All patient names, member IDs, claim IDs, and denial records are synthetic. CARC and RARC codes are real WPC ANSI X12 835 values; CPT and HCPCS codes are real. The data behind every code is fictional; no PHI, no real denials, no real appeal letters.
  • The triage classifier is the economic gatekeeper. Its outputs are pre-computed in the synthetic dataset. A production deployment would call a gradient-boosted classifier or a fine-tuned distilbert with features like CARC + RARC + payer + CPT + dollar amount + days since denial + prior appeal count + clinical doc availability. The decision shape (recover / borderline / write-off), the confidence score, the per-feature contribution breakdown, and the expected-economics arithmetic are all modeled on what a production classifier would emit — not narrative text dressed up to look like a model output.
  • The appeal drafter is the headline agentic surface, and the drafted letters in the synthetic dataset are pre-computed to keep the demo deterministic and fast. A production deployment would call a fine-tuned LLM (Claude or GPT) with the relevant template + denial provenance + payer policy corpus + clinical context as input, and emit the same output shape (letter text + citations array + supporting docs list + word count). The template structure is the durable contract that the rest of the system builds against; the LLM output is the variable piece.
  • Citation references in the drafted letters use synthetic policy numbers and section references. The format of the citation (e.g. Payer Medical Policy MP-{policy_number} Section {section}) is real; the specific MP- numbers and section references are synthetic. In a real deployment the drafter would resolve actual payer policy URLs and section anchors from a maintained corpus.
  • The outcome predictor models a survival-analysis classifier. It outputs overturn probability with a confidence interval AND expected days-to-resolution with a range — a joint distribution. Production would fit this against a labeled corpus of historical appeals per payer / per CARC / per template. The CI widths in the synthetic dataset are illustrative; real CIs would be calibrated against historical model error.
  • The 5 appeal templates are illustrative, not authoritative. Real RCM organizations maintain template libraries that are frequently larger (often 15-30 named templates) and are tuned to each payer's preferred appeal format. The template structure shown here — required sections, citation format spec, supporting-doc checklist — is the right shape; the specific 5 templates are common archetypes selected for demo clarity.
  • Economics is grounded in plausible real-world ratios. The 10-20x cost difference between front-end correction and downstream appeal is real and durable. The write-off threshold shift from $200-500 to $25-75 is a function of how much labor the drafter removes from the appeal-and-submit workflow. The $1M-$3M annual additional recoverable assumes a mid-size health system; smaller organizations would see proportionally less. Real organizations vary widely — calibrate the absolute dollars to the specific deployment.
  • The risk patterns and CARC mappings in this dataset reference taxonomy node IDs (e.g. tax-medical-necessity-denial) for narrative consistency with the denial-spine demo. The IDs are referenced as plain strings here — there's no runtime join across demos. In a real deployment the taxonomy is shared infrastructure; here the linkage is illustrative.
  • The recover-rate metric (~65% in this dataset) is a function of how the synthetic denials were sampled. A real deployment's recover-rate would land lower if the underlying payer mix is heavy on contractual write-offs or patient-responsibility, higher if the payer mix is heavy on medical-necessity and auth denials. The decision logic is the durable piece; the rate is a function of the input distribution.

Library overview