Cross-Cutting Tools

Document to Database

Executable orchestrator that turns tokenized/extracted CRE document content (rent rolls, T-12s, operating statements, Prose Frontier narrative artifacts) into validated, typed, auditable, target-model-ready database payloads.

Open GitHub source

No packaged download — skills install from the open-source plugin repo. Read the SKILL.md and bundled files below before you install.

How to install a skill →
01 · Problem

Executable orchestrator that turns tokenized/extracted CRE document content (rent rolls, T-12s, operating statements, Prose Frontier narrative artifacts) into validated, typed, auditable, target-model-ready database payloads.

Derived from the skill’s “Skill description” section.

02 · Who & When

Explicit triggers:

  • "turn these documents into a database" / "ingest this data room into our schema"
  • "document to warehouse" / "load this deal package into the model"
  • "stand up the database-ready payload for these extracted rent rolls and T-12s"

Implicit triggers:

  • Extracted/tokenized CRE document content (from document-to-data-room-extractor, the rent-roll/T-12 readers, or any OCR / PDF-table / LLM-extraction step) must become typed, validated, provenance-stamped records before it can feed underwriting, reconciliation, or a warehouse.
  • A mixed data room of rent rolls, operating statements, and narrative artifacts must be classified and routed to the right specialized reader, then graded and reconciled as one ingestion run.

Do NOT activate for:

  • A single rent roll where the specialized reader is the right entry point — use rent-roll-to-database.
  • A single T-12 or operating statement — use t12-to-database / operating-statement-to-database.
  • Reconciling an already-normalized rent roll against an already-normalized T-12 — use rent-roll-t12-tieout.
  • Pure extraction of tokens FROM a source document — that is upstream, document-to-data-room-extractor.
  • Rent-roll ANALYSIS (rollover, WALT, mark-to-market) — use rent-roll-analyzer.

Derived from the skill’s “When to Activate” section.

03 · How It's Done Today

Not documented yet for this skill.

04 · What This Skill Changes

A canonical payload {doc_type, records, leases, units, aggregates, periods, issues} where each record carries the provenance bundle (a strict superset of the 8-column warehouse contract) with source_ref in data-room/<doc>#<anchor> form, pii_class, and redaction_status. Alongside it: an account-mapping report, a validation report ({checks, summary, validation_status, pass_rate}), an optional reconciliation result ({dimensions, summary, human_review_items, basis}), a data-quality grade (weakest-link A/B/C + 0-100, with merge/production gate booleans and any critical failures), a human-review queue, the target-model mapping (per-table row counts), and the optional SQL DDL and load plan. All numbers are reproducible from the inputs.

Derived from the skill’s “Output Format” section.

05 · Risks & Caveats

Not documented yet for this skill.