AVAILABLE FOR WORK·SUWANEE, GA
15Y+ IN & AROUND AZURELAST SHIP THIS WEEK
Work / Fintech analytics
◆ SHIPPED Fintech analytics

Thousands of PDFs in. Boardroom-ready risk signals out.

A full-stack financial analytics platform, built solo: a serverless document pipeline turns thousands of raw invoices a month into clean, validated records, and a SQL-backed metrics layer turns those records into C-suite dashboards, drill-downs, and predictive risk scores. The unglamorous part — getting messy PDFs to trustworthy numbers — is exactly where the engineering lives.

TypeScript React Azure SQL Server
TanStack QueryRechartsAzure AI
◆ Under NDA — architecture and engineering approach only. No client names, no real figures; the dashboard graphic below is illustrative.
1000s
invoices processed / month
50+
SQL stored procedures
3
derived risk metrics
1
human-in-the-loop reviewer
fintech
Executive analytics dashboard — KPIs, revenue trend, monthly invoices, and portfolio risk mix (illustrative)
// The problem

The numbers leadership wanted were trapped inside PDFs.

The data that mattered most — margins, concentration, who's quietly drifting away — was buried in thousands of invoices and payment documents arriving every month as PDFs. Pulling a single executive answer meant someone re-keying figures by hand, and every dashboard built on that was only as honest as the last manual export.

The goal was a platform where raw documents become validated records automatically, those records feed a governed metrics layer, and leadership gets dashboards and drill-downs they can actually trust — without anyone retyping a number. That meant solving extraction, validation, and modeling as one pipeline, not three disconnected scripts.

// How it works

Documents in, validated metrics out.

A serverless pipeline does the extraction; SQL does the math; React does the storytelling.

01 INGEST
Thousands of invoice and payment PDFs arrive each month. Azure Functions pick them up serverlessly — no always-on box, scale that follows the document volume.
02 EXTRACT
Azure Document Intelligence reads each PDF into structured fields. The interesting work is the validation around it — confidence thresholds, cross-checks, and shape rules that decide what's trustworthy.
03 REVIEW
Anything the pipeline isn't confident about is routed to a human-in-the-loop reviewer. People spend their time on the genuine edge cases, not the thousands of clean documents.
04 MODEL
Validated records land in SQL Server, where 50+ stored procedures do the heavy lifting — aggregations, period comparisons, and the derived financial metrics computed close to the data.
05 PRESENT
A React + TypeScript front end with TanStack Query and Recharts renders C-suite dashboards and drill-downs — fast to load, cache-aware, and built to move from headline to underlying record in a click.
// The interesting part

The risk metrics, not the chart library.

Three derived signals computed on top of the validated data — generic by design, no client specifics.

MARGIN

Effective Margin Rate

A normalized margin computed across the portfolio rather than read off any single invoice — built in SQL so the definition is consistent everywhere it's shown, from the executive headline down to the drill-down.

CONCENTRATION

Client Concentration Risk

How dependent the book is on its largest relationships. A structural risk signal that's easy to ignore when you only look at totals, and obvious once it's modeled and surfaced as a first-class metric.

FLIGHT RISK

Client Flight Risk Score

A predictive score that flags relationships trending toward churn from changes in behavior over time — the one signal leadership most wants early, derived on the same validated foundation as everything else.

The deliberate choice throughout was to compute metrics in SQL, close to the validated data, instead of scattering business logic across the front end. One definition of "margin" or "concentration," reused by every dashboard and every drill-down — so a number on the executive screen and a number three clicks deep can never quietly disagree.

◆ Architecture only — this one's under NDA

Because the platform is under NDA, this write-up stays at the level of architecture and engineering approach: the document pipeline, the human-in-the-loop validation, and how the risk metrics are derived. No real client names, figures, or screenshots — the dashboard above is an illustrative mock. Want to talk through the approach? Reach out →