RCTK Studies

Published, methodology-first benchmark evaluations and engagement case studies of AI systems — the same evidence-based process used in client engagements, applied to questions teams actually face before and after deploying AI. Each study includes full methodology, statistical analysis, and limitations. A new study is published every few months.

Latest Study · June 11, 2026

From 82% to 95% on Existing Hardware — Assessing and Improving an On-Premise AI Assistant

An assessment-first engagement took a regulated, on-premise AI assistant from 82.2% to a deployed 94.8% answer accuracy on the same single GPU — adding a zero-critical-error auto-accept capability that handled roughly 1,100 answers in its first production month.

82% → 95%

accuracy on the same GPU

9/9

pre-registered predictions held

~1,100

auto-accepts, zero critical errors

Read the full study

Recent Study · April 6, 2026

Where AI Beats Traditional OCR — and Where It Still Needs Human Review

A six-stage evaluation across 1,497 real documents found AI extraction beat traditional OCR by 9 percentage points — and a confidence-based auto-accept layer cut document processing time by more than half.

1,497

documents evaluated

+9.25 pp

accuracy over OCR

53–58%

processing time saved

Read the full study

Recent Study · March 18, 2026

Reclaiming AI Document Search Quality Through Configuration Testing And Parameter Sweeps

Four rounds of systematic testing — 16,143 evaluations across 40+ configurations — produced a single evidence-based RAG setup, overturning several "best practice" assumptions along the way.

16,143

evaluations run

40+

configurations tested

4.37/5

judged helpfulness

Read the full study