Skip to main content

RCTK Studies

Published, methodology-first benchmark evaluations and engagement case studies of AI systems — the same evidence-based process used in client engagements, applied to questions teams actually face before and after deploying AI. Each study includes full methodology, statistical analysis, and limitations. A new study is published every few months.

Latest Study ·

From 82% to 95% on Existing Hardware — Assessing and Improving an On-Premise AI Assistant

An assessment-first engagement took a regulated, on-premise AI assistant from 82.2% to a deployed 94.8% answer accuracy on the same single GPU — adding a zero-critical-error auto-accept capability that handled roughly 1,100 answers in its first production month.

82% → 95%
accuracy on the same GPU
9/9
pre-registered predictions held
~1,100
auto-accepts, zero critical errors
Read the full study