
Ryan Bennett
AI Detection Researcher · False Positive Analyst · Content Integrity Specialist · LLM Pattern Reviewer
London
"The number that matters in AI detection isn't the accuracy rate on clean AI text — any decent model gets that right. The number that matters is the false positive rate on human writing: ESL writers, technical authors, students who write formally. Those are the people who get wrongly accused. That's what I test first on every tool."
— Ryan Bennett, Winston AI Detector FreeThe researcher who tests what happens when the detector is wrong
Ryan Bennett is an AI detection researcher and content integrity specialist with six years of experience covering AI writing tools, detection algorithms, and the specific failure modes that make free detection tools unreliable in practice. He holds a B.Sc. in Computer Science with a specialization in Natural Language Processing from King's College London, giving him the statistical grounding to distinguish between meaningful detection signals and noise — a distinction that most published accuracy claims ignore.
Before joining the Winston AI Detector Free team, Ryan worked as a content integrity analyst at a digital media company, where he developed systematic testing protocols for AI detection platforms and documented their real-world false positive rates across different content types. That work taught him the gap between benchmark performance and actual performance on the kinds of text real users submit — formal academic writing, technical documentation, ESL content, and heavily edited drafts.
At Winston AI Detector Free, Ryan reviews and tests AI detection tools by running them against the same problem inputs that trip up less careful implementations: non-native English, highly formalized prose, template-heavy content, and lightly AI-edited human drafts. His goal is to give users an honest picture of when to trust a detection score and when to look closer.
Six scenarios where AI detectors get it wrong — and how Ryan tests them
The most important quality metric for an AI detector isn't how accurately it flags AI text — it's how rarely it flags human text incorrectly. Ryan built a 200+ scenario test set specifically around these failure modes.