AI Detection · Mechane

AI detection tools are software systems designed to identify whether a piece of text was written by a human or generated by an AI model. The basic idea is intuitive: if AI writing has characteristic patterns — certain word choices, sentence structures, rhythms — then a classifier trained on enough examples should be able to spot them. Some tools work by scoring the statistical predictability of the text; others look for stylistic signatures. The underlying assumption is that AI-generated language is, in some measurable way, different from human-generated language.

The problem is that this assumption does not hold reliably enough to act on. AI detectors produce false positives at a significant rate — flagging human writing as machine-generated — particularly for non-native English speakers, for highly formal writing styles, and for texts that happen to use common or predictable phrasings. They also produce false negatives: AI-generated text that has been lightly edited, translated, or run through paraphrasing tools often passes undetected. The field's practitioners largely agree that current detectors should not be used to make consequential judgements about authorship.

Part of the difficulty is structural. Large language models are trained to predict the most probable next token given a context — which is another way of saying they are trained to write in ways that feel natural and human. As models improve, the statistical distance between human and AI text narrows. A detector trained on one model's outputs may perform poorly on a different model, or on a later version of the same one. Detection is, in a sense, chasing a moving target that gets harder to hit as the technology advances.

What this means practically is that a detection score — whether 20% or 80% — tells you something about probability, not truth. It is evidence to weigh, not a verdict to act on. The more interesting question AI detection keeps raising is not 'did a machine write this?' but 'does it matter if it did?' That question turns out to be much harder, and far more interesting, than the percentage on the screen.

The Magnificent Paradox