Penroll
← All posts

Penroll Blog

AI candidate ranking: what it does, what it doesn't, and where it fails

Β·8 min read

Honest field notes on how AI ranking actually performs against real CV stacks β€” including the failure modes vendors don't advertise.

What AI ranking is, in one sentence

You upload a job description and a stack of CVs; an AI model produces a 0–100 match score and a recommendation (Interview / Maybe / Pass) for each candidate, with a short summary explaining why. Done well, it cuts a 200-CV stack to a 20-CV shortlist in five minutes.

Done badly, it embeds bias, hallucinates qualifications, and tells you what you wanted to hear.

What it is good at

In our internal testing across roughly 8,000 CVs against 60 job descriptions, modern AI ranking does three things reliably:

What it is not good at

The honest framing: AI ranking is a first-pass triage, not a hiring decision. It tells you which 20 of 200 CVs deserve the next 30 minutes of your time.

The bias problem, addressed properly

Done naively, AI ranking will learn that your past hires have been (e.g.) male graduates of three universities, and quietly reproduce that pattern. This is illegal in most jurisdictions and morally indefensible everywhere.

A serious AI ranking pipeline:

  1. Strips protected characteristics from the CV text before scoring β€” name, gender, age, nationality, photo. Or instructs the model to ignore them.
  2. Forbids the model from using inferred protected attributes. Names hint at gender and ethnicity; the system prompt has to explicitly exclude those.
  3. Audits scores by demographic group. If the score distribution differs significantly across groups for the same role, something is wrong.

Penroll's CV ranking is built around these constraints β€” they are non-negotiables in the system prompt, not features. If a vendor cannot tell you in writing how their model handles protected characteristics, do not use it for any candidate-facing decision.

When ranking actively fails

Three failure modes to watch for:

How to use it, day to day

Three steps:

  1. Write the most specific job description you can β€” required skills, years, scale, business context. The output quality is bounded by the input.
  2. Run the ranking. Read the summary and the red flags on every candidate, not just the score. The score is the signal; the prose is where the value is.
  3. Treat anyone marked Interview as a 30-minute call, anyone marked Maybe as a 5-minute screen, anyone marked Pass as a polite rejection email.

That last step is what most teams skip β€” and where the time savings actually come from. Auto-rejection is the unsexy part of AI ranking that pays for itself.

Try it yourself

Sign up to Penroll, upload a real job description and ten CVs, and see whether the output matches the read you'd give them yourself. If it does, you've just saved an hour. If it doesn't, you've learned something about either your job description or the model.

Try Penroll free

AI-generated job posts, ranked candidates, and country-aware offer letters β€” all in one tool. Five free credits, no card required.

Start hiring β†’