Buyer’s guide
A pre-employment assessment is a standardised, scored method, taken before a hiring decision, for measuring something that predicts how someone will do the job. Here are the eight main types and what the evidence actually says about each. Then how to combine them, and how to keep the whole thing fair.
By Michael Hodge for SeeMyPersonality. Written from the peer-reviewed selection literature.
It’s a structured way to gather job-relevant evidence about a candidate before you decide. The thing being measured might be ability, knowledge, personality, judgement or a specific skill. What makes it an assessment, rather than a hunch, is consistency: the same questions, the same scoring, every candidate. So you compare people, not the impression each one happened to leave in the room.
That’s the whole promise. A freewheeling chat tends to reward whoever is most like the interviewer, or most relaxed on the day. A scored method holds everyone to the same yardstick. It won’t tell you everything about a person, and it shouldn’t try to. It narrows the noise so your judgement has something steady to push against.
This guide is neutral by design. We cover all eight testing types, say plainly where each is strong and where it isn’t, then get into the part most guides skip: how to combine methods for non-overlapping signal, and how to stay on the right side of EEOC and ADA rules. Background and reference checks, and composite methods like assessment centres, sit outside the frame here on purpose: they’re screening or bundled methods rather than the single, scored, predictive tests this guide is about.
Written from the peer-reviewed selection literature and checked against the sources cited throughout. General information, not legal advice.
The plain reason: a structured, scored method beats a gut read from an unstructured chat. That isn’t a slogan, it’s one of the most replicated findings in work psychology. Schmidt and Hunter’s 1998 review, drawing on eighty-five years of selection research, showed that consistent, job-related methods predict performance far better than reviewing a CV and trusting your instincts.
Honesty matters here, because many guides stop at the flattering 1998 numbers. They shouldn’t. Sackett, Zhang, Berry and Lievens (2022) re-estimated those validities, correcting for range restriction the proper way, and several figures came down. Cognitive ability, long treated as the runaway predictor, dropped most: from the high estimates older sources quote to around .31 operational validity, now sitting behind a well-built structured interview at roughly .42. The methods still work. They just aren’t magic, and the old league table needs rewriting.
So the real takeaway isn’t “tests are powerful.” It’s subtler and more useful: a consistent method reduces noise and bias compared with a freewheeling interview. It standardises the input, so two interviewers can’t score the same answer three ways. That’s the mechanism worth paying for.
If you’ve read that cognitive ability is “the number-one predictor,” you’ve met a figure the field has since revised down. Don’t over-weight any single method on the strength of an older meta-analysis. The Sackett 2022 re-analysis is the current leading reference point, and what it clearly favours is combining methods over crowning one.
No single type does everything. Each measures a different quality and predicts a different slice of performance, which is exactly why most teams use two or three together.
Personality measures read the stable patterns in how someone thinks, feels and behaves at work. The model with the strongest evidence base is the Big Five, also called the Five-Factor Model: openness, conscientiousness, extraversion, agreeableness and emotional stability. It’s the framework academic psychology keeps returning to, and it carries the strongest contemporary measurement evidence, as Soto and John (2017) set out in developing the BFI-2. The same five dimensions also recur across very different societies: Schmitt and colleagues (2007) mapped them across 56 nations.
The hiring evidence is real and specific. Barrick and Mount’s 1991 meta-analysis found that conscientiousness predicts performance across nearly every job family, and that extraversion helps in sales and management. Their 1991 work was careful to say agreeableness didn’t generalise as a predictor across the board. Where it does earn its keep is narrower: later research from Mount, Barrick and Stewart (1998) ties it to team-heavy roles where cooperation is the job. That trait-by-role pattern is exactly what a good behaviour-based hiring assessment captures.
A note on what counts as research-grade. Big Five and IPIP-NEO measures sit on a long validity record. Popular type-sorting tools, the kind that put you in a four-letter box, don’t carry the same evidence for selection, and it’s fair to say so plainly without turning anyone into a villain. If you want to feel the difference yourself, you can try the free Big Five test yourself and see how trait scores read against a type label.
Best for: understanding role fit, working style, collaboration and onboarding. Not for ranking people as better or worse, because there’s no such thing as a better personality. Use normal-range traits, never as a pass/fail gate on their own. Personality informs a hiring decision; it doesn’t define it. This is the category our own pre-employment personality test sits in, and if you want to see how it stacks up against the alternatives, you can compare the best personality tests for hiring first.
These measure reasoning, learning speed and how someone handles unfamiliar information. For a long time the field treated general mental ability as the single strongest predictor of performance. The 2022 Sackett re-analysis is the reason to soften that claim: with range restriction corrected properly, the estimate came down to roughly .31. Cognitive ability still earns its place, it’s just no longer the runaway figure older sources quote.
Best for complex, fast-changing roles where the learning curve is steep and people need to absorb new material quickly. The serious watch-out is fairness: cognitive tests carry the highest adverse-impact risk of the common methods, so pair them carefully and keep clear documentation of why the role needs that level of reasoning. If you want to see the consumer end of this category, a consumer cognitive ability test shows the format candidates recognise.
The most face-valid category, and often the most reassuring to candidates: can the person already do the task, or do they know the specific body of knowledge the role needs? A coding exercise, a writing brief, a bookkeeping problem, a language check. Nobody argues these aren’t relevant.
Best for roles with a clear technical core. The honest limit is that they measure present skill, not potential. A talented career-changer who’d grow into the role in three months can look weaker on day one than someone who’s simply done it before. Where growth and interest matter as much as current ability, pair a skills test with something forward-looking, even a career interest assessment, so you’re not mistaking a head start for a ceiling.
A category many guides gloss over. An SJT presents realistic on-the-job dilemmas and asks what the candidate would most likely do. What it reads is judgement and interpersonal effectiveness in context, which is hard to get at any other way. Strong for customer-facing, people-heavy or values-sensitive roles, including anything that hinges on how someone handles conflict when the easy option and the right option diverge.
The catch is build quality. An SJT is only as good as the job analysis behind it. Write the scenarios from the actual work and it measures something. Buy a generic one off the shelf and it measures very little.
These look at tendencies around dependability, rule-following and counterproductive behaviour. Best suited to roles with cash handling, safety stakes or unusually high trust. They can do real work in the right place.
Framing matters more here than anywhere else. Candidates can find integrity questions intrusive, and a clumsy rollout reads as suspicion. Treat the result as one signal among several, never a verdict on someone’s character, and be ready to explain what you’re measuring and why.
The highest-fidelity method, and one many guides under-cover. A work sample asks the candidate to perform a slice of the actual job under realistic conditions. Its strength is structural: it samples behaviour directly rather than inferring it from a questionnaire. You watch the work instead of guessing at it.
Best for almost any role you can excerpt into a short, fair task. The cost is time, for the candidate doing it and the reviewer scoring it, and the discipline it demands. Without a fixed rubric, a work sample slides straight back into opinion, which defeats the point of running one.
Most teams don’t think of the interview as an assessment type. They should. A structured interview means the same questions, in the same order, scored against defined anchors, for every candidate. The evidence is consistent: structured interviews out-predict unstructured ones, one of the more durable findings to survive the 2022 re-analysis, and it’s the structure doing the work, not the interviewer’s intuition. In that re-analysis a well-built structured interview came out as the strongest single predictor of the methods examined at about .42, ahead of cognitive ability — though the authors note interview validity varies widely with build quality, which is why the structure, not the format, is what does the work.
Best for every hire, because it’s the cheapest validity upgrade most teams can make. You don’t need new software to start, just a fixed question set and a rating scale. The one condition: you have to actually score it. Notes and vibes aren’t structure.
Last, the category that needs the heaviest legal care. Physical and role-specific tests measure capacities genuinely required to perform the job: lifting, stamina, vision, a sensory or functional requirement the work actually depends on.
Best for safety-critical and physically demanding roles. These carry the highest ADA exposure, and the rules on timing and demonstrable job-relevance are strict. You have to be able to document why each requirement is essential, which is covered in the legal section below.
The same eight types, side by side, so you can match a method to the role rather than the other way round.
| Type | Best for | Main watch-out |
|---|---|---|
| Personality assessment (Big Five) | Role fit, working and collaboration style, management and onboarding. | Use normal-range traits only. Never a sole pass/fail gate; no trait is a better or worse personality. |
| Cognitive ability / aptitude | Complex, fast-changing roles where the learning curve is steep. | Highest adverse-impact risk of the common methods. Pair it carefully and document the job-relevance. |
| Skills & job-knowledge tests | Roles with a clear technical core: coding, writing, bookkeeping, language. | Measures present skill, not potential. Can screen out career-changers who would grow into the role. |
| Situational judgement test (SJT) | Customer-facing, people-heavy or values-sensitive roles. | Only valid if built from a real job analysis. A generic SJT measures little. |
| Integrity & honesty assessment | Cash-handling, safety-critical and high-trust roles. | Can feel intrusive. Treat results as one signal, never a verdict on someone’s character. |
| Work-sample test / job simulation | Almost any role you can excerpt into a short, fair, scorable task. | Costs candidate and reviewer time. Standardise scoring or it drifts back into opinion. |
| Structured interview | Every hire. The cheapest reliability and validity upgrade most teams can make. | Only works if you actually score against the anchors, rather than taking freeform notes. |
| Physical / role-specific test | Safety-critical and physically demanding roles. | Highest ADA exposure. Strict rules on timing and demonstrable job-relevance. |
The skill isn’t picking the single best test. It’s assembling two or three that each tell you something the others miss.
Do a quick job analysis. Decide the two or three things that actually predict success in this specific role before you look at any tool.
Each added method should tell you something the others don’t. That’s incremental validity: you’re buying new information, not a second copy of what you already know.
For most knowledge and people-facing roles, a scored structured interview plus a Big Five measure is strong and low adverse-impact. Add cognitive or skills where the role warrants it.
Here’s incremental validity in plain language. If you already run a cognitive test, adding a second one barely moves the needle, because the two largely measure the same thing. Adding a personality measure does move it, because it reads a different quality. You’re after signals that don’t overlap, so each method earns its place by telling you something new.
A sensible default for many roles: a scored structured interview, a Big Five personality measure, and (where the role warrants it) a cognitive or skills test. Personality plus a structured interview is a strong, low adverse-impact combination for most knowledge and people-facing work, which is why it’s the pairing we keep coming back to. For specific roles you can sharpen the personality layer further, with a sales-fit assessment, a leadership style read, or a culture-fit check as worked examples of the same science applied to a particular hire.
If you’re weighing the personality tools specifically, read our roundup of the best personality tests for hiring, strengths and limits included, with the Big Five set against type-sorting tools like DISC. To watch the combine-for-signal idea play out on a real role, look at the Big Five role-fit assessment we built: paste a job description, get a role-fit profile plus a structured interview guide with 1-5 scoring anchors and a 30/60/90 onboarding brief, all from one input.
A test is only as fair as the way you use it. Under the EEOC-enforced Uniform Guidelines on Employee Selection Procedures, any selection procedure must be justifiable as job-related. That single principle, job-relatedness, is the spine of everything below.
Adverse impact. If a method screens out a protected group at a meaningfully different rate, it isn’t automatically illegal, but the burden shifts to you to show the method genuinely predicts the work. Physical and cognitive tests carry the most risk, which is the practical reason to pair them carefully and document the job-relevance rather than assume it.
ADA and normal-range traits. A workplace personality measure should assess ordinary work behaviour, how someone plans, collaborates, handles pressure. The moment it starts to resemble a medical or mental-health screen, it can trigger ADA limits on what you may ask and when. Keep the instrument firmly in normal-range territory.
Consistency. Every candidate for the role gets the same assessment, scored the same way, under the same conditions. Ad-hoc exceptions, the extra question for one person, the waived test for another, are exactly where bias and liability creep in.
Documentation. Keep the job analysis, the scoring rubric and the rationale on file. The test is blunt and worth applying to every method you use: if you can’t explain why it’s job-related, don’t use it.
We build our own tool to be EEOC-aware, with transparent, documentable scoring you can show a candidate and keep on file. But the employer owns lawful use of any assessment, ours included. The documentation, the consistency, the job analysis are yours to keep. None of this is legal advice; for a regulated or high-stakes role, take proper counsel.
Long, opaque or irrelevant assessments cost you good candidates, who simply drop out. The people most able to walk away are often the ones you most wanted. So candidate experience isn’t a courtesy, it’s a retention question for your pipeline.
Be transparent. Tell people what you’re measuring, why it’s relevant to this role, and how long it takes. A short, clearly job-relevant task reads as serious. A two-hour personality quiz with no explanation reads as a hurdle someone built to look thorough.
The best assessments give something back. When a candidate finishes a well-made Big Five measure and receives a transparent personality report, language for how they work, the process stops feeling like data extraction and starts feeling like a fair exchange. That goodwill is real, and it survives whether or not you make the hire.
It’s a standardised method, taken before a hiring decision, for measuring something that predicts how well someone will do a job: their ability, knowledge, personality, judgement or a specific skill. Because every candidate answers the same questions and is scored the same way, you end up comparing people on a consistent basis rather than on the impression each one happened to leave in the room.
The eight you’ll meet most often are personality assessments (best mapped by the Big Five), cognitive ability and aptitude tests, skills and job-knowledge tests, situational judgement tests, integrity tests, work-sample tests and job simulations, structured interviews, and physical or role-specific tests. Each measures a different thing, so most teams use two or three together rather than relying on any single one.
People use the terms loosely, but there’s a useful distinction. Pre-employment screening is the wider net: anything you check before hiring, including background and reference checks, right-to-work verification and the assessments themselves. Pre-employment testing usually means the scored, predictive instruments specifically: ability, personality, skills, judgement. So testing is one part of screening. This guide is about the testing side, the methods that try to predict how someone will actually do the job.
It depends on the method and how you use it. Decades of research, summarised in Schmidt and Hunter’s 1998 review and re-estimated more carefully by Sackett and colleagues in 2022, show that structured, scored methods predict job performance better than an unstructured chat. No assessment is a crystal ball. A well-chosen one reduces noise and bias; it doesn’t remove judgement.
Yes, when they’re job-related and used consistently. In the US, the EEOC-enforced Uniform Guidelines on Employee Selection Procedures treat a selection procedure as something you must be able to justify as predictive of the job. If a method screens out a protected group at a meaningfully different rate, you need evidence it genuinely measures something the role requires. Keep your job analysis, scoring rubric and rationale on file. This is general information, not legal advice.
There isn’t a single winner that fits every role, and older sources that crowned cognitive ability the runaway predictor were working from validity estimates that the 2022 Sackett re-analysis brought down. In that re-analysis a well-built structured interview came out around .42 operational validity against cognitive ability’s .31, the strongest single predictor of the methods examined. The authors add an honest caveat: interview validity varies widely with how well the interview is built. Even so, the stronger move is to combine two or three low-overlap methods so each one tells you something the others miss.
No, and it shouldn’t. A Big Five measure tells you about how someone tends to work and collaborate, which is genuinely useful, but it’s one input, not a verdict. Used well, it shapes your interview questions and your onboarding plan. Used as a standalone pass/fail gate, it’s both weaker and harder to defend than a combination of methods.
Start from the job, not the catalogue. Do a quick analysis of what actually predicts success in this specific role, pick the two or three signals that matter most, then choose methods that measure those things without overlapping each other. For many knowledge and people-facing roles, a scored structured interview plus a Big Five personality measure is a strong, low-adverse-impact default.
Adverse impact happens when a selection method screens out people from a protected group at a substantially higher rate than others, even unintentionally. It doesn’t make a test illegal by itself, but it shifts the burden to you to show the method is genuinely job-related. Physical and cognitive tests carry the highest risk, which is why you pair them carefully and document why each requirement is essential.
They dislike long, opaque or irrelevant ones, and good people drop out of those. A short, clearly job-relevant task that you explain in advance reads as serious and respectful. The best assessments even give the candidate something back, language for how they work, which is one reason a transparent Big Five report usually leaves people feeling the process was fair.
Reviewed for accuracy against the cited selection literature. This page is general information about pre-employment testing and screening, not legal advice.
Let the job pick the methods, keep them consistent, and you can defend every call you make. The fastest way to feel that working is to start from a real opening and watch the signals stack up.
Paste a JD into the Big Five hiring assessment and get a candidate role-fit profile, a structured interview guide with scoring anchors, and a 30/60/90 onboarding brief. Built on real psychometrics, transparent by design.