Skip to Content

What is reliability and validity in assessments?

Written by
Ben Schwencke
Updated
decorative gradient bars

Reliability and validity are critical in ensuring results are dependable and measure what they are supposed to. Reliability refers to the consistency, precision and accuracy of a measurement instrument (like an assessment). Whereas validity refers to whether a tool measures what it is intended to measure and whether the results are meaningful and applicable to the concept being studied.

In the context of psychometric testing, reliability and validity are related, but are ultimately separate constructs. Put simply, reliability relates to the precision, accuracy, and replicability of psychometric test scores.

Validity however, answers the question "does this assessment actually measure the construct it claims to?". As a result, reliability is required for validity, but not necessarily the other way around. For example, if a student completes a psychometric assessment 10 times, and gets the exact same score each time, the assessment can be said to show "reliability".

However, just because it's reliable doesn't mean it's valid and actually measuring what it's supposed to.

What is reliability in assessments?

From a classical test theory perspective, there are two primary forms of reliability: Test-retest reliability, and internal consistency.

Test-retest reliability involves giving the assessment to a group of participants two or more times, and evaluating the differences between each attempt. If the scores differ significantly between attempts, the test lacks reliability. If the tests scores are broadly similar (not necessarily identical), then the test can be said to be reliable.

Internal consistency relates to the relationships between the individual items in the test and the overall score. With internally consistent tests, high scores on each specific item should correlate with a higher score on the assessment overall. Conversely, low scores on each specific item should correlate negatively with the score overall. This suggests that each question individually is measuring the same psychological construct, suggesting that the assessment is reliable.

If the tests scores are broadly similar (not necessarily identical), then the test can be said to be reliable.

What is validity and all the different types explained

Validity relates to whether or not a psychometric assessment measures its intended psychological construct. Although validity requires reliability, as unreliable tests cannot measure anything at all, reliability does not guarantee validity. There are various forms of validity, which include:

  1. Face validity: Whether or not an assessment appears to measure the intended psychological construct.
  2. Content validity: Whether an assessment measures all aspects of a particular psychological construct.
  3. Convergent validity: Whether scores on an assessment correlate positively with another similar assessment designed to measure that same construct.
  4. Divergent validity: Whether scores on an assessment correlates negatively / not at all with another assessment designed to measure an unrelated construct.
  5. Criterion-related validity: Whether an assessment is able to predict real-world outcomes which are hypothesised to be associated with that specific construct i.e. job performance, training performance, employee retention etc.

To show that a test is "valid", multiple forms of validation are required, especially for newer assessments and less established psychological constructs. As part of the R&D process for psychometric assessments, psychometricians conduct many studies investigating the reliability and validity of the assessment, presenting the findings in a technical manual or academic journal article.

Conclusion and next steps

At Test Partnership all our assessments are assessed thoroughly for reliability and validity when undergoing research and development. This ensures our assessments are the perfect tool to use for candidate selection and that results will be fair and consistent.

Find out more about the science of our assessments.

author profile ben schwencke
Primary author

Ben Schwencke

Chief psychologist at Test Partnership. MSc in Organisational Psychology with over ten years experience in psychometric testing.