If you are looking for a personality test, please click here to find our solutions.
Many organizations make use of Personality Assessments and many individuals have tried one or multiple assessments. Today, we will answer all questions related to Personality Assessments and their background. Below, you will see different questions you might be interested in:
The History of Personality Tests
Personality tests as we know them today are often comprised of questions that ask how a person typically feels or behaves, built as specialised self-reports (Holt et al., 2015). However, history of personality tests dates back to WWI, following a need for detecting recruits prone to psychoneurosis (Gregory, 2015). In 1919, Robert Woodworth developed the Woodworth Personal Data Sheet (WPDS), consisting of items aimed at distinguishing army recruits with a proneness to psychopathology (Gibby & Zickar, 2008). This pioneering method of contrasting responses of normal and psychiatrically ill subjects became an applied practice in development of later personality tests during the 1930’s.
The next major personality test development took place in 1930, when Louis and Thelma Thurstone developed an inventory of neurosis, the Thurstone Personality Schedule (Thurstone & Thursone, 1930). This was the first personality inventory to apply internal consistency reliability, correlating each item with the test’s total score, quantifying the utility of each item in the scale (Gregory, 2015). A year after, Robert G. Bernreuter developed the Bernreuter Personality Inventory (Bernreuter, 1931), a more refined test measuring four dimensions of personality: neurotic tendency, self-sufficiency, introversion/extroversion, and dominance/submission. The major innovation of this inventory was the discovery that a single test item could contribute to more than one scale within a test (Gregory, 2015).
In the beginning of the 1940’s, personality tests were recognised as useful tools also for assessing a “normal” functioning spectrum (Gregory, 2015). Initially conceived to facilitate psychiatric diagnosis, the Minnesota Multiphasic Personality Inventory (MMPI) was constructed in 1940 by S. R. Hathaway and J. C. McKinley (Hathaway & McKinley, 1940). The MMPI was a highly respected researched test and has later been revised and is widely used in the U.S. (Gregory, 2015).
Although MBTI has gained popularity over the years, it is not considered a valid framework for understanding personality (Stein & Swan, 2019).
Trait theorists seeking to identify the structure of personality applied factor analysis for disentangling the many aspects and dimensions of personality (Holt et al., 2015). This method allowed for identification of how personality traits are organised and related to each other (Gregory, 2015). By using this method, Raymond B. Cattell (1949) identified 16 personality factors and developed the Sixteen Personality Factor Questionnaire. However, further research grouped the dimensions, reducing the amount of personality factors, and the leading current trait theory is the Five Factor Model, also termed “Big Five”, suggesting that personality has a five-dimensional structure (Holt et al., 2015). These five dimensions have been conceptualised as the five-factor model of personality, and is broadly termed by the acronym OCEAN:
- Openness to Experience
- Conscientiousness
- Extraversion
- Agreeableness,
- Neuroticism
The Big Five framework of personality has gained support and recognition, as research indicates that the five-factor model captures a valid and useful representation of the structure of human traits (Gregory, 2015). This model has inspired several personality scales and assessments (deRaad & Perugini, 2002). One of these is the influential Revised NEO Personality Inventory (NEO-PI-R), first developed and published by Robert R. McCrae & Paul T. Costa in 1978 (Gregory, 2015), with later revisions (Costa, 1991; McCrae & Costa, 1987). NEO-PI-R measures the five major personality dimensions including six specific traits within each dimension (Gregory, 2015).
The development of reliable and valid personality tests is the product of a long history, starting with the need for identifying army recruits during WWI, to the current broad use in selection procedures and for employee development purposes. Psychometric properties of a personality test are of great importance for determining the utility of the test, as a test used in workplace situations should be valid and reliable to a satisfactory degree in order to be of value.
What is a Personality Test?
Personality is a set of organised psychological traits and mechanisms within an individual that remain relatively stable over time and across contexts, influencing interactions, and adaptations to intrapsychic, physical, and social environments (Larsen et al., 2022).

Personality traits describe the average tendencies of a person- and explain the reasons why individuals act a certain way (Larsen et al., 2022). This implies that the personality of an individual can be useful for understanding and predicting differences across individuals. The unique personality of an individual has the potential to explain why they prefer certain lifestyles, careers, or activities. For instance, a person that is highly extraverted may prefer a career that allows for a lot of interaction with others, while a personscoring lower on the extraversiondimension may prefer to work more independently, allowing for a larger focus on other aspects than interaction with others while at work.
Personality tests are standardised psychometric tests determining a person’s set of preferences, and measures individual differences, including motives, preferences, and predispositions for certain behaviours (Lundgren et al., 2016). Research within personality psychology indicates that personality tends to remain relatively stable over time, and across different contexts (Lundgren et al., 2016). Therefore, personality tests can provide us with important information that can aid in predicting behaviour in a range of situations (e.g., forensic, clinical, organisational, andeducational)(APA, 2011).
Most personality tests administered in an occupational setting are developed as self-reporting measures of specific traits or dispositions, requiring respondents to answer questions about their personality, or select items that describe themselves (Larsen et al., 2022; APA, 2011). Other personality tests can take form as projective tests, measuring both the conscious and unconscious aspects of personality (APA, 2011). Personality tests in the workplace are often implemented in the process of personnel selection procedures, integrity testing, and development (Larsen et al., 2022).
The majority of personality tests can be divided into either type- or trait-tests. Type (or typology) tests conceptualise personalities as a limited set of distinct, non-overlapping personality types (Gerrig & Zimbardo, 2009). Trait tests, on the other hand, describe personality characteristics as varying in strength, which allows for comparison with people in a relevant norm group (Gerrig & Zimbardo, 2009). Trait personality tests are usually based on the Big Five trait taxonomy (Furnham et al., 2009; Larson et al., 2002; Wolff & Kim, 2012), based on the Five Factor Model of personality (Costa & McCrae, 1996; Goldberg, 1990; McCrae & Costa, 1999).
The Five Factor Model, also referred to as Big Five or OCEAN (an acronym referring to each personality dimension), is widely replicated across countries and researchers, and propose five prominent dimensions of personality that is prevalent across individuals (Larsen et al., 2022):
- Openness to experience
- Conscientiousness
- Extraversion
- Agreeableness
- Emotional stability
As there are a myriad of personality tests available on the market, psychometric qualities of the test (e.g., measures of reliability and validity) provide important information for distinguishing between tests, and their utility in the process in which it is applied (Lundgren et al., 2016).
Using personality tests for recruitment
Fundamental to the success of any organization is selecting, recruiting, and choosing the right people. The basic is about gathering information and based on data and information making the right hiring choice.
In terms of recruitment, a lot of processes are in place that have to be gone through, including: job specifications, identifying potential candidates, personal characteristics and last but not least, hiring. This refers to what is often called person-job fit, watching personal characteristics and abilities of a candidate and the demands of a job. When in the recruitment process it must be planned and executed in order to create a successful and positive candidate experience. A positive candidate experience can have an increasingly important effect on the brand and ability of attracting future talent.

Evidence has been proven to show that testing and use of questionnaires can be powerful predictors of performance. Implementing and making use of structured interviews as well as personality tests can minimise the chances of poor hiring decisions. Poor hiring decisions may often occur based on judgement and evaluation bias.
Understanding that hiring decisions may be affected by bias and personal opinions makes recruiters call for a more systematic and structured approach, where decision-making is based on solid data.
The first step for any recruits in identifying the type of applicant needed for the job. This can be done by means of a person-job fit, determining the demands of a job, which identifies:
- Essential job tasks
- Required skills
- Required knowledge
- Personal characteristics (Personality)
Applying and using a personality test for candidates can be done in one of two ways:
- Allowing candidates to take the test or questionnaire after meeting the candidate. This allows for only giving tests to those candidates that make it to later round of candidates
- Giving or allowing candidates to take a test before meeting with the candidates. This is a means of using the test to filter and refine the pool of candidates, only calling in those relevant to the job search.
Assessment and Feedback

Assessment is the process of collecting information and analysing the information based on the candidates qualifications and the demands and needs of the organization. Additionally to using tests, interviews are a typical and critical part of the assessment process. Interviews are a great way to evaluate skills, knowledge and characteristics that are required for a specific job. One of the main challenges with interviews is that there is a larger chance of being susceptible to personal judgment as well as evaluation bias. In order to increase the value of an interview as a recruiter, one should commit to using a set of structured and “standard” questions. This ensures accurate and a fair comparison to all candidates.
Why use a structured approach for data collection? In order to progressively narrow the candidate pool, moving down to next level of assessment and finally hiring the job candidate.
Can I practice or improve at a Personality Test?
A personality test measures the preferences of behaviour. That means that you, as an individual, are not able to answer “right or wrong”, however there might be certain behavioral traits that are more appropriate or attractive in certain situations.
As mentioned above a personality test often shows or measures what we call the behavioural preference. The behavioural preference doesn’t describe a person or a candidates only way of behaving. However, it does indicate an individual's most typical or common behaviour.
If you have not been exposed to a Personality test, but you’re interested in understanding what type of questions might be asked here are a few examples:
Select one statement which describes you “the most” and one that describes you “the least”:
| Most like me | Least like me | |
| I like to have other people around me | X | |
| I prefer to hear what others have to say before I make my mind up | X | |
| I am good at making my point in discussions | X | |
| I enjoy discussing theoretical ideas | X | 
Assess the quality of a Personality test - Reliability
It can be a difficult task to figure out whether the test tools available on the market are of high enough quality to create real value within your company. Fortunately, local agencies that evaluate test quality do exist - for example the British Psychological Society (BPS) in the UK or DNV in Norway. An accreditation by these agencies signifies that a test meets several stringent quality requirements assessed by independent experts, and this is a very good benchmark. But what do you do in countries without such agencies?
What do the reliability numbers say about the precision and quality of a personality test?
In order to fully assess the quality of a test in terms of reliability, it is recommended to look at more than one factor. It would be easier if we could measure the quality of a personality test by looking at a single digit number. Unfortunately, this is not possible, due to the complex nature of occupational tests. Condensing a test's reliability to just one number does not really make sense. It would be equivalent to evaluating a car solely based on its miles pr. gallon (MPG). First, there are many other elements which are important for whether it is a good car, such as safety, horsepower or size. Second, there are several different standards for assessing a car's MPG. Likewise, you can’t evaluate a test's reliability from one number alone. The most important factor in measuring if the car is good or not, is to understand what you need the car for – you need a context.
To do this, it is important to understand what reliability really is. Essentially, reliability describes precision over both time and place. However, to better explain the concept, I’ll instead describe how we study reliability – this gives a good picture of what we are dealing with.
Reliability in tests (in general) can be investigated in three different ways:
- How well do the elements within the test fit together? (Internal consistency)
- Do test takers get the same result if they take the test several times? (Test-retest reliability)
- Are different people or versions of the test coming up with the same results (Inter-rater reliability or parallel versions)
Internal consistency. Internal consistency is investigated by quantifying how well each item in a scale is connected to another item within the same scale. A frequently used statistic is the Alpha coefficient, also called Cronbach's Alpha. Alpha captures the degree to which a scale is consistent in measuring the underlying concept of interest. What we want here is that the items on a scale share as much variance as possible. Alpha captures this shared variance. It ranges between 0 to 1, with values below .65 considered a minimum and values around 0.9 generally considered optimal. When people ask for "a test's reliability", they often refer to the Alpha coefficient as one of the most used measures of the level of precision of a test.
Test-retest reliability. It may, in some cases, be more relevant to know if the test is stable over time. In other words, do people get the same result if they retake the test after e.g. three months? For example, this is important information if you’re re-testing employees after a period. Using a test with poor test-retest reliability, you won’t know if differences in results are due to test inaccuracy or whether the employee has in fact changed behavior.
Test-retest reliability is typically stated as a correlation coefficient between the different times the test was taken, and ranges between 0 and 1. The higher the association between outcomes on a test on time x and time y, the better (i.e. coefficients closer to 1 are better). Practically speaking, values around 0.7 are good.
Inter-rater reliability. Inter-rater reliability is investigated by having different individuals to assess the same person. An example could be a 360-degree survey, asking several different people to assess the same person. The likeness of their scoring indicates the accuracy of the tool used. Obviously, personal evaluations confound the results and affect precision. But if the tool used is reliable, you should see a solid correlation between outcome on a test, independent on who administered the test. Preferably, we want to see association above 0.6.
Parallel versions of the same test are not often used for businesses, as it requires developing the “same” test twice and then investigating how similar the results from both tests are when they are filled out by the same person. Here you will typically want to see correlational values higher than 0.8.
Understand the numbers in their right context
The reliability of a personality test is by nature a complex topic. In the end, you want to know whether a personality test is “good” or not. However, that cannot be established based on one number alone. As a minimum you’ll need the context. An Alpha of 0.7 is certainly not "good", but if we are talking about test-retest reliability, then that number would actually indicate that the test is stable over time.
To make matters even more complicated, knowledge about reliability is not enough to assess the overall quality of a test. You also need to look at validity – which we will take a closer look in an upcoming article.
Our Personality test, OPTO has been accredited and approved by BPS and DNV. You can read more about it here.
What about faking results in a personality test?

How do you prevent faking in personality tests?
Those dealing with and in charge of hiring in an organization will relate to the fact that employee performance levels vary across jobs. This calls for use of selection and recruitment tools that can predict and identify differences for job performance. However, will the recruiter be able to trust that the tool will be unbiased and correctly identify the behaviors and traits of interest?
Personality testing is growing and is becoming an important tool of use in many organizations. They are often used for screening and evaluating the person-job fit and evidence shows that personality tests do predict job performance (Judge et al. 2002, Judge et al. 2013, Barrick et al. 2001).
What is the goal and use of a personality test? To accurately and precisely be able to describe and explain characteristics of behavior. Concerns about the manner of reporting do arise. Since the assessment is conducted by means of self-reporting many worry whether candidates will attempt or try to answer in a manner to seem more socially desirable. This has been an ongoing discussion within personality testing. The rise of concern causes organizations to minimize the risk of faking a personality test. How is that done? We can only speak from our experience and knowledge.
Dealing with concerns about faking
Our personality test, OPTO comes with in-built technology called Spotlight. Here is a brief summary of how it works. Spotlight is based on theory built on cognitive psychology as well as the theory about decision making. This theory states that responding to a cognitively demanding answer (effortful thinking) is qualitatively better than answering or giving a more immediate answer (intuitive thinking). Therefore, an item in a questionnaire may be framed differently multiple times in order for the candidate to re-evaluate their initial answer.
Spotlight presents items in a forced choice format, which is known to minimize and even eliminate faking. This is due to the fact that the candidate is forced to prioritize. To learn more about OPTO, click here.