In today's competitive marketplace and complex legal environment, employers face the challenge of attracting, developing, and retaining the best employees. Michael Eisner, CEO of the Disney Corporation, recognized the impact of personnel decisions on a business' bottom-line when he remarked, "My inventory goes home every night." This guide is to help managers and human resource (HR) professionals use assessment practices that are the right choices for reaching their organizations' HR goals. It conveys the essential concepts of employment testing in easy-to-understand terms so that managers and HR professionals can
This Guide is structured around a set of assessment principles and their applications. The information is organized so that readers from a variety of backgrounds will find the information presented in a clear and useful manner.
| The Guide is designed to provide accurate and important information regarding testing as part of a personnel assessment program. It gives general guidelines and must not be viewed as legal advice. |
Testing and Assessment: An Employer's Guide to Good Practices (Guide) was produced and funded by the Skills Assessment and Analysis Program in the U.S. Department of Labor, Employment and Training Administration, Office of Policy and Research (OPR) under the direction of Gerard F. Fiala, Administrator. The Skills Assessment and Analysis Program is directed by Donna Dye, Personnel Research Psychologist, who provided technical direction and support for this Guide.
The Guide was prepared under Department of Labor Grants with the North Carolina Employment Security Commission, Southern Assessment Research and Development Center and National O*NET Consortium; the New York Department of Labor; and the Utah Department of Employment Security. The Guide was completed under the direction of David Rivkin. Mr. Rivkin also served as editor of the Guide. Authors of this Guide were Syed Saad, Gary W. Carter, Mark Rothenberg, and Enid Israelson. Grateful acknowledgment is made to Phil Lewis, Patrice Gilliam-Johnson, Jonathan Levine, and Brenda Dunn for their contribution. Thanks are also given to Ann Kump, Helen Tannenbaum, Don Kreger, Kristin Fiske, and Marilyn Silver whose valuable suggestions were very much appreciated. Grateful acknowledgment is also made to Suzan Chastain, Department of Labor, Office of the Solicitor, Division of Civil Rights, and Hilary R. Weinerand and Cynthia Misicka of the Equal Employment Opportunity Commission for consultant review and insights into the final preparation of this Guide.
[back to Testing page] [back to Job page]
Personnel assessment is a systematic approach to gathering information about individuals. This information is used to make employment or career-related decisions about applicants and employees.
Assessment is conducted for some specific purpose. For example, you, as an employer, may conduct personnel assessment to select employees for a job. Career counselors may conduct personnel assessment to provide career guidance to clients.
Chapter Highlights
1. Personnel assessment tools: tests and procedures
2. Relationship between the personnel assessment process and tests and procedures
3. What do tests measure?
4. Why do organizations conduct assessment?
5. Some situations in which an organization may benefit from testing
6. Importance of using tests in a purposeful manner
7. Limitations of personnel tests and procedures-fallibility of test scores.
| Principles of Assessment Discussed Use assessment tools in a purposeful manner Use the whole-person approach to assessment. |
1. Personnel assessment tools: tests and procedures
Personnel assessment tools differ in
All assessment tools used to make employment decisions, regardless of their format, level of standardization, or objectivity, are subject to professional and legal standards. For example, both the evaluation of a resume and the use of a highly standardized achievement test must comply with applicable laws. Assessment tools used solely for career exploration or counseling are usually not held to the same legal standards.
2. Relationship between the personnel assessment process and tests and procedures
3. What do tests measure?
Constructs cannot be seen or heard, but we can observe their effects on other variables. For example, we don't observe physical strength but we can observe people with great strength lifting heavy objects and people with limited strength attempting, but failing, to lift these
| Tests, inventories, and procedures are assessment tools that may be used to measure an individual's abilities, values, and personality traits. They are components of the assessment process. | |
|
|
|
Assessment process Systematic approach to combining and evaluating all the information gained from testing and using it to make career or employment-related decisions. |
objects. Such differences in characteristics among people have important implications in the employment context. Employees and applicants vary widely in their knowledge, skills, abilities, interests, work styles, and other characteristics. These differences systematically affect the way people perform or behave on the job.
These differences in characteristics are not necessarily apparent by simply observing the employee or job applicant. Employment tests can be used to gather accurate information about job-relevant characteristics. This information helps assess the fit or match between people and jobs. To give an example, an applicant's score on a mechanical test reflects his or her mechanical ability as measured by the test. This score can be used to predict how well that applicant is likely to perform in a job that requires mechanical ability, as demonstrated through a professionally conducted job analysis. Tests can be used in this way to identify potentially good workers.
Some tests can be used to predict employee and applicant job performance. In testing terms, whatever the test is designed to predict is called the criterion. A criterion can be any measure of work behavior or any outcome that can be used as the standard for successful job performance. Some commonly used criteria are productivity, supervisory ratings of job performance, success in training, tenure, and absenteeism. For example, in measuring job performance, supervisory ratings could be the criterion predicted by a test of mechanical ability. How well a test predicts a criterion is one indication of the usefulness of the test.
4. Why do organizations conduct assessment?
5. Some situations in which an organization may benefit from testing
6. Importance of using tests in a purposeful manner
| Principle of Assessment Use assessment tools in a purposeful manner. It is critical to have a clear understanding of what needs to be measured and for what purpose. |
7. Limitations of personnel tests and procedures-fallibility of test scores
There will be cases where a test score or procedure will predict someone to be a good worker, who, in fact, is not. There will also be cases where an individual receiving a low score will be rejected, who, in fact, would actually be capable and a good worker. Such errors in the assessment context are called selection errors. Selection errors cannot be completely avoided in any assessment program.
Why do organizations conduct testing despite these errors? The answer is that appropriate use of professionally developed assessment tools on average enables organizations to make more effective employment-related decisions than use of simple observations or random decision making.
Using a single test or procedure will provide you with a limited view of a person's employment or career-related qualifications. Moreover, you may reach a mistaken conclusion by giving too much weight to a single test result. On the other hand, using a variety of assessment tools enables you to get a more complete picture of the individual. The practice of using a variety of tests and procedures to more fully assess people is referred to as the whole-person approach to personnel assessment. This will help reduce the number of selection errors made and will boost the effectiveness of your decision making. This leads to an important principle of assessment.
| Principle of Assessment Do not rely too much on any one test to make decisions. Use the whole-person approach to assessment. |
[back to Testing page] [back to Job page]
The number of laws and regulations governing the employment process has increased over the past four decades. Many of these laws and regulations have important implications for conducting employment assessment. This chapter discusses what you should do to make your practices consistent with legal, professional, and ethical standards.
Chapter Highlights
1. Title VII of the Civil Rights Act (CRA) of 1964, as amended in 1972; Tower Amendment to Title VII
2. Age Discrimination in Employment Act of 1967 (ADEA)
3. Equal Employment Opportunity Commission (EEOC) - 1972
4. Uniform Guidelines on Employee Selection Procedures - 1978; adverse or disparate impact, approaches to determine existence of adverse impact, four-fifths rule, job-relatedness, business necessity, biased assessment procedures
5. Title I of the Civil Rights Act (CRA) of 1991
6. Americans with Disabilities Act (ADA) - 1990
7. Record keeping of adverse impact and job-relatedness of tests
8. The Standards for Educational and Psychological Testing - 1985; The Principles for the Validation and Use of Personnel Selection Procedures - 1987
9. Relationship between federal, state, and local employment laws.
| Principles of Assessment Discussed Use only assessment instruments that are unbiased and fair to all groups. |
The general purpose of employment laws and regulations is to prohibit unfair discrimination in employment and provide equal employment opportunity for all. Unfair discrimination occurs when employment decisions are based on race, sex, religion, ethnicity, age, or disability rather than on job-relevant knowledge, skills, abilities, and other characteristics. Employment practices that unfairly discriminate against people are called unlawful or discriminatory employment practices.
The summaries of the laws and regulations in this chapter focus on their impact on employment testing and assessment. Before you institute any policies based on these laws and regulations, read the specific laws carefully, and consult with your legal advisors regarding the implications for your particular assessment program.
1. Title VII of the Civil Rights Act (CRA) of 1964 (as amended in 1972); Tower Amendment to Title VII
| ï recruitment ï transfer ï performance appraisal ï disciplinary action |
ï hiring ï training ï compensation ï termination |
ï job classification ï promotion ï union or other membership ï fringe benefits. |
Employers having 15 or more employees, employment agencies, and labor unions are subject to this law.
The Tower Amendment to this act stipulates that professionally developed workplace tests can be used to make employment decisions. However, only instruments that do not discriminate against any protected group can be used. Use only tests developed by experts who have demonstrated qualifications in this area.
2. Age Discrimination in Employment Act of 1967 (ADEA)
ADEA covers employers having 20 or more employees, labor unions, and employment agencies. Certain groups of employees are exempt from ADEA coverage, including public law enforcement personnel, such as police officers and firefighters. Uniformed military personnel also are exempt from ADEA coverage.
3. Equal Employment Opportunity Commission (EEOC)-1972
4. Uniform Guidelines on Employee Selection Procedures-1978; adverse or disparate impact, approaches to determine existence of adverse impact, four-fifths rule, job-relatedness, business necessity, biased assessment procedures
The Guidelines cover all employers employing 15 or more employees, labor organizations, and employment agencies. They also cover contractors and subcontractors to the federal government and organizations receiving federal assistance. They apply to all tests, inventories and procedures used to make employment decisions. Employment decisions include hiring, promotion, referral, disciplinary action, termination, licensing, and certification. Training may be included as an employment decision if it leads to any of the actions listed above. The Guidelines have significant implications for personnel assessment.
One of the basic principles of the Uniform Guidelines is that it is unlawful to use a test or selection procedure that creates adverse impact, unless justified. Adverse impact occurs when there is a substantially different rate of selection in hiring, promotion, or other employment decisions that work to the disadvantage of members of a race, sex, or ethnic group.
Different approaches exist that can be used to determine whether adverse impact has occurred. Statistical Techniques may provide information regarding whether or not the use of a test results in adverse impact. Adverse impact is normally indicated when the selection rate for one group is less than 80% (4/5) that of another. This measure is commonly referred to as the four-fifths or 80% rule. However, variations in sample size may affect the interpretation of the calculation. For example, the 80% rule may not be accurate in detecting substantially different rates of selection in very large or small samples. When determining whether there is adverse impact in very large or small samples, more sensitive tests of statistical significance should be employed.
When there is no charge of adverse impact, the Guidelines do not require that you show the job-relatedness of your assessment procedures. However, you are strongly encouraged to use only job-related assessment tools.
If your assessment process results in adverse impact, you are required to eliminate it or justify its continued use. The Guidelines recommend the following actions when adverse impact occurs:
Note that for the continued use of assessment instruments or procedures that cause adverse impact, courts have required justification by business necessity as well as validity for the specific use. The issue of business necessity is specifically addressed in Title I of the Civil Rights Act of 1991 (see next section).
An assessment procedure that causes adverse impact may continue to be used only if there is evidence that
Demonstrating job-relatedness of a test is the same as establishing that the test may be validly used as desired. Chapter 3 discusses the concept of test validity and methods for establishing the validity or job-relatedness of a test.
Demonstrating the business necessity of using a particular assessment instrument involves showing that its use is essential to the safe and efficient operation of the business and there are no alternative procedures available that are substantially equally valid to achieve the business objectives with a lesser adverse impact.
Another issue of importance discussed in the Uniform Guidelines relates to test fairness. The Uniform Guidelines define biased or unfair assessment procedures as those assessment procedures on which one race, sex, or ethnic group characteristically obtains lower scores than members of another group and the differences in the scores are not reflected in differences in the job performance of members of the groups.
The meaning of scores on an unfair or biased assessment procedure will differ depending on the group membership of the person taking the test. Therefore, using biased tests can prevent employers from making equitable employment decisions. This leads to the next principle.
| Principle of Assessment Use only assessment instruments that are unbiased and fair to all groups. |
Use of biased tools may result in unfair discrimination against members of the lower scoring groups. However, use of fair and unbiased tests can still result in adverse impact in some cases. If you are developing your own test or procedure, expert help may be advisable to make sure your procedure is fair to all relevant groups. If you are planning to purchase professionally developed assessment tools, first evaluate the fairness of those you are considering by reading the test manuals and consulting independent reviews.
5. Title I of the Civil Rights Act of 1991
As noted previously, the Act specifically requires demonstration of both the job-relatedness and business necessity of assessment instruments or procedures that cause adverse impact. The business necessity requirement, set forth in Title I of the CRA of 1991, is harder to satisfy in defending challenged practices than a business purpose test suggested by the Supreme Court earlier.
Another important provision relates to the use of group-based test score adjustments to maintain a representative work force. The Act prohibits score adjustments, the use of different cut-off scores for different groups of test takers, or alteration of employment-related test results based on the demographics of the test takers. Such practices, which are referred to as race norming or within-group norming, were used by some employers and government agencies to avoid adverse impact.
The Act also makes compensatory and punitive damages available as a remedy for claims of intentional discrimination under Title VII and the ADA.
6. Americans with Disabilities Act (ADA) - 1990
A qualified individual with a disability is one who can perform the essential functions of a job, with or without reasonable accommodation.
The ADA has major implications for your assessment practices.
Your organization should develop a written policy on conducting testing and assessment of individuals with disabilities. This will help ensure compliance with the provisions of the ADA.
If you need assistance in complying with the ADA, there are several resources you may contact.
7. Record keeping of adverse impact and job-relatedness of tests
8. The Standards for Educational and Psychological Testing - 1985; The Principles for the Validation and Use of Personnel Selection Procedures-1987
The Standards primarily address psychometric issues while the Principles primarily address the problems of making decisions in employee selection, placement, promotion, etc. The major concern of the Standards is general; the primary concern of the Principles is that performance on a test . . . is related to performance on a job or other measures of job success.
Compatibility of the Standards and the Principles with the Uniform Guidelines
The Uniform Guidelines were intended to be consistent with generally accepted professional standards for validating and evaluating standardized tests and other selection procedures. In this regard, the Guidelines specifically refer to the Standards.
It is strongly encouraged that you develop familiarity with both the Standards and the Principles in addition to the Uniform Guidelines. Together, they can help you conduct personnel assessment in a manner consistent with legal and professional standards.
9. Relationship between federal, state, and local employment laws
[back to Testing page] [back to Job page]
Test reliability and validity are two technical properties of a test that indicate the quality and usefulness of the test. These are the two most important features of a test. You should examine these features when evaluating the suitability of the test for your use. This chapter provides a simplified explanation of these two complex ideas. These explanations will help you to understand reliability and validity information reported in test manuals and reviews and use that information to evaluate the suitability of a test for your use.
Chapter Highlights
1. What makes a good test?
2. Test reliability
3. Interpretation of reliability information from test manuals and reviews
4. Types of reliability estimates
5. Standard error of measurement
6. Test validity
7. Methods for conducting validation studies
8. Using validity evidence from outside studies
9. How to interpret validity information from test manuals and independent reviews.
|
Principles of Assessment Discussed Use only assessment procedures and instruments that have been demonstrated to be valid for the specific purpose for which they are being used. Use assessment tools that are appropriate for the target population. |
1. What makes a good test?
The degree to which a test has these qualities is indicated by two technical properties: reliability and validity.
2. Test reliability
How do we account for an individual who does not get exactly the same test score every time he or she takes the test? Some possible reasons are the following:
These factors are sources of chance or random measurement error in the assessment process. If there were no random errors of measurement, the individual would get the same test score, the individual's "true" score, each time. The degree to which test scores are unaffected by measurement errors is an indication of the reliability of the test.
Reliable assessment tools produce dependable, repeatable, and consistent information about people. In order to meaningfully interpret test scores and make useful employment or career-related decisions, you need reliable tools. This brings us to the next principle of assessment.
| Principle of Assessment Use only reliable assessment instruments and procedures. In other words, use only assessment tools that provide dependable and consistent information. |
3. Interpretation of reliability information from test manuals and reviews
| The reliability of a test is indicated by the reliability coefficient. It is denoted by the letter "r," and is expressed as a number ranging between 0 and 1.00, with r = 0 indicating no reliability, and r = 1.00 indicating perfect reliability. Do not expect to find a test with perfect reliability. Generally, you will see the reliability of a test as a decimal, for example, r = .80 or r = .93. The larger the reliability coefficient, the more repeatable or reliable the test scores. Table 1 serves as a general guideline for interpreting test reliability. However, do not select or reject a test solely based on the size of its reliability coefficient. To evaluate a test's reliability, you should consider the type of test, the type of reliability estimate reported, and the context in which the test will be used. |
|
||||||||||||
4. Types of reliability estimates
The discussion in Table 2 should help you develop some familiarity with the different kinds of reliability estimates reported in test manuals and reviews.
Table 2. Types of Reliability Estimates
|
Test-retest reliability indicates the repeatability of test scores with the passage of time. This estimate also reflects the stability of the characteristic or construct being measured by the test. Some constructs are more stable than others. For example, an individual's reading ability is more stable over a particular period of time than that individual's anxiety level. Therefore, you would expect a higher test-retest reliability coefficient on a reading test than you would on a test that measures anxiety. For constructs that are expected to vary over time, an acceptable test-retest reliability coefficient may be lower than is suggested in Table 1. |
|
Alternate or parallel form reliability indicates how consistent test scores are likely to be if a person takes two or more forms of a test. A high parallel form reliability coefficient indicates that the different forms of the test are very similar which means that it makes virtually no difference which version of the test a person takes. On the other hand, a low parallel form reliability coefficient suggests that the different forms are probably not comparable; they may be measuring different things and therefore cannot be used interchangeably. |
|
Inter-rater reliability indicates how consistent test scores are likely to be if the test is scored by two or more raters. On some tests, raters evaluate responses to questions and determine the score. Differences in judgments among raters are likely to produce variations in test scores. A high inter-rater reliability coefficient indicates that the judgment process is stable and the resulting scores are reliable. Inter-rater reliability coefficients are typically lower than other types of reliability estimates. However, it is possible to obtain higher levels of inter-rater reliabilities if raters are appropriately trained. |
|
Internal consistency reliability indicates the extent to which items on a test measure the same thing. A high internal consistency reliability coefficient for a test indicates that the items on the test are very similar to each other in content (homogeneous). It is important to note that the length of a test can affect internal consistency reliability. For example, a very lengthy test can spuriously inflate the reliability coefficient. Tests that measure multiple characteristics are usually divided into distinct components. Manuals for such tests typically report a separate internal consistency reliability coefficient for each component in addition to one for the whole test. Test manuals and reviews report several kinds of internal consistency reliability estimates. Each type of estimate is appropriate under certain circumstances. The test manual should explain why a particular estimate is reported. |
The SEM is a useful measure of the accuracy of individual test scores. The smaller the SEM, the more accurate the measurements.
When evaluating the reliability coefficients of a test, it is important to review the explanations provided in the manual for the following:
For more information on reliability, consult the APA Standards, the SIOP Principles, or any major textbook on psychometrics or employment testing. Appendix A lists some possible sources.
6. Test validity
It is important to understand the differences between reliability and validity. Validity will tell you how good a test is for a particular situation; reliability will tell you how trustworthy a score on that test will be. You cannot draw valid conclusions from a test score unless you are sure that the test is reliable. Even when a test is reliable, it may not be valid. You should be careful that any test you select is both reliable and valid for your situation.
A test's validity is established in reference to a specific purpose; the test may not be valid for different purposes. For example, the test you use to make valid predictions about someone's technical proficiency on the job may not be valid for predicting his or her leadership skills or absenteeism rate. This leads to the next principle of assessment.
| Principle of Assessment Use only assessment procedures and instruments that have been demonstrated to be valid for the specific purpose for which they are being used. |
Similarly, a test's validity is established in reference to specific groups. These groups are called the reference groups. The test may not be valid for different groups. For example, a test designed to predict the performance of managers in situations requiring problem solving may not allow you to make valid or meaningful predictions about the performance of clerical employees. If, for example, the kind of problem-solving ability required for the two positions is different, or the reading level of the test is not suitable for clerical applicants, the test results may be valid for managers, but not for clerical employees.
Test developers have the responsibility of describing the reference groups used to develop the test. The manual should describe the groups for whom the test is valid, and the interpretation of scores for individuals belonging to each of these groups. You must determine if the test can be used appropriately with the particular type of people you want to test. This group of people is called your target population or target group.
| Principle of Assessment Use assessment tools that are appropriate for the target population. |
Your target group and the reference group do not have to match on all factors; they must be sufficiently similar so that the test will yield meaningful scores for your group. For example, a writing ability test developed for use with college seniors may be appropriate for measuring the writing ability of white-collar professionals or managers, even though these groups do not have identical characteristics. In determining the appropriateness of a test for your target groups, consider factors such as occupation, reading level, cultural differences, and language barriers.
Recall that the Uniform Guidelines require assessment tools to have adequate supporting evidence for the conclusions you reach with them in the event adverse impact occurs. A valid personnel tool is one that measures an important characteristic of the job you are interested in. Use of valid tools will, on average, enable you to make better employment-related decisions. Both from business-efficiency and legal viewpoints, it is essential to only use tests that are valid for your intended use.
In order to be certain an employment test is useful and valid, evidence must be collected relating the test to a job. The process of establishing the job relatedness of a test is called validation.
7. Methods for conducting validation studies
The three methods of validity-criterion-related, content, and construct-should be used to provide validation support depending on the situation. These three general methods often overlap, and, depending on the situation, one or more may be appropriate. French (1990) offers situational examples of when each method of validity may be applied.
First, as an example of criterion-related validity, take the position of millwright. Employees' scores (predictors) on a test designed to measure mechanical skill could be correlated with their performance in servicing machines (criterion) in the mill. If the correlation is high, it can be said that the test has a high degree of validation support, and its use as a selection tool would be appropriate.
Second, the content validation method may be used when you want to determine if there is a relationship between behaviors measured by a test and behaviors involved in the job. For example, a typing test would be high validation support for a secretarial position, assuming much typing is required each day. If, however, the job required only minimal typing, then the same test would have little content validity. Content validity does not apply to tests measuring learning ability or general problem-solving skills (French, 1990).
Finally, the third method is construct validity. This method often pertains to tests that may measure abstract traits of an applicant. For example, construct validity may be used when a bank desires to test its applicants for "numerical aptitude." In this case, an aptitude is not an observable behavior, but a concept created to explain possible future behaviors. To demonstrate that the test possesses construct validation support, ". . . the bank would need to show (1) that the test did indeed measure the desired trait and (2) that this trait corresponded to success on the job" (French, 1990, p. 260).
Professionally developed tests should come with reports on validity evidence, including detailed explanations of how validation studies were conducted. If you develop your own tests or procedures, you will need to conduct your own validation studies. As the test user, you have the ultimate responsibility for making sure that validity evidence exists for the conclusions you reach using the tests. This applies to all tests and procedures you use, whether they have been bought off-the-shelf, developed externally, or developed in-house.
Validity evidence is especially critical for tests that have adverse impact. When a test has adverse impact, the Uniform Guidelines require that validity evidence for that specific employment decision be provided.
The particular job for which a test is selected should be very similar to the job for which the test was originally developed. Determining the degree of similarity will require a job analysis. Job analysis is a systematic process used to identify the tasks, duties, responsibilities and working conditions associated with a job and the knowledge, skills, abilities, and other characteristics required to perform that job.
Job analysis information may be gathered by direct observation of people currently in the job, interviews with experienced supervisors and job incumbents, questionnaires, personnel and equipment records, and work manuals. In order to meet the requirements of the Uniform Guidelines, it is advisable that the job analysis be conducted by a qualified professional, for example, an industrial and organizational psychologist or other professional well trained in job analysis techniques. Job analysis information is central in deciding what to test for and which tests to use.
8. Using validity evidence from outside studies
The Uniform Guidelines, the Standards, and the SIOP Principles state that evidence of transportability is required. Consider the following when using outside tests:
To ensure that the outside test you purchase or obtain meets professional and legal standards, you should consult with testing professionals. See Chapter 5 for information on locating consultants.
9. How to interpret validity information from test manuals and independent reviews
Test manuals and reviews should describe
| Table 3. General Guidelines for Interpreting Validity Coefficients | |
| Validity coefficient value | Interpretation |
|---|---|
| above .35 | very beneficial |
| .21 - .35 | likely to be useful |
| .11 - .20 | depends on circumstances |
| below .11 | unlikely to be useful |
- level of adverse impact associated with your assessment tool
- selection ratio (number of applicants versus the number of openings)
- cost of a hiring error
- cost of the selection tool
- probability of hiring qualified applicant based on chance alone.
Here are three scenarios illustrating why you should consider these factors, individually and in combination with one another, when evaluating validity coefficients:
Now, let's change the situation.
Here is another scenario that shows why you need to consider multiple factors when evaluating the validity of assessment tools.
Again, these examples demonstrate the complexity of evaluating the validity of assessments. Multiple factors need to be considered in most situations. You might want to seek the assistance of a testing expert (for example, an industrial/organizational psychologist) to evaluate the appropriateness of particular assessments for your employment situation.
When properly applied, the use of valid and reliable assessment instruments will help you make better decisions. Additionally, by using a variety of assessment tools as part of an assessment program, you can more fully assess the skills and capabilities of people, while reducing the effects of errors associated with any one tool on your decision making.
[back to Testing page] [back to Job page]
This chapter briefly describes different types of assessment tools and procedures that organizations commonly use to conduct personnel assessment. Included are techniques such as employment interviews and reference checks, as well as various types of professionally developed assessment instruments. This chapter also includes a discussion of the use of medical tests and drug and alcohol testing in the workplace. Table 4, which appears at the end of this chapter, contains a brief description of the advantages and disadvantages of different types of assessment instruments.
Chapter Highlights
1. Mental and physical ability tests
2. Achievement tests
3. Biodata inventories
4. Employment interviews
5. Personality inventories
6. Honesty and integrity measures
7. Education and experience requirements (including licensing and certification)
8. Recommendations and reference checks
9. Assessment centers
10. Medical examinations
11. Drug and alcohol tests
It takes a good deal of knowledge and judgment to properly use assessment tools to make effective employment-related decisions. Many assessment tools and procedures require specialized training, education, or experience to administer and interpret correctly. These requirements vary widely, depending on the specific instruments being used. Check with the test publisher to determine whether you and your staff meet these requirements. To ensure that test users have the necessary qualifications, some test publishers and distributors require proof of qualifications before they will release certain tests.
1. Mental and physical ability tests
Examples of some mental abilities are verbal, quantitative, and spatial abilities. Physical ability tests usually encompass abilities such as strength, endurance, and flexibility.
Although mental ability tests are valid predictors of performance in many jobs, use of such tests to make employment decisions often results in adverse impact. For example, research suggests that mental abilities tests adversely impact some racial minority groups and, if speed is also a component of the test, older workers may be adversely impacted. Similarly, use of physical ability tests often results in adverse impact against women and older persons. See Chapter 7 for strategies to minimize adverse impact in your assessment program.
2. Achievement tests
3. Biodata inventories
Some individuals might provide inaccurate information on biodata inventories to portray themselves as being more qualified or experienced than they really are. Internal consistency checks can be used to detect whether there are discrepancies in the information reported. In addition, reference checks and resumes can be used to verify information.
4. Employment interviews
Regardless of the extent to which the interview is structured or unstructured, the skill of the interviewer can make a difference in the quality of the information gathered. A skillful, trained interviewer will be able to ask job-relevant follow-up questions to clarify and explore issues brought up during the interview.
It is unlawful to ask questions about medical conditions and disability before a conditional job offer. Even if the job applicant volunteers such information, you are not permitted to pursue inquiries about the nature of the medical condition or disability. Instead, refocus the interview so that emphasis is on the ability of the applicant to perform the job, not on the disability. In some limited circumstances, you may ask about the need for reasonable accommodation.
Where disability is concerned, the law requires that employers provide reasonable accommodations (meaning a modification or adjustment) to a job, the work environment or the way things are usually done so that qualified individuals with a disability are not excluded from jobs that they can perform. These legal requirements apply to all selection standards and procedures, including questions and rating systems used during the interview process.
Following a structured interview format can help interviewers avoid unlawful or inappropriate inquiries where medical conditions, disability, and age are concerned. For additional information on the ADA, see the EEOC Technical Assistance Manual on the Employment Provisions of the Americans with Disabilities Act and the EEOC ADA Enforcement Guidance: Preemployment Disability -Related Questions and Medical Examinations.
It is important to note that inquiries about race, ethnicity, or age generally are not expressly prohibited under the law, but usually serve no credible purpose in an interview. These types of questions are also closely scrutinized by organizations, including regulatory agencies, interested in protecting the civil rights of applicants.
5. Personality inventories
Some personality inventories have been developed to determine the psychological attributes of an individual for diagnostic and therapeutic purposes. These clinical tools are not specifically designed to measure job-related personality dimensions. These tests are used in only very limited employment situations, primarily with jobs where it is critical to have some idea about an applicant's state of mind, such as in the selection of law enforcement officers or nuclear power plant workers. This distinction between clinical and employment-oriented personality inventories can be confusing. Applicants asked to take personality tests may become concerned even though only employment-oriented personality inventories will be administered.
If a personality inventory or other assessment tool provides information that would lead to identifying a mental disorder or impairment, the tool is considered a medical exam under the ADA. The ADA permits medical examinations of applicants and employees only in limited circumstances.
There are a few additional concerns about personality tests. Since there are usually no right or wrong answers to the test items, test takers may provide socially desirable answers. However, sophisticated personality inventories often have "lie-scales" built in, which allow such response patterns to be detected. There is also a general perception that these tests ask personal questions that are only indirectly relevant to job performance. This may raise concern on the part of test takers that such tests are an invasion of privacy. Some of these concerns can be reduced by including personality tests only as one part of a broader assessment program.
6. Honesty and integrity measures
All the legitimate concerns and cautions of personality testing apply here. For instance, test takers may raise privacy concerns or question the relevance of these measures to job performance. If you choose to use an honesty test to select people for a particular job, you should document the business necessity of such a test. This would require a detailed job analysis, including an assessment of the consequences of hiring a dishonest individual. Make certain that your staff have the proper training and qualifications to administer and interpret integrity tests.
It is generally recommended that these tests be used only for pre-employment screening. Using the test with present employees could create serious morale problems. Using current employees' poor scores to make employment decisions may have legal repercussions when not substantiated by actual counterproductive behavior.
All honesty and integrity measures have appreciable prediction errors. To minimize prediction errors, thoroughly follow up on poor-scoring individuals with retesting, interviews, or reference checks. In general, integrity measures should not be used as the sole source of information for making employment decisions about individuals.
A number of states currently have statutes restricting the use of honesty and integrity measures. At least one state has an outright ban on their use. Consult regulations in your state that govern the use of honesty and integrity tests before using them.
7. Education and experience requirements (including licensing and certification)
Requirements for experience and education should be job-related. If the requirements you set result in adverse impact, you will have to demonstrate that they are job-related and justified by business necessity. However, in some cases job-relatedness might be difficult to demonstrate. For example, it is difficult to show that exactly 3 years of experience is necessary or demonstrate that a high school degree is required for a particular job.
8. Recommendations and reference checks
These verification procedures generally do not help separate potentially good workers from poor workers. This is because they almost always result in positive reports. However, use of these measures may serve two important purposes
9. Assessment centers
Assessors must be appropriately trained. Their skills and experience are essential to the quality of the evaluations they provide. Assessment centers apply the whole-person approach to personnel assessment. They can be very good predictors of job performance and behavior when the tests and procedures making up the assessment center are constructed and used appropriately.
It can be costly to set up an assessment center. Large companies may have their own assessment centers; mid-size and smaller firms sometimes send candidates to private consulting firms for evaluation.
10. Medical examinations
The Americans with Disabilities Act outlines when and in what manner medical exams can be used in employment-related situations. For additional information on the ADA, see Chapter 2 of the Guide, the EEOC Technical Assistance Manual on the Employment Provisions of the Americans with Disabilities Act, the EEOC ADA Enforcement Guidance: Preemployment Disability - Related Questions and Medical Examinations, and the EEOC Uniform Guidelines on Employee Selection Procedures. Some major points regarding medical exams are described below.
If your organization uses medical information to make personnel decisions, you should develop a written policy on medical testing to ensure compliance with relevant federal, state, and local laws. For additional information on the ADA, see the EEOC Technical Assistance Manual on the Employment Provisions of the Americans with Disabilities Act, and the EEOC ADA Enforcement Guidance: Preemployment Disability - Related Questions and Medical Examinations.
11. Drug and alcohol tests
Current use, possession, or distribution of illicit drugs does not qualify as a "disability" under the ADA. You may prohibit the use of such drugs at the workplace, and you may administer drug tests to applicants and employees alike. You may deny employment to an applicant and discipline or discharge an employee currently engaged in illegal drug use. However, you may not discriminate against a former drug addict who has successfully undergone rehabilitation and does not currently use illicit drugs.
If your organization is in the public sector, federal courts have generally upheld the use of random drug tests only when applied to safety-sensitive positions. This federal restriction does not apply if you are a private employer. However, state or local laws and collective bargaining agreements pertaining to drug testing may impose restrictions on your drug testing policy.
Some legal medications or even some foods can produce a positive reading on a drug screening test for an individual who, in fact, has not used illegal drugs. To minimize such errors, it is advisable to have a formal appeals process, and also provisions for retesting with a more sensitive drug test when necessary.
Under the ADA, a test for the illegal use of drugs is not considered a medical exam, but a test for alcohol use is. Therefore, you must follow the ADA rules on medical exams in deciding whether and when to administer an alcohol test to applicants or employees.
Alcoholism may qualify as a disability under the ADA, and hence an individual with this condition may be extended protection. However, organizations may discipline individuals who violate conduct or performance standards that are related to the job. Organizations also may discharge, or deny employment to individuals whose use of alcohol impairs job performance or compromises safety to the extent that he or she can no longer be considered a "qualified individual with a disability."
If your organization uses drug or alcohol tests to make personnel decisions, you should develop a written policy governing such a program to ensure compliance with all relevant federal, state, and local laws. Most states require written consent of employees and applicants before drug or alcohol tests can be administered. Consult the ADA, the EEOC Technical Assistance Manual on the Employment Provisions of the Americans with Disabilities Act, the EEOC ADA Enforcement Guidance: Preemployment Disability - Related Questions and Medical Examinations, and the EEOC Uniform Guidelines on Employee Selection Procedures, as well as your state and local laws when developing a drug or alcohol testing program.
| Table 4. Main Advantages and Disadvantages of Different Types of Assessment Instruments |
||
|
|
||
| Type of assessment instrument | Advantages | Disadvantages |
|
|
||
| Mental Ability tests |
|
|
|
|
||
| Achievement/proficiency tests |
|
|
|
|
||
| Biodata inventories |
|
|
|
|
||
| Employment interviews |
|
|
|
|
||
| Personality inventories |
|
|
|
|
||
| Honesty/integrity measures |
|
|
|
|
||
| Education and experience requirements |
|
|
|
|
||
| Recommendations and reference checks |
|
|
|
|
||
| Assessment centers |
|
|
|
|
||
| Medical examinations |
|
|
|
|
||
| Drug and alcohol tests |
|
|
|
|
||
[back to Testing page] [back to Job page]
Previous chapters described a number of types of personnel tests and procedures and use of assessment tools to identify good workers and improve organizational performance. Technical and legal issues that have to be considered in using tests were also discussed. In this chapter, information and procedures for evaluating tests will be presented.
Chapter Highlights
1. Sources of information about tests
2. Standards for evaluating a test-information to consider to determine suitability of a test for your use
3. Checklist for evaluating a test.
| Principle of Assessment Use assessment instruments for which understandable and comprehensive documentation is available. |
1. Sources of information about tests
The MMY series covers nearly all commercially available psychological, educational, and vocational tests published for use with English-speaking people. There is a detailed review of each test by an expert in the field. A brief description of the test covering areas such as purpose, scoring, prices, and publisher is also provided.
The MMY is published by the Buros Institute of Mental Measurements. The Buros Institute also makes test reviews available through a computer database. This database is updated monthly via an on-line computer service. This service is administered by the Bibliographic Retrieval Services (BRS).
If you are considering hiring a consultant, it is important to evaluate his or her qualifications and experience beforehand. Professionals working in this field generally have a Ph.D. in industrial/organizational psychology or a related field. Look for an individual with hands-on experience in the areas in which you need assistance. Consultants may be found in psychology or business departments at universities and colleges. Others serve as full-time consultants, either working independently, or as members of consulting organizations. Typically, professional consultants will hold memberships in APA, SIOP, or other professional organizations.
Reference libraries should contain the publications discussed above as well as others that will provide information about personnel tests and procedures. The Standards for Educational and Psychological Testing and the Principles for the Validation and Use of Personnel Selection Procedures can also help you evaluate a test in terms of its development and use. In addition, these publications indicate the kinds of information a good test manual should contain. Carefully evaluate the quality and the suitability of a test before deciding to use it. Avoid using tests for which only unclear or incomplete documentation is available, and tests that you are unable to thoroughly evaluate. This is the next principle of assessment.
| Principle of Assessment Use assessment instruments for which understandable and comprehensive documentation is available. |
2. Standards for evaluating a test-information to consider to determine suitability of a test for your use
The following basic descriptive and technical information should be evaluated before you select a test for your use. In order to evaluate a test, you should obtain a copy of the test and test manual. Consult independent reviews of the test for professional opinions on the technical adequacy of the test and the suitability of the test for your purposes.
3. Checklist for evaluating a test
CHECKLIST FOR EVALUATING A TEST
[back to Testing page] [back to Job page]
Proper administration of assessment instruments is essential to obtaining valid or meaningful scores for your test takers. This chapter discusses how to administer assessment instruments so that you can be certain that the results will be valid and fair.
Chapter Highlights
1. Training and qualifications of administration staff
2. Following instructions and guidelines stated in the test manual
3. Ensuring suitable and uniform assessment conditions
4. How much help to offer test takers
5. Test anxiety
6. Alternative assessment methods for special cases
7. Providing reasonable accommodation in the assessment process to people with disabilities
8. Administering computer-based tests
9. Obtaining informed consent of test takers and a waiver of liability claims
10. Maintaining assessment instrument security
11. Maintaining confidentiality of assessment results
12. Testing unionized employees
| Principles of Assessment Discussed Ensure that administration staff are properly trained. Ensure that testing conditions are suitable for all test takers. Provide reasonable accommodation in the assessment process for people with disabilities. Maintain assessment instrument security. Maintain confidentiality of assessment results. |
1. Training and qualifications of administration staff
Only those staff who can administer the test in a professional and satisfactory manner should be assigned test administration duties. Test administrators should be well organized and observant, speak well, and be able to deal comfortably with people. They should also be trained to handle special situations with sensitivity. For example, they should know how to respond to a test taker's request for an accommodation and be able to calm down those who may become overly anxious about taking a test. This leads to our next principle of assessment.
| Principle of Assessment Ensure that administration staff are properly trained. |
2. Following instructions and guidelines stated in the test manual
3. Ensuring suitable and uniform assessment conditions
| Principle of Assessment Ensure that testing conditions are suitable for all test takers. |
To maintain the integrity of test results, administrators need to be alert to test takers' activities throughout the session. For example, some individuals may lose their place in the test booklet or put answers in the wrong column on the answer sheet. Others may try to copy answers from someone else. An alert administrator will be able to correct these situations quickly before they invalidate the test takers' responses.
4. How much help to offer test takers
Some instruments allow the administrator to clarify the directions and practice exercises, but prohibit help with the actual test questions. This is generally true for ability and achievement tests. However, other assessment tools, such as interest inventories or biodata instruments, may allow for more assistance with the assessment. In general, test takers should not be coached on how best to answer test questions. Administrators should not offer more information than what is indicated in the instructions. If they do, some individuals will be given an unfair advantage.
5. Test anxiety
6. Alternative assessment methods for special cases
7. Providing reasonable accommodation in the assessment process to people with disabilities
| Principle of Assessment Provide reasonable accommodation in the assessment process for people with disabilities. |
Accommodation in the assessment process may involve ensuring physical accessibility to the test site, modifying test equipment or tests, or providing qualified assistance. Giving extra time on certain kinds of tests to test takers with dyslexia or other learning disability, and administering a larger print version of a test to a person who is visually impaired are examples of reasonable accommodation. Note, however, that providing a reader for a reading comprehension test, or extra time for a speeded test could invalidate the test results. You should become familiar with what accommodations can be made for different conditions or circumstances without invalidating the test. Provide all test takers with descriptive information about the test in advance, so that they will have ample opportunity to request needed accommodations. When the need for accommodation is not obvious, you may ask for reasonable documentation of the disability functional limitations for which accommodation is needed. The test taker, test manual, the test publisher, and several professional associations (listed in Chapter 2 and Appendix A) can help you determine what the appropriate reasonable accommodations are for particular situations. If an accommodation cannot be made without invalidating the test, alternative assessment strategies, such as a review of past job experience, a review of school records, or a brief job tryout, must be considered.
8. Administering computer-based tests
Some computer-based tests are adaptive. Adaptive tests, as opposed to conventional tests, present test questions based on the responses of the test taker to previous questions, and so adjust for his or her level of ability. This allows for a more reliable measure of ability with fewer items administered.
Advantages to computer-based testing include
! Administration procedures are the same for all test takers.
! The need for test administrators is reduced.
! Results can be available immediately.
! The test can be administered without delay to walk-in applicants.
Disadvantages of computer-based testing include
! A computer is needed for each test taker.
! Some test takers may feel uncomfortable using a computer; this could raise anxiety levels and adversely affect scores of these individuals.
9. Obtaining informed consent of test takers and a waiver of liability claims
Obtaining written consent does not relieve the organization of legal liability if applicable laws are violated.
10. Maintaining assessment instrument security
| Principle of Assessment Maintain assessment instrument security. |
Security measures are also required when you use computer-based tests. Establish a password procedure for accessing computerized test materials, and secure all related computer disks and manuals. Many computerized test developers encode test items and answer keys so that items cannot easily be read if electronic files are stolen. When tests are used over a long period of time, it becomes increasingly likely that some test questions will leak out. To help maintain security, test developers periodically introduce new alternate forms. If alternate forms of the test are available, you can increase security by varying the form used.
11. Maintaining confidentiality of assessment results
| Principle of Assessment Maintain confidentiality of assessment results. |
As discussed in Chapters 2 and 4, under the ADA, medical information about employees and applicants is confidential and must be kept in a separate location from other personnel information.
12. Testing unionized employees
[back to Testing page] [back to Job page]
This chapter describes some of the most common assessment instrument scoring procedures. It also discusses how to properly interpret results, and how to use them effectively. Other issues regarding the proper use of assessment tools are also discussed.
Chapter Highlights
1. Assessment instrument scoring procedures
2. Test interpretation methods: norm and criterion-referenced tests
3. Interpreting test results
4. Processing test results to make employment decisions-rank-ordering and cut-off scores
5. Combining information from many assessment tools
6. Minimizing adverse impact
| Principle of Assessment Ensure that scores are interpreted properly. |
1. Assessment instrument scoring procedures
2. Test interpretation methods: norm and criterion-referenced tests
The test manual will usually provide detailed descriptions of the norm groups and the test norms. To ensure valid scores and meaningful interpretation of norm-referenced tests, make sure that your target group is similar to the norm group. Compare the educational level, the occupational, language and cultural backgrounds, and other demographic characteristics of the individuals making up the two groups to determine their similarity.
For example, consider an accounting knowledge test that was standardized on the scores obtained by employed accountants with at least 5 years of experience. This would be an appropriate test if you are interested in hiring experienced accountants. However, this test would be inappropriate if you are looking for an accounting clerk. You should look for a test normed on accounting clerks or a closely related occupation.
A particular test score is generally chosen as the minimum acceptable level of competence. How is a level of competence chosen? The test publisher may develop a mechanism that converts test scores into proficiency standards, or the company may use its own experience to relate test scores to competence standards.
For example, suppose your company needs clerical staff with word processing proficiency. The test publisher may provide you with a conversion table relating word processing skill to various levels of proficiency, or your own experience with current clerical employees can help you to determine the passing score. You may decide that a minimum of 35 words per minute with no more than two errors per 100 words is sufficient for a job with occasional word processing duties. If you have a job with high production demands, you may wish to set the minimum at 75 words per minute with no more than 1 error per 100 words.
It is important to ensure that all inferences you make on the basis of test results are well founded. Only use tests for which sufficient information is available to guide and support score interpretation. Read the test manual for instructions on how to properly interpret the test results. This leads to the next principle of assessment.
| Principle of Assessment Ensure that scores are interpreted properly. |
3. Interpreting test results
4. Processing test results to make employment decisions-rank-ordering and cut-off scores
The rank-ordering of test results, the use of cut-off scores, or some combination of the two is commonly used to assess the qualifications of people and to make employment-related decisions about them. These are described below.
Rank-ordering is a process of arranging candidates on a list from highest score to lowest score based on their test results. In rank-order selection, candidates are chosen on a top-down basis.
A cut-off score is the minimum score that a candidate must have to qualify for a position. Employers generally set the cut-off score at a level which they determine is directly related to job success. Candidates who score below this cut-off generally are not considered for selection. Test publishers typically recommend that employers base their selection of a cut-off score on the norms of the test.
5. Combining information from many assessment tools
A key decision in using the total assessment approach is determining the relative weights to assign to each assessment instrument in the program.
Figure 3 is a simple example of how assessment results from several tests and procedures can be combined to generate a weighted composite score.
| Assessment instrument | Assessment score (0-100) | Assigned weight | Weighted total |
|---|---|---|---|
| Interview | 80 | 8 | 640 |
| Mechanical ability test | 60 | 10 | 600 |
| H.S. course work | 90 | 5 | 450 |
| Total Score: 1,690 | |||
6. Minimizing adverse impact
[back to Testing page] [back to Job page]
It is important to remember that an assessment instrument, like any tool, is most effective when used properly and can be very counterproductive when used inappropriately. In previous chapters you have read about the advantages of using tests and procedures as part of your personnel assessment program. You have also read about the limitations of tests in providing a consistently accurate and complete picture of an individual's employment-related qualifications and potential. This chapter highlights some important issues and concerns surrounding these limitations. Careful attention to these issues and concerns will help you produce a fair and effective assessment program.
Chapter Highlights
1. Deciding whether to test or not to test
2. Viewing tests as threats and invasions of privacy
3. Fallibility of test scores
4. Appeals process and retesting
5. Qualifications of assessment staff
6. Misuse or overuse of tests
7. Ensuring both efficiency and diversity
8. Ethnic, linguistic, and cultural differences and biases
9. Testing people with disabilities
1. Deciding whether to test or not to test
Sometimes a more vigorous employee training program will help to improve individual and organizational performance without expanding your current selection procedures. Sometimes a careful review of each candidate's educational background and work history will help you to select better workers, and sometimes using additional tests will be beneficial.
Consider how much additional time and effort will be involved in expanding your assessment program. As in every business decision, you will want to determine whether the potential benefits outweigh the expenditure of time and effort. Be sure to factor in all the costs, such as purchase of tests and staff time, and balance these against all the benefits, including potential increases in productivity.
In summary, before expanding your assessment program, it is important to have a clear picture of your organization's needs, the benefits you can expect, and the costs you will incur.
2. Viewing tests as threats and invasions of privacy
Fear or mistrust of tests can lower the scores of some otherwise qualified candidates. To reduce these feelings, it is important to take the time to explain a few things about the testing program before administering a test. Any explanation should, at a minimum, cover the following topics:
! why the test is being administered
! fairness of the test
! confidentiality of test results
! how the test results will be used in the assessment process.
3. Fallibility of test scores
It is, therefore, important not to rely entirely on any one assessment instrument in making employment decisions. Using a variety of assessment tools will help you obtain a fuller and more accurate picture of an individual. Consider such information as an evaluation of a person's education, work experience and other job-relevant factors in addition to standardized test results.
4. Appeals process and retesting
There are external circumstances or conditions that could invalidate the test results. These may include the test taker's state of mind or health at the time of the test, the conditions under which the test is given, and his or her familiarity with particular questions on the test. To give some specific examples, a person who has a child at home with the measles may not be able to concentrate on taking a vocabulary test. Someone sitting next to a noisy air conditioner may also not be able to concentrate on the test questions. On another day, under different circumstances, these individuals might obtain a different score.
If you believe that the test was not valid for an individual, you should consider a retest. If other versions of the test are not available, consider alternative means of assessment. Check the test manual for advice from the publisher regarding retesting. It is advisable to develop a policy on handling complaints regarding testing and appeals for retesting, so that these concerns can be resolved fairly and consistently.
5. Qualifications of assessment staff
6. Misuse or overuse of tests
In addition, test results usually provide specific information that is valid for a specific amount of time. Therefore, it is unlikely to be appropriate to consider an employee for a promotion based on his or her test scores on a proficiency test taken 5 years earlier.
The test manual and independent reviews of the test remain your best guides on administering, scoring, and interpreting the test.
7. Ensuring both efficiency and diversity
To help ensure both efficiency and diversity in your workforce, apply the whole-person approach to assessment. Use a variety of assessment tools to obtain a comprehensive picture of the skills and capabilities of applicants and employees. This approach to assessment will help you make sure you don't miss out on some very qualified individuals who could enhance your organization's success.
8. Ethnic, linguistic, and cultural differences and biases
Before selecting new tests, consider the composition of your potential candidate population. Are the tests appropriate for all of them? The test manuals may provide assistance in determining this. If you need further clarification, contact the test publisher.
There may be cases where appropriate standardized tests are not available for certain groups. You may have to rely on other assessment techniques, such as interviews and evaluations of education and work experience, to make your employment decisions.
9. Testing people with disabilities
Accommodation may involve ensuring physical accessibility to the test site, modifying test equipment or tests, or providing other forms of assistance. Giving extra time for certain kinds of tests to test takers with dyslexia or other learning disabilities and administering a braille version of a test for the blind may be examples of reasonable accommodation. See Chapters 2 and 6 for further discussions on testing people with disabilities.
[back to Testing page] [back to Job page]
Employers can effectively use personnel assessment instruments to measure job-relevant skills and capabilities of applicants and employees. These tools can help to identify and select better workers and can help improve the quality of an organization's overall performance. To use these tools properly, employers must be aware of the inherent limitations of any assessment procedure, as well as the legal issues involved in assessment.
The guide is organized around 13 important assessment principles and their applications. This final chapter brings all the principles together. They are listed below in the order of their appearance in the text, with the chapter number in parentheses. Together, the 13 principles provide a comprehensive framework for conducting an effective personnel assessment program.
Use assessment tools in a purposeful manner (Chapter 1) Assessment instruments, like other tools, are helpful when used properly but can be useless, harmful, or illegal when used inappropriately. Often, inappropriate use results from not having a clear understanding of what you want to measure and why you want to measure it. As an employer, you must first be clear about what you want to accomplish with your assessment program in order to select the proper tools to achieve those goals.
Your assessment strategies should be based on both an understanding of the kind of employment decisions to be made and the population to be assessed. Once you are clear on your purpose, you will be better able to select appropriate assessment tools, and use those tools in an effective manner. Only use tests that are appropriate for your particular purpose.
Use the whole-person approach to assessment (Chapter 1) An assessment instrument may provide you with important employment-related information about an individual. However, no assessment tool is 100% reliable or valid; all are subject to errors, both in measuring job-relevant characteristics and in predicting job performance. Moreover, a single assessment instrument only provides you with a limited view of a person's qualifications. Using a variety of tools to measure skills, abilities, and other job-relevant characteristics provides you with a solid basis upon which to make important career and employment-related decisions and minimizes adverse impact.
Use only assessment instruments that are unbiased and fair to all groups (Chapter 2) Using unbiased and fair tests will help you select a qualified and diverse workforce. Employment decisions based on tests that are biased are likely to lead to unfair and illegal discrimination against members of the lower scoring groups. You should review the fairness evidence associated with assessment instruments before selecting tools by examining the test manual and independent test reviews.
Use only reliable assessment instruments and procedures (Chapter 3) If a person takes the same test again, will he or she get a similar score, or a very different score? A reliable instrument will provide accurate and consistent scores. To meaningfully interpret test scores and make useful career or employment-related decisions, use only reliable tools. Test manuals will usually provide a statistic, known as the reliability coefficient, giving you an indication of a test's reliability. The higher the reliability coefficient, the more confidence you can have that the score is accurate.
Use only assessment procedures and instruments that have been demonstrated to be valid for the specific purpose for which they are being used (Chapter 3) Validity is the most important issue in selecting assessment tools. It refers to (1) the characteristic the assessment instrument measures, and (2) how well the instrument measures the characteristic. Validity is not a property of the assessment instrument itself; it relates to how the instrument is being used.
A test's validity is established in reference to a specific purpose; it may not be valid for different purposes. For example, a test that may be valid for predicting someone's "job knowledge," may not be valid for predicting his or her "leadership skills." You must be sure that the instrument is valid for the purpose for which it is to be used. Selecting a commercially developed instrument does not relieve you of this responsibility.
The test manual usually provides a statistic, the validity coefficient, that will give an indication of the test's validity for a specific purpose under specific circumstances. It measures the degree of relationship between test performance and job performance (i.e., job-relatedness of the test).
Use assessment tools that are appropriate for the target population (Chapter 3) An assessment tool is usually developed for use with a specific group; it may not be valid for other groups. For example, a test designed to predict the performance of office managers may not be valid for clerical workers. The skills and abilities required for the two positions may be different, or the reading level of the test may not be suitable for clerical workers. Tests should be appropriate for the individuals you want to test, that is, your target population.
The manual should indicate the group or groups the test is designed to assess. Your target population should be similar to the group on which the test was developed, or normed. In determining the appropriateness of an instrument for your target group, also consider such factors as reading levels, cultural backgrounds, and language barriers.
Use assessment instruments for which understandable and comprehensive documentation is available (Chapter 5) Are the instructions for administration and interpretation understandable? Is the information sufficiently comprehensive to evaluate the suitability of the instrument for your needs? Carefully evaluate the documentation provided by the test publisher to be sure that the tools you select do the job you want them to do and furnish you with the information you need. If the documentation is not understandable or complete, you run the risk of selecting inappropriate instruments.
Test manuals should provide information about both the development and psychometric characteristics of tests. They should cover topics such as procedures for administration, scoring and interpretation, the recommended uses of an instrument, the groups for whom the test is appropriate, and test norms. They should also include a description of the validation procedures used, and evidence of validity, reliability, and test fairness.
Ensure that administration staff are properly trained (Chapter 6) Assessment instruments must be administered properly to obtain valid results. Consult the test publisher and administration manual for guidelines on the qualifications and training required for test administrators. These requirements will vary depending on the nature and complexity of the test. Only suitable staff should be selected. Administrators should be given ample time to learn their responsibilities and should practice by administering tests to other staff before administering tests to applicants. Some test publishers may run training sessions for test administration and interpretation.
Administration staff should also be trained to handle special situations with sensitivity. An example would be responding to a request for accommodation based on a disability.
Ensure that testing conditions are suitable for all test takers (Chapter 6) There are various extraneous influences that may affect the reliability and validity of an assessment procedure. For example, noise in the testing room, poor lighting, inaccurate timing and damaged test equipment may adversely affect test takers. Staff should ensure that the testing environment is suitable and that administration procedures are uniform for all test takers.
Provide reasonable accommodation in the assessment process for people with disabilities (Chapter 6) To ensure that qualified individuals with disabilities have an equal chance to demonstrate their potential, accommodations in the assessment process may be necessary. Under the ADA, reasonable accommodation may involve ensuring physical accessibility to the test site, modifying test equipment or the testing process, or providing qualified assistance to the test taker. For example, administering a braille version of a test, allowing extra time to complete the test, or supplying a reader may be appropriate. It is important to become familiar with the types of accommodations that can be made without invalidating test results. If reasonable accommodation involving test administration cannot be made, consider alternative assessment strategies.
Maintain assessment instrument security (Chapter 6) All materials used in the assessment process, whether paper-and-pencil or computer-based, must be kept secure. Lack of security may result in some test takers having access to test questions beforehand, thus invalidating their scores. To prevent this, test users should, for example, keep testing materials in locked rooms or cabinets and limit access to those materials to staff involved in the assessment process. Security is also the responsibility of test developers. The security of a test may become compromised over time. To protect security, test developers periodically introduce new forms of tests.
Maintain confidentiality of assessment results (Chapter 6) Assessment results are highly personal. Employers must respect the test taker's right to confidentiality. Assessment results should only be shared with those who have a legitimate need to know. This would include staff involved in interpreting assessment results and making employment decisions. Personal information should not be released to other organizations or individuals without the informed consent of the test taker.
Ensure that scores are interpreted properly (Chapter 7) Tests are used to make inferences about people's characteristics, capabilities, and future performance. The inferences should be reasonable, well-founded, and not based upon stereotypes. If test scores are not interpreted properly, the conclusions drawn from them are likely to be invalid, thus leading to poor decision making. Ensure that there is solid evidence to justify your test score interpretations and the employment decisions you make based on those scores. The test manual should provide instructions on how to properly interpret test results.
[back to Testing page] [back to Job page]
The following list of reference material provides sources of information on specific topics and issues relating to personnel testing and assessment. The main text has referred to many of the publications listed below. Others are included as general reference documents and as recommended readings.
American Educational Research Association, American Psychological Association and National Council on Measurement in Education. 1985. Standards for Educational and Psychological Testing. Washington, DC: American Psychological Association.
Anastasi, A. 1988. Psychological Testing (6th edition). New York: Macmillan.
Arvey, R.D., and R.H. Faley. 1988. Fairness in Selecting Employees. Reading, MA: Addison-Wesley.
Boudreau, J. 1996. Cumulative Supplement to Employment Testing Manual. Boston: Warren, Gorham-Lamont.
Bruyäre, S.M., and J. O'Keeffe (eds.). 1994. Implications of the Americans with Disabilities Act for Psychology. Washington, DC: American Psychological Association.
Bureau of National Affairs. 1990. The Americans with Disabilities Act: A Practical and Legal Guide to Impact, Enforcement, and Compliance. Washington, DC: Author.
Bureau of National Affairs Policy and Practice Series. 1992-1995. Fair Employment Practices Manual #8. Washington, DC: Author.
Buros Institute of Mental Measurements. Various. Mental Measurements Yearbook. Lincoln, NE: University of Nebraska Press.
Buros Institute of Mental Measurements. Various. Tests in Print. Lincoln, NE: University of Nebraska Press.
Douglas, J.A., D.E. Feld, and N. Asquith. 1989. Employment Testing Manual. Boston, MA: Warren, Gorham-Lamont.
Equal Employment Opportunity Commission. 1978. The Office of Personnel Management, U.S. Department of Justice and U.S. Department of Labor (1979). Questions and Answers Clarifying and Interpreting the Uniform Guidelines on Employee Selection Procedures. 29 CFR Part 1607 (1988).
Equal Employment Opportunity Commission. 1978. The Office of Personnel Management, U.S. Department of Justice and U.S. Department of Labor (1979). Uniform Guidelines on Employee Selection Procedures. 41 CFR Part 603 (1978).
Equal Employment Opportunity Commission. 1992. A Technical Assistance Manual on the Employment Provisions (Title I) of the Americans with Disabilities Act. Washington, DC: U.S.Government Printing Office.
Equal Employment Opportunity Commission. 1992. EEOC Technical Assistance Manual on Employment Provisions of the Americans with Disabilities Act; ADA Enforcement Guidance: Preemployment Disability Related Questions and Medical Examinations.
Equal Employment Opportunity Commission and U.S. Department of Justice. 1991. Americans with Disabilities Act Handbook. Washington, DC: Author.
French, W.L. 1990. Human resources management (2nd edition). Houghton Mifflin Co.: Boston, MA.
Guion, R.M. 1997. Assessment, Measurement, and Prediction for Personnel Decisions. Mahwah, NJ: Lawrence Erlbaum Associates. Hogan J., and R. Hogan (eds.) 1984-1990. Business and Industry Testing: Current Practices and Test Reviews. Austin, TX: Pro-ed.
Keyser, D.J., and R.C. Sweetland (eds.). 1984-1988. Test Critiques. (Vols. 1-7). Kansas City, MO: Test Corporation of America.
Murphy, K.R., and C.O. Davidshofer. 1988. Psychological Testing: Principles and Applications. Englewood Cliffs, NJ: Prentice Hall.
Nunnally, J.C. 1978. Psychometric Theory (2nd edition). New York: McGraw-Hill.
Society for Industrial and Organizational Psychology, Inc. 1987. Principles for the Validation and Use of Personnel Selection Procedures (3rd edition). College Park, MD: Author.
U.S. Department of Justice. 1991. The Americans with Disabilities Act: Questions and Answers. Washington, DC: Civil Rights Division, U.S. Department of Justice.
U.S. Department of Labor, Employment and Training Administration. 1993. JTPA: Improving Assessment: A Technical Assistance Guide. Washington, DC: Author.
ability test A test that measures the current performance or estimates future performance of a person in some defined area of cognitive, psychomotor, or physical functioning.
achievement test A test that measures acquired knowledge or skills, usually as the result of previous instruction.
adverse impact A situation in which members of a particular race, sex, or ethnic group have a substantially lower rate of selection in hiring, promotion, or other employment decisions.
alternate forms Two or more forms of a test that are similar in nature and intended to be used for the same purpose.
assessment Any test or procedure used to measure an individual's employment or career-related qualifications or characteristics.
basic skills test Assessments of competence in reading, simple mathematics, and other skills that are widely required in training and employment settings.
coaching Instructional activities designed to improve the test performance of prospective test takers.
compensatory approach See counterbalanced approach.
concurrent validity See criterion-related validity.
construct A theoretical characteristic or concept (e.g., numerical ability, conscientiousness) that has been constructed to explain observable patterns of behavior.
construct-related validity The extent to which a test measures a specific theoretical construct, characteristic, or trait. In employment testing, this characteristic should be important for job success. Examples of constructs are mechanical ability and physical endurance.
content-related validity The extent to which the content of a test samples or represents the subject area or behavior it is intended to measure.
converted score A raw score that has been converted by numerical transformation (for example, to percentile ranks or standard scores) to facilitate comparison of individual scores with group norms.
correlation A statistic that indicates the degree to which two variables relate to each other, such as a test score and job performance, or one test with another test.
counterbalanced approach An approach to personnel assessment that allows high scores in one or more areas to be counterbalanced with low scores in another area.
criterion A measure of performance, such as productivity rate, accident rate, or supervisory ratings. Test scores are used to predict criteria.
criterion-related validity The degree to which scores on an assessment instrument correlate with some external criterion, such as job performance. When the assessment instrument and the criterion are measured at about the same time, it is called concurrent validity; when the criterion is measured at some future time, it is called predictive validity.
derived score See converted score.
equivalent forms See alternate forms.
expectancy table A table that shows the probability of different criterion outcomes for each test score.
hurdles approach See multiple hurdles approach.
inventory A questionnaire or checklist that elicits information about an individual in such areas as work values, interests, attitudes, and motivation.
job analysis A systematic process used to identify the tasks, duties, responsibilities and working conditions associated with a job and the knowledge, skills, abilities and other characteristics required to perform that job.
mean The average score in a group of scores, computed by adding all the scores and dividing the sum by the number of cases.
median The middle score in a group of ranked scores. It is the point or score that divides the group into two equal parts. The median is also known as the 50th percentile.
multiple hurdles approach An approach to personnel assessment that requires a candidate to pass all tests in sequence in order to qualify.
normal curve A mathematical curve that is the basis of many statistical analyses. The curve is bilaterally symmetrical, with a single bell-shaped peak in the center. Most distributions of human traits, such as height, mathematical ability, and manual dexterity, approximate the normal curve.
norms Descriptive statistics that are used to summarize the test performance of a specified group, such as a sample of workers in a specific occupation. Norms are often assumed to represent a larger population, such as all workers in an occupation.
parallel forms See alternate forms.
percentile score The score on a test below which a given percentage of scores fall. For example, a score at the 65th percentile is equal to or higher than the scores obtained by 65% of the people who took the test.
predictive validity See criterion-related validity.
rank ordering The process of ranking individuals based on their relative test scores, from the highest to the lowest score.
raw score The obtained score on a test, usually determined by counting the number of correct answers.
reference group The group of individuals used to develop a test.
reliability The degree to which test scores are consistent, dependable, or repeatable.
reliability coefficient A coefficient of correlation that indicates the degree to which test scores are dependable, or repeatable.
standard deviation A statistic used to describe the variability within a set of scores. It indicates the extent to which scores vary around the mean or average score.
standard error of measurement (SEM) A statistic that gives an indication of the amount of error in a measurement system. It indicates a range within which a test taker's "true" score is likely to fall.
standard score A score that describes the location of a person's score within a set of scores in terms of its distance from the mean in standard deviation units.
standardized test A test developed using professionally prescribed methods that provides specific administration requirements, instructions for scoring and instructions for interpreting scores.
target group The population or group of individuals whom the employer wishes to assess.
test Any instrument or procedure that samples behavior or performance. A personnel or employment test is the general term for any assessment tool used to measure an individual's employment qualifications, capabilities, or characteristics.
validity The degree to which actions or inferences based on test results are meaningful or supported by theory and empirical evidence.
validity coefficient A numerical index that shows the strength of the relationship between a test score and a criterion, such as job performance.