Psychology Wiki
Psychology Wiki

Assessment | Biopsychology | Comparative | Cognitive | Developmental | Language | Individual differences | Personality | Philosophy | Social |
Methods | Statistics | Clinical | Educational | Industrial | Professional items | World psychology |

Statistics: Scientific method · Research methods · Experimental design · Undergraduate statistics courses · Statistical tests · Game theory · Decision theory


In psychometrics test bias is said to occur when a test yields higher or lower scores on average when it is administered to specific criterion groups such as people of a particular race or sex than when administered to an average population sample. Negative bias is said to occur when the criterion group scores lower than average and positive bias when they score higher.

The crux of the issue then is does this occur because there is a real difference in the attribute being measured or is this due to cultural test bias for example.



See also[]

  • Abad, F. J., Colom, R., Rebollo, I., & Escorial, S. (2004). Sex differential item functioning in the Raven's Advanced Progressive Matrices: Evidence for bias: Personality and Individual Differences Vol 36(6) Apr 2004, 1459-1470.
  • Abbott, M. L. (2006). ESL Reading Strategies: Differences in Arabic and Mandarin Speaker Test Performance: Language Learning Vol 56(4) Dec 2006, 633-670.
  • Ackerman, T. A. (1992). A didactic explanation of item bias, item impact, and item validity from a multidimensional perspective: Journal of Educational Measurement Vol 29(1) Spr 1992, 67-91.
  • Aguinis, H., & Smith, M. A. (2007). Understanding the impact of test validity and bias on selection errors and adverse impact in human resource selection: Personnel Psychology Vol 60(1) Spr 2007, 165-199.
  • Ajzen, Y., & Iagolnitzer, E. R. (1984). Awareness in the substance of the Body Focus Questionnaire: Perceptual and Motor Skills Vol 59(3) Dec 1984, 807-813.
  • Ajzen, Y., Iagolnitzer, E. R., & Bruchon-Schweitzer, M. (1985). Dimensionality of revisited body awareness: Perceptual and Motor Skills Vol 60(2) Apr 1985, 455-458.
  • Alfano, D. P., Paniak, C. E., & Finlayson, M. A. (1993). The MMPI and closed head injury: A neurocorrective approach: Neuropsychiatry, Neuropsychology, & Behavioral Neurology Vol 6(2) Apr 1993, 111-116.
  • Allan, R. G., Nassif, P. M., & Elliot, S. M. (1988). Bias issues in teacher certification testing. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Allbutt, J., Shafiullah, M., & Ling, J. (2006). The Relationship Between Self-Report Imagery Questionnaire Scores and Subtypes of Socially Desirable Responding: Visual and Movement Imagery: Journal of Mental Imagery Vol 30(1-2) Spr-Sum 2006, 1-20.
  • Allen, N. L., & Holland, P. W. (1993). A model for missing information about the group membership of examinees in DIF studies. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Allen-Taylor, S. L. (1987). A comparative analysis of alternative contingency-table methods for detecting item bias in standardized achievement tests: Dissertation Abstracts International.
  • Anderson, J. (1988). An informal reaction to these chapters. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Angoff, W. H. (1990). Philosophical issues of current interest to measurement theorists: Interdisciplinaria Revista de Psicologia y Ciencias Afines Vol 9(2) 1990, 59-69.
  • Angoff, W. H. (1993). Perspectives on differential item functioning methodology. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Aramburu-Zabala Higuera, L. (2001). Adverse impact in personnel selection: The legal framework and test bias: European Psychologist Vol 6(2) Jun 2001, 103-111.
  • Aramburu-Zabala, L. A. (2004). The anti-discrimination Directive (2000/78/EC); and its implications in personnel selection: Revista de Psicologia del Trabajo y de las Organizaciones Vol 20(2) 2004, 199-223.
  • Arce-Ferrer, A. J. (2006). An Investigation Into the Factors Influencing Extreme-Response Style: Improving Meaning of Translated and Culturally Adapted Rating Scales: Educational and Psychological Measurement Vol 66(3) Jun 2006, 374-392.
  • Archer, R. P. (2006). A Perspective on the Restructured Clinical (RC) Scale Project: Journal of Personality Assessment Vol 87(2) Oct 2006, 179-185.
  • Armstrong-Hall, J. G. (1997). An examination of gender bias on the eighth-grade MEAP science test as it relates to the hunter gatherer theory of spatial sex differences. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Aronowitz, A., Bridge, R. G., & Jones, P. (1985). Sex bias in the Self-Directed Search Investigative subscale: Journal of Vocational Behavior Vol 26(2) Apr 1985, 146-154.
  • Artelt, C., & Baumert, J. (2004). Comparability of Students' Reading Literacy Performance Measured with Items Originating from Different Language Backgrounds: Zeitschrift fur Padagogische Psychologie/ German Journal of Educational Psychology Vol 18(3-4) Nov 2004, 171-185.
  • Arvey, R. D., Miller, H. E., Gould, R., & Burch, P. (1987). Interview validity for selecting sales clerks: Personnel Psychology Vol 40(1) Spr 1987, 1-12.
  • Atkinson, L., & Cyr, J. J. (1985). Gender IQ differences among psychiatric patients: Canadian Journal of Behavioural Science/Revue canadienne des Sciences du comportement Vol 17(4) Oct 1985, 417-423.
  • Autor, D. H., Suyemoto, K. L., & Harder, D. W. (1988). Negative androgyny and self-esteem: Towards a confound-free scale: Psychological Reports Vol 63(2) Oct 1988, 643-650.
  • Baillie, A. J. (2005). Predictive gender and education bias in Kessler's Psychological Distress Scale (K10): Social Psychiatry and Psychiatric Epidemiology Vol 40(9) Sep 2005, 743-748.
  • Baron, J. (1997). Biases in the quantitative measurement of values for public decisions: Psychological Bulletin Vol 122(1) Jul 1997, 72-88.
  • Basskin, L. (2003). Statistical interpretation can also bias research evidence: BMJ: British Medical Journal Vol 327(7417) Oct 2003, 752.
  • Beal, A. L. (1988). Canadian content in the WISC--R: Bias or jingoism? : Canadian Journal of Behavioural Science/Revue canadienne des Sciences du comportement Vol 20(2) Apr 1988, 154-166.
  • Becker, M. L. (2000). Assessing depression in women: Is the BDI-II biased? Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Bell, T., Watson, M., Sharp, D., Lyons, I., & Lewis, G. (2005). Factors associated with being a false positive on the General Health Questionnaire: Social Psychiatry and Psychiatric Epidemiology Vol 40(5) May 2005, 402-407.
  • Benavente, A., Ato, M., & Lopez, J. J. (2006). Methods for detecting and assessing the interrater bias: Anales de Psicologia Vol 22(1) Jun 2006, 161-167.
  • Benbow, C. P., & Wolins, L. (1996). The utility of out-of-level testing for gifted seventh and eighth graders using the SAT-M: An examination of item bias. Baltimore, MD: Johns Hopkins University Press.
  • Bentz, S. K., Folks, J. M., Forgione, P. D., Jr., Gabrys, R. E., & Veselka, M. (1988). A series of brief perspectives on bias issues in teacher certification testing. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Bernal, E. M. (1975). A response to "Educational uses of tests with disadvantaged students." American Psychologist Vol 30(1) Jan 1975, 93-95.
  • Bernstein, I. H., Teng, G., Grannemann, B. D., & Garbin, C. P. (1987). Invariance in the MMPI's component structure: Journal of Personality Assessment Vol 51(4) Win 1987, 522-531.
  • Betz, N. E. (1993). Issues in the use of ability and interest measures with women: Journal of Career Assessment Vol 1(3) Sum 1993, 217-232.
  • Blair, R. J. (1996). Item bias analysis of the Leiter-R for English as a Second Language populations. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Bock, R. D. (1993). Different DIFs: Comment on the papers read by Neil Dorans and David Thissen. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Bond, L. (1993). Comments on the O'Neill & McPeek paper. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Bono, C., Ried, L. D., Kimberlin, C., & Vogel, B. (2007). Missing data on the Center for Epidemiologic Studies Depression Scale: A comparison of 4 imputation techniques: Research in Social & Administrative Pharmacy Vol 3(1) Mar 2007, 1-27.
  • Bornstein, R. A., Rosenberger, P., Harkness-Kling, K., & Suga, L. (1989). Content bias of the MacAndrew's Alcoholism Scale in seizure disorder patients: Journal of Clinical Psychology Vol 45(2) Mar 1989, 339-341.
  • Bowles, R. P. (2004). The Effect of Dropping Low Scores on Ability Estimates: Journal of Applied Measurement Vol 5(2) 2004, 178-188.
  • Bridgeman, B., & Schmitt, A. (1997). Fairness issues in test development and administration. Mahwah, NJ: Lawrence Erlbaum Associates Publishers.
  • Brown, R. T., Reynolds, C. R., & Whitaker, J. S. (1999). Bias in mental testing since Bias in Mental Testing: School Psychology Quarterly Vol 14(3) Fal 1999, 208-238.
  • Bryant, D. U. (2005). The effects of differential item functioning on predictive bias. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Bryant, M. R., Juarez, J. R., & Swanson, R. G. (1991). The prevention of bias. New York, NY, England: Praeger Publishers.
  • Brzezinski, E. J. (1985). Anomalous item behavior in basic skills tests: Dissertation Abstracts International.
  • Buffin, R. C. (1986). Sompa's estimated learning potential, K-ABC's mental processing composite, and achievement in children of Afro-American descent: Dissertation Abstracts International.
  • Bugel, K. (1991). Sex differences in school achievement in the Netherlands: A survey of the literature and some new data: Pedagogische Studien Vol 68(8) Oct 1991, 350-370.
  • Bugel, K., & Glas, C. (1991). Item-specific differences in the performance of boys and girls on modern foreign language reading comprehension examinations: Tijdschrift voor Onderwijsresearch Vol 16(6) 1991, 337-351.
  • Burns, C. W. (1986). The validity of the Roberts Apperception Test for Children across ethnic groups and between sexes: Dissertation Abstracts International.
  • Burton, E., & Burton, N. W. (1993). The effect of item screening on test scores and test characteristics. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Butcher, J. N., Hamilton, C. K., Rouse, S. V., & Cumella, E. J. (2006). The Deconstruction of the Hy Scale of MMPI-2: Failure of RC3 in Measuring Somatic Symptom Expression: Journal of Personality Assessment Vol 87(2) Oct 2006, 186-192.
  • Cahan, S., & Gamliel, E. (2001). Prediction bias and selection bias: An empirical analysis: Applied Measurement in Education Vol 14(2) 2001, 109-123.
  • Cahan, S., & Gamliel, E. (2006). Definition and Measurement of Selection Bias: From Constant Ratio to Constant Difference: Journal of Educational Measurement Vol 43(2) Sum 2006, 131-144.
  • Calvo, M. G., Eysenck, M. W., & Castillo, M. D. (1997). Interpretation bias in test anxiety: The time course of predictive inferences: Cognition & Emotion Vol 11(1) Jan 1997, 43-63.
  • Camilli, G. (1993). The case against item bias detection techniques based on internal criteria: Do item bias procedures obscure test fairness issues? Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Camilli, G., & Shepard, L. A. (1985). A computer program to aid the detection of biased test items: Educational and Psychological Measurement Vol 45(3) Fal 1985, 595-600.
  • Camilli, G., & Smith, J. K. (1990). Comparison of the Mantel-Haenszel test with a randomized and a jackknife test for detecting biased items: Journal of Educational Statistics Vol 15(1) Spr 1990, 53-67.
  • Campbell, T., Dollaghan, C., Needleman, H., & Janosky, J. (1997). Reducing bias in language assessment: Processing-dependent measures: Journal of Speech & Hearing Research Vol 40(3) Jun 1997, 519-525.
  • Candell, G. L., & Drasgow, F. (1988). An iterative procedure for linking metrics and assessing item bias in item response theory: Applied Psychological Measurement Vol 12(3) Sep 1988, 253-260.
  • Candell, G. L., & Hulin, C. L. (1986). Cross-language and cross-cultural comparisons in scale translations: Independent sources of information about item nonequivalence: Journal of Cross-Cultural Psychology Vol 17(4) Dec 1986, 417-440.
  • Cantor, J. M. (1964). All-or-none style of thinking as a source of test bias: Psychological Reports 15(2) 1964, 355-358.
  • Carle, A. C. (2003). Three latent variable model assessments of measurement bias across gender on the Children's Depression Inventory. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Carswell, L., & White, W. F. (1984). Problems of reporting socioeconomic bias among reading scores and standardized reading tests: Perceptual and Motor Skills Vol 58(1) Feb 1984, 181-182.
  • Cartledge, G., Stupay, D., & Kaczala, C. (1988). Testing language in learning disabled and non-learning disabled Black children: What makes the difference? : Learning Disabilities Research Vol 3(2) Sum 1988, 101-106.
  • Castenell, L. A., & Castenell, M. E. (1988). Norm-referenced testing and low-income Blacks: Journal of Counseling & Development Vol 67(3) Nov 1988, 205-206.
  • Cauffman, E., & MacIntosh, R. (2006). A Rasch Differential Item Functioning Analysis of the Massachusetts Youth Screening Instrument: Identifying Race and Gender Differential Item Functioning Among Juvenile Offenders: Educational and Psychological Measurement Vol 66(3) Jun 2006, 502-521.
  • Chambers, C. T., Giesbrecht, K., Craig, K. D., Bennett, S. M., & Huntsman, E. (1999). A comparison of faces scales for the measurement of pediatric pain: Children's and parents' ratings: Pain Vol 83(1) Oct 1999, 25-35.
  • Chamblee, M. C. (1999). A monte carlo investigation of conditions that impact type I error rates of DFIT. (test bias, differential test functioning). Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Chang, L. (1993). The effects of instructional differences on item bias indices and a comparison of item bias detection procedures: Dissertation Abstracts International.
  • Chang, Y.-w. (1992). A comparison of unidimensional and multidimensional IRT approaches to test information in a test battery: Dissertation Abstracts International.
  • Chapman, P. L., & Mullis, A. K. (2002). Readdressing gender bias in the Coopersmith Self-Esteem Inventory-Short Form: Journal of Genetic Psychology Vol 163(4) Dec 2002, 403-409.
  • Chavez, D. J. (1987). Ethnic differences in adaptive behavior: Lifecycle profiles of mildly mentally retarded people: Dissertation Abstracts International.
  • Chernin, J., Holden, J. M., & Chandler, C. (1997). Bias in psychological assessment: Heterosexism: Measurement and Evaluation in Counseling and Development Vol 30(2) Jul 1997, 68-76.
  • Chernin, J., Holden, J. M., & Chandler, C. (1997). Rejoinder to Wohlgemuth's and Prince's responses to Bias in psychological assessment: Heterosexism: Measurement and Evaluation in Counseling and Development Vol 30(2) Jul 1997, 88-90.
  • Chilisa, B. (2000). Towards Equity in Assessment: Crafting gender-fair assessment: Assessment in Education: Principles, Policy & Practice Vol 7(1) Mar 2000, 61-81.
  • Chipman, S. F., Marshall, S. P., & Scott, P. A. (1991). Content effects on word problem performance: A possible source of test bias? : American Educational Research Journal Vol 28(4) Win 1991, 897-915.
  • Choca, J. P. (2004). Effect of Individual Variables. Washington, DC: American Psychological Association.
  • Choca, J. P., & Van Denburg, E. (1997). Effect of individual variables. Washington, DC: American Psychological Association.
  • Chung-Yan, G. A., & Cronshaw, S. F. (2002). A critical re-examination and analysis of cognitive ability tests using the Thorndike model of fairness: Journal of Occupational and Organizational Psychology Vol 75(4) Dec 2002, 489-509.
  • Clancy, E. A. (1997). Factors influencing the resubstitution accuracy in multivariate classification analysis: Implications for study design in ergonomics: Ergonomics Vol 40(4) Apr 1997, 417-427.
  • Clark, J. H. (1985). Racial test bias: An artifact of test development methodology? : Dissertation Abstracts International.
  • Clauser, B. E., Mazor, K., & Hambleton, R. K. (1991). Influence of the criterion variable on the identification of differentially functioning test items using the Mantel-Haenszel statistic: Applied Psychological Measurement Vol 15(4) Dec 1991, 353-359.
  • Coenen, M., & Vallen, T. (1991). Item bias in the CITO final primary school tests: Pedagogische Studien Vol 68(1) Jan 1991, 15-26.
  • Cohen, A. S., & Ibarra, R. A. (2005). Examining Gender-Related Differential Item Functioning Using Insights from Psychometric and Multicontext Theory. New York, NY: Cambridge University Press.
  • Cohen, A. S., & Kim, S.-h. (1993). A comparison of Lord's !x-2 and Raju's area measures in detection of DIF: Applied Psychological Measurement Vol 17(1) Mar 1993, 39-52.
  • Cole, N. S. (1980). Can We Be Neutral About Bias? : PsycCRITIQUES Vol 25 (11), Nov, 1980.
  • Cole, N. S. (1993). History and development of DIF. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Cole, N. S. (1997). Understanding gender differences and fair assessment in context. Mahwah, NJ: Lawrence Erlbaum Associates Publishers.
  • Cole, N. S., & Moss, P. A. (1989). Bias in test use. New York, NY, England: Macmillan Publishing Co, Inc; American Council on Education.
  • Contrada, R. J., & Krantz, D. S. (1995). Measurement bias in health psychology research designs. Oxford, England: John Wiley & Sons.
  • Cook, W. A. (1996). Item validity of the MMPI-2 for a Hispanic and White clinical sample. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Cozart, G. W. (1998). A comparative analysis of WISC-III and SB-IV profiles for Black and White students referred for learning problems. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Crone, C. R. (1994). Differential distractor functioning: A log-linear analysis. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Cronshaw, S. F., Hamilton, L. K., Onyura, B. R., & Winston, A. S. (2006). Case for Non-Biased Intelligence Testing Against Black Africans Has Not Been Made: A Comment on Rushton, Skuy, and Bons (2004): International Journal of Selection and Assessment Vol 14(3) Sep 2006, 278-287.
  • Crow, L. W., & Piper, M. K. (1986). A study of field independent biased mental ability tests in community college science classes: Journal of Research in Science Teaching Vol 23(9) Dec 1986, 817-822.
  • Cunningham, C. L., Ferree, N. K., & Howard, M. A. (2003). Apparatus bias and place conditioning with ethanol in mice: Psychopharmacology Vol 170(4) Dec 2003, 409-422.
  • Cunningham, H. M. (1985). Instruments bias in assessment centers: Dissertation Abstracts International.
  • Cunningham, J. L. (1998). Learning disabilities. Washington, DC: American Psychological Association.
  • Cyr, J. J., & Atkinson, L. (1987). Test item bias in the WISC--R: Canadian Journal of Behavioural Science/Revue canadienne des Sciences du comportement Vol 19(1) Jan 1987, 101-107.
  • Dana, R. H. (2000). Culture and methodology in personality assessment. San Diego, CA: Academic Press.
  • Dash, U. N. (1983). Methods for examining item bias: A comparative review: Indian Psychologist Vol 2(1) Jun 1983, 21-28.
  • Davis, F. D., & Venkatesh, V. (1996). A critical assessment of potential measurement biases in the technology acceptance model: Three experiments: International Journal of Human-Computer Studies Vol 45(1) Jul 1996, 19-45.
  • de Jong, M., & Vallen, T. (1989). Linguistic and cultural sources of item bias for ethnic minority pupils in the CITO final primary schooltests: Pedagogische Studien Vol 66(10) Oct 1989, 390-402.
  • Delucchi, K. L. (1987). A comparison of chi-squared based procedures for the detection of biased items in educational and psychological tests: Dissertation Abstracts International.
  • Demsky, Y. I., Mittenberg, W., Quintar, B., Katell, A. D., & Golden, C. J. (1998). Bias in the use of standard American norms with Spanish translations of the Wechsler Memory Scale--Revised: Assessment Vol 5(2) Jun 1998, 115-121.
  • Derksen, M. A. (2001). Discipline, subjectivity and personality: An analysis of the manuals of four psychological tests: History of the Human Sciences Vol 14(1) Feb 2001, 25-47.
  • Diamond, E. E., & Elmore, P. B. (1986). Bias in achievement testing: Follow-up report of the AMECD Commission on Bias in Measurement: Measurement and Evaluation in Counseling and Development Vol 19(2) Jul 1986, 102-112.
  • Dibu-Ojerinde, O. O. (1991). Test item bias and implications for testing in Nigeria: Nigerian Journal of Guidance & Counselling Vol 4(1-2) Jul 1991, 108-116.
  • Donald, A. (1996). Verification bias: A pitfall in evaluating screening tests: Nursing Research Vol 45(6) Nov-Dec 1996, 350-352.
  • Donoghue, J. R., Holland, P. W., & Thayer, D. T. (1993). A Monte Carlo study of factors that affect the Mantel-Haenszel and standardization measures of differential item functioning. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Dorans, N. J. (2004). Freedle's Table 2: Fact or Fiction? : Harvard Educational Review Vol 74(1) Spr 2004, 62-72.
  • Dorans, N. J., & Holland, P. W. (1993). DIF detection and description: Mantel-Haenszel and standardization. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Douglas, J. A., Stout, W., & DiBello, L. V. (1996). A kernel-smoothed version of SIBTEST with applications to local DIF inference and function estimation: Journal of Educational and Behavioral Statistics Vol 21(4) Win 1996, 333-363.
  • Doverspike, D., & Barrett, G. V. (1984). An internal bias analysis of a job evaluation instrument: Journal of Applied Psychology Vol 69(4) Nov 1984, 648-662.
  • Downey, R. G., & Stockdale, M. S. (1987). Computer programs to compute Lord's item bias statistic for a three-parameter ICC: Educational and Psychological Measurement Vol 47(3) Fal 1987, 637-641.
  • Downs, S. L., & Silvestro, J. R. (1988). Increasing the benefits of teacher certification testing programs through the use of support systems. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Drasgow, F. (1987). Study of the measurement bias of two standardized psychological tests: Journal of Applied Psychology Vol 72(1) Feb 1987, 19-29.
  • Dreger, R. M. (1986). A Perspective on Perspectives: PsycCRITIQUES Vol 31 (1), Jan, 1986.
  • Duffy, J., Gunther, G., & Walters, L. (1997). Gender and mathematical problem solving: Sex Roles Vol 37(7-8) Oct 1997, 477-494.
  • Duhan, D. F., Keown, C. F., & Falkenberg, A. W. (1988). Effect of biasing an attitude scale: Acquiescence, reactance, or balancing? : Psychological Reports Vol 62(2) Apr 1988, 567-574.
  • Dunbar, J. L. (1999). Differential item performance by gender on the externalizing scales of the Behavior Assessment System for Children. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Dunn, J. C., & Watkinson, E. J. (1996). Problems with identification of children who are physically awkward using the TOMI: Adapted Physical Activity Quarterly Vol 13(4) Oct 1996, 347-356.
  • Edwards, O. W., & Oakland, T. D. (2006). Factorial Invariance of Woodcock-Johnson III Scores for African Americans and Caucasian Americans: Journal of Psychoeducational Assessment Vol 24(4) Dec 2006, 358-366.
  • Einspruch, E. L. (1989). An examination of selection bias in the Florida College Level Academic Skills Test: Dissertation Abstracts International.
  • Elder, C. (1997). What does test bias have to do with fairness? : Language Testing Vol 14(3) Nov 1997, 261-277.
  • Elder, C., McNamara, T., & Congdon, P. (2003). Rasch Techniques for Detecting Bias in Performance Assessments: An Example Comparing the Performance of Native and Non-native Speakers on a Test of Academic English: Journal of Applied Measurement Vol 4(2) 2003, 181-197.
  • Elliot, S. M. (1988). Preventing bias in teacher certification tests through valid job analysis procedures. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Elliott, R. (1988). Tests, abilities, race, and conflict: Intelligence Vol 12(4) Oct-Dec 1988, 333-350.
  • Elosua Oliden, P., Lopez Jauregui, A., & Egana Makazaga, J. (2000). Potential sources of bias in a numerical aptitude test: Psicothema Vol 12(3) Aug 2000, 376-382.
  • Elosua Oliden, P., Lopez Jauregui, A., & Torres Alvarez, E. (2000). Didactic developments and differential item functioning: Psicothema Vol 12(Suppl2) 2000, 198-202.
  • Emerling, F. (1990). An investigation of test bias in two nonverbal cognitive measures for two ethnic groups: Journal of Psychoeducational Assessment Vol 8(1) Mar 1990, 34-41.
  • Evans, J. A. (1997). Investigating the instructional sensitivity of distractors in items exhibiting DIF. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Fan, X., Willson, V. T., & Kapes, J. T. (1996). Ethnic group representation in test construction samples and test bias: The standardization fallacy revisited: Educational and Psychological Measurement Vol 56(3) Jun 1996, 365-381.
  • Fellers, G., McInnis, C., Cappelli, M., Cragg, S., & Vaillancourt, R. (1987). WAIS-R Information subtest item bias for Canadian high school students: Canadian Journal of Behavioural Science/Revue canadienne des Sciences du comportement Vol 19(1) Jan 1987, 108-114.
  • Fernando, K., Chard, L., Butcher, M., & McKay, C. (2003). Standardization of the Rey Complex Figure Test in New Zealand children and adolescents: New Zealand Journal of Psychology Vol 32(1) Jun 2003, 33-38.
  • Flannelly, L. T. (1996). Reducing judgment bias in test-taking situations. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Fleer, P. F. (1993). A Monte Carlo assessment of a new measure of item and test bias: Dissertation Abstracts International.
  • Floyd, R. L., Gathercoal, K., & Roid, G. (2004). No evidence for ethnic and racial bias in the tryout edition of the Merrill-Palmer Scale-Revised: Psychological Reports Vol 94(1) Feb 2004, 217-220.
  • Flynt, S. W., Warren, J. S., Morton, R. C., & Smith, F. H. (1997). Examining the question of gender bias in the Slosson Intelligence Test in relation to reading: Reading Psychology Vol 18(3) Jul-Sep 1997, 237-248.
  • Fouad, N. A. (1993). Cross-cultural vocational assessment: The Career Development Quarterly Vol 42(1) Sep 1993, 4-13.
  • Fox, J. D. (2003). From Products to Process: An Ecological Approach to Bias Detection: International Journal of Testing Vol 3(1) Mar 2003, 21-47.
  • Francis, L. J. (1997). Coopersmith's model of self-esteem: Bias toward the stable extravert? : Journal of Social Psychology Vol 137(1) Feb 1997, 139-142.
  • Freedle, R. O. (2003). Correcting the SAT's ethnic and social-class bias: A method for reestimating SAT scores: Harvard Educational Review Vol 73(1) Spr 2003, 1-43.
  • Freedle, R. O. (2004). The Truth and the Truthful Sages That Spin It: A Review of Dorans: Harvard Educational Review Vol 74(1) Spr 2004, 73-79.
  • Friedlander, F. (1964). Type I and Type II Bias: American Psychologist Vol 19(3) Mar 1964, 198-199.
  • Fuchs, D. (1987). Examiner familiarity effects on test performance: Implications for training and practice: Topics in Early Childhood Special Education Vol 7(3) Fal 1987, 90-104.
  • Furqon. (1994). The use of flexible logistic regression in assessing differential item functioning. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Gamiliel, E., & Cahan, S. (2007). Mind the gap: Between-group differences and fair test use: International Journal of Selection and Assessment Vol 15(3) Sep 2007, 273-282.
  • Garriga-Trillo, A. (1997). Are there other famous artefacts? : Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 695-701.
  • Gass, C. S. (1992). MMPI-2 interpretation of patients with cerebrovascular disease: A correction factor: Archives of Clinical Neuropsychology Vol 7(1) 1992, 17-27.
  • Geisinger, K. F. (2005). The Testing Industry, Ethnic Minorities, and Individuals With Disabilities. Mahwah, NJ: Lawrence Erlbaum Associates Publishers.
  • Geisinger, K. F., & Carlson, J. F. (1998). Training psychologists to assess members of a diverse society. Washington, DC: American Psychological Association.
  • Gierl, M. J., Bisanz, J., Bisanz, G. L., Boughton, K. A., & Khaliq, S. N. (2001). Illustrating the utility of differential bundle functioning analyses to identify and interpret group differences on achievement tests: Educational Measurement: Issues and Practice Vol 20(2) Sum 2001, 26-36.
  • Giffin, M. E. (1984). Item bias detection methods for small samples: Dissertation Abstracts International.
  • Gignac, G. E. (2006). Evaluating subtest 'g' saturation levels via the single trait-correlated uniqueness (STCU) SEM approach: Evidence in favor of crystallized subtests as the best indicators of 'g': Intelligence Vol 34(1) Jan-Feb 2006, 29-46.
  • Gipps, C. V., & Murphy, P. (1994). A fair test? Assessment, achievement and equity. Buckingham, England: Open University Press.
  • Glutting, J. J. (1986). Potthoff bias analyses of K-ABC MPC and Nonverbal Scale IQs among Anglo, Black, and Puerto Rican kindergarten children: Professional School Psychology Vol 1(4) Fal 1986, 225-234.
  • Glutting, J. J., Oh, H.-J., Ward, T., & Ward, S. (2000). Possible criterion-related bias of the WISC-III with a referral sample: Journal of Psychoeducational Assessment Vol 18(1) Mar 2000, 17-26.
  • Goldstein, B. L., & Patterson, P. O. (1988). Turning back the Title VII clock: The resegregation of the American work force through validity generalization: Journal of Vocational Behavior Vol 33(3) Dec 1988, 452-462.
  • Gomez Benito, J., & Navas Ara, M. J. (1998). Gender-related impact and differential item functioning in a test of numerical ability: Psicothema Vol 10(3) Nov 1998, 685-696.
  • Good, R. H., & Salvia, J. (1988). Curriculum bias in published, norm-referenced reading tests: Demonstrable effects: School Psychology Review Vol 17(1) 1988, 51-60.
  • Gordon, E. W., & Terrell, M. D. (1982). The changed social context of testing: Annual Progress in Child Psychiatry & Child Development 1982, 291-299.
  • Gordon, R. P., Stump, K., & Glaser, B. A. (1996). Assessment of individuals with hearing impairments: Equity in testing procedures and accommodations: Measurement and Evaluation in Counseling and Development Vol 29(2) Jul 1996, 111-118.
  • Grayson, D. A., Mackinnon, A., Jorm, A. F., Creasey, H., & Broe, G. A. (2000). Item bias in the Center for Epidemiologic Studies Depression Scale: Effects of physical disorders and disability in an elderly community sample: Journals of Gerontology: Series B: Psychological Sciences and Social Sciences Vol 55B(5) Sep 2000, P273-P282.
  • Greer, T. G. (2004). Detection of Differential Item Functioning (DIF) on the SATV: A comparison of four methods: Mantel-Haenszel, logistic regression, simultaneous item bias, and likelihood ratio test. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Gregory, K. L. (1992). A reconsideration of bias in employment testing from the perspective of factorial invariance: Dissertation Abstracts International.
  • Gregory, S., & Lee, S. (1986). Psychoeducational assessment of racial and ethnic minority groups: Professional implications: Journal of Counseling & Development Vol 64(10) Jun 1986, 635-637.
  • Griffin, E. F. (1988). Revising the federal courts' analysis of Title VII employment test discrimination cases to include methods for detecting item bias: Dissertation Abstracts International.
  • Gutierrez-Clellen, V. F. (1996). Language diversity: Implications for assessment. Baltimore, MD: Paul H Brookes Publishing.
  • Guyatt, G. H., Cook, D. J., King, D., Norman, G. R., Kane, S.-L. C., & van Ineveld, C. (1999). Effect of the framing of questionnaire items regarding satisfaction with training on residents' responses: Academic Medicine Vol 74(2) Feb 1999, 192-194.
  • Haertel, E. H. (1990). Showing That Teacher Tests Are Free From Bias: PsycCRITIQUES Vol 35 (3), Mar, 1990.
  • Hambleton, R. K., Clauser, B. E., Mazor, K. M., & Jones, R. W. (1993). Advances in the detection of differentially functioning test items: European Journal of Psychological Assessment Vol 9(1) 1993, 1-18.
  • Hambleton, R. K., & Rogers, H. J. (1991). Evaluation of the plot method for identifying potentially biased test items. New York, NY: Kluwer Academic/Plenum Publishers.
  • Hamilton, L. S. (1999). Detecting gender-based differential item functioning on a constructed-response science test: Applied Measurement in Education Vol 12(3) 1999, 211-235.
  • Hamlin, R. H. (1986). A correlative study of Concept Mastery Test scores of prospective Anglo, Black, and Hispanic graduate students: Dissertation Abstracts International.
  • Hankins, J. A. (1990). The effects of variable entry for a Bayesian adaptive test: Educational and Psychological Measurement Vol 50(4) Win 1990, 785-802.
  • Hanson, D. J. (1984). Liberal-conservative bias in the Dogmatism Scale: Psychology: A Journal of Human Behavior Vol 21(1) 1984, 7-8.
  • Harrington, G. M. (1988). Two forms of minority-group test bias as psychometric artifacts with an animal model (Rattus norvegicus): Journal of Comparative Psychology Vol 102(4) Dec 1988, 400-407.
  • Harris, A. L., & Robinson, K. (2007). Schooling behaviors or prior skills? A cautionary tale of omitted variable bias within oppositional culture theory: Sociology of Education Vol 80(2) Apr 2007, 139-157.
  • Harris, D. J., & Kolen, M. J. (1989). Examining the stability of Angoff's delta item bias statistic using the bootstrap: Educational and Psychological Measurement Vol 49(1) Spr 1989, 81-87.
  • Harris, J. G., Tulsky, D. S., & Schultheis, M. T. (2003). Assessment of the non-native English speaker: Assimilating history and research findings to guide clinical practice. San Diego, CA: Academic Press.
  • Harty, H., Adkins, D. M., & Sherwood, R. D. (1984). Predictability of giftedness identification indices for two recognized approaches to elementary school gifted education: Journal of Educational Research Vol 77(6) Jul-Aug 1984, 337-342.
  • Harville, D. L. (1996). Ability test equity in predicting job performance work samples: Educational and Psychological Measurement Vol 56(2) Apr 1996, 344-348.
  • Haskell, R. E. (1998). Setting the record straight: American Psychologist Vol 53(11) Nov 1998, 1229-1230.
  • Haslam, N. (2006). Bias in psychopathology research: Current Opinion in Psychiatry Vol 19(6) Nov 2006, 625-630.
  • Haupt, H., & Oberhofer, W. (2006). Best affine unbiased representations of the fully restricted general Gauss-Markov model: Journal of Multivariate Analysis Vol 97(3) Mar 2006, 759-764.
  • Hautus, M. J., & Lee, A. (2006). Estimating sensitivity and bias in a yes/no task: British Journal of Mathematical and Statistical Psychology Vol 59(2) Nov 2006, 257-273.
  • Hay, D. A. (1997). Genetic research on intelligence is more than just psychometrics: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 702-710.
  • Hays, W. L. (1984). Review of Test Item Bias: PsycCRITIQUES Vol 29 (4), Apr, 1984.
  • Henry, M. (1988). ASAT and the TE score: A critique of "objective testing." Australian & New Zealand Journal of Sociology Vol 24(2) Jul 1988, 289-311.
  • Henry, P., Bryson, S., & Henry, C. A. (1990). Black student attitudes toward standardized tests: Does gender make a difference? : College Student Journal Vol 23(4) Win 1990, 346-354.
  • Hessels, M. G. P. (1996). Ethnic differences in learning potential test scores: Research into item and test bias in the Learning potential test for Ethnic Minorities: Journal of Cognitive Education Vol 5(2) 1996, 133-153.
  • Higgins, O. H. I. (1998). Item position effects and differential item functioning for African-American and White examinees completing the Arithmetic Reasoning subtest of the preliminary item tryout version of Form E of the General Aptitude Test Battery. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Hill, G., MacNeill, I., Aylesworth, R., McDowell, I., Forbes, W., & Kozak, J. (2001). Effects of screening errors and differential mortality on the estimation of the incidence of dementia in the Canadian Study of Health and Aging: International Psychogeriatrics Vol 13(Suppl1) 2001, 143-146.
  • Hills, J. R. (1989). Screening for potentially biased items in testing programs: Educational Measurement: Issues and Practice Vol 8(4) Win 1989, 5-11.
  • Hintze, J. M., Callahan, J. E., III, Matthews, W. J., Williams, S. A. S., & Tobin, K. G. (2002). Oral reading fluency and prediction of reading comprehension in African American and Caucasian elementary school children: School Psychology Review Vol 31(4) 2002, 540-553.
  • Hirsch, J. (1997). The triumph of wishfull thinking over genetic irrelevance: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 711-720.
  • Hishinuma, E. S. (1995). WISC-III accommodations: The need for practitioner guidelines: Journal of Learning Disabilities Vol 28(3) Mar 1995, 130-135.
  • Holland, J. L. (1985). Author biases, errors, and omissions in an evaluation of the SDS Investigative Summary Scale: A response to Aronowitz, Bridge, and Jones (1985): Journal of Vocational Behavior Vol 27(3) Dec 1985, 374-376.
  • Holland, P. W., & Wainer, H. (1993). Differential item functioning. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Hong, S. (1998). An investigation of the influence of internal test bias on test predictive validity. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Hong, S., & Roznowski, M. (2001). An investigation of the influence of internal test bias on regression slope: Applied Measurement in Education Vol 14(4) Oct 2001, 351-368.
  • Hopkins, W. D., & Fernandez-Carriba, S. (2000). The effect of situational factors on hand preferences for feeding in 177 captive chimpanzees (Pan troglodytes): Neuropsychologia Vol 38(4) 2000, 403-409.
  • Horn, J. (1997). On the mathematical relationship between factor or component coefficients and differences between means: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 721-728.
  • House, J. D. (1989). Age bias in prediction of graduate grade point average from Graduate Record Examination scores: Educational and Psychological Measurement Vol 49(3) Fal 1989, 663-666.
  • Hsu, L. M. (2002). Fail-safe Ns for one- versus two-tailed tests lead to different conclusions about publication bias: Understanding Statistics Vol 1(2) Apr 2002, 85-100.
  • Hu, C.-c. (1990). The effect of multidimensionality of data on item bias detection: Dissertation Abstracts International.
  • Huang, K. H., Watters, J. K., & Case, P. (1988). Psychological assessment and AIDS research with intravenous drug users: Challenges in measurement: Journal of Psychoactive Drugs Vol 20(2) Apr-Jun 1988, 191-195.
  • Hui-Xia, M., & Yao-Xian, G. (2003). Development of multiple achievement tests: Chinese Journal of Clinical Psychology Vol 11(2) May 2003, 81-85.
  • Hulin, C. L., & Mayer, L. J. (1986). Psychometric equivalence of a translation of the Job Descriptive Index into Hebrew: Journal of Applied Psychology Vol 71(1) Feb 1986, 83-94.
  • Hultquist, A. M., & Metzke, L. K. (1993). Potential effects of curriculum bias in individual norm-referenced reading and spelling achievement tests: Journal of Psychoeducational Assessment Vol 11(4) Dec 1993, 337-344.
  • Humphreys, L. B. (1997). Professor Schonemann is both right and wrong: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 729-732.
  • Humphreys, L. G. (1975). "Educational uses of tests with disadvantaged students": Addendum: American Psychologist Vol 30(1) Jan 1975, 95-96.
  • Humphreys, L. G. (1986). An analysis and evaluation of test and item bias in the prediction context: Journal of Applied Psychology Vol 71(2) May 1986, 327-333.
  • Hunt, D. M., Magruder, S., & Bolon, D. S. (1995). Questionnaire format bias: When are juxtaposed scales appropriate: A call for further research: Psychological Reports Vol 77(3, Pt 1) Dec 1995, 931-941.
  • Hunter, J. E., & Schmidt, F. L. (2000). Racial and gender bias in ability and achievement tests: Resolving the apparent paradox: Psychology, Public Policy, and Law Vol 6(1) Mar 2000, 151-158.
  • Ikeda, E. (1995). Raters' use of differential item functioning and the Mantel-Haenszel procedure in the item analysis of written test responses: Japanese Journal of Educational Psychology Vol 43(3) Sep 1995, 343-350.
  • Ironson, G., Homan, S., Willis, R., & Signer, B. (1984). The validity of item bias techniques with math word problems: Applied Psychological Measurement Vol 8(4) Fal 1984, 391-396.
  • Irwin, H. J. (2001). Age and sex differences in paranormal beliefs after controlling for differential item functioning: European Journal of Parapsychology Vol 16 2001, 102-106.
  • Jackson, G. D. (1975). On the report of the Ad Hoc Committee on Educational Uses of Tests with Disadvantaged Students: Another psychological view from the Association of Black Psychologists: American Psychologist Vol 30(1) Jan 1975, 88-93.
  • Jane, J. S. (2001). Gender bias in diagnostic criteria for personality disorders: An item response theory analysis. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Jensen, A. R. (1985). The nature of the Black-White difference on various psychometric tests: Spearman's hypothesis: Behavioral and Brain Sciences Vol 8(2) Jun 1985, 193-263.
  • Jiang, H., & Stout, W. (1998). Improved type I error control and reduced estimation bias for DIF detection using SIBTEST: Journal of Educational and Behavioral Statistics Vol 23(4) Win 1998, 291-322.
  • Johnson, S. T. (1988). Validity and bias in teacher certification testing. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Johnson, S. T. (1989). Test fairness and bias: Measuring academic achievement among Black youth. New Brunswick, NJ: Transaction Publishers.
  • Jones, R. N. (2003). Racial bias in the assessment of cognitive functioning of older adults: Aging & Mental Health Vol 7(2) Mar 2003, 83-102.
  • Jorm, A. F., Scott, R., Henderson, A. S., & Kay, D. W. (1988). Educational level differences on the Mini-Mental State: The role of test bias: Psychological Medicine Vol 18(3) Aug 1988, 727-731.
  • Jurgensen, C. E. (1955). Item weights in employee rating scales: Journal of Applied Psychology Vol 39(5) Oct 1955, 305-307.
  • Justiz, M. J., & Kameen, M. C. (1988). Demographic trends in our society and the implications for teacher certification testing policies. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Kadlec, H. (1997). The correlation between the mean difference vector and first principal component holds even with discrete variables containing no g: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 733-739.
  • Kaplan, E. P. (1993). Sex differences in WAIS--R subtests for a psychiatric inpatient population.-700 A: Dissertation Abstracts International.
  • Karkee, T. B. (2000). An investigation of IRT-adjusted grade point average in the prediction of college performance: Applications to prediction bias. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Kaufman, J. C. (2005). Nonbiased Assessment: A Supplemental Approach. Hoboken, NJ: John Wiley & Sons Inc.
  • Kearney, P., & Plax, T. G. (1997). Item desirability and the BAT checklist: A reply to Waltman and Burleson: Communication Education Vol 46(2) Apr 1997, 95-103.
  • Kehoe, J. F., & Tenopyr, M. L. (1994). Adjustment in assessment scores and their usage: A taxonomy and evaluation methods: Psychological Assessment Vol 6(4) Dec 1994, 291-303.
  • Kelderman, H. (1989). Item bias detection using loglinear IRT: Psychometrika Vol 54(4) Dec 1989, 681-697.
  • Kim, J. K., & Nicewander, W. A. (1993). Ability estimation for conventional tests: Psychometrika Vol 58(4) Dec 1993, 587-599.
  • Kim, M. S. (1992). Applications of multidimensional scaling to the study of test bias in internal construct validity: A complimentary approach to factor analysis: Dissertation Abstracts International.
  • Kim, S. (1993). An empirical investigation of the robustness of the Mantel-Haenszel procedure and sources of differential item functioning: Dissertation Abstracts International.
  • Kirby, W. N. (1988). Minority participation in Texas education. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Kline, R. B., Lachar, D., & Sprague, D. J. (1985). The Personality Inventory for Children (PIC): An unbiased predictor of cognitive and academic status: Journal of Pediatric Psychology Vol 10(4) Dec 1985, 461-477.
  • Knuckle, E. P., & Asbury, C. A. (1986). WISC-R discrepancy score directions and gender as reflected in neuropsychological test performance of Black adolescents: Journal of Research & Development in Education Vol 20(1) Fal 1986, 44-51.
  • Kok, F. (1988). Item bias and test multidimensionality. New York, NY: Plenum Press.
  • Kok, F. G., Mellenbergh, G. J., & Van der Flier, H. (1985). Detecting experimentally induced item bias using the iterative logit method: Journal of Educational Measurement Vol 22(4) Win 1985, 295-303.
  • Kondo-Brown, K. (2002). A FACETS analysis of rater bias in measuring Japanese second language writing performance: Language Testing Vol 19(1) Jan 2002, 3-31.
  • Korkmaz, M. (2006). The New Approaches in Scale Development: Methods of Differential Item Functioning (Item Bias) Based on Item Response Theory: Turk Psikoloji Yazilari Vol 9(18) Dec 2006, 63-80.
  • Krieg, E. F., Jr. (1999). Biases induced by coarse measurement scales: Educational and Psychological Measurement Vol 59(5) Oct 1999, 749-766.
  • Kristjansson, E., Aylesworth, R., McDowell, I., & Zumbo, B. D. (2005). A Comparison of Four Methods for Detecting Differential Item Functioning in Ordered Response Items: Educational and Psychological Measurement Vol 65(6) Dec 2005, 935-953.
  • Kupke, T., Revis, E. S., & Gantner, A. B. (1993). Hemispheric bias of the Mini-Mental State Examination in elderly males: Clinical Neuropsychologist Vol 7(2) Apr 1993, 210-214.
  • Kwate, N. O. A. (2001). Intelligence or misorientation? Eurocentrism in the WISC-III: Journal of Black Psychology Vol 27(2) May 2001, 221-238.
  • Lambert, N. M. (1986). Evidence on age and ethnic status bias in factor scores and the comparison score for the AAMD Adaptive Behavior Scale-School Edition: Journal of School Psychology Vol 24(2) Sum 1986, 143-153.
  • Lance, C. E., Newbolt, W. H., Gatewood, R. D., Foster, M. R., French, N. R., & Smith, D. E. (2000). Assessment center exercise factors represent cross-situational specificity, not method bias: Human Performance Vol 13(4) 2000, 323-353.
  • Lange, R., & Thalbourne, M. A. (2002). Rasch scaling paranormal belief and experience: Structure and semantics of Thalbourne's Australian Sheep-Goat Scale: Psychological Reports Vol 91(3,Pt2) Dec 2002, 1065-1073.
  • Larsen, J. D., Mascharka, C., & Toronski, C. (1987). Does the wording of the question change the number of headaches people report on a health questionnaire? : Psychological Record Vol 37(3) Sum 1987, 423-427.
  • Lautenschlager, G. J., Flaherty, V. L., & Park, D.-G. (1994). IRT differential item functioning: An examination of ability scale purifications: Educational and Psychological Measurement Vol 54(1) Spr 1994, 21-31.
  • Lautenschlager, G. J., & Mendoza, J. L. (1986). A step-down hierarchical multiple regression analysis for examining hypotheses about test bias in prediction: Applied Psychological Measurement Vol 10(2) Jun 1986, 133-139.
  • Lautenschlager, G. J., & Park, D.-g. (1988). IRT item bias detection procedures: Issues of model misspecification, robustness, and parameter linking: Applied Psychological Measurement Vol 12(4) Dec 1988, 365-376.
  • Lawlor, S., Richman, S., & Richman, C. L. (1997). The validity of using the SAT as a criterion for black and white students' admission to college: College Student Journal Vol 31(4) Dec 1997, 507-515.
  • Lawshe, C. H. (1987). Adverse impact: Is it a viable concept? : Professional Psychology: Research and Practice Vol 18(5) Oct 1987, 492-497.
  • Lawson, J. S., & Inglis, J. (1985). Learning disabilities and intelligence test results: A model based on a principal components analysis of the WISC--R: British Journal of Psychology Vol 76(1) Feb 1985, 35-48.
  • Leal, J. T. (1989). Sex differences in field dependence/independence: The development of a gender fair embedded figures test: Dissertation Abstracts International.
  • Lewis, C. (1993). A note on the value of including the studied item in the test score when analyzing test items for DIF. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Lewis, G. (1994). Assessing psychiatric disorder with a human interviewer or a computer: Journal of Epidemiology & Community Health Vol 48(2) Apr 1994, 207-210.
  • Lievens, F., Coetsier, P., Janssen, P. J., & Decaesteker, C. (2001). Predictive validity and gender bias of the admission exam "Medical and Dental Studies" in Flanders: A first evaluation: Pedagogische Studien Vol 78(1) 2001, 4-15.
  • Lievens, F., Reeve, C. L., & Heggestad, E. D. (2007). An examination of psychometric bias due to retesting on cognitive ability tests in selection settings: Journal of Applied Psychology Vol 92(6) Nov 2007, 1672-1682.
  • Lim, R. G., & Drasgow, F. (1990). Evaluation of two methods for estimating item response theory parameters when assessing differential item functioning: Journal of Applied Psychology Vol 75(2) Apr 1990, 164-174.
  • Linacre, J. M., & Wright, B. D. (2002). Construction of measures from many-facet data: Journal of Applied Measurement Vol 3(4) 2002, 486-512.
  • Lindsay, K. A., & Widiger, T. A. (1995). Sex and gender bias in self-report personality disorder inventories: Item analyses of the MCMI-II, MMPI, and PDQ--R: Journal of Personality Assessment Vol 65(1) Aug 1995, 1-20.
  • Linn, M. C., & Kessel, C. (2006). Assessment and Gender. New York, NY: Oxford University Press.
  • Linn, R. L. (1984). Selection bias: Multiple meanings: Journal of Educational Measurement Vol 21(1) Spr 1984, 33-47.
  • Linn, R. L. (1993). The use of differential item functioning statistics: A discussion of current practice and future implications. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Linn, R. L. (1994). Fair test use: Research and policy. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Linn, R. L. (2002). Constructs and values in standards-based assessment. Mahwah, NJ: Lawrence Erlbaum Associates Publishers.
  • Linn, R. L., Werts, C. E., Ironson, G. H., & Subkoviak, M. J. (1996). Bias. Lanham, MD, England: University Press of America.
  • Long, K. A., & Hamlin, C. M. (1988). Use of the Piersarris Self-Concept Scale with Indian children: Cultural considerations: Nursing Research Vol 37(1) Jan-Feb 1988, 42-46.
  • Long, K. A., & Hamlin, C. M. (1988). Use of the Piers-Harris Self-Concept Scale with Indian children: Cultural considerations: Nursing Research Vol 37(1) Jan-Feb 1988, 42-46.
  • Longford, N. T., Holland, P. W., & Thayer, D. T. (1993). Stability of the MH D-DIF statistics across populations. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Makitalo, A. (1996). Gender differences in performance on the DTM subtest in the Swedish Scholastic Aptitude Test as a function of item position and cognitive demands: Scandinavian Journal of Educational Research Vol 40(3) Sep 1996, 189-201.
  • Malgady, R. G. (1996). The question of cultural bias in assessment and diagnosis of ethnic minority clients: Let's reject the null hypothesis: Professional Psychology: Research and Practice Vol 27(1) Feb 1996, 73-77.
  • Malgady, R. G. (2000). Myths about the null hypothesis and the path to reform. Mahwah, NJ: Lawrence Erlbaum Associates Publishers.
  • Maller, S. J. (1994). Validity and item bias of the WISC-III with deaf children. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Maller, S. J. (2000). Item invariance in four subtests of the Universal Nonverbal Intelligence Test (UNIT) across groups of deaf and hearing children: Journal of Psychoeducational Assessment Vol 18(3) Sep 2000, 240-254.
  • Maller, S. J. (2003). Best practices in detecting bias in nonverbal tests. New York, NY: Kluwer Academic/Plenum Publishers.
  • Malpass, R. S., & Poortinga, Y. H. (1986). Strategies for design and analysis. Thousand Oaks, CA: Sage Publications, Inc.
  • Manbeck, M. D. (2005). Initial examination of differential item functioning between genders on the Test of Reactions and Adaptations in College-Revised. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Maraun, M. D. (1997). Exactly what is an artefact? : Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 740-749.
  • Margrain, S. (1985). Bias in the BAS: Bulletin of the British Psychological Society Vol 38 Jun 1985, 176-179.
  • Marks, D. (1990). Cautions in interpreting district-wide standardized mathematics achievement test results: Journal of Educational Research Vol 83(6) Jul-Aug 1990, 349-354.
  • Marquardt, T. P., & Gillam, R. B. (1999). Assessment in communication disorders: Some observations on current issues: Language Testing Vol 16(3) Jul 1999, 249-269.
  • Marsh, H. W., & Roche, L. A. (1997). Making students' evaluations of teaching effectiveness effective: The critical issues of validity, bias, and utility: American Psychologist Vol 52(11) Nov 1997, 1187-1197.
  • Marshall, S. C., Mungas, D., Weldon, M., Reed, B., & Haan, M. (1997). Differential item functioning in the Mini-Mental State Examination in English- and Spanish-speaking older adults: Psychology and Aging Vol 12(4) Dec 1997, 718-725.
  • Martens, B. K., Steele, E. S., Massie, D. R., & Diskin, M. T. (1995). Curriculum bias in standardized tests of reading decoding: Journal of School Psychology Vol 33(4) Win 1995, 287-296.
  • Masling, J. M. (1992). The influence of situational and interpersonal variables in projective testing: Journal of Personality Assessment Vol 59(3) Dec 1992, 616-640.
  • Masse, L. C., & Ross, M. W. (2001). Assessing differential item validity of the AIDS-Related Social Skills Questionnaire among African adolescents: Social Science Research Vol 30(1) Mar 2001, 50-61.
  • Masters, G. N. (1988). Item discrimination: When more is worse: Journal of Educational Measurement Vol 25(1) Spr 1988, 15-29.
  • Matthews, G., & Harley, T. A. (1996). Connectionist models of emotional distress and attentional bias: Cognition & Emotion Vol 10(6) Nov 1996, 561-600.
  • May, K., & Nicewander, W. A. (1998). Measuring change conventionally and adaptively: Educational and Psychological Measurement Vol 58(6) Dec 1998, 882-897.
  • May, K. O. R. (1993). Measuring change conventionally and adaptively: Dissertation Abstracts International.
  • May, R. S. (1986). Overconfidence as a result of incomplete and wrong knowledge. New York, NY: Peter Lang Publishing.
  • Mayer, D. M., & Hanges, P. J. (2003). Understanding the stereotype threat effect with "culture-free" tests: An examination of its mediators and measurement: Human Performance Vol 16(3) 2003, 207-230.
  • Mayfield, J. W., & Reynolds, C. R. (1998). Are ethnic differences in diagnosis of childhood psychopathology an artifact of psychometric methods? An experimental evaluation of Harrington's hypothesis using parent-reported symptomatology: Journal of School Psychology Vol 36(3) Fal 1998, 313-334.
  • McAuliffe, W. E., LaBrie, R., Woodworth, R., & Zhang, C. (2002). Estimates of potential bias in telephone substance abuse surveys due to exclusion of households without telephones: Journal of Drug Issues Vol 32(4) Fal 2002, 1139-1154.
  • McCann, J. T. (1990). Bias and Millon Clinical Multiaxial Inventory (MCMI-II) diagnosis: Journal of Psychopathology and Behavioral Assessment Vol 12(1) Mar 1990, 17-26.
  • McCauley, C. D., & Mendoza, J. (1985). A simulation study of item bias using a two-parameter item response model: Applied Psychological Measurement Vol 9(4) Dec 1985, 389-400.
  • McCornack, R. L., & McLeod, M. M. (1988). Gender bias in the prediction of college course performance: Journal of Educational Measurement Vol 25(4) Win 1988, 321-331.
  • McKay, M. (1996). The Neale Analysis of Reading Ability Revised--systematically biased? : British Journal of Educational Psychology Vol 66(2) Jun 1996, 259-266.
  • McLarty, J. R., Noble, A. C., & Huntley, R. M. (1989). Effects of item wording on sex bias: Journal of Educational Measurement Vol 26(3) Fal 1989, 285-293.
  • McLaughlin, M. E., & Drasgow, F. (1987). Lord's chi-square test of item bias with estimated and with known person parameters: Applied Psychological Measurement Vol 11(2) Jun 1987, 161-173.
  • Mellenbergh, G. J. (1985). Overview on item bias: Definition, detection and examination: Nederlands Tijdschrift voor de Psychologie en haar Grensgebieden Vol 40(7) Oct 1985, 425-435.
  • Mellenbergh, G. J., & Kok, F. G. (1991). Finding the biasing trait(s). New York, NY: Kluwer Academic/Plenum Publishers.
  • Meredith, W. (1993). Measurement invariance, factor analysis and factorial invariance: Psychometrika Vol 58(4) Dec 1993, 525-543.
  • Meredith, W., & Millsap, R. E. (1992). On the misuse of manifest variables in the detection of measurement bias: Psychometrika Vol 57(2) Jun 1992, 289-311.
  • Meredith, W., & Teresi, J. A. (2006). An Essay on Measurement and Factorial Invariance: Medical Care Vol 44(11, Suppl 3) Nov 2006, S69-S77.
  • Meunier, S. A. (2005). Fear- and disgust-based expectancy and covariation biases in blood-injection-injury Phobia. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Meyer, G. J. (2002). Exploring possible ethnic differences and bias in the Rorschach Comprehensive System: Journal of Personality Assessment Vol 78(1) Feb 2002, 104-129.
  • Miller, M. D., & Oshima, T. C. (1992). Effect of sample size, number of biased items, and magnitude of bias on a two-stage item bias estimation method: Applied Psychological Measurement Vol 16(4) Dec 1992, 381-388.
  • Millsap, R. E. (1995). Measurement invariance, predictive invariance, and the duality paradox: Multivariate Behavioral Research Vol 30(4) 1995, 577-605.
  • Millsap, R. E. (1997). Invariance in measurement and prediction: Their relationship in the single-factor case: Psychological Methods Vol 2(3) Sep 1997, 248-260.
  • Millsap, R. E. (1997). The investigation of Spearman's hypothesis and the failure to understand factor analysis: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 750-757.
  • Millsap, R. E. (2006). Comments on Methods for the Investigation of Measurement Bias in the Mini-Mental State Examination: Medical Care Vol 44(11, Suppl 3) Nov 2006, S171-S175.
  • Millsap, R. E., & Everson, H. T. (1993). Methodology review: Statistical approaches for assessing measurement bias: Applied Psychological Measurement Vol 17(4) Dec 1993, 297-334.
  • Millsap, R. E., & Meredith, W. (1992). Inferential conditions in the statistical detection of measurement bias: Applied Psychological Measurement Vol 16(4) Dec 1992, 389-402.
  • Moinpour, C. M., Lyons, B., Schmidt, S. P., Chansky, K., & Patchell, R. A. (2000). Substituting proxy ratings for patient ratings in cancer clinical trials: An analysis based on a Southwest Oncology Group trial in patients with brain metastases: Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care & Rehabilitation Vol 9(2) Mar 2000, 219-231.
  • Monaco, L. G. (1985). A comparison of three methods of detecting test item bias: Dissertation Abstracts International.
  • Morales Ortiz, M., Jurado Muriel, T., & Lopez Dominguez, M. L. (2000). Multivariate exploratory data analysis: Bias assessment: Psicothema Vol 12(Suppl2) 2000, 393-395.
  • Mouches, A. (2003). Analysis of errors: Interest and limits of a clinical questionnaire: Journal de Therapie Comportementale et Cognitive Vol 13(4) Dec 2003, 175-181.
  • Muijtjens, A. M. M., Schuwirth, L. W. T., Cohen-Schotanus, J., & van der Vleuten, C. P. M. (2007). Origin bias of test items compromises the validity and fairness of curriculum comparisons: Medical Education Vol 41(12) Dec 2007, 1217-1223.
  • Murdick, N. L., Gartin, B. C., & Arnold, M. B. (1994). A method for the reduction of bias in educational assessment: Journal of Instructional Psychology Vol 21(1) Mar 1994, 83-89.
  • Murphy, P. (1991). Assessment and gender: Cambridge Journal of Education Vol 21(2) 1991, 203-214.
  • Muthen, B. O. (1989). Using item-specific instructional information in achievement modeling: Psychometrika Vol 54(3) Sep 1989, 385-396.
  • Nagoshi, C. T. (1997). g -loadings and the nature and salience of intelligence in the Hawaii family study of cognition: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 758-761.
  • Nandakumar, R. (1993). Simultaneous DIF amplification and cancellation: Shealy-Stout's test for DIF: Journal of Educational Measurement Vol 30(4) Win 1993, 293-311.
  • Nandakumar, R., Glutting, J. J., & Oakland, T. (1993). Mantel-Haenszel methodology for detecting item bias: An introduction and example using the guide to the assessment of test session behavior: Journal of Psychoeducational Assessment Vol 11(2) Jun 1993, 108-119.
  • Narayanan, P., & Swaminathan, H. (1996). Identification of items that show nonuniform DIF: Applied Psychological Measurement Vol 20(3) Sep 1996, 257-274.
  • Navas-Ara, M. J., & Gomez-Benito, J. (2002). Effects of ability scale purification on the identification of dif: European Journal of Psychological Assessment Vol 18(1) 2002, 9-15.
  • Nichols, D. S. (2006). Correction to: "The Trials of Separating Bath Water From Baby: A Review and Critique of the MMPI-2 Restructured Clinical Scales." Journal of Personality Assessment Vol 87(3) 2006, 358.
  • Nichols, D. S. (2006). The Trials of Separating Bath Water From Baby: A Review and Critique of the MMPI-2 Restructured Clinical Scales: Journal of Personality Assessment Vol 87(2) Oct 2006, 121-138.
  • No authorship, i. (1985). Review of Assessing Sex Bias in Testing: PsycCRITIQUES Vol 30 (9), Sep, 1985.
  • No authorship, i. (1987). Review of Using Standardized Tests in Education (4th ed.): PsycCRITIQUES Vol 32 (11), Nov, 1987.
  • No authorship, i. (1988). Bias issues in test development. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Nolan, R. F., Watlington, D. K., & Willson, V. L. (1989). Gifted and nongifted race and gender effects on item functioning on the Kaufman Assessment Battery for Children: Journal of Clinical Psychology Vol 45(4) Jul 1989, 645-650.
  • Norborg, J. M. (1984). A warning regarding the simplified approach to the evaluation of test fairness in employee selection procedures: Personnel Psychology Vol 37(3) Fal 1984, 483-486.
  • Nyborg, H., & Jensen, A. R. (2000). Black-white differences on various psychometric tests: Spearman's hypothesis tested on American armed services veterans: Personality and Individual Differences Vol 28(3) Mar 2000, 593-599.
  • Oakland, T., & Hatzichristou, C. (2003). Issues to consider when adapting tests: Psychology: The Journal of the Hellenic Psychological Society Vol 10(4) Dec 2003, 437-448.
  • O'Bryant, S. E., Hilsabeck, R. C., McCaffrey, R. J., & Gouvier, W. D. (2003). The Recognition Memory Test: Examination of ethnic differences and norm validity: Archives of Clinical Neuropsychology Vol 18(2) Mar 2003, 135-143.
  • Ogasawara, H. (2005). Bias reduction of estimated standard errors in factor analysis: Behaviormetrika Vol 32(1) Jan 2005, 9-28.
  • Olshausen, B. A., & Field, D. J. (2005). How Close Are We to Understanding V1? : Neural Computation Vol 17(8) Aug 2005, 1665-1699.
  • O'Neill, K. A., & McPeek, W. M. (1993). Item and test characteristics that are associated with differential item functioning. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Oort, F. J. (1992). Using restricted factor analysis to detect item bias: Methodika Vol 6(2) 1992, 150-166.
  • Oosterhof, A. C., Atash, M. N., & Lassiter, K. L. (1984). Facilitating identification of item bias through use of Delta plots: Educational and Psychological Measurement Vol 44(3) Fal 1984, 619-627.
  • Orhede, E., & Kreiner, S. (2000). Item bias in indices measuring psychosocial work environment and health: Scandinavian Journal of Work, Environment & Health Vol 26(3) Jun 2000, 263-272.
  • Oshima, T. (1990). The effect of multidimensionality on item bias detection based on item response theory: Dissertation Abstracts International.
  • Oshima, T. C., & Miller, M. D. (1992). Multidimensionality and item bias in item response theory: Applied Psychological Measurement Vol 16(3) Sep 1992, 237-248.
  • Ottosson, H., Grann, M., & Kullgren, G. (2000). Test-retest reliability of a self-report questionnaire for DSM-IV and ICD-10 personality disorders: European Journal of Psychological Assessment Vol 16(1) 2000, 53-58.
  • Padilla Garcia, J. L., Perez Melendez, C., & Gonzalez Gomez, A. (1998). The influence of instructional experience on achievement item bias: Psicothema Vol 10(2) Jul 1998, 481-490.
  • Pae, T.-I., & Park, G.-P. (2006). Examining the relationship between differential item functioning and differential test functioning: Language Testing Vol 23(4) Oct 2006, 475-496.
  • Paolo, A. M., Ryan, J. J., Ward, L. C., & Hilmer, C. D. (1996). Different WAIS-R short forms and their relation to ethnicity: Personality and Individual Differences Vol 21(6) Dec 1996, 851-856.
  • Park, D.-g. (1989). Investigations of item response theory item bias detection: Dissertation Abstracts International.
  • Park, D.-g., & Lautenschlager, G. J. (1990). Improving IRT item bias detection with iterative linking and ability scale purification: Applied Psychological Measurement Vol 14(2) Jun 1990, 163-173.
  • Parmar, R. S. (1989). Cross-cultural transfer of non-verbal intelligence tests: An (in)validation study: British Journal of Educational Psychology Vol 59(3) Nov 1989, 378-388.
  • Parshall, C. G., & Miller, T. R. (1995). Exact versus asymptotic Mantel-Haenszel DIF statistics: A comparison of performance under small-sample conditions: Journal of Educational Measurement Vol 32(3) Fal 1995, 302-316.
  • Perloff, J. M., & Persons, J. B. (1988). Biases resulting from the use of indexes: An application to attributional style and depression: Psychological Bulletin Vol 103(1) Jan 1988, 95-104.
  • Petroski, G. F. (2006). Statistical tests in the DFIT framework: A Monte Carlo evaluation of conventional methods and a bootstrap alternative. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Pincus, T., & Callahan, L. F. (1993). Depression scales in rheumatoid arthritis: Criterion contamination in interpretation of patient responses: Patient Education and Counseling Vol 20(2-3) May 1993, 133-143.
  • Pinto-Meza, A., Serrano-Bianco, A., Penarrubia, M. T., Blanco, E., & Haro, J. M. (2005). Assessing Depression in Primary Care with the PHQ-9: Can It Be Carried Out over the Telephone? : Journal of General Internal Medicine Vol 20(8) Aug 2005, 738-742.
  • Piotrowski, M. J., Barnes-Farrell, J. L., & Esrig, F. H. (1989). Behaviorally anchored bias: A replication and extension of Murphy and Constans: Journal of Applied Psychology Vol 74(5) Oct 1989, 823-826.
  • Poortinga, Y. H. (1991). Conceptual implications of item bias. New York, NY: Kluwer Academic/Plenum Publishers.
  • Poortinga, Y. H., & van der Flier, H. (1988). The meaning of item bias in ability tests. New York, NY: Cambridge University Press.
  • Posserud, M.-B., Lundervold, A. J., & Gillberg, C. (2006). Autistic features in a total population of 7-9-year-old children assessed by the ASSQ (Autism Spectrum Screening Questionnaire): Journal of Child Psychology and Psychiatry Vol 47(2) Feb 2006, 167-175.
  • Pratt, S. I., & Moreland, K. L. (1998). Individuals with other characteristics. Washington, DC: American Psychological Association.
  • Price, L. R. (1997). Differential Item Functioning and language translation: A cross-national study with a test developed for certification. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Prince, J. P. (1997). Assessment bias affecting lesbian, gay male and bisexual individuals: Measurement and Evaluation in Counseling and Development Vol 30(2) Jul 1997, 82-87.
  • Raju, N. S. (1990). Determining the significance of estimated signed and unsigned areas between two item response functions: Applied Psychological Measurement Vol 14(2) Jun 1990, 197-207.
  • Raju, N. S. (1991). "Determining the significance of estimated signed and unsigned areas between two item response functions": Correction: Applied Psychological Measurement Vol 15(4) Dec 1991, 352.
  • Raju, N. S., Drasgow, F., & Slinde, J. A. (1993). An empirical comparison of the area methods, Lord's Chi-Square Test, and the Mantel-Haenszel Technique for assessing differential item functioning: Educational and Psychological Measurement Vol 53(2) Sum 1993, 301-314.
  • Raju, N. S., & Ellis, B. B. (2002). Differential item and test functioning. San Francisco, CA: Jossey-Bass.
  • Raju, N. S., & Normand, J. (1985). The regression bias method: A unified approach for detecting item bias and selection bias: Educational and Psychological Measurement Vol 45(1) Spr 1985, 37-54.
  • Ramirez, M., Teresi, J. A., Holmes, D., Gurland, B., & Lantigua, R. (2006). Differential Item Functioning (DIF) and the Mini-Mental State Examination (MMSE): Overview, Sample, and Issues of Translation: Medical Care Vol 44(11, Suppl 3) Nov 2006, S95-S106.
  • Ramirez, M., Teresi, J. A., Silver, S., Holmes, D., Gurland, B., & Lantigua, R. (2002). Cognitive assessment among minority elderly: Possible test bias. New York, NY: Springer Publishing Co.
  • Ramsay, J. O. (1993). Comments on the Monte Carlo study of Donoghue, Holland, and Thayer. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Ramsey, P. A. (1993). Sensitivity review: The ETS experience as a case study. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Raote, R. G. (1992). Study of item bias in Scales F, Depression and Schizophrenia of the Minnesota Multiphasic Personality Inventory using item response theory: Dissertation Abstracts International.
  • Rebell, M. A. (1988). Legal issues concerning bias in testing. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Redmayne, D. A. (2002). The expanded trail making test: Psychometric properties and ethnological implications. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Rees, J. (2003). A Crisis Over Consensus: Standardized Testing in American History and Student Learning: Radical Pedagogy Vol 5(2) Fal 2003, No Pagination Specified.
  • Rehnman, J., & Herlitz, A. (2007). Women remember more faces than men do: Acta Psychologica Vol 124(3) Mar 2007, 344-355.
  • Reips, U.-D. (2002). Context effects in web surveys. Ashland, OH: Hogrefe & Huber Publishers.
  • Rengel, E. K. (1987). The effects of deletion of biased items on test reliability and validity: Dissertation Abstracts International.
  • Reuterberg, S.-E. (1998). On differential selection in the Swedish Scholastic Aptitude Test: Scandinavian Journal of Educational Research Vol 42(1) Mar 1998, 81-97.
  • Reynolds, C. R. (1995). Test bias and the assessment of intelligence and personality. New York, NY: Plenum Press.
  • Reynolds, C. R., & Kaiser, S. M. (1990). Test bias in psychological assessment. Oxford, England: John Wiley & Sons.
  • Reynolds, C. R., & Kaiser, S. M. (2003). Bias in assessment of aptitude. New York, NY: Guilford Press.
  • Reynolds, C. R., & Ramsay, M. C. (2003). Bias in psychological assessment: An empirical review and recommendations. Hoboken, NJ: John Wiley & Sons Inc.
  • Richards, P. S. (1989). The relation between principled moral reasoning and conservative religious ideology: A critical reevaluation and investigation of test item bias in the Defining Issues Test: Dissertation Abstracts International.
  • Richards, P. S., & Davison, M. L. (1992). Religious bias in moral development research: A psychometric investigation: Journal for the Scientific Study of Religion Vol 31(4) Dec 1992, 467-485.
  • Roche, L. A., & Marsh, H. W. (1998). Workload, grades, and students' evaluations of teaching: Clear understanding sometimes requires more patient explanations: American Psychologist Vol 53(11) Nov 1998, 1230-1231.
  • Rodevich, M. A. (1993). The moderating effect of spinal cord injury on t score elevations on the Minnesota Multiphasic Personality Inventory 2 (MMPI-2): A clinically derived t score correction procedure: Dissertation Abstracts International.
  • Rogers, H. J., & Hambleton, R. K. (1989). Evaluation of computer simulated baseline statistics for use in item bias studies: Educational and Psychological Measurement Vol 49(2) Sum 1989, 355-369.
  • Rogler, L. H. (1999). Methodological sources of cultural insensitivity in mental health research: American Psychologist Vol 54(6) Jun 1999, 424-433.
  • Rosenbaum, D. J. (1997). Depression: Its confounding effect on neuropsychological test scores in the adult learning disabled population. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Rosenstein, R., & Glickman, A. S. (1994). Type size and performance of the elderly on the Wonderlic Personnel Test: Journal of Applied Gerontology Vol 13(2) Jun 1994, 185-192.
  • Ross, S. J., & Okabe, J. (2006). The Subjective and Objective Interface of Bias Detection on Language Tests: International Journal of Testing Vol 6(3) 2006, 229-253.
  • Rosselli, M., Ardila, A., & Rosas, P. (1990). Neuropsychological assessment in illiterates: II. Language and praxic abilities: Brain and Cognition Vol 12(2) Mar 1990, 281-296.
  • Roth, P. L., Bobko, P., & Switzer, F. S., III. (2006). Modeling the Behavior of the 4/5ths Rule for Determining Adverse Impact: Reasons for Caution: Journal of Applied Psychology Vol 91(3) May 2006, 507-522.
  • Roznowski, M. (1987). Use of tests manifesting sex differences as measures of intelligence: Implications for measurement bias: Journal of Applied Psychology Vol 72(3) Aug 1987, 480-483.
  • Roznowski, M., & Reith, J. (1999). Examining the measurement quality of tests containing differentially functioning items: Do biased items result in poor measurement? : Educational and Psychological Measurement Vol 59(2) Apr 1999, 248-269.
  • Russell, M., Chan, A. W. K., & Mudar, P. (1997). Gender and screening for alcohol-related problems. Piscataway, NJ: Rutgers Center of Alcohol Studies.
  • Ryan, J. M., & DeMark, S. (2002). Variation in achievement scores related to gender, item format, and content area tested. Mahwah, NJ: Lawrence Erlbaum Associates Publishers.
  • Saad, S. O. A. (2000). Investigating differential prediction bias by gender in employment-oriented personality measures. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Sabine, D. M. (1994). The use of the MMPI-2 in the low socioeconomic status group. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Saccuzzo, D. P., & Johnson, N. E. (1995). Traditional psychometric tests and proportionate representation: An intervention and program evaluation study: Psychological Assessment Vol 7(2) Jun 1995, 183-194.
  • Salminen, S. (1988). Two psychometric problems of the FIRO-B questionnaire: Psychological Reports Vol 63(2) Oct 1988, 423-426.
  • Sandoval, J., & Duran, R. P. (1998). Language. Washington, DC: American Psychological Association.
  • Sandoval, J. H., Frisby, C. L., Geisinger, K. F., Scheuneman, J. D., & Grenier, J. R. (1998). Test interpretation and diversity: Achieving equity in assessment. Washington, DC: American Psychological Association.
  • Santor, D. A., Ramsay, J. O., & Zuroff, D. C. (1994). Nonparametric item analyses of the Beck Depression Inventory: Evaluating gender item bias and response option weights: Psychological Assessment Vol 6(3) Sep 1994, 255-270.
  • Scheffner-Hammer, C., Pennock-Roman, M., Rzasa, S., & Tomblin, J. B. (2002). An analysis of the Test of Language Development--Primary for item bias: American Journal of Speech-Language Pathology Vol 11(3) Aug 2002, 274-284.
  • Scheunemam, J. D. (1984). A theoretical framework for the exploration of causes and effects of bias in testing: Educational Psychologist Vol 19(4) Fal 1984, 219-225.
  • Scheuneman, J. D. (1987). An experimental, exploratory study of causes of bias in test items: Journal of Educational Measurement Vol 24(2) Sum 1987, 97-118.
  • Scheuneman, J. D. (1991). Item bias and individual differences. New York, NY: Kluwer Academic/Plenum Publishers.
  • Scheuneman, J. D., & Oakland, T. (1998). High-stakes testing in education. Washington, DC: American Psychological Association.
  • Schinka, J. A., LaLone, L., & Greene, R. L. (1998). Effects of psychopathology and demographic characteristics on MMPI-2 scale scores: Journal of Personality Assessment Vol 70(2) Apr 1998, 197-211.
  • Schmand, B., Lindeboom, J., Hooijer, C., & Jonker, C. (1995). Relation between education and dementia: The role of test bias revisited: Journal of Neurology, Neurosurgery & Psychiatry Vol 59(2) Aug 1995, 170-174.
  • Schmitt, A. P., Holland, P. W., & Dorans, N. J. (1993). Evaluating hypotheses about differential item functioning. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Schmitt, N., Hattrup, K., & Landis, R. S. (1993). Item bias indices based on total test score and job performance estimates of ability: Personnel Psychology Vol 46(3) Fal 1993, 593-611.
  • Schoener, J. E. (1985). A comparison of statistical and judgmental methods for identifying item bias: Dissertation Abstracts International.
  • Schonemann, P. H. (1997). The rise and fall of Spearman's hypothesis: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 788:812.
  • Schonemann, P. H., & Thompson, W. W. (1996). Hit-rate bias in mental testing: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 15(1) Feb 1996, 3-28.
  • Schotte, C. K. W., Maes, M., Cluydts, R., & Cosyns, P. (1996). Effects of affective-semantic mode of item presentation in balanced self-report scales: Biased construct validity of the Zung Self-rating Depression Scale: Psychological Medicine Vol 26(6) Nov 1996, 1161-1168.
  • Schultz, M. T. (1992). A comparison of some recently proposed procedures for detecting the presence of biased test items: Dissertation Abstracts International.
  • Schwarz, N. (1999). Self-reports: How the questions shape the answers: American Psychologist Vol 54(2) Feb 1999, 93-105.
  • Scruggs, T. E., & Lifson, S. A. (1985). Current conceptions of test-wiseness: Myths and realities: School Psychology Review Vol 14(3) 1985, 339-350.
  • Shealy, R., & Stout, W. (1993). A model-based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DTF as well as item bias/DIF: Psychometrika Vol 58(2) Jun 1993, 159-194.
  • Shealy, R. T., & Stout, W. F. (1993). An item response theory model for test bias and differential test functioning. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Sheehan, K. R. (1990). The relationship of gender bias and standardized tests to the mathematics competency of university men and women: Dissertation Abstracts International.
  • Shepard, L. A., Camilli, G., & Williams, D. M. (1985). Validity of approximation techniques for detecting item bias: Journal of Educational Measurement Vol 22(2) Sum 1985, 77-105.
  • Sheppard, R. L., Jr. (1999). Differential item functioning in the Hogan Personality Inventory. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Shibutani, H. (1995). Item bias detection in very small samples: A comparison of three indices based on transformed item difficulty. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Simms, L. J. (2006). Bridging the Divide: Comments on the Restructured Clinical Scales of the MMPI-2: Journal of Personality Assessment Vol 87(2) Oct 2006, 211-216.
  • Sireci, S. G., & Geisinger, K. F. (1998). Equity issues in employment testing. Washington, DC: American Psychological Association.
  • Skaggs, G., & Lissitz, R. W. (1992). The consistency of detecting item bias across different test administrations: Implications of another failure: Journal of Educational Measurement Vol 29(3) Fal 1992, 227-242.
  • Smith, J. B. (1994). A study of item bias in the Maine Educational Assessment Test. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Smith, L. L. (2002). On the usefulness of item bias analysis to personality psychology: Personality and Social Psychology Bulletin Vol 28(6) Jun 2002, 754-763.
  • Smith, R. M. (1994). Detecting item bias in the Rasch rating scale model: Educational and Psychological Measurement Vol 54(4) Win 1994, 886-896.
  • Smith, R. M. (1996). A comparison of the Rasch separate calibration and between-fit methods of detecting item bias: Educational and Psychological Measurement Vol 56(3) Jun 1996, 403-418.
  • Smith, R. M. (2004). Detecting item bias with the Rasch model: Journal of Applied Measurement Vol 5(4) 2004, 430-449.
  • Smits, C. H. M., de Vries, W. M., & Beekman, A. T. F. (2005). The CIDI as an instrument for diagnosing depression in older Turkish and Moroccan labour migrants: An exploratory study into equivalence: International Journal of Geriatric Psychiatry Vol 20(5) May 2005, 436-445.
  • Song, W.-z., Cui, Q.-g., Cheung, F. M., & Kong, Y.-y. (1987). Comparison of personality characteristics of university students in Beijing and Hong Kong: Analysis of item endorsement discrepancies on the MMPI: Acta Psychologica Sinica Vol 19(3) 1987, 263-269.
  • Spencer, M. S., Fitch, D., Grogan-Kaylor, A., & McBeath, B. (2005). The Equivalence of the Behavior Problem Index Across U.S. Ethnic Groups: Journal of Cross-Cultural Psychology Vol 36(5) Sep 2005, 573-589.
  • Spitzenstetter, F. (2003). Optimistic bias and measurement bias: Relative or absolute valuation of the personal risk: Cahiers Internationaux de Psychologie Sociale No 58 Apr-Jun 2003, 19-27.
  • St George, R. (1987). Applying an internal criterion of test bias to progressive achievement tests and the Test of Scholastic Abilities: Itemxgroup interactions: New Zealand Journal of Educational Studies Vol 22(1) 1987, 117-119.
  • St George, R. (1987). Itemxgroup interactions over time on three progressive achievement tests: New Zealand Journal of Educational Studies Vol 22(1) 1987, 113-115.
  • Stattler, J. M. (1981). Intelligence Tests on Trial: Another Judiciary Opinion: Professional Psychology Vol 12(2) Apr 1981, 197-198.
  • Steiger, J. H. (1997). Alternate models and the evaluation of social issues: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 762-768.
  • Stein, S. V. (1985). An investigation of item characteristics which are predictive of item bias: Dissertation Abstracts International.
  • Sternberg, R. J., & Grigorenko, E. L. (1997). Infamous artifacts in the study of intelligence: Why there is so much support for so many hypotheses: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 769-778.
  • Stommel, M., Given, B. A., Given, C. W., Kalaian, H. A., & et al. (1993). Gender bias in the measurement properties of the Center for Epidemiologic Studies Depression Scale (CES-D): Psychiatry Research Vol 49(3) Dec 1993, 239-250.
  • Stone, B. J. (1989). An investigation of test bias of a kindergarten screening battery in predicting achievement and educational placement for American Indians and Caucasians: Dissertation Abstracts International.
  • Stone, B. J., & Gridley, B. E. (1991). Test bias of a kindergarten screening battery: Predicting achievement for White and Native American elementary students: School Psychology Review Vol 20(1) 1991, 132-139.
  • Stout, W., Li, H.-H., Nandakumar, R., & Bolt, D. (1997). MULTISIB: A procedure to investigate DIF when a test is intentionally two-dimensional: Applied Psychological Measurement Vol 21(3) Sep 1997, 195-213.
  • Stricker, L. J., & Emmerich, W. (1999). Possible determinants of differential item functioning: Familiarity, interest and emotional reaction: Journal of Educational Measurement Vol 36(4) Win 1999, 347-366.
  • Strommen, E. F., & Smith, J. K. (1987). Internal consistency and bias considerations of the Goodenough-Harris Draw-A-Person Test: Educational and Psychological Measurement Vol 47(3) Fal 1987, 731-736.
  • Sudweeks, R. R., & Tolman, R. R. (1993). Empirical versus subjective procedures for identifying gender differences in science test items: Journal of Research in Science Teaching Vol 30(1) Jan 1993, 3-19.
  • Switzer, D. M. (1993). Differential item functioning and opportunity to learn: Adjusting the Mantel-Haenszel chi-square procedure: Dissertation Abstracts International.
  • Takala, S., & Kaftandjieva, F. (2000). Test fairness: A DIF analysis of an L2 vocabulary test: Language Testing Vol 17(3) Jul 2000, 323-340.
  • Talbott, M. M. (1989). Age bias in the Beck Depression Inventory: A proposed modification for use with older women: Clinical Gerontologist Vol 9(2) 1989, 23-35.
  • Tamayo, J. M. (1985). Frequency of use as a measure of word difficulty in bilingual vocabulary test construction and translation: Dissertation Abstracts International.
  • Tamkin, A. S., & Hyer, L. A. (1984). Testing for cognitive dysfunction in the aging psychiatric patient: Military Medicine Vol 149(7) Jul 1984, 397-399.
  • Tatsuoka, K. K., Linn, R. L., Tatsuoka, M. M., & Yamamoto, K. (1988). Differential item functioning resulting from the use of different solution strategies: Journal of Educational Measurement Vol 25(4) Win 1988, 301-319.
  • Taylor, J. M., & Radford, E. J. (1986). Psychometric testing as an unfair labour practice: South African Journal of Psychology Vol 16(3) Sep 1986, 79-86.
  • Taylor, R. L. (1990). The Larry P. decision a decade later: Problems and future directions: Mental Retardation Vol 28(1) Feb 1990, iii-vi.
  • Taylor, R. L., Ziegler, E. W., & Partenio, I. (1984). An investigation of WISC--R verbal-performance differences as a function of ethnic status: Psychology in the Schools Vol 21(4) Oct 1984, 437-441.
  • Taylor, T. R. (1987). Test bias: The roles and responsibilities of test user and test publisher. Pretoria, South Africa: Human Sciences Research Council.
  • Taylor, T. R., & Boeyens, J. C. (1991). The comparability of the scores of Blacks and Whites on the South African Personality Questionnaire: An exploratory study: South African Journal of Psychology Vol 21(1) Mar 1991, 1-11.
  • Tellegen, A., Ben-Porath, Y. S., Sellbom, M., Arbisi, P. A., McNulty, J. L., & Graham, J. R. (2006). Further Evidence on the Validity of the MMPI-2 Restructured Clinical (RC) Scales: Addressing Questions Raised by Rogers, Sewell, Harrison, and Jordan and Nichols: Journal of Personality Assessment Vol 87(2) Oct 2006, 148-171.
  • Tench, S. L. (1985). A comparison of three intelligence tests as non-biased measures of academic potential: Dissertation Abstracts International.
  • Tepper, B. J., & Tepper, K. (1993). The effects of method variance within measures: Journal of Psychology: Interdisciplinary and Applied Vol 127(3) May 1993, 293-302.
  • Teresi, J. A., Cross, P. S., & Golden, R. R. (1989). Some applications of latent trait analysis to the measurement of ADL: Journals of Gerontology Vol 44(5) Sep 1989, S196-S204.
  • Teresi, J. A., Holmes, D., Ramirez, M., Gurland, B. J., & Lantigua, R. (2002). Performance of cognitive tests among different racial/ethnic and education groups: Findings of differential item functioning and possible item bias. New York, NY: Springer Publishing Co.
  • Thissen, D., Steinberg, L., & Gerrard, M. (1986). Beyond group-mean differences: The concept of item bias: Psychological Bulletin Vol 99(1) Jan 1986, 118-128.
  • Thissen, D., Steinberg, L., & Wainer, H. (1993). Detection of differential item functioning using the parameters of item response models. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Thomas, J., Turkheimer, E., & Oltmanns, T. F. (2000). Psychometric analysis of racial differences on the Maudsley Obsessional Compulsive Inventory: Assessment Vol 7(3) Sep 2000, 247-258.
  • Touma, S. G. (2004). Psychological testing and psychotherapy: Annals of the American Psychotherapy Assn Vol 7(4) Win 2004, 24-28.
  • Toyoda, H., Kawahashi, I., & Nakamura, K. (2007). Scaling risk-taking tendencies using item response theory in the context of prospect theory: Japanese Journal of Educational Psychology Vol 55(2) Jun 2007, 161-169.
  • Turkheimer, E. (1997). The search for a psychometric left: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 779-783.
  • Tyerman, M. J. (1986). Gifted children and their identification: Learning ability not intelligence: Gifted Education International Vol 4(2) 1986, 81-84.
  • Tziner, A., Prince, J. B., & Murphy, K. R. (1997). PCPAQ - The questionnaire for measuring perceived political considerations in performance appraisal: Some new evidence regarding its psychometric qualities: Journal of Social Behavior & Personality Vol 12(1) Mar 1997, 189-199.
  • Uiterwijk, H., & Vallen, T. (1997). Test bias and item bias for ethnic minority students in the Final Primary School Tests of the Dutch National Institute for Educational Measurement: Pedagogische Studien Vol 74(1) 1997, 21-32.
  • Uiterwijk, H., & Vallen, T. (2005). Linguistic sources of item bias for second generation immigrants in Dutch tests: Language Testing Vol 22(2) Apr 2005, 211-234.
  • Ukrainetz, T. A., Harpell, S., Walsh, C., & Coyle, C. (2000). A preliminary investigation of dynamic assessment with Native American kindergartners: Language, Speech, and Hearing Services in Schools Vol 31(2) Apr 2000, 142-154.
  • Valdes, G., & Figueroa, R. A. (1994). Bilingualism and testing: A special case of bias. Westport, CT: Ablex Publishing.
  • Valencia, R. R., & Rankin, R. J. (1986). Factor analysis of the K-ABC for groups of Anglo and Mexican American children: Journal of Educational Measurement Vol 23(3) Fal 1986, 209-219.
  • Valencia, R. R., & Rankin, R. J. (1988). Evidence of bias in predictive validity on the Kaufman Assessment Battery for Children in samples of Anglo and Mexican American children: Psychology in the Schools Vol 25(3) Jul 1988, 257-263.
  • Van de Vijver, F. (2000). The nature of bias. Mahwah, NJ: Lawrence Erlbaum Associates Publishers.
  • van de Vijver, F. J. R. (1991). Group differences in structured tests. New York, NY: Kluwer Academic/Plenum Publishers.
  • van de Vijver, F. J. R. (1994). Bias: Where psychology and methodology meet. Lisse, Netherlands: Swets & Zeitlinger Publishers.
  • van de Vijver, F. J. R. (2000). Bias and Equivalence. Kazdin, Alan E (Ed). (2000). Encyclopedia of psychology, Vol. 1. (pp. 408-410). Washington, DC ; New York, NY: American Psychological Association; Oxford University Press.
  • Van der Flier, H., Mellenbergh, G. J., & Ader, H. J. (1984). The effectiveness of an iterative item bias detection method applied to groups differing in mean ability: Tijdschrift voor Onderwijsresearch Vol 9(2) Mar 1984, 61-70.
  • Vinchur, A. J. (1987). An empirical comparison of methods for detecting test bias: Dissertation Abstracts International.
  • Voskuil, S. L. (1986). Examiner disability, examiner gender and examinee gender as potential sources of bias in the administration of selected subtests of the WAIS--R: Dissertation Abstracts International.
  • Wahlsten, D. (1997). Studied ignorance weakens human behavior genetics: Cahiers de Psychologie Cognitive/Current Psychology of Cognition Vol 16(6) Dec 1997, 784-787.
  • Wainer, H. (1993). Model-based standardized measurement of an item's differential impact. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Waller, N. G., Thompson, J. S., & Wenk, E. (2000). Using IRT to separate measurement bias from true group differences on homogeneous and heterogeneous scales: An illustration with the MMPI: Psychological Methods Vol 5(1) Mar 2000, 125-146.
  • Walter, M. I., Stone, W. F., & Bourgeois, D. Y. (1996). Authoritarianism and response style: New results on an old question: Psicologia Politica No 13 Nov 1996, 17-27.
  • Waltman, M. S., & Burleson, B. R. (1997). Explaining bias in teacher ratings of behavior alteration techniques: An experimental test of the heuristic processing account: Communication Education Vol 46(2) Apr 1997, 75-94.
  • Wang, N., & Lane, S. (1996). Detection of gender-related differential item functioning in a mathematics performance assessment: Applied Measurement in Education Vol 9(2) 1996, 175-199.
  • Ware, M. D. (1998). The MMPI-2, racial differences, and associated empirical correlates. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Watkins, M. W., & Hetrick, C. J. (1999). MacPotthoff: Automated calculation of the Potthoff regression bias procedure: Behavior Research Methods, Instruments & Computers Vol 31(4) Nov 1999, 710-711.
  • Watkins, M. W., Kush, J. C., & Glutting, J. J. (1997). Dsicriminant and predictive validity of the WISC-III ACID profile among children with learning disabilities: Psychology in the Schools Vol 34(4) Oct 1997, 309-319.
  • Weierbach, J. L. (1987). Performances of deaf and hearing children on the Leiter International Performance Scale: An investigtion of psychometric bias: Dissertation Abstracts International.
  • Weijerman, E. A., & Born, M. P. (1995). The relationship between gender and assessment center scores: Gedrag en Organisatie Vol 8(5) Oct 1995, 284-291.
  • Weinstein, N. D. (2007). Misleading Tests of Health Behavior Theories: Annals of Behavioral Medicine Vol 33(1) Feb 2007, 1-10.
  • Weiss, D. J., & McBride, J. R. (1984). Bias and information of Bayesian adaptive testing: Applied Psychological Measurement Vol 8(3) Sum 1984, 273-285.
  • Weiss, L. G., & Prifitera, A. (1995). An evaluation of differential prediction of WIAT achievement scores from WISC-III FSIQ across ethnic and gender groups: Journal of School Psychology Vol 33(4) Win 1995, 297-304.
  • Weiss, L. G., Prifitera, A., & Roid, G. H. (1993). The WISC-III and the fairness of predicting achievement across ethnic and gender groups. Brandon, VT: Clinical Psychology Publishing Co.
  • Weitzman, B. C., Guttmacher, S., Weinberg, S., & Kapadia, F. (2003). Low response rate schools in surveys of adolescent risk taking behaviours: Possible biases, possible solutions: Journal of Epidemiology & Community Health Vol 57(1) Jan 2003, 63-67.
  • Welch, C., & Hoover, H. D. (1993). Procedures for extending item bias detection techniques to polytomously scored items: Applied Measurement in Education Vol 6(1) 1993, 1-19.
  • Wheeler, P. T. (1987). A study of the effect of a child's physical attractiveness upon verbal scoring of the Wechsler Intelligence Scale for Children (Revised) and upon personality attributions: Dissertation Abstracts International.
  • Widerstrom, A. H., Miller, L. J., & Marzano, R. J. (1986). Sex and race differences in the identification of communicative disorders in preschool children as measured by the Miller Assessment for Preschoolers: Journal of Communication Disorders Vol 19(3) Jun 1986, 219-226.
  • Wiig, E. H. (2000). Authentic and other assessments of language disabilities: When is fair fair? : Reading & Writing Quarterly: Overcoming Learning Difficulties Vol 16(3) Jul-Sep 2000, 179-210.
  • Wilcox, R. R. (1984). A note on measuring item bias: Journal of Experimental Education Vol 53(2) Win 1984-1985, 114-116.
  • Wiley, L., & Jenkins, W. S. (1963). Method for measuring bias in raters who estimate job qualifications: Journal of Industrial Psychology 1(1) 1963, 16-22.
  • Wilkie, P. C. (2002). Are curriculum-based reading probes sex or SES biased? criterion-related validity in an elementary-aged sample. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Williams, D. M., & Dunsiger, S. (2007). Suggestions for testing health behavior theories: Implications for mediator analysis: Annals of Behavioral Medicine Vol 34(2) Sep-Oct 2007, 223.
  • Williams, V. S. L. (1997). The "unbaised" anchor: Bridging the gap between DIF and item bias: Applied Measurement in Education Vol 10(3) 1997, 253-267.
  • Willingham, W. W. (2002). Seeking fair alternatives in construct design. Mahwah, NJ: Lawrence Erlbaum Associates Publishers.
  • Willingham, W. W., & Cole, N. S. (1997). Gender and fair assessment. Mahwah, NJ: Lawrence Erlbaum Associates Publishers.
  • Willson, V. L., Nolan, R. F., Reynolds, C. R., & Kamphaus, R. W. (1989). Race and gender effects on item functioning on the Kaufman Assessment Battery for Children: Journal of School Psychology Vol 27(3) Fal 1989, 289-296.
  • Wilson, M. J., & Bullock, L. M. (1989). Psychometric characteristics of behavior rating scales: Definitions, problems, and solutions: Behavioral Disorders Vol 14(3) May 1989, 186-200.
  • Wohlgemuth, E. A. (1997). Walking the fine line between parsimony and oversimplification: Attempting to decrease bias in assessment: Measurement and Evaluation in Counseling and Development Vol 30(2) Jul 1997, 77-81.
  • Wong, J. L., & Besett, T. M. (1999). Sex differences on the MMPI-2 Substance Abuse Scales in psychiatric inpatients: Psychological Reports Vol 84(2) Apr 1999, 582-584.
  • Wood, D. S. (2006). A comparison of Hopi Head Start students and the normative sample on the Merrill-Palmer-Revised. Dissertation Abstracts International Section A: Humanities and Social Sciences.
  • Woodard, J. L., Auchus, A. P., Godsall, R. E., & Green, R. C. (1998). An analysis of test bias and differential item functioning due to race on the Mattis Dementia Rating Scale: Journals of Gerontology: Series B: Psychological Sciences and Social Sciences Vol 53B(6) Nov 1998, P370-374.
  • Wyche, L. G., & Novick, M. R. (1985). Standards for educational and psychological testing: The issue of testing bias from the perspective of school psychology and psychometrics: Journal of Black Psychology Vol 11(2) Feb 1985, 43-48.
  • Wyche, L. G., & Novick, M. R. (1992). Standards for educational and psychological testing: The issue of testing bias from the perspective of school psychology and psychometrics. Thousand Oaks, CA: Sage Publications, Inc.
  • Yarnitsky, D., Sprecher, E., Zaslansky, R., & Hemli, J. A. (1996). Multiple session experimental pain measurement: Pain Vol 67(2-3) Oct 1996, 327-333.
  • Young, K. I. (1998). Objective and projective measures: Assessment of depression in adolescents: A convergent validity study. Dissertation Abstracts International: Section B: The Sciences and Engineering.
  • Younkin, W. F. (1987). Speededness as a source of test bias for non-native English speakers in the College Level Academic Skills Test: Dissertation Abstracts International.
  • Yura, C. A. (1986). An investigation of Black college students and White college students on the Strong-Campbell Interest Inventory: Dissertation Abstracts International.
  • Zeidner, M. (1986). Sex differences in scholastic aptitude: The Israeli scene: Personality and Individual Differences Vol 7(6) 1986, 847-852.
  • Zeidner, M. (1987). Age bias in the predictive validity of Scholastic Aptitude Tests: Some Israeli data: Educational and Psychological Measurement Vol 47(4) Win 1987, 1037-1047.
  • Zeidner, M. (1987). A cross-cultural test of sex bias in the predictive validity of scholastic aptitude examinations: Some Israeli findings: Evaluation and Program Planning Vol 10(3) 1987, 289-295.
  • Zieky, M. (1993). Practical questions in the use of DIF statistics in test development. Hillsdale, NJ, England: Lawrence Erlbaum Associates, Inc.
  • Zook, A., & Sipps, G. J. (1986). Reliability data and sex differences with a gender-free Mach IV: Journal of Social Psychology Vol 126(1) Feb 1986, 131-132.
  • Zucker, K. J. (1996). Sexism and heterosexism in the Diagnostic Interview for Borderline Patients? : American Journal of Psychiatry Vol 153(7) Jul 1996, 966.