RATERS’ BIAS, BACKGROUND AND PERCEPTION IN AWARDING SCORE OF WRITING PERFORMANCE

Endah Yulia Rahayu(1*),

(1) Postgraduate in ELT of Universitas Negeri Malang, East Java Indonesia English Department of Teacher Training Faculty Universitas PGRI Adi buana Surabaya, East Java, Indonesia
(*) Corresponding Author


Abstract


Assessing writing performance commits bias due to interaction between raters and criteria because raters can score more consistently or harshly on  some criterions. Therefore I explored how the seven raters assessed three essays in order to seek their bias in their rating task, how their background effect (having teaching writing experience & length of teaching writing) their scoring, and how their perception understanding the scoring rubric. The instruments were three essays, analytical writing rubric, questionairres of raters’ background and perception. I applied Two-Way Anova, One-Way Anova and Hoyt’s Anova to measure the raters’ bias, background and perception in awarding score of writing performance. The raters’ scoring criteria of  Content, Organization and Vocabulay  (0.195, 0.511, 0.545 )  were respectively found bias. Based on the raters’ background of having experience of teaching writing, the scoring criteria of Mechanics was bias (0.026  0.050). But the length of teaching writing experience did not affect   the scoring criteria of Content, Organization, Vocabulary, Language Use and Mechanics, in term of no bias (0.705, 0.663, 0.171, 0.206, 0.090 ≥ 0.050). Based on the raters’ perception questionnaire, they were familiar with the instrument of writing rubric prior to this reseach and agreed that the rubric help them to discriminate among the different score level. They also considered that the rangefinders in the rubric were usefull tools to asign score, and the writing rubric measured some essential elements for effectively teaching and learning writing. They assumed  the rubric could be used as a professional development tool to support teaching and learning writing, and finally they  were confident in their ability to score using the rubric.


Keywords


bias, background, perception, writing rubric

Full Text:

PDF

References


Amin, I. A.-R., Aly, M. A.-S., & Amin, M. M. (2011). A Correlation Study between EFL Strategic Listening and Listening Comprehension Skills among Secondary School Studnets. Benha, Egypt: Benha University.

Baak, E. (1997). Portfolio Development: An Introduction. Forum, 35(2), 38.

Bacha, N. (2001). Writing evaluation: what can analytic versus holistic essay scoring tell us? System, 371-383.

Bachman, L. F. (2014). Statistical analyses for language assessment. Cambridge: Cambridge University Press.

Basir, A. (2014). Autistic Students’ Learning Strategies in Writing English Texts and Their Impacts on The Teaching and Learning Process. Surakarta: Sebelas Maret University.

Bill & Melinda Gates Foundation. (2012). Gathering feed- back for teaching: Combining high-quality observations with student surveys and achievement gains. Measures of Effective Teaching (MET). Seattle, WA: Author.

Bozorgian, H., & Pillay, H. (2013). Enhancing Foreign Language Learning through Listening Strategies Delivered in L1: An Experimental Study. International Journal of Instruction, 6(1), 105-122.

Brown, A. (1995). The effect of rater variables in the development of an occupation-specifc language performance test. Language Testing, 12, 1-15.

Brown, H. D. (2006). Teaching by Principles: An Interactive Approach to Language Pedagogy. New Jersey: Prentice Hall Regents.

Cabaysa, C. C., & Baetiong, L. R. (2010). Language Learning Strategies of Students at Different Levels of Speaking Proficiency. Education Quarterly, 61(8), 16-35.

Cahyono, B. Y. (2000, August). The Overall Proficiency in English Composition of Indonesian: University Students of EFL. TEFLIN Journal, 11(1), 78-87.

Carey, M. D., Mannell, R. H., & Dunn, P. K. (2011). Does a rater’s familiarity with a candidate’s pronunciation affect the rating in oral profciency interviews? Language Testing, 28, 201–219.

Celce-Murcia, M. (2001). Teaching English as a Second or Foreign Language. Boston: Heinle & Heinle Publishers.

Chang, C. Y., Liu, S., & Lee, Y. (2007). A study of language learning strategies used by college EFL learners in Taiwan. Language Learning, 3, 235-262.

Clark, K. (1999, November). Test Realibility. The Mathematics Teacher, 92(8), 719-723.

Cohen, D. (1998). Strategies in Learning and using a Second Language. London: Longman.

Congdon, P. J., & McQueen, J. (2000). The stability of rater severity in large-scale assessment programs. Journal of Educational Measurement, 37, 163–178.

Coskun, A. (2010). The Effect of Metacognitive Strategy Training on the Listening Performance of Beginner Students. Novitas-ROYAL (Research on Youth and Language), 4(1), 35-50.

Crusan, D. (2013, November 14). Designing Writing Assessment and Rubrics LARC/CALPER Testing & Assessment Webinar. Dayton, OH, USA.

DeCarlo, L. T. (2005). A model of rater behavior in essay grading based on signal detection theory. Journal of Educational Measurement, 42(1), 53–76.

Diederich, P.D., French, J.W., Carlton, S.T. (1961). Factors in Judgements of Writing Ability. Princeton, New Jersey: Educational Testing Service.

Eckes, T. (2005). Examining Rater Effects in TestDaf Writing and Speaking Performance Assessment. Language Assessment Quarterly, 2(3), 197–221.

Eckes, T. (2008). Rater types in writing performance assessments: A classifcation approach to rater. Language Testing, 25(2), 155-185.

Eckes, T. (2012). Operational Rater Types in Writing Assessment: Linking Rater Cognition to Rater Behavior. Language Assessment Quarterly, 9, 270-292.

Eckes, T. (2012). Operational Rater Types in Writing Assessment: Linking Rater Cognition to Rater Behavior. Language Assessment Quarterly, 9, 270–292.

Ellis, R. (1994). The Study of Second Language Acquisition. New York: Oxford University Press.

Farlex. (2007). Retrieved January 20, 2016, from The Free Dictionary: http://thefreedictionary.com

Gestanti, R. A. (2015). Students’Learning Strategies and Their Accomplishment in Speaking English. Surakarta: Sebelas Maret University.

Ghaderpanahi, L. (2012). Using Authentic Aural Materials to Develop Listening Comprehension in the EFL Classroom. English Language Teaching, 5(6), 146-153.

Ghanbari, B., Barati, H., Moinzadeh, A. (2012). Rating Scales Revisited: EFL Writing Assessment Context of Iran under Scrutiny. Language Testing in Asia, 2(1), 83-100.

Gilakjani, A. P., & M. R. (2011). A study of Factors Affecting EFL Learners’ English Listening Comprehension and The Strategies for Improvement. Journal of Language Teaching and Research, 2(5), 977-988.

Gilakjani, A. P., & Sabouri, N. B. (2016). Learners’ Listening Comprehension Difficulties in English Language. English Language Teaching, 9(6), 123-133.

Hammond, L.D. (2010). Evaluating Teacher Effectiveness How Teacher Performance Assessments Can Measure and Improve Teaching. Washington, DC: Center for American Progress.

Harmer, J. (2001). The Practice of English Language Teaching. New York: Longman.

Harmer, J. (2007). How to Teach English: New Edition. London: Pearson Education Limited.

Haswell, R. H. (2007, January 15). Researching Teacher Evaluation of Second Language Writing via Prototype Theory. Corpus Christi, Texas, USA.

H-R Guide. (2015, May 12). Chapter 3: Understanding Test Quality-Concepts of Reliability and Validity. Retrieved 2017, from Human Resources: http://www.hr-guide.com/data/G362.htm

Huang, Y. F. (2009). The Relationship between College Students’ Learning Strategies and Their English Speaking Proficiency. Ming Chuan, : Ming Chuan Univ Press.

Huda, N. (1998). Relationship between Speaking Proficiency, Reflectivity-impulsivity, and L2 Learning Strategies. Learners and Language Learning. RELC Anthology series, 39, 40-45. (W. Renandya, & G. M. Jacobs, Eds.) Singapore: SEAMEO Regional Language Centre.

Huy, L. H. (2015). An Investigation into Listening Strategies of EFL Students. Asian Journal of Educational Research, 3(4), 21-34.

Ivarsson, E., & Palm, M. (2013). Listening Strategies in the L2 Classroom. Malmö högskola.

Jacobs, H.L., Zinkgraf S.A., Wormuth D.R., Hartfiel V.F., Hughey J.B. (1981). Testing ESL Composition: a practical approach. Rowley, Massachusetts: Newbury House.

Janssen, F. J. (2015, January). Research on rater bias in classroom observation. Retrieved March 23, 2017, from http://janbri.nl/?page_id=103

Jonathan Trace, G. J. (2017). Measuring the impact of rater negotiation in writing performance assessment. Language Testing, 34(1), 3-22.

Jonsson, A., Svingby, G. (2007). The use of scoring rubrics: Reliability, validity and educational consequences. Educational Research Review, 130–144.

Jou, Y.-J. (2009). A Study of English Listening Strategies Applied by. Cheng Shiu: Cheng Shiu University.

Kane, M. (2013). Validating the interpretations and uses of test scores. Journal of Educational Masurement, 1, 1-73.

Kassem, H. M. (2015). The Relation between Listening Strategies Used by Egyptian EFL College Sophomores and Their Listening Comprehension and Self-Efficacy. English Language Teaching Journal, 8(2), 153-169.

Khamdani, A. K. (2014). Learning Strategies Applied by Students of Nursing Academy in Listening. Surakarta: Sebelas Maret University.

Klein, C. R. (1987). The value of assignment-specific writing scales for ESL composition. Ames, Iowa, USA.

Knoch, U., Fairbairn, J., Huisman, A. (2016). An evaluation of an online rater training program for the speaking and writing sub-tests of the Aptis test. Papers in Language Testing and Assessment , 5(1), 90-106.

Knoch, U., Read, J., von Randow, J. (2007). Re-training writing raters online: How does it compare with face-to-face training? Assessing writing, 12, 26-43.

Koretz, D. (2008). Measuring up: What educational testing really tell us. Massachusetts/London, England: Harvard University Press.

Liang, T. (2009). Language Learning Strategies- The Theoretical Framework and Some Suggestions for Learner Training Practice. English Language Teaching Journa, 2(4), 199-206.

Lundstrom, K., Diekema, A. R., Leary, H., Haderlie, S., Holliday. W. (2015). Teaching and learning information synthesis. Communications in Information Literacy, 9(1), 60-82.

McNamara, T. F. (1996). Measuring second language performance. London: Longman.

McNamara, T., Roever, C. (2006). Language Testing: The Social Dimension. Oxford: Blackwell Publishing.

Meier, V. (n.d.). Evaluating rater and rubric performance on writng placement exam. University of Hawai‘i at Mānoa.

Milles, M. B., & Huberman, A. M. (1984). Qualitative Data Analysis. A Sourcebook of New Methods. California: SAGE Publication, Inc.

Nakamura, Y. (2004). A comparison of holistic and analytic scoring methods in the assessment of writing. Proceedings of the 3rd Annual JALT Pan-SIG Conference (pp. 45-52). Tokyo: 2004 Pan SIG.

Nakanishi, C. (2005). What Influences the Quality of Japanese College Students’ Writing in English as a Foreign Language? The Journal of Asia TEFL , 2(1), 155-180.

Nation, I., & Newton, J. (2009). Teaching ESL/EFL Listening and Speaking. New York: Routledge.

NC Departement of Public Instruction. (2015). North Carolina Teacher Evaluation Process. Raleigh: Public School of North Carolina.

O’Malley, J. M. (1990). Learning Strategies in Second Language Acquisition. Cambridge: Cambridge University Press.

Oxford, R. L. (1990). Language Learning Strategies: What Every Teacher Should Know. Boston: Heinle & Heinle Publishers .

Oxford, R. L. (2003). Language Learning Styles and Strategies: An Overview. London: GALA.

Park, Y.S., Chen, J., Holtzman, S.L. (2014). Evaluating Efforts to Minimize Rater Bias in Scoring Classroom Observations. In T. K. Kane, DESIGNING TEACHER EVALUATION SYSTEMS (p. 384). San Francisco: Wiley.

Penulis, T. (2015). Panduan Akademik 2015/2016. Ponorogo: UMP Press.

Quintero, E.F.G., Guzmán, N.P.T, Guzmán, R.R. (2017). Assessing EFL University Students’ Writing: A Study of Score Reliability. Revista Electrónica de Investigación Educativa, 9(2).

Razawi, N. A. (2011). Students’ Diverse Learning Styles in Learning English as a Second Language. International Journal of Bussiness and Social Science, 2(19), 179-186.

Rezaei, A.R., Lovorn, M. (2010). Reliability and validity of rubrics for assessment through writing. Assessing Writing, 15, 18-39.

Richards, J. C. (2002). Methodology in Language Teaching. An Anthology of Current Practice. New York: Cambridge University Press.

Riduwan. (2004). Metode dan Teknik Menyusun Thesis. Bandung: Alfabeta.

Rost, M. (1994). Introducing Listening. London: Penguin Group.

Saeidi, M., Yousefi, M., Baghayei, P. (2013). Rater Bias in Assessing Iranian EFL Learners’ Writing Performance. Iranian Journal of Applied Linguistics (IJAL), 16(1), 145-175.

Schaefer, E. . (2008). Rater bias patterns in an EFL writing assessment. Language Testing, 25, 465-493.

Shi, C. (2011). A Study of the Relationship between Cognitive Styles and. Higher Education Studies, 1(1), 20-26.

Shutler, J. (2002, August 15). One way ANOVA - Analysis of variance. Retrieved April 21, 2017

Sokolov, C. (2014). Self-evaluation of rater bias in written composition assessment. Linguistica, 54(1), 261-275.

Stuart, I., Halmilton. (2007). Dictionary of Psychological Testing, Assessment and Treatment (second ed.). London and Philadelphia: Jessica Kingsley .

Sudweeks, R. R., Reeve, S., & Bradshaw, W. (2005). A comparison of generalizability theory and Many-Facet Rasch Measurement in an analysis of college sophomore writing. Assessing Writing, 239-261.

Underwood, M. (1989). Teaching Listening. London: Longman.

Ur, P. (1996). A Course in Language Teaching Practice and Theory. Melbourne: Cambridge University Press.

Watthajarukiat, T. E. (2011). An Investigation of English Listening Strategies Used by Thai Undergraduate Students in Public Universities in the South Thailand. Journal of Art, 15(4), 1-17.

Weir, J. C. (1998). Communicative Language Testing. New Jersey: Prentice Hall Europe.

Wenden, A. &. (1987). Learner Strategies in Language Learning. New Jersey: Prentice Hall.

Wigglesworth, G. . (1993). Exploring bias analysis as a tool for improving rater consistency in assessing oral interaction. Language Testing, 10, 305-335.

Zare, P. (2012). Language Learning Strategies among EFL/ESL Learners: A Review of Literature. International Journal of Humanities and Social Science, 2(5), 162-169.

Zhang, W.-S. (2007). Teach More Strategiesin EFL CollegeListening Classroom. US-China Education Review, 4(3), 71-76.




DOI: http://dx.doi.org/10.24127/pj.v6i2.1022

Refbacks

  • There are currently no refbacks.


Copyright (c) 2017 Endah Yulia Rahayu



Published by Universitas Muhammadiyah Metro 

Scientific Publication Unit (UPI)

Gd. HI, Lt1 Kampus 1 Universitas Muhammadiyah Metro 

Jl. Ki Hajar Dewantara 116 A 

Kota Metro Lampung 34145  Indonesia 

Email  : help.upi@ummetro.ac.id

Phone : +62-725-42445 

Fax     : +62-725-42454

Mobile : +62-8570914-1060

Certificate of Accreditation (Volume 11 No 1, 2022-Volume 15 No 2, 2026

 

 

 

 

 

 

Publisher

Universitas Muhammadiyah Metro

Unit Publikasi Ilmiah (Scientific Publication Unit)

Address:

Gedung HI Lt 1, Ruang UPT Publikasi Ilmiah Universitas Muhammadiyah Metro

Jl. Ki Hajar Dewantara No.116, Iringmulyo, Metro Timur, Kota Metro, Lampung 34111
Phone/WA: +6285709141060

Email:upi@ummetro.ac.id 

======================

e-ISSN-2442-482x  p-ISSN-2089-3345

Download Premise Official Template  June -October 2023

Certificate