国产日韩欧美一区二区三区三州_亚洲少妇熟女av_久久久久亚洲av国产精品_波多野结衣网站一区二区_亚洲欧美色片在线91_国产亚洲精品精品国产优播av_日本一区二区三区波多野结衣 _久久国产av不卡

?

Reliability, practicality, and washback of choice format in language testing

2017-07-29 19:06:55XuemeiWu
關(guān)鍵詞:中圖標(biāo)識(shí)碼分類號(hào)

Xuemei+Wu

【Abstract】 This paper aims to discuss reliability, practicality, and washback of MC format and show how effective and applicable it is in language testing. It mainly includes an evaluation of MC format reliability, and practicality, and a discussion about the washback MC format has on the teaching and learning process. This paper concludes by suggesting that although MC formats reliability and validity are still questionable, it is still quite applicable in certain circumstances.

【Key Words】 MC format; language testing; reliability; practicality ; washback

【中圖分類號(hào)】 G64.33 【文獻(xiàn)標(biāo)識(shí)碼】 A 【文章編號(hào)】 2095-3089(2017)16-00-01

1.Introduction

Multiple-choice (MC) has been used widely as an instrument for language testing. In spite of the controversies on it in recent years, it still plays an very important role for many language tests, either regionally or globally. Therefore, it is crucial to study the effectiveness of MC format in language testing. To evaluate whether a language test instrument is effective or not, we have to consider many factors. These mainly include reliability, practicality and washback.

2. Evaluation

2.1 Reliability

"The reliability of a test concerns with its precision as a measuring instrument. Reliability asks whether an assessment instrument administered to the same respondents a second time would yield the same results" (Cohen, 1994:36)

MC format is said to have high rater-reliability, one of the important components of test reliability. Given that no rating carelessness occurs, the score of MC choice will always to be same. That is to say, a scorer will give the same score on different occasions; a different scorer will give the same score to a certain candidate. Objective scoring makes the MC format reliable. As Cohen(1994) pointed out, the more objective the scoring is, the higher the rater reliability a test has. In addition, MC format is likely to include many different items. The test takers will only have to make a mark on their answer sheet, so unlike essay writing, which can only test one specific topic, MC format can cover a wide variety of language applications. Hughes (2003) concluded that the more items a test includes, the more reliable the test will be.

However, guessing is a crucial factor reducing the reliability of MC format. It may have an unknown effect on test scores. A student knowing nothing about the test content may get as high as 30 to 40 out of 100 in a test, just because of luck. Zhong, a teacher of mine once had a small experience on her three-year old daughter, who was asked to circle a multiple-choices answer sheet randomly. The teacher then compared the score of her daughter with that of the students and found that her daughter could get even higher scores than some of her students. This example might be too extreme, but it really makes me doubt the reliability of MC format while it is related to guessing. What is more, the responses of MC format are very simple, the candidate will only need to write down or circle a,b,c,d. This will facilitate the communication among test takers. Cheating is very likely to occur in this case, which will certainly affect the reliability of the test.

2.2 Practicality

According to Bachman and Palmer (1996:36), practicality is "the relationship between resources that will be required in the design development and use of the test and the resources that will be available for these activities."

MC format for language testing is often said to be practical due to the following factors. Firstly, MC format can be applied to large groups of people compared to those tests that can only be used with individuals, such as face-to-face interviews. It is much less time-consuming to administers (Genesee & Upshur, 1996). Secondly, we do not need extra procedures or facilities to implement MC format testing. For some of the tests, special training has to be provided to examiners, which is the case with the IELTS speaking interview. Special facilities and equipment such as recorders are also needed for this testing procedure. Inevitably, cost of the test will increase. With MC tests, however, such problems do not exist. Lastly, it is also indisputable that scoring of MC tests is quick and economical. With the help of scoring machine, the answer sheets of hundreds of test takers can be scored in a few minutes. Further analysis of test taker's responses is also available with computer software ( Oosterholf, 1996).

On the other hand, the effort and time involved in constructing MC items pose problems with its practicality. Successful MC items are very difficult to write. Considerable time is needed to write a good item with plausible and effective alternatives. Professional test writers usually write more MC items than actually needed, and apply them in a real test only after trialing and statistical analysis of performance on the items (Hughes, 2003). This procedure will inevitably increase the time and effort for the test preparation.

2.3 Washback

According to Brown and Hudson (1998, p667), washback is "the effect of testing and assessment on the language teaching curriculum that is related to it."

The washback for MC format can be harmful due to the fact that it might be the easier instrument to find corresponding decoding test-taking strategies. Instead of studying hard, students are likely to focus on the test -taking strategies, considering them as a short cut. There is a danger that some teachers may also change their teaching goal to cater for the need of the students. They pay their attention to the studying and teaching strategies solely for the sake of coping with various tests. This is quite common for teachers in private language school in China. They know that students have strong desire to pass exams such as TOEFL and IELTS. As thus, they have worked out a set of strategies to "decode" MC format. Special effort has been made to improve the educated guessing. Some of the teachers could even make 30 correct choices without listening to the materials in a TOEFL listening test. I doubt students will try hard to improve their English when they know they can easily get high mark in this way.

3. Conclusion

MC format may still have important status in the language testing due to its versatility, practicality and objectivity. However, it is not problematic. As suggested in the above discussions, some aspects of its reliability and practicality are still questionable. It is further degraded by the fact that it has negative washback for our learning and teaching. To fully evaluate the candidate's language ability, we need to use a combination of various test instruments. The test instrument itself is of value, if employed appropriately; every single instrument can play a positive role in testing candidate's real language ability.

References:

Bachman, L.& Palmer, A., 1996, Language Testing in Practice. Oxford University Press. Oxford.

Brown, J & Hudson, T., 1998, The Alternatives in Language Assessment, TESOL quarterly, Vol.32, No.4, P653-675

Cohen, A.D., 1994, Assessing language ability in the classroom (2nd edition), Heinle & Heinle Publishers.

Genesee, F & Upshur, J,A., 1996, Classroom-based Evaluation in Second Language Education, Cambridge University Press.

Hughes, A., 2003, Testing for Language Teachers ( second edition), Cambridge University Press.

Oosterhof, A., 1996, Developing and Using Classroom Assessments, Prentice-Hall, Inc.

Weir, c.j., 1990, Communicative Language Testing. Prentice Hall. New York.

猜你喜歡
中圖標(biāo)識(shí)碼分類號(hào)
The Tragic Color of the Old Man and the Sea
Connection of Learning and Teaching from Junior to Senior
English Language Teaching in Yunann Province: Opportunities & Challenges
A Study of Chinese College Athletes’ English Learning
A Study on the Change and Developmentof English Vocabulary
Translation on Deixis in English and Chinese
Process Mineralogy of a Low Grade Ag-Pb-Zn-CaF2 Sulphide Ore and Its Implications for Mineral Processing
Study on the Degradation and Synergistic/antagonistic Antioxidizing Mechanism of Phenolic/aminic Antioxidants and Their Combinations
潤(rùn)滑油(2014年3期)2014-11-07 14:30:02
A Comparative Study of HER2 Detection in Gastroscopic and Surgical Specimens of Gastric Carcinoma
The law of exercise applies on individual behavior change development
平塘县| 安丘市| 临泽县| 梁平县| 海盐县| 咸宁市| 北宁市| 通渭县| 波密县| 潮州市| 手机| 湖南省| 河北区| 内江市| 深水埗区| 永兴县| 固安县| 台前县| 西城区| 威海市| 东明县| 竹溪县| 通州市| 松原市| 山西省| 迁西县| 黔南| 玉林市| 罗江县| 三门县| 五河县| 阳高县| 田东县| 富川| 沈丘县| 武乡县| 含山县| 邵武市| 临泽县| 哈密市| 惠安县|