The Evaluation on the Reliability and Validity of CET-SET 6

来源 :校园英语·下旬 | 被引量 : 0次 | 上传用户:ZZ2077
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  【Abstract】The College English Test-Spoken English Test (CET-SET) is a nationwide spoken English test including two different bands – CET-SET 4 and CET-SET 6. It has been designed to test the oral communicative ability of university or college students in China. This article intends to evaluate CET-SET 6 in details as there are limited studies on it. With the general description of CET-SET, this article will evaluate the test in terms of its reliability and validity and give suggestions to modify CET-SET 6.
  【Key words】CET-SET 6; reliability; validity; suggestions
  【作者簡介】Han Yiqiong, Guangzhou University Sontan College.
  1. Introduction
  The College English Test-Spoken English Test (CET-SET) is affiliated to the College English Test (CET) - a criterion-related norm-referenced test in China. As one of the large-scale and high-stake standardized language test in China, CET has been more frequently researched over the past ten years (Cheng, 2008). Researchers have studied CET from different aspects: Ren (2011) used questionnaire and interview to find out the effects of CET on English education in universities of Tianjin City; He and Dai (2006) studied CET-SET group discussion based on a corpus of test performance and found the candidates’ interaction is at a low degree in the group discussion; Li (2009) studied CET writing and found the teachers did not teach to the test. However, few researchers evaluate the reliability and validity of CET-SET 6 in detail, which this paper intends to do.
  2. The Description of CET-SET 6
  Authorized by Ministry of Education, the National College English Testing Committee (NCETC) has administered CET since 1987. CET aims to assess the fulfillment of College English Teaching Syllabus in college, measure the English communication ability of non-English majors at the tertiary level and provide feedback to teachers and students (Wang, Yan, and Liu, 2014). It has been said to be a high-stake test because CET-4 certificate is a prerequisite for graduation or bachelor’s degree in many universities in China (He and Dai, 2006; Ren, 2011; Wang, Yan, and Liu, 2014:). CET is comprised two levels of tests: CET-4 is to check the basic requirements in College English Curriculum Requirement (CECR); CET-6 is to check the intermediate requirements (Du, 2012). Before 2016, the total score was shown on CET report with three sub-scores from writing and translation, listening, and reading. Speaking was absent from the test report because speaking ability was ignored. In 1999, NCETC started CET-SET which tests English communication ability of students from higher education in China. From 2016, candidates can take CET-SET 6 without any limitation. Since the interview test was replaced by the Internet-based one, this article will only focus on the Internet-based CET-SET 6.   The Internet-based CET-SET 6 is conducted in a small group including one virtual examiner who in fact works as an interlocutor and two candidates. The oral test consists of three parts. In the first part, candidates take turns to give a self-introduction in 20 seconds, and then they answer one question asked by the interlocutor simultaneously in 30 seconds. The question would be related to the topic of the test. The second part is individual presentation and group discussion. In this part, every candidate take turns to make a 90-second presentation with a given a visual prompt (sentences or pictures) on the screen after 60-second preparation. The information given to the candidates in the same group are about the same topic. After the presentations, candidates are instructed to take part in the group discussion on the given topic and try to reach an agreement in the end. In the third section, two candidates simultaneously answer a question about the given topic from the interlocutor in 45 seconds. The total length of the test is around 18 minutes.
  3. The Evaluation of Reliability in CET-SET 6
  If the scores would have been more similar, the test is said to be more reliable (Hughes, 2003). In other words, reliability means the consistency in scores regardless of when and how many times a particular test is taken. The reliability of CET-SET 6 is achieved by the practices of the standardized administration procedures, the format of the interlocutor’s engagement, and the standardized rating procedures (Zhang and Elder, 2009). The procedures of the testing are regulated and organized for the interlocutor and candidates so it makes no difference on the test score whether the time or the location the candidates take the test. The interviewer variability is also confirmed to have influence on the test and scores (Van Moere, 2006). However, in CET-SET 6, the interviewer’s engagement in CET-SET 6 will not affect the test result, since the interviewer simply reads the instructions and questions in the test.
  Besides, the standardized scoring procedures can minimize variations in the process of scoring and the scoring criterion (Zhang and Elder, 2009). The candidate’s performance in CET-SET 6 is scored by the authorized and trained raters with the formal rating scale designed on the requirements of CECR. The scoring criteria are also irreplaceable in the score reliability which is an essential component of the test reliability (Bachman and Palmer, 1996). In CET-SET 6, the criteria are specified into three aspects: 1) accuracy and range; 2) length and coherence; 3) flexibility and appropriateness. The candidate’s performance is scored on a scale from 1 to 5 based on the criteria. With the scoring criteria and the standard samples for reference, rater can make a proper judgement on candidates’ performances. This analytical scoring can make an oral test achieve a higher score reliability (Li, 2011). Moreover, test score in interview is inevitably subjective, which causes the inconsistency in the ratings (Bachman, 1990). However, the potential sources of inconsistencies cannot be diminished entirely (Bachman and Palmer, 1996).   4. The Evaluation of Validity in CET-SET 6
  Validity means whether a test measures what it is intended to measure (Hughes, 2003). It is the most important consideration in the test development, interpretation and use (Bachman, 1990). Among the different types of test validity, the most common ones are content validity, criterion-related validity, face validity and construct validity (Li, 2011).
  Content validity means the content of the test should contain a representative sample of a language skill, structures, etc. (Hughes, 2003). A speaking test has content validity only if it tests a proper sample of the related structure such as dialogue or discussion etc., which is easy to identify from the description of CET-SET 6. Besides, the three parts of CET-SET 6 actually test a variety of contents because candidates complete the tasks via description, negotiation, persuasion, debate, argumentation etc. which are required in CECR (Yang, 2003). Above all, CET-SET6 has high content validity.
  Criterion-related validity is divided into concurrent validity and predictive validity. To get concurrent validity, testers need to compare the performance of a randomly sample of students with that of all students’. The similarity between the two groups shows the concurrent validity of the test: the more similar they are, the higher the concurrent validity is. Predictive validity measures the degree to which a test can predict interviewees’ future performance. There is little research on the criterion-related validity of CET-SET.
  Face validity means the test looks as if it measures what it is supposed to measure (Hughes, 2003). Face validity can be gained in a speaking test if the testing uses direct method such as dialogue, group discussion, role play etc., by copying a similar context for the use of target language (Li, 2011). CET-SET 6 tries to provide a real-life interactive context for achieving the face validity. However, it is difficult to judge whether it has high face validity. Since the test interview is conducted in a small group, each candidate has his different attitude towards the test.
  Construct validity refers to the extent of the consistency between the performance on test and test purpose (Bachman, 1990). If the test measures the ability which it is intended to measure, it is said to have construct validity. As the performance does not fit the types of test task and result, the construct validity of CET-SET 6 is considered relatively low (Jing and Ma, 2012).   5. The suggestions to CET-SET 6
  With the above discussion, there are suggestions to modify CET-SET 6. Firstly, Ma (2014) concluded that the Spoken English test should be included as one inseparable part of the entire College English Test for the sake of validity of a test. This is also one of my suggestions to CET-SET 6. Secondly, CET-SET 6 needs to involve more question-and-answer interaction in the third part, which is also proposed by Lei (2019) in his study on IELTS speaking test and CET-SET 4/6. Since daily communication seldom stops with only a question and answer, the third part should have more interaction. Thirdly, CET-SET 6 needs to enrich its content by referring to the content of IELTS Speaking Test (Lei, 2019). CET-SET 6 should choose hot issues of daily life to allow candidates to express their ideas and interact with each other so as to better assess candidates’ communicative ability. This will enhance the reliability and validity of CET-SET 6 and achieve the test aim as well. Lastly, raters can work with the computer-automated scoring system because the differences between raters’ rating are unavoidable. It may make the rating more complicated, but it helps to ensure the reliability and validity of the test score.
  References:
  [1]Bachman, L. Fundamental considerations in language testing[M]. Oxford: Oxford University Press, 1990.
  [2]Bachman, L.
其他文献
【摘要】有效的课堂提问能助力学生思维品质的培养。教师应找准提问角度,拓宽提问广度,升华提问高度,优化课堂提问方式,激活学生思维,发展和提升学生的思维能力。  【关键词】 英语课程;课堂提问;思维品质  【作者简介】林晓梁,福建省福清市滨江小学。  《义务教育英语课程标准(2011年版)》(教育部,2012)指出,语言既是交流的工具,又是思维的工具。英语课程承担着培养学生英语素养和发展学生思维能力的
【摘要】英语是极为重要的语言学科之一,在该课程中学生能够学习到英语知识,提升自身的语言能力。在如今核心素养的要求之下,高中英语教师进行英语阅读教学时,必须积极引导学生逐渐形成较为强大的语言能力,形成良好的思维品质,学生也能够在提高自身能力的基础上,拥有相关的文化品格。因从,教师必须要对高中英语阅读教学措施进行升级,努力为学生创建具有形象性和生动性的英语教学课堂,在帮助学生取得更好成长的基础上,让学
【摘要】分析现阶段小学英语教学情况发现,小学英语写作教学活动中应用思维导图针对提高学生写作能力而言,发挥着重要作用,除可帮助学生理清写作思路外,还可提高学生英语语言表达能力。因此,本文首先对小学英语写作教学中应用思维导图的意义加以阐述,其次,针对思维导图在小学英语写作教学中的应用策略提出几点建议,望借此可切实提高学生英语写作水平。  【关键词】思维导图;小学英语;英语写作  【作者简介】孙亚萍,江
最近在我市组织的高中历史优质课评比活动中,我抽到的课题是人教版必修3《建国以来的重大科技成就》。拿到课题后我感觉这一课的内容较为简单,基本上就是罗列有关的史实,似乎没有多少知识点可以深入讲解和探究,要想讲课“出彩”就必须找新的切入点和鲜明的主题。在仔细看了本课的课程标准内容后,我被其中的“体会科技工作者的爱国热情和艰苦创业、自主创新的精神”所吸引和打动,决定把三维目标中的情感态度与价值观予以充分的
【摘要】在新形势下的英语教学中,一线英语教师要扭转传统的教学思维,打破应试教育的桎梏,多重视听、说、读、写等多种英语能力的培养。作为艺术类初中生,不仅要学习英语基础知识,还要掌握英语语言的表达技巧,进而将其在课堂上所学的知识运用到日常生活中去,才能适应日益增长的社会人才需求。在新形势下的艺术类初中英语听力教学,英语教师更要不断地改良自己的教学方法,突破教学瓶颈,极力提升自身的听力教学质量,从而帮助
【摘要】随着科学技术的发展,当今世界已然成了一个整体,而英语作为世界各国之间沟通交流的官方语言,也成了我国大中小学必学的科目之一,而在英语的教学中,我国英语教学方式以老师讲解为主,理论知识的学习成为学生学习英语的重点,而学生语言能力的培养被忽视,以至于学生不能做到学以致用。本文通过分析当下英语语言文学的教学现状,英语语言文学对学生语言能力的培养作用,致力于提高学生的主动学习能力和英语语言能力,让英
【摘要】随着新课标改革的不断深入,各学科的核心素养发展也变得越来越重要。在英语教学当中,由于与现代国际接轨,其学科本身就对学生来说非常的重要,因此英语学科核心素养的培养也就显得更重要。本文将针对基于学生核心素养发展的高中阅读教学策略进行探究,希望能够对高中英语教学发展作出微薄的贡献。  【关键词】核心素养;高中英语;阅读教学  【作者简介】陈燕梅,福建省莆田第十二中学。  2017年所颁布的《普通
【摘要】思维是人脑的机能,是对外部现实的反映,而语言是思维的要素,亦是实现思维、巩固和传达思维成果的工具,因之,语言学习需以思维启迪为前提,高质、高效的语言学习又需以“问题求解思维、决策思维、批判性思维与创新性思维”等高阶思维为基础。而课堂提问作为对学生进行思维启迪的主要方式,对其精心设计便成为小学英语教研的核心重点。本文即是就此为重点进行分析,从问题选择、问题提出、问题处理这三大方面来详细阐述。
【摘要】随着社会的不断发展以及教育制度的不断改革,对小学的英语教学提出了更高的要求。而加强在小学英语教学中运用立体化教学模式,不仅可以有效的提高课堂的教学质量,而且还能在一定程度上不断的激发学生的学习兴趣,进而有效的提高学生的英语应用能力。本文就针对立体化教学模式在小学英语教学中的应用展开具体的分析与讨论。  【关键词】立体化;教学模式;小学课堂;英语教学  立体化教学模式主要就是以学生为主,进而
【摘要】对于小学生而言,迅速准确读出单词是有难度的,更不用说长段落和故事阅读了。自然拼读可以帮助学生形成语音意识,获得单词拼读拼写的能力,发展见词能念,听音能辨的学习技能,使单词学习轻松有效,有助于学生的词汇积累,基于此,阅读教学才能顺利进行。  【关键词】自然拼读;语音意识;词汇积累  【作者简介】费玲娜,渤海大学教育与体育学院,太仓市双凤镇新湖小学。  在中国,英语作为第二语言,学习英语的主要