The CAA International EALTS and the VRIP principlesFollowing best practice in English language examinations for speakers of other languages, the development and production of the CAAi EALTS has been underpinned by a consideration of the four essential examination qualities of validity, reliability, impact and practicality, referred to collectively as the VRIP principles. 1 Validity is generally considered to be the most important examination quality; it concerns the appropriateness and meaningfulness of an examination in a specific context and the specific inferences made from examination results and refers to the extent to which an examination measures what it is intended to measure and can be shown to produce scores or ratings which are an accurate reflection of the candidate’s actual level of language ability in the skills assessed as set against the criteria by which they are assessed. The validity of an examination is often described in relation to a number of related concepts for which evidence of validity can be provided:
In language testing, validity is one of the ways in which the overall fairness of an examination can be interpreted, with validity being seen as the degree an examination measures what it is supposed to measure. Aviation language tests have high stakes. It is important for safety and for the integrity of the industry, particularly the operators and for test-takers themselves, that language tests be fair and accurate. 2 Reliability is a key concept in any form of measurement and contributes to overall validity. In language assessment, reliability concerns to the overall stability of an examination - the extent to which examination results are fair, consistent, accurate and free from errors of measurement, both in relation to a single candidate and across candidate populations, and therefore the extent to which the results can be depended on for making decisions about a candidate. For tests of speaking where the assessment of candidate performance involves judgement by examiners or assessors, it is important to consider reliability in terms of the accuracy and consistency of the assessments which are made. The reliability of subjective assessments (i.e. examiner judgements) is usually estimated using correlations (e.g. intra- and inter-rater reliability correlations). With validity, reliability is the second of the two ways in which the overall fairness of a test can be interpreted, with reliability being seen as the degree that the test produces consistent and fair results. 3 Impact refers to the effects, educational, professional, social, economic or political which an examination may have on the candidates and other test users. The effects of this impact may be positive, negative or neutral. It is recognized that examinations can have a major impact on language training processes and on society in general. This impact operates on at least two levels:
Positive impact on teaching and learning is an important aspect of impact which operates on both levels (macro and micro). It is in this context that the notions of "face validity" (or test appeal) and wash-back are considered. In the context of English for Aviation language proficiency testing for ICAO compliance, the overall impact of any assessment means should lead to improved operational practice in aeronautical communications and enhanced aviation safety. 4 Practicality is an integral part of the concept of examination usefulness and affects many different aspects of an examination. It refers to the extent to which an examination is practicable and sustainable in terms of the balance of resources required to produce, administer and support it (including the funds, time and expertise) in its intended context of use with the resources available to do so. The practicality of any examination, involves two factors:
A practical examination is one that does not place an unreasonable demand on available resources. It is the consideration given to these four essential assessment criteria that assures the quality of the CAAi English for Aviation Language Testing System. 5 1 c.f. www.ALTE.org for content and further details. Also ICAO Cir 318-AN/180 Ch.1 Fundamental Principles of Language Testing 3.3 2 c.f. Ch.2 Test Validity and Reliability 2.1. Statements of evidence for test validity in plain, layperson language accessible to test stake-holders are available in the CAA International EALTS Test Specification 2009. 3 c.f. ICAO Cir 318-AN/180 Ch.2 Test Validity and Reliability 2.1 Statements of evidence for test reliability in plain, layperson language accessible to test stake-holders are available in the CAA International EALTS Test Specification 2009. 4 c.f. ICAO Cir 318-AN/180 Ch.1 Introduction to Language Testing 3.4. Further details of test impact in the form of a Test Wash-back statement are provided in the section EALTS Test Wash-back in this Handbook. 5 c.f. ICAO Cir 318-AN/180 Ch.1 Introduction to Language Testing. Further details in relation to test administration and delivery are provided in the section Test Administration in this Handbook. |