Is Your Language Assessment Data Accurate and Reliable?

The value of language skills is increasing.

Both learners and educators are discovering that individuals who can demonstrate proficiency in more than one language improve their chances to earn college admission, secure a good job, and increase their earning potential. Assessment is the most efficient means of determining proficiency.

When you rely on a language proficiency assessment, how do you know its results are accurate and reliable? It turns out, not all assessments are created equal.

Why language assessment accuracy and reliability matter.

Assessment data and proficiency outcomes are often the basis for:

Language program quality ratings
Decisions about program funding
Staff hiring and promotions
Credentials such as State and Global Seals of Biliteracy
College credit
Progress of individual learners

Regardless of what assessment is used, it is essential for language learners and their trusted programs to be confident that the scores they receive are accurate and reliable. When various assessments are all testing the same skills, what makes them different? Or, what makes one better than the other?

Common practices within a program, or even within the language teaching field, may agree upon an assessment practice and find its results suitable. However, the assessments may not be meeting certain rating accuracy and reliability requirements. If an inaccurate thermometer indicates you have a fever but you do not, you may end up taking medication for the wrong diagnosis.

Accuracy and reliability matter when they can be decisive in awarding a credential for language skills, a company’s decision to hire, or whether a program gets funded or not.

How can you tell if scores are accurate and reliable?

Recent Avant research on the rating of the Writing and Speaking sections of the Avant STAMP assessments demonstrates how Avant applies rigorous standards and rating quality checks to achieve a high degree of accuracy and reliability across all of the 40+ languages that Avant tests. The research examined the following components:

Rater training
The rating process, using human raters and the procedures when two raters disagree on a rating
How the final score is determined
The following statistical measurements:
1. Exact Agreement
2. Exact + Adjacent Agreement
3. Quadratic weighted kapp (QWK)
4. Standardized Mean Difference (SMD)
5. Spearman’s Rank-Order Correlation (p)
6. 2 STAMP Levels Apart (a measure of non-adjacent agreement)

These measures can be triangulated to ensure the highest possible degree of accuracy and reliability in the Avant STAMP results.

Results show that across all levels, the rating of Avant STAMP 4S and STAMP WS Writing and Speaking responses is highly consistent. The American Council on Education (ACE) has conducted an extensive review of Avant’s rating processes, accuracy and reliability. Based on their review, ACE recommends Avant STAMP for college credit. For more statistical detail about the accuracy and reliability of Avant’s rating of Speaking and Writing responses, read the full white paper on the accuracy and reliability of Avant’s rating of STAMP Speaking and Writing responses.

Verifying the accuracy and reliability of rating in a language proficiency test is of critical importance when evaluating whether the test is appropriate for your program. As the stakes rise for testing and documenting language skills the question is: can you afford not to?

Articles you may also like:

Discover the Potential of Your Language Programs with Avant STAMP Data Data is a Key to Language Proficiency Test Reliability and Validity

Post

How the Avant STAMP Test Made Facilitated Interdependent Language Learning (FILL) Possible

Published: Jan 21, 2026 Updated: Feb 19, 2026

My name is J. Ryan Allen, and I teach World Languages at Delmar High School, a small but ambitious school district in southern Delaware. For many years, Delmar offered only…

Utah State Board of Education Adopts Avant STAMP for DLI Programs

Published: Aug 12, 2025

The Utah State Board of Education (USBE) announced the adoption of Avant STAMP for all Utah DLI schools starting in the 2025-26 school year. The decision follows a thorough evaluation of proficiency data and assessment platforms, with Avant STAMP emerging as the ideal solution to meet the needs of students, teachers, and program leaders. This is the third state, after New Mexico and Delaware, to select Avant as the vendor for assessing learners in their Dual Language Immersion programs.

one young african american male and one young caucasian male student with headsets at their computers

Post

No One-Size-Fits-All: Tailored Spanish Language Solutions

Published: Jun 4, 2025 Updated: Oct 5, 2025

Did you know that Spanish is spoken by over 43 million people at home across America? That’s about 14% of our population, making Spanish the most common non-English language in…

Ensuring Excellence: How Avant STAMP Sets the Standard for Reliable Language Testing

Published: Aug 28, 2024 Updated: Oct 7, 2025

As the first online computer-adaptive language assessment, Avant STAMP is that original, forward-thinking bridge that stands the test of time but also evolves, adapts, and anticipates the needs of educators and learners. Avant focuses on pushing the boundaries to make language testing more effective, secure, and responsive to the real-world challenges faced by educators and next-generation learners. This ongoing commitment ensures that the bridge we build together is not just a structure of the past, but a pathway to the future of language education.

Post

Cybersecurity Excellence in Action: Avant Signs CISA Pledge

Published: Jul 17, 2024 Updated: Aug 12, 2024

At Avant, we are pleased to announce our latest initiative to enhance our commitment to cybersecurity excellence. We have officially signed the Cybersecurity & Infrastructure Security Agency (CISA) Secure by…