Lexical Diversity — marinfanjoylabrenz

MARIN FANJOY-LABRENZ

design & research

PAST & FUTURE
CONTACT

Project

Lexical Diversity

Client

English Literacy Platform

Role

Instructional Designer

Overview

During this project, I conducted research into computational metrics for evaluating lexical diversity and sentence complexity. New scoring categories, along with the introduction of subscores, enhanced our scoring accuracy and allowed us to provide students with more meaningful insights into their English proficiency.

Year

2023

PROBLEM

In our assessments, students write paragraphs of ±30-60 words in response to a prompt. Initially, we scored writing samples on a 0-20 scale in five categories:

Holistic Quality
Genre Elements
Correct Word Sequence
Readability
Complex Words

While categories 1-3 were accurately scored by human raters, categories 4-5 used computational methods that introduced significant inaccuracies. Readability calculations using the Flesch-Kincaid formula and complex word percentages produced unreliable results for short text samples.

Our research aimed to identify and integrate alternative metrics to assess the readability and sophistication of a writing sample. We also researched options for scoring scales that would allow students to see more growth on their assessment reports.

RESEARCH

Our objective for this research was to find metrics to assess a student's

Lexical Profile: Vocabulary usage, variety, and accuracy
Sentence Variety and Accuracy: The ability to construct clear, varied sentence structures

Additionally, metrics must be:

Easy and quick to determine using rubrics or computational methods
Less susceptible to innacuracies due to short text length

LEXICAL PROFILE (LP)

After extensive research into lexical diversity measures, I selected the Measure of Textual Lexical Diversity (MTLD) for its resistance to text length variations as one half of the LP score. For lexical sophistication, I averaged two distinct metrics, the Academic Word List (AWL) and the English Vocabulary Profile (EVP).

Lexical Profile = Lexical Diversity + Lexical Sophistication

= MTLD + AVG(AWL, EVP)

LEXICAL DIVERSITY

Chosen metric:

Measure of Textual Lexical Diversity (MTLD)

MTLD Characteristics

Typically ranges from 0-100, where lower numbers indicate less lexical diversity
Evaluates texts sequentially, measures average words read before type-token ratio falls below 0.72

Less sensitive to text length because it maintains consistent factor size

Retrieved using Text Inspector
Rounded to nearest whole number
Translated to 0-9 scale using a piecewise function

LEXICAL SOPHISTICATION

Chosen metrics:

Academic Word List (AWL) & English Vocabulary Profile (EVP)

AWL Characteristics

Percentage of words from Averil Coxhead's academic word families
Retrieved using Text Inspector
Rounded to nearest tenth
Typical range: 0-10%
Translated to 0-9 scale using a piecewise function

EVP Characteristics

Percentage of unique words at B1 CEFR level
Uses UK vocabulary list
Retrieved using Text Inspector
Rounded to nearest tenth
Typical range: 0-20%
Translated to 0-9 scale using a piecewise function

SENTENCE VARIETY & ACCURACY (SVA)

Colleagues simultaneously developed a 0-9 rubric that evaluated:

Clarity of sentence structures
Variety of clause arrangements
Effectiveness of sentence structures in conveying meaning
Sentence complexity and intentional communication

IMPACT ON DESIGN

We implemented several key changes to make our scoring scale and reports more student-friendly:

Scoring Scale:

Version 1: Categories scored 0-4, totaling 20 possible points
Version 2: Categories scored 0-18, totaling 90 possible points

Contextualization:

Version 1: Scores directly compared to CEFR
Version 2: Replaced direct CEFR correlation with standardized cateogires specific to the platform's curriculum

Subscores:

Version 1: Feedback based on total score
Version 2: Three subscores displayed numerically, allowing performance-based customization

RESULTS

By implementing advanced linguistic analysis techniques like the Measure of Textual Lexical Diversity (MTLD) and developing a comprehensive rubric, we significantly improved the precision of our English proficiency assessments. The new scoring system allowed us to provide students with more detailed feedback about their writing skills and linguistic development.

RESEARCH: Automated Scoring System