Date of Graduation

5-2025

Document Type

Dissertation

Degree Name

Doctor of Philosophy in Educational Statistics and Research Methods (PhD)

Degree Level

Graduate

Department

Counseling, Leadership, and Research Methods

Advisor/Mentor

Ames, Allison

Committee Member

Turner, Ronna C.

Second Committee Member

Treadwell, Kelli

Third Committee Member

Liang, Xinya

Fourth Committee Member

Clauser, Jerome

Keywords

Educational testing; Psychometrics; Test security

Abstract

Educational assessment policymakers and stakeholders have recently called for assessments that incorporate instruction and learning into the assessment process (e.g., through-course assessment and longitudinal certification assessments). A feature of these assessments is repeated measurements of examinee ability intended to inform learning interventions. Though these longitudinally-administered assessments are often designed to be more flexible for examinees, the increased flexibility presents testing practitioners with unique challenges in maintaining the integrity of exam scores. These assessments, often characterized by on-demand or continuous administration, necessitate repeated use of items over time. Repeating items increases the risk of compromised items that may lead to examinees with item preknowledge, thereby compromising the validity of score interpretations.

Test security, and more specifically, item preknowledge, has received much attention among researchers. Given the recency of longitudinally-administered assessments, however, test security statistics applied to longitudinally-collected assessment data have received little or no attention. Beyond increased security threats, these assessments introduce challenges to item preknowledge detection because of systematic (e.g., change in examinee ability) and unsystematic (e.g., unreliability in proficiency estimation) influences. Nonetheless, longitudinally-collected assessment data may provide additional possibilities for detecting aberrant response behavior not possible in point-in-time assessment contexts.

The purpose of this dissertation was to develop and examine methods for identifying examinees exhibiting item preknowledge in a longitudinal assessment framework. This dissertation comprises three studies that may help testing organizations administering longitudinally-administered assessments develop a suite of test security statistics. By leveraging the information from longitudinal data, the first study extends preexisting test security statistics developed for point-in-time assessments to evaluate for changes in examinee response behavior. The second study extends test security statistics specifically developed to detect sudden changes in examinee response behavior to detect whether and when an examinee begins exhibiting item preknowledge throughout a longitudinal assessment. The third study examines if test security programs should adopt specialized longitudinal statistics, such as those from studies one and two, or if conventional point-in-time methods are sufficient. The three studies comprise simulation studies to assess the test security statistics' performance in realistic testing conditions and applied examples to showcase their practical use and limitations with real data.

Available for download on Thursday, June 17, 2027

Share

COinS