Read by QxMD icon Read

Journal of Applied Measurement

Xiaofeng Steven Liu
Item difficulty and discrimination index are often used to evaluate test items and diagnose possible issues in true score theory. The two statistics are more related than the literature suggests. In particular, the discrimination index can be mathematically determined by the item difficulty and the correlation between the item performance and the total test score.
2019: Journal of Applied Measurement
Sarah Bansilal, Caroline Long, Andrea Juan
The use of multiple-choice items in assessments in the interest of increased efficiency brings associated challenges, notably the phenomenon of guessing. The purpose of this study is to use Rasch measurement theory to investigate the extent of guessing in a sample of responses taken from the Trends in International Mathematics and Science Study (TIMSS) 2015. A method of checking the extent of the guessing in test data, a tailored analysis, is applied to the data from a sample of 2188 learners on a subset of items...
2019: Journal of Applied Measurement
Song Boon Khing, Tay Eng Guan
The main objective of this study is to develop and validate a sources of mathematics self-efficacy (SMSE) scale to be used in a polytechnic adopting Problem Based Learning (PBL) as its main instructional strategy. Based on socio-constructivist learning approach, PBL emphasizes collaborative and self-directed learning. A non-experimental cross-sectional design using a questionnaire was employed in this study. The validation process was conducted over three phases. Phase 1 was the initial development stage to generate a pool of items in the questionnaire...
2019: Journal of Applied Measurement
Sijia Zhang, Stefanie A Wind
School leadership influences school conditions and organizational climate; these conditions in turn impact student outcomes. Accordingly, examining differences in principals' perceptions of leadership activities within and across countries may provide insight into achievement differences. The major purpose of this study was to explore differences in the relative difficulty of principals' leadership activities across four countries that reflect Asian and North American national contexts: (1) Hong Kong SAR, (2) Chinese Taipei, (3) the United States, and (4) Canada...
2019: Journal of Applied Measurement
Glenn Thomas Waterbury
This simulation study explores the effects of missing data mechanisms, proportions of missing data, sample size, and test length on the biases and standard errors of item parameters using the Rasch measurement model. When responses were missing completely at random (MCAR) or missing at random (MAR), item parameters were unbiased. When responses were missing not at random (MNAR), item parameters were severely biased, especially when the proportion of missing responses was high. Standard errors were primarily affected by sample size, with larger samples associated with smaller standard errors...
2019: Journal of Applied Measurement
Eivind Kaspersen
There are many sources of evidence for a well-functioning rating-scale. Two of these sources are analyses of measure-to-category and category-to-measure statistics. An absolute cut-value of 40% for these statistics has been suggested. However, no evidence exists in the literature that this value is appropriate. Thus, this paper discusses the results of simulation studies that examined the expected values in different contexts. The study concludes that a static cut-value of 40% should be replaced with expected values for measure-to-category and category-to-measure analyses...
2019: Journal of Applied Measurement
Rianne Janssen, Jorge Gonzalez, Ernesto San Martin
An examinee- and an item-centered procedure are proposed to set cut scores for counts data. Both procedures assume that the counts data are modelled according to the Rasch Poisson counts model (RPCM). The examinee-centered method is based on Longford's (1996) approach and links contrasting-groups judgements to the RPCM ability scale using a random logistic regression model. In the item-centered method, the judges are asked to describe the minimum performance level of the minimally competent student by giving the minimum number of correct responses (or, equivalently, the maximum number of admissible errors)...
2019: Journal of Applied Measurement
Mark H Stone, A Jackson Stenner
Loevinger's specifications for a unidimensional test are discussed. The implications are reviewed using commentary from Guttman's and Rasch's specification for specific objectivity. A large population is sampled to evaluate the implications of this approach in light of Wright's early presentation regarding data analysis. The results of this analysis show the sample follows the specifications of Loevinger and those of Rasch for a unidimensional test.
2019: Journal of Applied Measurement
Tracy Kline, Corina Owens, Courtney Peasant Bonner, Tara Carney, Felicia A Browne, Wendee M Wechsberg
Hazardous drinking is a risk factor associated with sexual risk, gender-based violence, and HIV transmission in South Africa. Consequently, sound and appropriate measurement of drinking behavior is critical to determining what constitutes hazardous drinking. Many research studies use internal consistency estimates as the determining factor in psychometric assessment; however, deeper assessments are needed to best define a measurement tool. Rasch methodology was used to evaluate a shorter version of the Alcohol Use Disorders Identification Test, the AUDIT-C, in a sample of adolescent girls and young women (AGYW) who use alcohol and other drugs in South Africa (n =100)...
2019: Journal of Applied Measurement
Victoria T Tanaka, George Engelhard, Matthew P Rabbitt
The Household Food Security Survey Module (HFSSM) is a scale used by the U.S. Department of Agriculture to measure the severity of food insecurity experienced by U.S. households. In this study, measurement invariance of the HFSSM is examined across households based on participation in the Supplemental Nutrition Assistance Program (SNAP). Households with children who responded to the HFSSM in 2015 and 2016 (N = 3,931) are examined. The Rasch model is used to analyze differential item functioning (DIF) related to SNAP participation...
2019: Journal of Applied Measurement
Pey Shin Ooi, George Engelhard
The fairness of raters in music performance assessment has become an important concern in the field of music. The assessment of students' music performance depends in a fundamental way on rater judgements. The quality of rater judgements is crucial to provide fair, meaningful and informative assessments of music performance. There are many external factors that can influence the quality of rater judgements. Previous research has used different measurement models to examine the quality of rater judgements (e...
2019: Journal of Applied Measurement
Trenton J Combs, Kyle W English, Barbara G Dodd, Hyeon-Ah Kang
Computerized adaptive testing (CAT) is an attractive alternative to traditional paper-and-pencil testing because it can provide accurate trait estimates while administering fewer items than a linear test form. A stopping rule is an important factor in determining an assessments efficiency. This simulation compares three variable-length stopping rules-standard error (SE) of .3, minimum information (MI) of .7 and change in trait (CT) of .02 - with and without a maximum number of items (20) imposed. We use fixed-length criteria of 10 and 20 items as a comparison for two versions of a linear assessment...
2019: Journal of Applied Measurement
Stefanie A Wind, Behzad Mansouri, Parvaney Yaghoubi Jami
Isolated and integrated grammar instruction are two approaches to grammar teaching that can be implemented within a form-focused instruction (FFI) framework. In both approaches, instructors primarily concentrate on meaning, and the difference is in the timing of instruction on specific language forms. In previous studies, researchers have observed that the match between teachers' and learners' beliefs related to the effectiveness of instructional approaches is an important component in predicting the success of grammar instruction...
2019: Journal of Applied Measurement
Ki Cole
The WINSTEPS software is widely used for Rasch model calibrations. Recently, SAS/STAT released the PROC IRT procedure for IRT analysis, including Rasch. The purpose of the study is compare the performance of the PROC IRT procedure with WINSTEPS to calibrate dichotomous and polytomous Rasch models in order to diagnose the possibility of using PROC IRT as a viable alternative. A simulation study was used to compare the two programs in terms of the convergence rate, run time, item parameter estimates, and ability estimates with different test lengths and sample sizes...
2019: Journal of Applied Measurement
W Holmes Finch, Brian F French, Maria E Hernandez Finch
An important aspect of educational and psychological measurement and evaluation of individuals is the selection of scales with appropriate evidence of reliability and validity for inferences and uses of the scores for the population of interest. One aspect of validity is the degree to which a scale fairly assesses the construct(s) of interest for members of different subgroups within the population. Typically, this issue is addressed statistically through assessment of differential item functioning (DIF) of individual items, or differential bundle functioning (DBF) of sets of items...
2019: Journal of Applied Measurement
Glenn Thomas Waterbury, Christine E DeMars
This paper investigates a strategy for accounting for correct guessing with the Rasch model that we entitled the Guessing Adjustment. This strategy involves the identification of all person/item encounters where the probability of a correct response is below a specified threshold. These responses are converted to missing data and the calibration is conducted a second time. This simulation study focuses on the effects of different probability thresholds across varying conditions of sample size, amount of correct guessing, and item difficulty...
2019: Journal of Applied Measurement
Ngoc Chu Nguyen Mong, Trong Hoang
This study examined a measurement model for the construct of consumer patriotism in the context of city-based consumers in Vietnam, a developing country, and the linkage of consumer patriotism with consumer ethnocentrism. Exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) were conducted to assess the measurement model. A mediator effect test was utilised to test the hypothesis of the model, using a multiple regression procedure. Two studies were carried out, the first a preliminary study with a convenience sample of 230 people and the second a full study with a probability sample of 300 people...
2018: Journal of Applied Measurement
Vernon Mogol, Yan Chen, Marcus Henning, Andy Wearn, Jennifer Weller, Jill Yielder, Warwich Bagg
The Revised Two-Factor Study Process Questionnaire (R-SPQ-2F) was developed in 1998 using the true score theory to measure students' deep approaches (DA) and surface approaches (SA) to learning. Using Rasch analyses, this study aimed to 1) validate the R-SPQ-2F's two-factor structure, and 2) explore whether the full scale (FS), after reverse scoring responses to SA items, could measure learning approach as a uni-dimensional construct. University students (N = 327) completed an online version of the R-SPQ-2F...
2018: Journal of Applied Measurement
Meng-Ting Lo, Ssu-Kuang Chen, Ann A O'Connell
The present study used the Rasch rating scale model (RSM) to reassess the psychometric properties of the Chinese version of Rosenberg self-esteem scale (RSES) among 501 Grade 10 students in Taiwan. The reliability, dimensionality, and differential item functioning were examined. The dimensionality assumption was met after excluding item 8 ("I wish I could have more respect for myself."). The successive response categories for item 7 ("I feel that I am a person of worth, at least on an equal plane with others...
2018: Journal of Applied Measurement
Fereshteh Zeynivandnezhad
The advent of new technologies has replaced physical manipulatives - which are physical models of equivalent representation of concepts by virtual manipulatives which many learners and teachers find useful in the mathematics classroom. The current study investigated students' motivation to engage with virtual manipulatives as a tool in the mathematics education. Activity theory was used to conduct a multicomponential survey of virtual manipulatives in education and administered it to 442 Iranian high school students with the aim of examining students' perception of various aspects of the manipulatives...
2018: Journal of Applied Measurement
Fetch more papers »
Fetching more papers... Fetching...
Read by QxMD. Sign in or create an account to discover new knowledge that matter to you.
Remove bar
Read by QxMD icon Read

Search Tips

Use Boolean operators: AND/OR

diabetic AND foot
diabetes OR diabetic

Exclude a word using the 'minus' sign

Virchow -triad

Use Parentheses

water AND (cup OR glass)

Add an asterisk (*) at end of a word to include word stems

Neuro* will search for Neurology, Neuroscientist, Neurological, and so on

Use quotes to search for an exact phrase

"primary prevention of cancer"
(heart or cardiac or cardio*) AND arrest -"American Heart Association"