• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Applying the Rasch Model: Fundamental Measurement in the Human Sciences, Second Edition. (2007)

by T G Bond, C M Fox
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 319
Next 10 →

Scaling of Theory-of-Mind Tasks

by Henry M Wellman , David Liu , 1995
"... ..."
Abstract - Cited by 67 (4 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...outfit is a mean of 0 and standard deviation of 1.0; fit statistics42.0 indicate misfit. 534 Wellman and Liu person’s measurement level (Linacre & Wright, 1994; Wright & Masters, 1982). Standardized infit and outfit statistics for individual items have an expected value of 0. Positive values greater than 2.0 indicate greater unpredictable variation than expected. Negative values suggest the scale is more deterministic than expected because Rasch models are probabilistic. Therefore, negative values are acceptable for our comparison with the Guttman scale because they actually indicate overfit (Bond & Fox, 2001). Therefore, we consider standardized fit statistics for individual items greater than 2.0 as indicating misfit (Wright & Masters, 1982). As shown in Table 5, all five items’ standardized infit and outfit statistics fall well short of 2.0, and mean fit statistics are near the expected value of 0. Mean standardized infit and outfit statistics for person ability, which indicate overall fit of individual persons to the scale, also fall well short of 2.0 and are near their expected value of 0. Therefore, these five items fit the Rasch model well. Seven-item Rasch model. A problematic outcome of a ...

Comparing the aberrant response detection performance of thirty-six person-fit statistics

by George Karabatsos - Applied Measurement in Education , 2003
"... The accurate measurement of examinee test performance is critical to educational decision-making, and inaccurate measurement can lead to negative consequences for examinees. Person-fit statistics are important in a psychometric analysis for de-tecting examinees with aberrant response patterns that l ..."
Abstract - Cited by 28 (0 self) - Add to MetaCart
The accurate measurement of examinee test performance is critical to educational decision-making, and inaccurate measurement can lead to negative consequences for examinees. Person-fit statistics are important in a psychometric analysis for de-tecting examinees with aberrant response patterns that lead to inaccurate measure-ment. Unfortunately, although a large number of person-fit statistics is available, there is little consensus as to which ones are most useful. The purpose of this study was to compare 36 person-fit indices, under different testing conditions, to obtain a better consensus as to their relative merits. The results of these comparisons, and their implications, are discussed. Sound decisions in educational settings hinge largely on accurate measurement of student characteristics. Such measurements can help identify those individuals who are qualified enough to enter a particular school, or receive a particular edu-cational degree. Also, these measurements can be used to monitor students ’ learn-ing progress. This may, for example, enable educators to productively tailor their curriculum, or help policy makers decide on important educational issues. In contrast, the inaccurate measurement of test performance can lead to nega-tive consequences. On the one hand, spuriously high test scores can lead to un-qualified individuals being enrolled into an educational program (e.g., undergrad-uate, graduate, or professional), or being awarded an educational degree. On the other hand, qualified individuals with spuriously low test scores may be unfairly excluded from academic programs, or unfairly denied a degree. Furthermore, the inaccurate measurement of test performance undermines the assessment of stu-dents ’ learning progress, and curriculum planning efforts.
(Show Context)

Citation Context

...97), and also, its parametric simplicity enables a straightforward investigation of the fit statistics. Furthermore, the Rasch model is frequently used in educational and psychological testing (e.g., =-=Bond & Fox, 2001-=-; Fisher & Wright, 1997; Wilson & Englehard, 2000).sThe Rasch model postulates that when examinee n with ability θn encounters item j with difficulty δj, the probability of a correct response Pnj1 dep...

Developing Citizens: The Impact of Civic Learning Opportunities on Students’ Commitment to Civic Participation

by Joseph E. Kahne, Susan E. Sporte - American Educational Research Journal , 2008
"... This study of 4,057 students from 52 high schools in Chicago finds that a set of specific kinds of civic learning opportunities fosters notable improvements in students ’ commitments to civic participation. The study controls for demo-graphic factors, preexisting civic commitments, and academic test ..."
Abstract - Cited by 28 (1 self) - Add to MetaCart
This study of 4,057 students from 52 high schools in Chicago finds that a set of specific kinds of civic learning opportunities fosters notable improvements in students ’ commitments to civic participation. The study controls for demo-graphic factors, preexisting civic commitments, and academic test scores. Prior large-scale studies that found limited impact from school-based civic education often did not focus on the content and style of the curriculum and instruction. Discussing civic and political issues with one’s parents, extracurricular activi-ties other than sports, and living in a civically responsive neighborhood also appear to meaningfully support this goal. Other school characteristics appear less influential.
(Show Context)

Citation Context

...espondents and puts all respondent scores on the same scale based on the likelihood that the respondent endorses each item in the suite of items (for an introductory discussion of Rasch modeling, see =-=Bond & Fox, 2001-=-). Rasch measures are scaled in logits; we transformed them to a 10-point scale for ease of explanation. This approach permits the creation of a latent variable such as “commitment to civic participat...

A stage is a stage is a stage: A direct comparison of two scoring systems

by Theo L. Dawson - Journal of Genetic Psychology , 2003
"... ABSTRACT. L. Kohlberg (1969) argued that his moral stages captured a developmental sequence specific to the moral domain. To explore this contention, the author compared stage assignments obtained with the Standard Issue Scoring System (A. Colby & L. Kohlberg, 1987a, 1987b) and those obtained wi ..."
Abstract - Cited by 13 (0 self) - Add to MetaCart
ABSTRACT. L. Kohlberg (1969) argued that his moral stages captured a developmental sequence specific to the moral domain. To explore this contention, the author compared stage assignments obtained with the Standard Issue Scoring System (A. Colby & L. Kohlberg, 1987a, 1987b) and those obtained with a generalized content-independent stage-scoring system called the Hierarchical Complexity Scoring System (T. Dawson, 2002a), on 637 moral judgment interviews (participants ’ ages ranged from 5 to 86 years). The cor-relation between stage scores produced with the 2 systems was.88. Although standard issue scoring and hierarchical complexity scoring often awarded different scores up to Kohlberg’s Moral Stage 2/3, from his Moral Stage 3 onward, scores awarded with the two systems predominantly agreed. The author explores the implications for developmental research. Key words: cognitive development, developmental assessment, developmental stages, life-span development, moral development I EXPLORED KOHLBERG’S contention that moral stages represent a unique cognitive structure, along with broader questions about the nature of develop-ment, by comparing the functioning of two developmental stage-scoring sys-tems—Kohlberg’s domain-specific Standard Issue Scoring System (Colby & Kohlberg, 1987b), and the domain-general Hierarchical Complexity Scoring Sys-tem (Dawson, 2002a). Developmental stages, also referred to in this article as orders of hierarchi-cal complexity (complexity orders), are conceived of as a series of hierarchical integrations of knowledge structures. Most developmental stage theories use the notion of hierarchical complexity. In the Piagetian model (Piaget, 1977), for The author thanks the Murray Research Center, Cheryl Armon, Marvin Berkowitz, and Larry Walker for the use of their moral judgment interview data. This project was funded by a grant from the Spencer Foundation. The data presented, the statements made, and the views expressed are the responsibility solely of the author.

The Rainbow Project: Enhancing the SAT through assessments of analytical, practical, and creative skills

by Robert J. Sternberg , 2006
"... This article describes the formulation and execution of the Rainbow Project, Phase I, funded by the College Board. Past data suggest that the SAT is a good predictor of performance in college. But in terms of the amount of variance explained by the SAT, there is room for improvement, as there would ..."
Abstract - Cited by 11 (0 self) - Add to MetaCart
This article describes the formulation and execution of the Rainbow Project, Phase I, funded by the College Board. Past data suggest that the SAT is a good predictor of performance in college. But in terms of the amount of variance explained by the SAT, there is room for improvement, as there would be for virtually any single test battery. Phase I of the Rainbow Project, described here, uses Sternberg's triarchic theory of successful intelligence as a basis to provide a supplementary assessment of analytical skills, as well as tests of practical and creative skills, to augment the SAT in predicting college performance. This assessment is delivered through a modification of the Sternberg Triarchic Abilities Test (STAT) and the development of new assessment devices. Results from Phase I of the Rainbow Project support the construct validity of the theory of successful intelligence and suggest its potential for use in college admissions as an enhancement to the SAT. In particular, the results indicated that the triarchically based Rainbow measures enhanced predictive validity for college GPA relative to high school grade point average (GPA) and the SAT and also reduced ethnic group differences. The data suggest that measures such as these potentially could increase diversity and equity in the admissions process.
(Show Context)

Citation Context

...ent sample. This approach served to increase the precision of the estimates, but did not alter in a substantive way the difference between scores of participants as the IRT estimates are sample-free (=-=Bond & Fox, 2001-=-; Wright & Stone, 1979). The details of these analyses, based on the combined high school and college sample (N=1013), are provided in the technical report prepared independently by the Jefferson Psyc...

On the form and function of forgiving: modeling the timeforgiveness relationship and testing the valuable relationships hypothesis

by Michael E. Mccullough, Lindsey Root Luna, Jack W. Berry, Benjamin A. Tabak, Giacomo Bono - Emotion , 2010
"... In two studies, the authors sought to identify the mathematical function underlying the temporal course of forgiveness. A logarithmic model outperformed linear, exponential, power, hyperbolic, and exponential-power models. The logarithmic function implies a psychological process yielding diminish-in ..."
Abstract - Cited by 11 (5 self) - Add to MetaCart
In two studies, the authors sought to identify the mathematical function underlying the temporal course of forgiveness. A logarithmic model outperformed linear, exponential, power, hyperbolic, and exponential-power models. The logarithmic function implies a psychological process yielding diminish-ing returns, corresponds to the Weber-Fechner law, and is functionally similar to the power law underlying the psychophysical function (Stevens, 1971) and the forgetting function (Wixted & Ebbesen, 1997). By 3 months after their transgressions, the typical participant’s forgiveness had increased by two log-odds units. Individual differences in rates of change were correlated with robust predictors of forgiveness. Consistent with evolutionary theorizing (McCullough, 2008), Study 2 revealed that forgive-ness was uniquely associated with participants ’ perceptions that their relationships with their offenders retained value.
(Show Context)

Citation Context

...n & Reise, 2000). Such floor and ceiling effects in previous research might have limited the ability to depict change faithfully. More advanced psychometric scaling procedures, such as Rasch scaling (=-=Bond & Fox, 2001-=-), can be used to convert ordinal raw scores to intervallevel measures that are theoretically unbounded at the extremes, thereby removing such floor and ceiling effects. In Rasch scaling, person laten...

Examining rater effects in TestDaF writing and speaking performance assessments: A many-facet Rasch analysis

by Thomas Eckes - Language Assessment Quarterly , 2005
"... I studied rater effects in the writing and speaking sections of the Test of German as a Foreign Language (TestDaF). Building on the many-facet Rasch measurement meth-odology, the focus was on rater main effects as well as 2- and 3-way interactions be-tween raters and the other facets involved, that ..."
Abstract - Cited by 9 (2 self) - Add to MetaCart
I studied rater effects in the writing and speaking sections of the Test of German as a Foreign Language (TestDaF). Building on the many-facet Rasch measurement meth-odology, the focus was on rater main effects as well as 2- and 3-way interactions be-tween raters and the other facets involved, that is, examinees, rating criteria (in the writing section), and tasks (in the speaking section). Another goal was to investigate differential rater functioning related to examinee gender. Results showed that raters (a) differed strongly in the severity with which they rated examinees; (b) were fairly consistent in their overall ratings; (c) were substantially less consistent in relation to rating criteria (or speaking tasks, respectively) than in relation to examinees; and (d) as a group, were not subject to gender bias. These findings have implications for con-trolling and assuring the psychometric quality of the TestDaF rater-mediated assess-ment system. Rater effects such as severity or leniency, halo, or central tendency are commonly viewed as a source of method variance, that is, as a source of systematic variance in observed ratings that is associated with the raters and not with the ratees (Cron-
(Show Context)

Citation Context

...it for infit and outfit mean-square statistics.8 Others researchers suggested to use a narrower range defined by a lower control limit of 0.70 (or 0.75) and an upper control limit of 1.30 (see, e.g., =-=Bond & Fox, 2001-=-; McNamara, 1996). Table 4 presents percentages of rater fit values falling into the overfit, acceptable fit, or misfit categories, using either a narrow or a wide range of upper and lower control lim...

A twin-family study of general IQ

by Marieke Van Leeuwen, Stéphanie M. Van Den Berg, Dorret I. Boomsma - Learning and Individual Differences , 2008
"... This article was published in an Elsevier journal. The attached copy is furnished to the author for non-commercial research and education use, including for instruction at the author’s institution, sharing with colleagues and providing to institution administration. Other uses, including reproductio ..."
Abstract - Cited by 9 (6 self) - Add to MetaCart
This article was published in an Elsevier journal. The attached copy is furnished to the author for non-commercial research and education use, including for instruction at the author’s institution, sharing with colleagues and providing to institution administration. Other uses, including reproduction and distribution, or selling or licensing copies, or posting to personal, institutional or third party websites are prohibited. In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier’s archiving and manuscript policies are encouraged to visit: http://www.elsevier.com/copyright Author's personal copy Available online at www.sciencedirect.com

WHERE DOES GENDER FIT IN THE MEASUREMENT OF SELF-CONTROL?

by Chris L. Gibson, Jeffrey T. Ward, John Paul Wright, Kevin M. Beaver, Matt Delisi, Chris L. Gibson, Jeffrey T. Ward, John Paul Wright, Kevin M. Beaver , 2010
"... The online version of this article can be found at: DOI: 10.1177/0093854810369082 ..."
Abstract - Cited by 9 (2 self) - Add to MetaCart
The online version of this article can be found at: DOI: 10.1177/0093854810369082
(Show Context)

Citation Context

...difficulty of items that compose the measurement instrument. For dichotomous models, item difficulty is defined as the ability levelsin which a person has a 50% chance of answering an item correctly (=-=Bond & Fox, 2001-=-;sB. D. Wright & Masters, 1982). Person ability and item difficulty are placed on the sameslinear scale consisting of logit units that range from ± ∞ (Bond & Fox, 2001; B. D.sWright & Masters, 1982). ...

Using Rasch scaled stage scores to validate orders of hierarchical complexity of balance beam task sequences

by Eric Andrew Goodheart, Alexander Pekker, Theo Linda Dawson, Karen Draney, Kathryn Marie Adams - In Rasch measurement: Advanced and specialized , 2007
"... These studies examine the relationship between the analytic basis underlying the hierarchies produced by the Model of Hierarchical Complexity and the probabilistic Rasch scales that places both partici-pants and problems along a single hierarchically ordered dimension. A Rasch analysis was performed ..."
Abstract - Cited by 8 (0 self) - Add to MetaCart
These studies examine the relationship between the analytic basis underlying the hierarchies produced by the Model of Hierarchical Complexity and the probabilistic Rasch scales that places both partici-pants and problems along a single hierarchically ordered dimension. A Rasch analysis was performed on data from the balance-beam task series. This yielded scaled stage of performance for each of the items. The items formed a series of clusters along this same dimension, according to their order of hierarchical complexity. We sought to ascertain whether there was a significant relationship between the order of hierarchical complexity (a task property variable) of the tasks and the corresponding Rasch scaled difficulty of those same items (a performance variable). It was found that The Model of Hierarchical Complexity was highly accurate in predicting the Rasch Stage scores of the performed tasks, therefore providing an analytic and developmental basis for the Rasch scaled stages.
(Show Context)

Citation Context

...lts is due to thesfact that the resulting scaling of item difficultysobtained with a Rasch analysis is independent ofsthe distribution of the particular participants usedswhen the data fit the model (=-=Bond and Fox, 2001-=-).sThe Rasch model enables us to estimate subjectsability and item difficulty independently of eachsother (Andrich, 1988). Among the participantssthere needs to be a range of performance so thatssome ...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University