首页> 外文期刊>Frontiers in Psychology >A Comparison of IRT Observed Score Kernel Equating and Several Equating Methods
【24h】

A Comparison of IRT Observed Score Kernel Equating and Several Equating Methods

机译:IRT观察评分核等式的比较和几种等式方法

获取原文
           

摘要

Item response theory (IRT) observed score kernel equating was evaluated and compared with equipercentile equating, IRT observed score equating, and kernel equating methods by varying the sample size and test length. Considering that IRT data simulation might unequally favor IRT equating methods, pseudo tests and pseudo groups were also constructed to make equating results comparable with those from the IRT data simulation. Identity equating and the large sample single group rule were both set as criterion equating (or true equating) on which local and global indices were based. Results show that in random equivalent groups design, IRT observed score kernel equating is more accurate and stable than others. In non-equivalent groups with anchor test design, IRT observed score equating shows lowest systematic and random errors among equating methods. Those errors decrease as a shorter test and a larger sample are used in equating; nevertheless, effect of the latter one is ignorable. No clear preference for data simulation method is found, though still affecting equating results. Preferences for true equating are spotted in random Equivalent Groups design. Finally, recommendations and further improvements are discussed.
机译:物品响应理论(IRT)观察到的评分核等式被评估并与Equipercent Sequent,IRT观察到的评分等式进行比较,通过改变样品大小和测试长度来进行核等式方法。考虑到IRT数据仿真可能不平等地利用IRT等式方法,还构造了伪测试和伪组以使等同于与IRT数据模拟的结果相当的结果。标识等式和大型样本单组规则都被设置为标准等式(或真正的等式),其中本地和全局指数是基于的。结果表明,在随机等效组设计中,IRT观察到的核心等于更准确且稳定。在具有锚定测试设计的非等效组中,IRT观察到的评分等方案显示等式方法中的最低系统和随机误差。这些误差随着较短的测试而减少,并且在等式中使用较大的样本;然而,后者的效果是无知的。没有发现对数据仿真方法的清晰偏好,尽管仍然影响等同于等同的结果。真正等式的偏好在随机等效组设计中被发现。最后,讨论了建议和进一步改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号