Comparing the Effectiveness of Two Methods for Detecting Measurement Invariance at the Test Level "Dift and Sibtest" in Light of Differences in Ability Distribution and Sample Size

النفيعي، عبدالرحمن عبدالله أحمد

Comparing the Effectiveness of Two Methods for Detecting Measurement Invariance at the Test Level "Dift and Sibtest" in Light of Differences in Ability Distribution and Sample Size

العنوان بلغة أخرى:	مقارنة فعالية طريقتين لبيان ثبات القياس على مستوى الاختبار "اختبار الانحراف والاختبار البعدي" في ضوء الاختلافات في توزيع القدرة وحجم العينة
المصدر:	مجلة الجامعة الإسلامية للعلوم التربوية والاجتماعية
الناشر:	الجامعة الإسلامية بالمدينة المنورة
المؤلف الرئيسي:	النفيعي، عبدالرحمن عبدالله أحمد (مؤلف)
المؤلف الرئيسي (الإنجليزية):	Alnofei, Abdulrahman A.
المجلد/العدد:	ع19
محكمة:	نعم
الدولة:	السعودية
التاريخ الميلادي:	2024
التاريخ الهجري:	1446
الشهر:	سبتمبر
الصفحات:	365 - 406
DOI:	10.36046/2162-000-019-019
ISSN:	1658-8509
رقم MD:	1494080
نوع المحتوى:	بحوث ومقالات
اللغة:	الإنجليزية
قواعد المعلومات:	EduSearch
مواضيع:	النظريات الرياضية \| الاختبارات الإحصائية \| الأداء التفاضلي \| المعدلات القياسية
كلمات المؤلف المفتاحية:	تكافؤ القياس \| الأداء التفاضلي للاختبارات \| نظرية الاستجابة للمفردة \| طريقة DIFT \| طريقة SIBTEST \| Measurement Parity \| Differential Performance of Tests \| Test Single Response Theory \| DIFT Method \| SIBTEST Method
رابط المحتوى:	PDF (صورة)

عدد مرات التحميل

5

المستخلص:

هدفت الدراسة إلى مقارنة فاعلية أداء طريقتي DIFT وSIBTEST في الكشف عن تكافؤ القياس للاختبارات وفقا لمتغيري حجم العينة وفرق توزيع القدرة، وباستخدام تصميم بحثي عاملي يتم من خلاله دراسة أثر التفاعل بين طريقة الكشف وحجم العينة وفوارق توزيع القدرة، وذلك من خلال فحص معدلات الخطأ من النوع الأول، وقوة الاختبار. ولتحقيق هذا الهدف تم إجراء دراستين: الأولى لدراسة معدلات الخطأ من النوع الأول، والثانية لدراسة معدلات القوة للاختبار الإحصائي، وذلك عند ضبط فوارق توزيع القدرة وحجم العينة، باستخدام التصميم التجريبي للقياسات المتكررة لثلاثة عوامل تجريبية، وقد تم تحليل البيانات باستخدام الأسلوب الإحصائي لكل طريقة من طريقتي الكشف، وذلك لاختبار الفرضية الصفرية التي تنص على عدم وجود أداء تفاضلي للفقرة، والحصول على معدلات الخطأ من النوع الأول ومعدلات القوة. وتم معالجة البيانات باستخدام الأسلوب الإحصائي تحليل التباين المختلط. وبناء على نتائج التحليلات الإحصائية تم التوصل إلى الاستنتاجات التالية: -تميز طريقتي SIBTEST وDIFT بالفاعلية في الكشف عن الأداء التفاضلي للاختبار بصفة عامة. -طريقة الأداء التفاضلي لحزم الفقرات والاختبار أكثر فاعلية من طريقة اختبار تحيز الفقرة المتزامن عند أخذ حجم العينة في الاعتبار عند استخدام حجم العينة الكبير (١٠٠٠/١٠٠٠) أو أكثر. -طريقة اختبار تحيز الفقرة المتزامن أكثر فاعلية في الكشف عن الأداء التفاضلي لحزم الفقرات والاختبار في حالة عدم وجود فرق في توزيع القدرة، وفي حالة وجود فرق في توزيع القدرة فكلا الطريقتين غير فاعلة، حيث تعاني طريقة DIFT من ضعف قوة الاختبار الإحصائي، كما تعاني طريقة SIBTEST من تضخم الخطأ من النوع الأول، لذا يوصى باستخدام الطريقتين معا للكشف عن الأداء التفاضلي للاختبارات في حالة وجود فرق في توزيع القدرة بين المجموعتين.

The aim of the current study was to compare the effectiveness of the DIFT and SIBTEST methods in detecting measurement invariance for tests according to sample size and differences in ability distribution. A factorial experimental design was used to look at how the detection method, sample size, and differences in ability distribution all affect each other. Examining type I error rates and test power served to accomplish this. Two studies were conducted, the first to examine Type I error rates and the second to examine test power while controlling for ability distribution differences and sample size. Data were analyzed using statistical methods for each detection method to test the null hypothesis of no differential performance and obtain Type I error rates and test power. The data were processed using mixed-variance analysis. Based on the results of the statistical analysis, a number of important findings were obtained, including: both the SIBTEST and DIFT methods were effective in detecting differential performance of the test in general; the differential item functioning (DIF) method was more effective than the simultaneous item bias test (SIBTEST) when considering sample sizes of 1000 or more. And the differential item bias test was more effective in detecting differential performance of items and tests in the absence of ability distribution differences. However, in the presence of ability distribution differences, both methods were ineffective, as DIFT suffered from low statistical power and SIBTEST suffered from inflated Type I error rates. Therefore, the study recommends using both methods together to detect differential test performance in the presence of ability distribution differences between groups.

ISSN:

1658-8509

عناصر مشابهة

VIOLATION OF STATISTICAL ASSUMPTIONS IN THE TWO SAMPLE T-TEST
بواسطة: Ibrahim, Abrahim Hassan منشور: (1992)
Comparing Different Estimators for Two Parameters Weibull Distribution under censored sample Type I
بواسطة: Al-Sultany, Shurooq A. K. منشور: (2019)
Conducting Multivariate One- and Two-Sample Statistical Tests Using SPSS Syntax
بواسطة: بوتاني، دلشاد شاكر إسماعيل منشور: (2024)
comparing Two Numerical & Optimal Methods of Their Error
بواسطة: Hassan, Samy Refahy Mahmoud منشور: (1996)
Two-Sample Multivariate Test of Homogeneity Using Weighted Nearest Neighbours
بواسطة: بركات، أريج علي سعيد منشور: (2012)

Comparing the Effectiveness of Two Methods for Detecting Measurement Invariance at the Test Level "Dift and Sibtest" in Light of Differences in Ability Distribution and Sample Size

عدد مرات التحميل

5

عناصر مشابهة

دليل المستخدم

دليل الفيديو