Automated Essay Scoring Validity for Native Arabic Speaking Learners of English

Elgindy, Mervat Mohamed; Badawy, Amany; Kamel, Salwa

Automated Essay Scoring Validity for Native Arabic Speaking Learners of English

المصدر:	مجلة كلية الآداب
الناشر:	جامعة القاهرة - كلية الآداب
المؤلف الرئيسي:	Elgindy, Mervat Mohamed (Author)
مؤلفين آخرين:	Badawy, Amany (Advisor) , Kamel, Salwa (Advisor)
المجلد/العدد:	مج77, ج2
محكمة:	نعم
الدولة:	مصر
التاريخ الميلادي:	2017
الشهر:	يناير
الصفحات:	49 - 68
ISSN:	1012-6015
رقم MD:	846286
نوع المحتوى:	بحوث ومقالات
اللغة:	الإنجليزية
قواعد المعلومات:	AraBase
مواضيع:	نظم التقييم الآلي \| اللغة الإنجليزية \| طلبة الجامعات
رابط المحتوى:	PDF (صورة) PDF (نص) HTML

عدد مرات التحميل

12

LEADER	05592nam a22002417a 4500
001	1601981
041		\|a eng
044		\|b مصر
100		\|9 454580 \|a Elgindy, Mervat Mohamed \|e Author
245		\|a Automated Essay Scoring Validity for Native Arabic Speaking Learners of English
260		\|b جامعة القاهرة - كلية الآداب \|c 2017 \|g يناير
300		\|a 49 - 68
336		\|a بحوث ومقالات \|b Article
520		\|a تهدف الدراسة إلى قياس صحة استخدام نظم التقييم الآلي في تقييم كتابة المقال باللغة الإنجليزية لطلاب الجامعيات العربية. ومن ثمَ سيقارن أداء إحدى أنظمة التقييم الآلي لموضوعات المقال بأداء محكمين بشريين. والبرنامج المستخدم في الدراسة هو ماي أكسسMy Access ويدعم هذا البرنامج نظام IntelliMetric للتصحيح الآلي. تم الحصول على البيانات من خلال استخدام نظام IntelliMetric للتصحيح الآلي لتصحيح 55 مقالة كتبها طلبة العينة بالإضافة إلى ثلاثة من المحكمين البشريين ذوي الخبرة من أهل اللغة الإنجليزية يقومون بتصحيح نفس المقالات. وتم تقييم موضوعات المقال في برنامج ماى أكسسMy Access بناء على مقاييس منها ما هو تحليلي ومنها ما هو كلي وكل مقياس مقسم إلى أربع نقاط. ويشتمل المقياس التحليلي على خمسة معايير.تم تجميع درجات كل من البرنامج والمحكمين البشريين وتم حساب المتوسطات والإنحرفات المعيارية للدرجات بالإضافة إلى حساب معامل الإرتباط بطريقة برسون. وكذلك استخدام تحليل التباين أحادي الإتجاة لدراسة الفروق بين المتوسطات. و لمعرفة اتجاة ودلالة هذة الفروق تم اجراء المقارنات المتعددة بين المتوسطات بطريقة .LSD من خلال المعايير الخمسة أظهرت النتائج ان هناك ارتباط ضعيف ومتوسط بين المحكمين البشريين ونظام Access My يتراوح بين 0.308 و0.435، ومعامل الإرتباط بين My Access والمحكم الأول في التصحيح الكلي هو 0.278 و 0.288 مع المحكم الثاني، في حين ان معامل الإرتباط بين My Access والمحكم الثالث غير دال إحصائياً، أي انة لا توجد علاقة بين My Access والمحكم الثالث. وقد اثبت تحليل التباين أحادي الإتجاة انة توجد فروق بين متوسطات برنامج My Access والمحكمين البشريين. وبعد اجراء المقارنات المتعددة بطريقة LSD تبين انة ليس هناك فروق بين متوسطات درجات برنامج My Access والمحكم الثالث في ثلاثة معايير وايضاَ التصحيح الكلي.
520		\|b This study aimed to investigate the validity of using Automated Essay Scoring (AES) systems to score essays written by nonnative university female students of English whose native language was Arabic. For this purpose, the performance of the AES program, My Access which was supported by IntelliMetric scoring system, was compared with that of human raters in assigning scores. The data had been obtained by using the IntelliMetric scoring system to score 55 essays and by asking three qualified experienced human raters to score the same essays. The human raters were native English speakers. Four- point informative analytic and holistic rubrics had been used. The analytic rubric included five traits: focus and meaning, content and development, organization, language use, voice and style, and mechanics and conventions. The scores were then accumulated. Descriptive statistics, mean differences and Pearson Correlation Coefficient were calculated. The results showed that across the five traits the correlations between the human raters and IntelliMetric scores were weak and moderate, ranging from 0.308 to 0.435. The correlation between IntelliMetric and the first human rater (H1) on holistic scoring was 0.278 and 0.288 with the second human rater (H2). There was no significance correlation between IntelliMetric and the third human rater (H3) on holistic scoring. Across the five traits the results of One-Way Analysis of Variance (ANOVA) indicated that there was a statistically significant difference in the mean of IntelliMetric, H1, H2, and H3. Least Significant Difference (LSD) test showed that IntelliMetric and H3 were not statistically different on three traits besides holistic scoring: focus and meaning, content and development and mechanics and conventions. Regarding organization trait, IntelliMetric and H1 were not statistically different.
653		\|a نظم التقييم الآلي \|a اللغة الإنجليزية \|a طلبة الجامعات
700		\|9 454584 \|a Badawy, Amany \|e Advisor
700		\|a Kamel, Salwa \|e Advisor \|9 454582
773		\|4 الادب \|6 Literature \|c 004 \|e Bulletin of the Faculty of Arts \|f Maǧallaẗ Kulliyyaẗ al-ādāb \|l 002 \|m مج77, ج2 \|o 0415 \|s مجلة كلية الآداب \|v 077 \|x 1012-6015
856		\|u 0415-077-002-004.pdf
930		\|d y \|p y \|q y
995		\|a AraBase
999		\|c 846286 \|d 846286

عناصر مشابهة

My Access Feedback Validity for Native Arabic Speaking Learners of English
بواسطة: Elgindy, Mervat Mohammed منشور: (2016)
Error Analysis of the Written English Essays by Libyan EFL Learners: Case Study of Alasmarya University EFL Students
بواسطة: Abied, Ahmed Abdussalam منشور: (2022)
The Difficulties in Understanding Strong and Weak Forms, in Listening to Native Speakers in English Language
بواسطة: Yagoob, Adam Mohamed Ishag منشور: (2022)
Automatic Scoring Approach for Arabic Short Answers Essay Questions
بواسطة: Al Rababah, Hebah Hasan منشور: (2017)
The Frequency of the English Conjunctions Use in Argumentative Essay by Fell Learners
بواسطة: Boubakar, Fatma J. منشور: (2021)

Automated Essay Scoring Validity for Native Arabic Speaking Learners of English

عدد مرات التحميل

12

عناصر مشابهة

دليل المستخدم

دليل الفيديو