Identifying Informative Coronavirus Tweets Using Recurrent Neural Network Document Embedding

يوسف، رامي نعيم محمد

Identifying Informative Coronavirus Tweets Using Recurrent Neural Network Document Embedding

العنوان بلغة أخرى:	تصنيف الأخبار المسهبة حول وباء الكورونا المستجد عبر تويتر باستخدام تضمين الملفات النصية بواسطة خوارزمية الشبكات العصبية المتكررة
المصدر:	مجلة جامعة فلسطين التقنية للأبحاث
الناشر:	جامعة فلسطين التقنية خضوري - عمادة البحث العلمي
المؤلف الرئيسي:	يوسف، رامي نعيم محمد (مؤلف)
المؤلف الرئيسي (الإنجليزية):	Yousuf, Rami Naim Mohammed
المجلد/العدد:	مج10, ع1
محكمة:	نعم
الدولة:	فلسطين
التاريخ الميلادي:	2022
الشهر:	مايو
الصفحات:	93 - 102
ISSN:	2307-8081
رقم MD:	1423982
نوع المحتوى:	بحوث ومقالات
اللغة:	الإنجليزية
قواعد المعلومات:	EduSearch, HumanIndex
مواضيع:	جائحة كورونا "كوفيد19" \| العلامات التجارية \| النصوص المركبة \| مكافحة الوباء
كلمات المؤلف المفتاحية:	فيروس كورونا \| تغريدات مسهبة إعلاميا \| تضمين النصوص \| الشبكة العصبية المتكررة \| بيانات مشوشة من انشاء المستخدم \| Coronavirus \| Informative Tweets \| Document Embedding \| Recurrent Neural Network \| Wnut-2020
رابط المحتوى:	PDF (صورة)

عدد مرات التحميل

2

المستخلص:

أدى ظهور جائحة كورونا إلى انتشار عدد هائل من أخبار كاذبة ومعلومات مضللة عبر التغريدات. ومن ثم فان هناك حاجة مهمه تتمثل في تصنيف التغريدات إلى تغريدات مسهبة إعلاميا وغير مسهبة إعلاميا وهذا الأمر حفز الباحثين على استخدام تقنيات التعلم الآلي لمعالجة هذا الموضوع. وقد أظهرت الدراسات الحديثة اعتمادا كبيرا على خوارزميات الشبكة العصبية المتحورة. ومع ذلك فهي تعاني من مشكلة تدعى "النسيان الكارثي" حيث تفقد معلومات سياقية خلال عملية التدريب. لذلك هذه الدراسة تهدف إلى اقتراح آلية لتضمين النصوص بناءا على الشبكة العصبية المتكررة. أخيرا، تم استخدام ثلاثة مصنفات من LR وSVM وMLP لتصنيف المستندات إلى مسهبة إعلاميا وغير مسهبة إعلاميا باستخدام مجموعة البيانات المعيارية لـ WNUT-2020 في المهمة 2، حصل مصنف LR على أعلى مقياس يبلغ 0.91. توضح هذه النتيجة فعالية RNN في توليد آليات تضمين نصوص مركبة.

The coronavirus pandemic has led to the spread of tremendous fake news and misleading information through tweets. Hence, an interesting task of classifying tweets into informative and uninformative has motivated researchers to employ machine learning techniques. The state-of-the-art studies showed high dependency on transformers architecture. However, the transformers architecture suffers from the catastrophic forgetting problem where important contextual information is being forgotten by the gradients. Therefore, this paper proposes a document embedding using Recurrent Neural Network. Lastly, three classifiers of LR, SVM and MLP have been used to classify documents into Informative and Uninformative. Using the benchmark dataset of WNUT-2020 at Task 2, LR classifier obtained the highest f-measure of 0.91. This result demonstrates the efficacy of RNN to generate sophisticated document embedding.

ISSN:

2307-8081

عناصر مشابهة

Using Neural Networks to Classify COVID-19 Death Rates Cases in Iraq
بواسطة: غافل، منى طاهر منشور: (2024)
Coronavirus Pandemic and the Social Ties
بواسطة: Dechira, Mohammed Amine منشور: (2022)
COVID-19 Prediction for Jordan Using Neural Networks
بواسطة: عبيدات، أمل محمد منشور: (2022)
EFL Curriculum Design in the Post-Coronavirus Era
بواسطة: Attelisi, Adulhameed A. منشور: (2022)
A Pragma-Discourse Study of the CNN Article’s "Omicron is now the Dominant Strain of Coronavirus in the US, According to the CDC"
بواسطة: عبدالله، سارة موفق منشور: (2022)

Identifying Informative Coronavirus Tweets Using Recurrent Neural Network Document Embedding

عدد مرات التحميل

2

عناصر مشابهة

دليل المستخدم

دليل الفيديو