استفاده از تكنيك راي‌گيري اكثريت در طبقه‌بندي اخبار جعلي از واقعي با الگوريتم‌هاي درخت ‌تصميم،رگرسيون لجستيك و كا-نزديك ترين همسايگي

عنوان به زبان ديگر

Using the majority voting technique in classifying fake news from real news with decision tree, logistic regression and K-nearest neighbor algorithms.

پديدآورندگان

وظيفه‌آبان هادي hadi.vazifehaban@ihu.ac.ir دانشگاه جامع امام حسين(ع) , حسني آهنگر محمدرضا Mrhasani@ihu.ac.ir دانشگاه جامع امام حسين(ع)

تعداد صفحه

كليدواژه

طبقه‌بندي متن , اخبار جعلي , پردازش زبان طبيعي , متن انگليسي , راي‌گيري اكثريت

سال انتشار

1402

عنوان كنفرانس

ششمين همايش ملي فناوريهاي نوين در مهندسي برق، كامپيوتر و مكانيك ايران

زبان مدرك

فارسي

چكيده فارسي

براي طبقه‌بندي متن از يادگيري‌ ماشين و پردازش زبان طبيعي به طور گسترده‌اي استفاده مي‌شود. به طور كلي هدف يك طبقه‌بند متن، طبقه‌بندي سندها در قالب تعداد مشخصي از دسته‌هاي از پيش تعيين شده مي‌باشد. در اين پژوهش، اخبار پخش شده در فضاي مجازي مورد بررسي قرار گرفته است تا به اين سوال پاسخ داده ‌شود كه اين اخبار در كدام يك از دسته‌ي جعلي يا واقعي قرار مي‌گيرد. روش پيشنهادي با در نظر داشتن ساختاركلمات انگليسي، مدل tf-idf و الگوريتم‌هاي درخت ‌تصميم،رگرسيون لجستيك و كا-نزديك ترين همسايگي، اخبار جعلي را از واقعي بااستفاده از تكنيك راي‌گيري اكثريت طبقه‌بندي مي‌كند. در اين پژوهش سه الگوريتم يادگيري مذكور مورد استفاده قرار گرفت تا يك مدل تركيبي براي دسته‌بندي اخبار جعلي از واقعي ايجاد شود. مطابق نتايج و براساس شاخص‌هاي مختلف ازريابي دقت، مدل پيشنهادي دقتي در حدود 88 درصد داشت.

چكيده لاتين

Machine learning and natural language processing are widely used for text classification. In general, the purpose of a text classifier is to classify documents in the form of a certain number of predetermined categories. In this research, the news spread in cyberspace has been examined to answer the question of which category of fake or real this news is. Considering the structure of English words, tf-idf model and decision tree, logistic regression and k-nearest neighbor algorithms, the proposed method classifies fake news from real news using majority voting technique. In this research, the three mentioned learning algorithms were used to create a hybrid model for classifying fake news from real news. According to the results and based on various indicators of accuracy, the proposed model had an accuracy of about 88%.

كشور

ايران

لينک به اين مدرک

https://search.isc.ac/dl/search/defaultta.aspx?DTC=36&DC=333232