واكاوي اثربخشي برچسب‌گذاري معنايي در رفع ابهام معنايي هم‌نويسه‌هاي تخصصي از نظر ميزان دقت در بازيابي متون علمي

عنوان به زبان ديگر

Investigating the Effectiveness of Semantic Tagging in Sense Disambiguation of Specialized Homographs from the perspective of Precision in Retrieving scientific texts

پديد آورندگان

رضايي دينائي، مينا دانشگاه الزهرا - دانشكده علوم تربيتي و روانشناسي، تهران، ايران , كربلا آقايي كامران، معصومه دانشگاه الزهرا - دانشكده علوم تربيتي و روانشناسي - گروه علم اطلاعات و دانش شناسي، تهران، ايران , ميرزاييان، وحيدرضا دانشگاه الزهرا - دانشكده ادبيات - گروه زبان و ادبيات انگليسي، تهران، ايران

تعداد صفحه

از صفحه

از صفحه (ادامه)

تا صفحه

تا صفحه(ادامه)

كليدواژه

هم نويسه تخصصي , بازيابي اطلاعات , سازماندهي اطلاعات , برچسب گذاري , پيكره متني

چكيده فارسي

هدف: تبيين كاربرد روش برچسب‌گذاري پيكره متني در رفع ابهام معنايي از هم‌نويسه‌هاي تخصصي از نظر ميزان دقت در بازيابي متون علمي حاوي اين گونه هم‌نويسه‌ها. روش: اين پژوهش از حيث هدف كاربردي است كه به روش ‌تجربي انجام شد و در رفع ابهام معنايي، روشي با نظارت محسوب مي‌شود. جامعه پژوهش را 442 مقاله علمي در قالب دو گروه گواه و آزمون تشكيل دادند. گروه گواه داراي 221 متن كامل مقاله بدون برچسب و گروه تجربي داراي همان 221 مقاله اما اين بار برچسب‌گذاري شده، بود كه در نظام بازيابي اطلاعات براي سنجش كارآيي برچسب‌ها در رفع ابهام معنايي از هم‌نويسه‌هاي تخصصي مورد آزمون قرار گرفتند. يافته‌ها: سطح معني‌داري آزمون رتبه‌هاي علامت‌دار ويلكاكسون (0001/0 = P، 909/5- = Z) نشان مي‌دهد كه ميزان دقت نتايج بازيابي هم‌نويسه‌هاي تخصصي بعد از به كارگيري پيكره تخصصي برچسب‌گذاري‌شده در نظام بازيابي اطلاعات نسبت به قبل از آن تفاوت معني‌داري دارد. بررسي رتبه‌هاي منفي و مثبت نشان مي‌دهد ميزان دقت نتايج بعد از به كارگيري پيكره تخصصي برچسب‌گذاري‌شده به ميزان معني‌داري افزايش يافته و به حد بيشينه آن يعني 1 رسيده است. نتيجه‌گيري: اگر طراحان سيستم‌هاي بازيابي بر بهينه‌سازي فرمول‌هاي بازيابي متمركز شوند و نظام‌هاي بازيابي را براي جستجوي اسناد مرتبط توانمند سازند، پژوهشگران با هر ويژگي فيزيولوژيكي، تجربي و دانشي قادرند به اسناد مرتبط با نياز اطلاعاتي خود با صرف زماني اندك دسترسي يابند. در اين پژوهش، ارزش پيكره متني به عنوان گنجينه غني دانش محور، در ايجاد تمايز نقش معنايي هم‌نويسه‌هاي تخصصي، آشكار شد.

چكيده لاتين

Objective: The aim of this study was to explain the application of text corpus tagging method in sense disambiguation from specialized homographs and increasing the retrieval precision of scientific texts containing such homographs. Methodology: This research was conducted experimentally and it is a supervised method that is one of the three methods of word sense disambiguation. The research sample consisted of 442 scientific articles of two groups of experimental group and control group. The control group had 221 full-text articles without tags and the experimental group had the same 221 tagged articles, which were tested in the information retrieval system to measure the effectiveness of tagging in sense disambiguation from specialized homographs. Findings: The research findings indicate that while retrieval in the control group due to sense ambiguity of specialized homographs is accompanied with false drop and reduced precision, tagging of specialized homographs in the full text of articles in the experimental group have direct effect in sense disambiguation from specialized homographs. It is possible to retrieve specialized homographs related to each tag, while in retrieval based on the control group, this is not possible. The level of significance of the Wilcoxon signed-rank test (P = 0.0001, Z = -5/909) shows that the accuracy of retrieval results of specialized homograph after using the tagged text corpus in the information retrieval system is significantly different. Examination of negative and positive rankings shows that the accuracy of the results after using the tagged text corpus has increased significantly and has reached its maximum level of 1. Conclusion: The rate of precision in retrieving scientific texts in the research findings is evidence of acceptable tagging effectiveness in sense disambiguation of specialized homographs and its effective role in optimizing the information retrieval system. If retrieval system designers focus on optimizing retrieval formulas in search of specialized homograph and empower retrieval systems to search for related documents, researchers with any physiological, experimental, and knowledge characteristics will be able to access related documents. Access their information needs in a short time. In this study, the value of the text corpus as a rich treasure of knowledge-based for information retrieval system was revealed in distinguishing the semantic role of specialized homographs. Although the research was conducted on limited corpus, the researcher believes that because this limited text corpus was designed in a principled way and the texts were consciously selected, the results of the findings can be generalized to all scientific texts in various fields.

سال انتشار

1400

عنوان نشريه

كتابداري و اطلاع رساني

فايل PDF

8602008

لينک به اين مدرک

https://search.isc.ac/dl/search/defaultta.aspx?DTC=8&DC=1272492