0% Complete
English
صفحه اصلی
/
سی و دومین کنفرانس ملی و دهمین کنفرانس بین المللی مهندسی زیست پزشکی ایران
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model
نویسندگان :
Mehrdad Ghassabi
1
Pedram Rostami
2
Hamidreza Baradaran kashani
3
Amirhossein Poursina
4
Zahra Kazemi
5
Milad Tavakoli
6
1- دانشگاه اصفهان
2- دانشگاه تهران
3- دانشگاه اصفهان
4- دانشگاه علوم پزشکی اصفهان
5- دانشگاه اصفهان
6- دانشگاه اصفهان
کلمات کلیدی :
persian medical question answering،small language model،medical language models،data crawling
چکیده :
The rapid advancement of language models has demonstrated the potential of artificial intelligence in the healthcare industry. However, small language models struggle with specialized domains in low-resource languages like Persian. While numerous medical-domain websites exist in Persian, no curated dataset or corpus has been available—making ours the first of its kind. This study explores the enhancement of medical knowledge in a small language model by leveraging accessible online data, including a crawled corpus from medical magazines and a dataset of real doctor-patient Q&A pairs. We fine-tuned a baseline model using our curated data to improve its medical knowledge.Benchmark evaluations demonstrate that the fine-tuned model achieves improved accuracy in medical question answering and provides better responses compared to its baseline. This work highlights the potential of leveraging open-access online data to enrich small language models in medical fields, providing a novel solution for Persian medical AI applications suitable for resource constrained environments.
لیست مقالات
لیست مقالات بایگانی شده
تشخیص پول شویی در بانکداری هوشمند با استفاده از مدل مخفی مارکوف مبتنی بر استنتاج فازی
فرهاد کریم¬خانی - رضا جعفرزاده - حسن مکرمی
تاثیر اختلاف قیمت خرید و فروش سهام و اهرم مالی بر مدیریت سود واقعی با تاکید بر نقش تعدیلی حاکمیت شرکتی
هادی اله قلیان - مهدی زینالی
Freeze-Dried Oxidized Alginate–Gelatin Scaffold Coated with Reduced Graphene Oxide for Bone Tissue Engineering
Mohsen Aghababaei Tafreshi - Sameereh Hashemi-Najafabadi - Nafiseh Baheiraei
Natural Language Processing and Speech Processing Integration: Toward A Point-of-Care Framework for Early Detection of Alzheimer’s Disease
Aslan Modir - Fatemeh Shalchizadeh - Armin Ghasimi - Sina Shamekhi
Data Mining in the Age of Information Explosion: An Intelligent Analysis Tool for Social Media
Hossein Bodaghi Khajeh Noubar - Seyed Meead Hosseini - Shiva Mohammadi
Effect of ph changes on thermal and mechanical properties of polyacrylamide hydrogel using molecular dynamics simulation
Narges Karimzadeh Dehkordi
بررسی حسابرسی تقلب در شرکتها و گزارش اخلاقی تقلب
محمدحسین مظلومان - محمدامین زکی زاده
TransFuse++: A Hybrid CNN-Transformer Architecture with Cross-Attention, Temporal Modeling, and Uncertainty Estimation for Medical Image Segmentation
Masoud Noroozi - Sayna Jamaati - Hamed Aghapanah - Ali Saeeidi Rad - Mahsa Asadi Anar - Ali Darzi - Mahla Shokouhfar - Helia Sadat Kazemi - Mohammadreza Ghahari - Mohammad Saeed Soleimani Meigoli - Jafar Majidpour - Hossein Arabi - Ali Reza Karimian
Injectability Enhancement and Optimization of a Biphasic Calcium Phosphate Bone Cement
Sepehr Larijani - Mitra Asadi-Eydivand - Nabiollah Abolfathi - Mehran Solati-Hashjin
Application of Attention Mechanisms in Deep Learning Models for COVID-19 Detection and Classification from Medical Images: A Systematic Review
Jafar Abdollahi - Babak Nouri-Moghaddam - Abbas Mirzaei
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.2