Abstract
Generative artificial intelligence (genAI) models are rapidly being adopted for health information delivery. Nevertheless, systematic cross-linguistic evaluations of their clinical reliability—particularly in high-burden conditions such as asthma, allergy, and respiratory tract infections (RTIs)—remain limited. The aim of this study was to compare the English and Arabic performance of ChatGPT‑4o, Gemini, and DeepSeek in responding to common asthma, allergy, and RTI queries using a validated clinical assessment framework. A bilingual evaluation was conducted using 30 frequently asked questions (FAQs) related to asthma, allergy, and RTIs. Each question was submitted in English and Arabic to ChatGPT‑4o, Gemini, and DeepSeek. Responses were evaluated independently by three bilingual clinical experts using the CLEAR framework for Completeness, Accuracy, and Relevance of the generated content. Inter-rater reliability was assessed using intraclass correlation coefficients (ICCs). Language and model comparisons were analyzed using non-parametric Kruskal-Wallis and Mann-Whitney U tests. The study followed the METRICS reporting guideline for genAI in healthcare. ChatGPT‑4o consistently outperformed Gemini and DeepSeek across all CLEAR dimensions and the two languages. In English, the mean CLEAR scores were: ChatGPT‑4o: 3.90, Gemini: 2.50, DeepSeek: 2.09. In Arabic, ChatGPT‑4o again scored highest (3.63), followed by Gemini (2.38) and DeepSeek (1.84). All inter-model differences were statistically significant (p < 0.001). Inter-rater reliability was excellent across dimensions: ICC for completeness = 0.858, accuracy = 0.917, relevance = 0.950 (all p < 0.001), confirming strong consistency and validity in scoring. Within each genAI model, English outputs significantly outperformed Arabic in completeness, accuracy, relevance, and the overall CLEAR score. Domain-wise, asthma queries achieved the highest performance across models and languages, while allergy queries showed the lowest accuracy. ChatGPT‑4o demonstrated superior bilingual performance, while Gemini and DeepSeek exhibited significant limitations, particularly in Arabic. These findings highlight persistent language-based disparities in genAI health outputs. Rigorous cross-linguistic evaluation and domain-specific fine-tuning are essential to ensure safe and equitable deployment of genAI tools in global health communication.
References
Cline RJ, Haynes KM. Consumer health information seeking on the Internet: The state of the art. Health education research. 2001;16(6):671-692.
Lemire M, Paré G, Sicotte C, Harvey C. Determinants of Internet use as a preferred source of information on personal health. International journal of medical informatics. 2008;77(11):723-734.
Tonsaker T, Bartlett G, Trpkov C. Health information on the Internet: gold mine or minefield? Canadian Family Physician. 2014;60(5):407-408.
Andreassen HK, Bujnowska-Fedak MM, Chronaki CE, Dumitru RC, Pudule I, Santana S, et al. European citizens' use of E-health services: a study of seven countries. BMC public health. 2007;7(1):53.
Kummervold P, Chronaki C, Lausen B, Prokosch H-U, Rasmussen J, Santana S, et al. eHealth trends in Europe 2005-2007: a population-based survey. Journal of Medical Internet Research. 2008;10(4):e42.
Frishauf P. Medscape–The first 5 years. Medscape General Medicine. 2005;7(2):5.
National Health Service (NHS). NHS website for England: Find information and services to help you manage your health [Internet]. Leeds, England: National Health Service; 2025. Available from: https://www.nhs.uk/.
Smith CA, Wicks PJ. PatientsLikeMe: Consumer health vocabulary as a folksonomy. AMIA annual symposium proceedings. 2008;2008:682.
Arruda RMd, Ayoub IA, Nunes R, Azevedo Neto RSd, Nunes MdPT. Consulting “Dr. Google”: how the digital search for internet health information influences doctor-patient relationship. Cadernos de Saúde Pública. 2025;41:e00153623.
Cacciamani GE, Dell’Oglio P, Cocci A, Russo GI, Abreu ADC, Gill IS, et al. Asking “Dr. Google” for a second opinion: the devil is in the details. European urology focus. 2021;7(2):479-481.
Fox S. Pew Research Center. Online Health Search 2006: Part 1. 113 Million Internet Users Seek Health Information Online [Internet]. Washington, DC: Pew Research Center; 2006. Available from: https://www.pewresearch.org/internet/2006/10/29/part-1-113-million-internet-users-seek-health-information-online/.
Wac K. Smartphone as a personal, pervasive health informatics services platform: literature review. Yearbook of medical informatics. 2012;21(01):83-93.
Zawati MnH, Lang M. Does an app a day keep the doctor away? AI symptom checker applications, entrenched bias, and professional responsibility. Journal of Medical Internet Research. 2024;26:e50344.
Ozdalga E, Ozdalga A, Ahuja N. The smartphone in medicine: a review of current and potential use among physicians and students. Journal of Medical Internet Research. 2012;14(5):e128.
Shen Y-T, Chen L, Yue W-W, Xu H-X. Digital technology-based telemedicine for the COVID-19 pandemic. Frontiers in medicine. 2021;8:646506.
Rouvinen H, Turunen H, Lindfors P, Kinnunen JM, Rimpelä A, Koivusilta L, et al. Online health information-seeking behaviour and mental well-being among Finnish higher education students during COVID-19. Health promotion international. 2023;38(6):daad143.
Almalki M, Azeez F. Health chatbots for fighting COVID-19: a scoping review. Acta Informatica Medica. 2020;28(4):241-247.
Sallam M. ChatGPT utility in healthcare education, research, and practice: systematic review on the promising perspectives and valid concerns. Healthcare. 2023;11(6):887.
Alanzi TM. Impact of ChatGPT on teleconsultants in healthcare: perceptions of healthcare experts in Saudi Arabia. Journal of multidisciplinary healthcare. 2023:2309-2321.
Alanezi F. Factors influencing patients’ engagement with ChatGPT for accessing health-related information. Critical Public Health. 2024;34(1):1-20.
Walker HL, Ghani S, Kuemmerli C, Nebiker CA, Müller BP, Raptis DA, et al. Reliability of medical information provided by ChatGPT: assessment against clinical guidelines and patient information quality instrument. Journal of Medical Internet Research. 2023;25:e47479.
Sallam M, Al-Mahzoum K, Alshuaib O, Alhajri H, Alotaibi F, Alkhurainej D, et al. Language discrepancies in the performance of generative artificial intelligence models: an examination of infectious disease queries in English and Arabic. BMC Infectious Diseases. 2024;24(1):799.
Rebitschek FG, Carella A, Kohlrausch-Pazin S, Zitzmann M, Steckelberg A, Wilhelm C. Evaluating evidence-based health information from generative AI using a cross-sectional study with laypeople seeking screening information. npj Digital Medicine. 2025;8(1):343.
Madanian S, Nakarada-Kordic I, Reay S, Chetty Th. Patients' perspectives on digital health tools. PEC innovation. 2023;2:100171.
Esmaeilzadeh P, Maddah M, Mirzaei T. Using AI chatbots (eg, CHATGPT) in seeking health-related information online: The case of a common ailment. Computers in Human Behavior: Artificial Humans. 2025;3:100127.
Sallam M, Al-Mahzoum K, Sallam M, Mijwil MM. DeepSeek: Is it the end of generative AI monopoly or the mark of the impending doomsday? Mesopotamian Journal of Big Data. 2025;2025:26-34.
Chow JC, Li K. Large language models in medical chatbots: opportunities, challenges, and the need to address AI risks. Information. 2025;16(7):549.
Xu R, Wang Z. Generative artificial intelligence in healthcare from the perspective of digital media: Applications, opportunities and challenges. Heliyon. 2024;10(12):e32364.
Goktas P, Damadoglu E. Future of allergy and immunology: is artificial intelligence the key in the digital era? Annals of Allergy, Asthma & Immunology. 2025;134(4):396-407.e392.
González-Díaz SN, Morais-Almeida M, Ansotegui IJ, Macouzet-Sánchez C, Ordóñez-Azuara YG, Camarena-Galván J, et al. Artificial intelligence in allergy practice: Digital transformation and the future of clinical care. World Allergy Organization Journal. 2025;18(8):101078.
Tan LD, Nguyen N, Lopez E, Peverini D, Shedd M, Alismail A, et al. Artificial intelligence in the management of asthma: a review of a new frontier in patient care. Journal of Asthma and Allergy. 2025:1179-1191.
Pawankar R. Allergic diseases and asthma: a global public health concern and a call to action. World Allergy Organization Journal. 2014;7(1):12.
Oh J, Kim S, Kim MS, Abate YH, Abd ElHafeez S, Abdelkader A, et al. Global, regional, and national burden of asthma and atopic dermatitis, 1990–2021, and projections to 2050: a systematic analysis of the Global Burden of Disease Study 2021. The lancet respiratory medicine. 2025;13(5):425-446.
Sirota SB, Doxey MC, Dominguez R-MV, Bender RG, Vongpradith A, Albertson SB, et al. Global, regional, and national burden of upper respiratory infections and otitis media, 1990–2021: a systematic analysis from the Global Burden of Disease Study 2021. The Lancet Infectious Diseases. 2025;25(1):36-51.
Bender RG, Sirota SB, Swetschinski LR, Dominguez R-MV, Novotney A, Wool EE, et al. Global, regional, and national incidence and mortality burden of non-COVID-19 lower respiratory infections and aetiologies, 1990–2021: a systematic analysis from the Global Burden of Disease Study 2021. The Lancet Infectious Diseases. 2024;24(9):974-1002.
Gohal G, Moni SS, Bakkari MA, Elmobark ME. A review on asthma and allergy: current understanding on molecular perspectives. Journal of Clinical Medicine. 2024;13(19):5775.
Monk AS, Worden CP, Benaim EH, Klatt-Cromwell C, Thorp BD, Ebert Jr CS, et al. The impact of occupational exposures on chronic rhinosinusitis: a scoping review. Exploration of Asthma & Allergy. 2024;2(4):301-308.
Ibekwe PU, Ekop E, Otu T, Bassi P, Ukonu BA. Atopic dermatitis in adults: prevalence, clinical pattern, and contact sensitization. Exploration of Asthma & Allergy. 2024;2(5):450-460.
Skolnik N, Yawn BP, Correia de Sousa J, Vázquez MMM, Barnard A, Wright WL, et al. Best practice advice for asthma exacerbation prevention and management in primary care: an international expert consensus. NPJ Primary Care Respiratory Medicine. 2024;34(1):39.
Chamola V. Generative AI for Transformative Healthcare: A Comprehensive Study of Emerging Models, Applications, Case Studies, and Limitations. Ieee Access. 2024;12:31078-31106.
Maleki Varnosfaderani S, Forouzanfar M. The role of AI in hospitals and clinics: transforming healthcare in the 21st century. Bioengineering. 2024;11(4):337.
Babel A, Taneja R, Mondello Malvestiti F, Monaco A, Donde S. Artificial intelligence solutions to increase medication adherence in patients with non-communicable diseases. Frontiers in Digital Health. 2021;3:669869.
Drummond D, Adejumo I, Hansen K, Poberezhets V, Slabaugh G, Hui CY. Artificial intelligence in respiratory care: perspectives on critical opportunities and challenges. Breathe. 2024;20(3):230189.
Gori A, Zicari A, Barreto M, Della Giustina A, Sfika I, Pattini S, et al. Artificial Intelligence-Driven Innovations in Allergy. Italian Journal of Pediatric Allergy and Immunology. 2025;39(1):22-25.
van Breugel M, Fehrmann RS, Bügel M, Rezwan FI, Holloway JW, Nawijn MC, et al. Current state and prospects of artificial intelligence in allergy. Allergy. 2023;78(10):2623-2643.
Tu T, Schaekermann M, Palepu A, Saab K, Freyberg J, Tanno R, et al. Towards conversational diagnostic artificial intelligence. Nature. 2025:442-450.
Fu B, Hadid A, Damer N. Generative AI in the context of assistive technologies: Trends, limitations and future directions. Image and Vision Computing. 2025;154:105347.
Deniz-Garcia A, Fabelo H, Rodriguez-Almeida AJ, Zamora-Zamorano G, Castro-Fernandez M, Alberiche Ruano MdP, et al. Quality, usability, and effectiveness of mHealth apps and the role of artificial intelligence: current scenario and challenges. Journal of Medical Internet Research. 2023;25:e44030.
Mohamed YA, Khanan A, Bashir M, Mohamed AHH, Adiel MA, Elsadig MA. The impact of artificial intelligence on language translation: a review. Ieee Access. 2024;12:25553-25579.
Witkowski K, Dougherty RB, Neely SR. Public perceptions of artificial intelligence in healthcare: ethical concerns and opportunities for patient-centered care. BMC Medical Ethics. 2024;25(1):74.
Sun Y, Sheng D, Zhou Z, Wu Y. AI hallucination: towards a comprehensive classification of distorted information in artificial intelligence-generated content. Humanities and Social Sciences Communications. 2024;11(1):1278.
Braido F. Failure in asthma control: reasons and consequences. Scientifica. 2013;2013(1):549252.
Pouessel G, Morisset M, Schoder G, Santos C, Villard-Truc F, Just J, et al. Update on the emergency action plan for allergic reactions in children and adolescents. Position of the “Allergy at school” and “Food allergy” working groups of the French Allergology Society. Revue Française d'Allergologie. 2020;60(2):83-89.
Cross JL, Choma MA, Onofrey JA. Bias in medical AI: Implications for clinical decision-making. PLOS Digital Health. 2024;3(11):e0000651.
Shekar S, Pataranutaporn P, Sarabu C, Cecchi GA, Maes P. People over trust AI-generated medical responses and view them to be as valid as doctors, despite low accuracy. arXiv. 2024. DOI: 10.48550/arXiv.2408.15266.
Bélisle-Pipon J-C. Why we need to be careful with LLMs in medicine. Frontiers in medicine. 2024;11:1495582.
Yim D, Khuntia J, Parameswaran V, Meyers A. Preliminary evidence of the use of generative AI in health care clinical services: systematic narrative review. JMIR Medical Informatics. 2024;12(1):e52073.
Singhal K, Tu T, Gottweis J, Sayres R, Wulczyn E, Amin M, et al. Toward expert-level medical question answering with large language models. Nature Medicine. 2025;31(3):943-950.
Kung TH, Cheatham M, Medenilla A, Sillos C, De Leon L, Elepaño C, et al. Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models. PLOS Digital Health. 2023;2(2):e0000198.
Sallam M, Al-Salahat K, Eid H, Egger J, Puladi B. Human versus artificial intelligence: ChatGPT-4 outperforming Bing, bard, ChatGPT-3.5 and humans in clinical chemistry Multiple-Choice questions. Advances in Medical Education and Practice. 2024:857-871.
Mennella C, Maniscalco U, De Pietro G, Esposito M. Ethical and regulatory challenges of AI technologies in healthcare: A narrative review. Heliyon. 2024;10(4):e26297.
Li J. Security implications of AI chatbots in health care. Journal of Medical Internet Research. 2023;25:e47551.
Sallam M, Al-Mahzoum K, Sallam M. Generative Artificial Intelligence and Cybersecurity Risks: Implications for Healthcare Security Based on Real-life Incidents. Mesopotamian Journal of Artificial Intelligence in Healthcare. 2024;2024:184-203.
Habli I, Lawton T, Porter Z. Artificial intelligence in health care: accountability and safety. Bulletin of the World Health Organization. 2020;98(4):251-256.
Wei X, Kumar N, Zhang H. Addressing Bias in Generative Ai: Challenges and Research Opportunities in Information Management. Information & Management. 2025;62(2):104103.
Schut L, Gal Y, Farquhar S. Do Multilingual LLMs Think In English? arXiv. 2025. DOI: 10.48550/arXiv.2502.15603.
Myung J, Lee N, Zhou Y, Jin J, Putri R, Antypas D, et al. BLEnD: A benchmark for LLMs on everyday knowledge in diverse cultures and languages. arXiv. 2024. DOI: 10.48550/arXiv.2406.09948.
Nacar O, Sibaee ST, Ahmed S, Atitallah SB, Ammar A, Alhabashi Y, et al., editors. Towards inclusive Arabic LLMs: A culturally aligned benchmark in Arabic large language model evaluation. Proceedings of the First Workshop on Language Models for Low-Resource Languages; 2025 January 20; Abu Dhabi, United Arab Emirates. Stroudsburg, PA: Association for Computational Linguistics.
Taira K, Itaya T, Hanada A. Performance of the large language model ChatGPT on the national nurse examinations in Japan: evaluation study. JMIR nursing. 2023;6:e47305.
Guigue PA, Meyer R, Thivolle‐Lioux G, Brezinov Y, Levin G. Performance of ChatGPT in French language Parcours d'Accès Spécifique Santé test and in OBGYN. International Journal of Gynecology & Obstetrics. 2024;164(3):959-963.
Morreel S, Mathysen D, Verhoeven V. Aye, AI! ChatGPT passes multiple-choice family medicine exam. Medical Teacher. 2023;45(6):665-666.
Liu X, Wu J, Shao A, Shen W, Ye P, Wang Y, et al. Uncovering language disparity of ChatGPT on retinal vascular disease classification: cross-sectional study. Journal of Medical Internet Research. 2024;26:e51926.
Tarraf H, Aydin O, Mungan D, Albader M, Mahboub B, Doble A, et al. Prevalence of asthma among the adult general population of five Middle Eastern countries: results of the SNAPSHOT program. BMC pulmonary medicine. 2018;18(1):68.
Al-Digheari A, Mahboub B, Tarraf H, Yucel T, Annesi-Maesano I, Doble A, et al. The clinical burden of allergic rhinitis in five Middle Eastern countries: results of the SNAPSHOT program. Allergy, Asthma & Clinical Immunology. 2018;14(1):63.
Sallam M, Mousa D. Evaluating ChatGPT performance in Arabic dialects: A comparative study showing defects in responding to Jordanian and Tunisian general health prompts. Mesopotamian Journal of Artificial Intelligence in Healthcare. 2024;2024:1-7.
Samaan JS, Yeo YH, Ng WH, Ting P-S, Trivedi H, Vipani A, et al. ChatGPT’s ability to comprehend and answer cirrhosis related questions in Arabic. Arab Journal of Gastroenterology. 2023;24(3):145-148.
Sallam M, Al-Mahzoum K, Almutawaa RA, Alhashash JA, Dashti RA, AlSafy DR, et al. The performance of OpenAI ChatGPT-4 and Google Gemini in virology multiple-choice questions: a comparative analysis of English and Arabic responses. BMC Research Notes. 2024;17(1):247.
Sallam M, Alasfoor IM, Khalid SW, Al-Mulla RI, Al-Farajat A, Mijwil MM, et al. Chinese generative AI models (DeepSeek and Qwen) rival ChatGPT-4 in ophthalmology queries with excellent performance in Arabic and English. Narra J. 2025;5(1):e2371.
Sallam M, Barakat M, Sallam M. A Preliminary Checklist (METRICS) to standardize the design and reporting of studies on generative artificial intelligence–based models in health care education and practice: development study involving a literature review. Interactive journal of medical research. 2024;13(1):e54704.
Sallam M, Barakat M, Sallam M. Pilot testing of a tool to standardize the assessment of the quality of health information generated by artificial intelligence-based models. Cureus. 2023;15(11):e49373.
Dhand NK, MS K. Statulator: An online statistical calculator. Sample Size Calculator for Comparing Two Paired Means [Internet]. Statulator; 2025. Available from: http://statulator.com/SampleSize/ss2PM.html.
Global Initiative for Asthma – GINA. FAQs: Answers to frequently asked questions about asthma [Internet]. Fontana, WI: Global Initiative for Asthma – GINA; 2025. Available from: https://ginasthma.org/about-us/faqs/.
American Academy of Allergy AI. Overview of asthma symptoms, asthma diagnosis, asthma treatment and asthma [Internet]. Milwaukee, WI: American Academy of Allergy AI; 2025. Available from: https://www.aaaai.org/conditions-treatments/asthma.
WebMD Editorial Contributors. Allergies: Your Top Questions Answered. WebMD [Internet]. WebMD Editorial Contributors; 2025. Available from: https://www.webmd.com/allergies/allergies-faq.
American Academy of Allergy AI. Learn about symptoms, diagnosis, treatment and management for these allergies [Internet]. Milwaukee, WI: American Academy of Allergy AI; 2025. Available from: https://www.aaaai.org/conditions-treatments/allergies.
WHO. Q&As on COVID-19 and related health topics [Internet]. Geneva: WHO; 2025. Available from: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/question-and-answers-hub.
NHS Borders. RSV - Frequently Asked Questions [Internet]. Scottish: NHS Borders; 2025. Available from: https://www.nhsborders.scot.nhs.uk/patients-and-visitors/respiratory-syncytial-virus-(rsv)/frequently-asked-questions/.
WHO. Regional Office for the Eastern Mediterranean. Influenza (seasonal): Influenza Q&As [Internet]. WHO2025. Available from: https://www.emro.who.int/health-topics/influenza/questions-and-answers.html#influenza.
OpenAI. ChatGPT-4o [Internets]. OpenAI; 2025. Available from: https://chatgpt.com/?model=gpt-4o.
DeepSeek. DeepSeek-V3 [Internet]. DeepSeek; 2025. Available from: https://chat.deepseek.com/.
Google. Gemini Flash 2.5 (version 2.5) [Internet]. Google; 2025. Available from: https://gemini.google.com/app.
Muluk SY, Olcucu N. The role of artificial intelligence in the primary prevention of common musculoskeletal diseases. Cureus. 2024;16(7):e65372.
Alnsour MM, Alenezi R, Barakat M, Al-Omiri MK. Assessing ChatGPT’s suitability in responding to the public’s inquires on the effects of smoking on oral health. BMC Oral Health. 2025;25(1):1207.
Incerti Parenti S, Bartolucci ML, Biondi E, Maglioni A, Corazza G, Gracco A, et al. Online patient education in obstructive sleep apnea: ChatGPT versus Google Search. Healthcare. 2024;12(17):1781.
Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of chiropractic medicine. 2016;15(2):155-163.
Sallam M, Irshaid A, Snygg J, Albadri R, Sallam M. Rapid Evolution of Large Language Models in Medical Education: Comparative Performance of ChatGPT-3.5, ChatGPT-5, and DeepSeek on Medical Microbiology MCQs. Contemporary Education and Teaching Research. 2025;6:295-309.
Yuan M, Bao P, Yuan J, Shen Y, Chen Z, Xie Y, et al. Large language models illuminate a progressive pathway to artificial intelligent healthcare assistant. Medicine Plus. 2024;1(2):100030.
Peng W, Feng Y, Yao C, Zhang S, Zhuo H, Qiu T, et al. Evaluating AI in medicine: a comparative analysis of expert and ChatGPT responses to colorectal cancer questions. Scientific reports. 2024;14(1):2840.
Checcucci E, Rodler S, Piazza P, Porpiglia F, Cacciamani GE. Transitioning from “Dr. Google” to “Dr. ChatGPT”: the advent of artificial intelligence chatbots. Translational Andrology and Urology. 2024;13(6):1067-1070.
Cazzamatta R, Sarısakaloğlu A. AI-generated misinformation: A case study on emerging trends in fact-checking practices across Brazil, Germany, and the United Kingdom. Emerging Media. 2025;3:214-251.
Guo Y, Guo M, Su J, Yang Z, Zhu M, Li H, et al. Bias in large language models: Origin, evaluation, and mitigation. arXiv. 2024. DOI: 10.48550/arXiv.2411.10915.
Seo J, Kim J, Byun S, Shin H. How does a Language-Specific Tokenizer affect LLMs? arXiv. 2025. DOI: 10.48550/arXiv.2502.12560.
Saadi N, Raha T, Christophe C, Pimentel MA, Rajan R, Kanithi PK. Bridging Language Barriers in Healthcare: A Study on Arabic LLMs. arXiv. 2025. DOI: 10.48550/arXiv.2501.09825.
Mohammad R, Alkhnbashi OS, Hammoudeh M. Optimizing Large Language Models for Arabic Healthcare Communication: A Focus on Patient-Centered NLP Applications. Big Data and Cognitive Computing. 2024;8(11):157.
Ibrahim A, Hosseini A, Helmy H, Lakhdhar W, Serag A, editors. Bridging Dialectal Gaps in Arabic Medical LLMs through Model Merging. Proceedings of The Third Arabic Natural Language Processing Conference; 2025 November 8-9; Suzhou, China. Stroudsburg, PA: Association for Computational Linguistics.
Qin L, Chen Q, Zhou Y, Chen Z, Li Y, Liao L, et al. A survey of multilingual large language models. Patterns. 2025;6(1):101118.
Wang S, Zhang S, Zhang J, Hu R, Li X, Zhang T, et al. Reinforcement learning enhanced llms: A survey. arXiv. 2024. DOI: 10.48550/arXiv.2412.10400.
Alshehhi M, Sharshar A, Guizani M. Towards Inclusive NLP: Assessing Compressed Multilingual Transformers Across Diverse Language Benchmarks. arXiv. 2025. DOI: 10.48550/arXiv.2507.19699.
Dankwa-Mullan I. Health equity and ethical considerations in using artificial intelligence in public health and medicine. Preventing chronic disease. 2024;21:E64.
Weiner EB, Dankwa-Mullan I, Nelson WA, Hassanpour S. Ethical challenges and evolving strategies in the integration of artificial intelligence into clinical practice. PLOS Digital Health. 2025;4(4):e0000810.
Schlicht IB, Sayin B, Zhao Z, Labonté FM, Barbera C, Viviani M, et al. Disparities in Multilingual LLM-Based Healthcare Q&A. arXiv. 2025. DOI: 10.48550/arXiv.2510.17476.
Al Shamsi H, Almutairi AG, Al Mashrafi S, Al Kalbani T. Implications of language barriers for healthcare: a systematic review. Oman medical journal. 2020;35(2):e122.
Aljamani S, Hassona Y, Fansa HA, Saadeh HM, Jamani KD. Evaluating Large Language Models in Addressing Patient Questions on Endodontic Pain: A Comparative Analysis of Accessible Chatbots. Journal of Endodontics. 2025;51:1617-1624.
Islam R, Moushi OM. Gpt-4o: The cutting-edge advancement in multimodal llm. TechRxiv. 2025. DOI: 10.36227/techrxiv.171986596.65533294/v1.
ŞAHİN EG. Comparative performance of ChatGPT, Gemini, and DeepSeek on endodontic exam questions in Turkish and English. 2025. DOI: 10.21203/rs.3.rs-6738945/v1.
Palmer A, Gorman S. Misinformation, Trust, and Health: The Case for Information Environment as a Major Independent Social Determinant of Health. Social Science & Medicine. 2025:118272.
Sallam M, Khalil R, Sallam M. Benchmarking generative AI: A call for establishing a comprehensive framework and a generative AIQ test. Mesopotamian Journal of Artificial Intelligence in Healthcare. 2024;2024:69-75.
Bonini S, Rasi G, Torre A, D'Amato M, Matricardi PM. The heterogeneity of allergic phenotypes: genetic. Ann Allergy Asthma Immunol. 2001;87:48-51.
Wang J, Zhou Y, Zhang H, Hu L, Liu J, Wang L, et al. Pathogenesis of allergic diseases and implications for therapeutic interventions. Signal transduction and targeted therapy. 2023;8(1):138.
Stukus DR. Tackling medical misinformation in allergy and immunology practice. Expert Review of Clinical Immunology. 2022;18(10):995-996.
Verdi M, Candido D, Madan J, Bernstein JA, Bukstein D, Anagnostou A, et al. Addressing Anxiety and Depression in the Allergy Clinic Through Motivational Interviewing, Brief Cognitive Behavioral Therapy, and Curious Questions. The Journal of Allergy and Clinical Immunology: In Practice. 2025;13:1960-1969.e1962.
Anagnostou A. Addressing common misconceptions in food allergy: a review. Children. 2021;8(6):497.
Fitzpatrick PJ. Improving health literacy using the power of digital communications to achieve better health outcomes for patients and practitioners. Frontiers in Digital Health. 2023;5:1264780.
Khan R, Khan S, Almohaimeed HM, Almars AI, Pari B. Utilization, challenges, and training needs of digital health technologies: perspectives from healthcare professionals. International journal of medical informatics. 2025;197:105833.
Badr J, Motulsky A, Denis J-L. Digital health technologies and inequalities: a scoping review of potential impacts and policy recommendations. Health Policy. 2024;146:105122.
Ning Y, Teixayavong S, Shang Y, Savulescu J, Nagaraj V, Miao D, et al. Generative artificial intelligence and ethical considerations in health care: a scoping review and ethics checklist. The Lancet Digital Health. 2024;6(11):e848-e856.
Reddy S. Generative AI in healthcare: an implementation science informed translational path on application, integration and governance. Implementation Science. 2024;19(1):27.
Sallam M. Bibliometric top ten healthcare-related ChatGPT publications in the first ChatGPT anniversary. Narra J. 2024;4(2):e917.
Chen B, Zhang Z, Langrené N, Zhu S. Unleashing the potential of prompt engineering for large language models. Patterns. 2025;6:101260.

This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright (c) 2026 Mohammed Sallam, Adrian Stanley, Johan Snygg, Hasanain Al-Shakerchi, Omar Al Atragchi, Rania Abusamra, Malik Sallam (Author)

