Search CORE

42 research outputs found

Model Tuning or Prompt Tuning? A Study of Large Language Models for Clinical Concept and Relation Extraction

Author: Bian Jiang
Chen Aokun
Peng Cheng
Smith Kaleb E
Wu Yonghui
Yang Xi
Yu Zehao
Publication venue
Publication date: 09/10/2023
Field of study

Objective To develop soft prompt-based learning algorithms for large language models (LLMs), examine the shape of prompts, prompt-tuning using frozen/unfrozen LLMs, transfer learning, and few-shot learning abilities. Methods We developed a soft prompt-based LLM model and compared 4 training strategies including (1) fine-tuning without prompts; (2) hard-prompt with unfrozen LLMs; (3) soft-prompt with unfrozen LLMs; and (4) soft-prompt with frozen LLMs. We evaluated 7 pretrained LLMs using the 4 training strategies for clinical concept and relation extraction on two benchmark datasets. We evaluated the transfer learning ability of the prompt-based learning algorithms in a cross-institution setting. We also assessed the few-shot learning ability. Results and Conclusion When LLMs are unfrozen, GatorTron-3.9B with soft prompting achieves the best strict F1-scores of 0.9118 and 0.8604 for concept extraction, outperforming the traditional fine-tuning and hard prompt-based models by 0.6~3.1% and 1.2~2.9%, respectively; GatorTron-345M with soft prompting achieves the best F1-scores of 0.8332 and 0.7488 for end-to-end relation extraction, outperforming the other two models by 0.2~2% and 0.6~11.7%, respectively. When LLMs are frozen, small (i.e., 345 million parameters) LLMs have a big gap to be competitive with unfrozen models; scaling LLMs up to billions of parameters makes frozen LLMs competitive with unfrozen LLMs. For cross-institute evaluation, soft prompting with a frozen GatorTron-8.9B model achieved the best performance. This study demonstrates that (1) machines can learn soft prompts better than humans, (2) frozen LLMs have better few-shot learning ability and transfer learning ability to facilitate muti-institution applications, and (3) frozen LLMs require large models

arXiv.org e-Print Archive

On the Impact of Cross-Domain Data on German Language Models

Author: Bian Jiang
Chen Aokun
Dada Amin
Egger Jan
Friedrich Christoph M.
Heiliger Lars
Idrissi-Yaghir Ahmad
Kleesiek Jens
Li Jianning
Peng Cheng
Seibold Constantin Marc
Smith Kaleb E
Truhn Daniel
Wu Yonghui
Yang Xi
Publication venue
Publication date: 13/10/2023
Field of study

Traditionally, large language models have been either trained on general web crawls or domain-specific data. However, recent successes of generative large language models, have shed light on the benefits of cross-domain datasets. To examine the significance of prioritizing data diversity over quality, we present a German dataset comprising texts from five domains, along with another dataset aimed at containing high-quality data. Through training a series of models ranging between 122M and 750M parameters on both datasets, we conduct a comprehensive benchmark on multiple downstream tasks. Our findings demonstrate that the models trained on the cross-domain dataset outperform those trained on quality data alone, leading to improvements up to

4.45\%

over the previous state-of-the-art. The models are available at https://huggingface.co/ikim-uk-essenComment: 13 pages, 1 figure, accepted at Findings of the Association for Computational Linguistics: EMNLP 202

arXiv.org e-Print Archive

Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding

Author: Arzideh Kamyar
Baldini Giulia
Bauer Marie
Bewersdorff Jeanette
Bian Jiang
Dada Amin
Friedrich Christoph M.
Hasin Max
Horn Peter A.
Idrissi-Yaghir Ahmad
Kleesiek Jens
Nensa Felix
Schlötterer Jörg
Schmidt Cynthia S.
Schäfer Henning
Seifert Christin
Smith Kaleb E.
Trienes Jan
Wu Yonghui
Zesch Torsten
Publication venue
Publication date: 08/05/2024
Field of study

Recent advances in natural language processing (NLP) can be largely attributed to the advent of pre-trained language models such as BERT and RoBERTa. While these models demonstrate remarkable performance on general datasets, they can struggle in specialized domains such as medicine, where unique domain-specific terminologies, domain-specific abbreviations, and varying document structures are common. This paper explores strategies for adapting these models to domain-specific requirements, primarily through continuous pre-training on domain-specific data. We pre-trained several German medical language models on 2.4B tokens derived from translated public English medical data and 3B tokens of German clinical data. The resulting models were evaluated on various German downstream tasks, including named entity recognition (NER), multi-label classification, and extractive question answering. Our results suggest that models augmented by clinical and translation-based pre-training typically outperform general domain models in medical contexts. We conclude that continuous pre-training has demonstrated the ability to match or even exceed the performance of clinical models trained from scratch. Furthermore, pre-training on clinical data or leveraging translated texts have proven to be reliable methods for domain adaptation in medical NLP tasks.Comment: Accepted at LREC-COLING 202

arXiv.org e-Print Archive

A Study of Generative Large Language Model for Medical Research and Healthcare

Author: Ahmed Mustafa M
Bian Jiang
Chen Aokun
Costa Anthony B
Flores Mona G
Guo Yi
Hogan William R
Lipori Gloria
Magoc Tanja
Martin Cheryl
Mitchell Duane A
Ospina Naykky S
Peng Cheng
PourNejatian Nima
Shenkman Elizabeth A
Smith Kaleb E
Wu Yonghui
Yang Xi
Zhang Ying
Publication venue
Publication date: 22/05/2023
Field of study

There is enormous enthusiasm and concerns in using large language models (LLMs) in healthcare, yet current assumptions are all based on general-purpose LLMs such as ChatGPT. This study develops a clinical generative LLM, GatorTronGPT, using 277 billion words of mixed clinical and English text with a GPT-3 architecture of 20 billion parameters. GatorTronGPT improves biomedical natural language processing for medical research. Synthetic NLP models trained using GatorTronGPT generated text outperform NLP models trained using real-world clinical text. Physicians Turing test using 1 (worst) to 9 (best) scale shows that there is no significant difference in linguistic readability (p = 0.22; 6.57 of GatorTronGPT compared with 6.93 of human) and clinical relevance (p = 0.91; 7.0 of GatorTronGPT compared with 6.97 of human) and that physicians cannot differentiate them (p < 0.001). This study provides insights on the opportunities and challenges of LLMs for medical research and healthcare

arXiv.org e-Print Archive

Global, regional, and national burden of osteoarthritis, 1990–2020 and projections to 2050: a systematic analysis for the Global Burden of Disease Study 2021

Author: Abbasi-Kangevari Mohsen
Abedi Aidin
Ackerman Ilana N
Amu Hubert
Antony Benny
Arabloo Jalal
Aravkin Aleksandr Y
Argaw Ayele Mamo
Artamonov Anton A
Ashraf Tahira
Barrow Amadou
Bearne Lindsay M
Bensenor Isabela M
Berhie Alemshet Yirga
Bhardwaj Nikha
Bhardwaj Pankaj
Bhojaraja Vijayalakshmi S
Bijani Ali
Briant Paul Svitil
Briggs Andrew M
Brooks Peter M
Butt Nadeem Shafique
Charan Jaykaran
Chattu Vijay Kumar
Cicuttini Flavia M
Coberly Kaleb
Collaborators GBD 2021 Osteoarthritis
Cross Marita
Cruz Jessica A
Culbreth Garland T
Dadras Omid
Dai Xiaochen
Dandona Lalit
Dandona Rakhi
de Luca Katie
Denova-Gutiérrez Edgar
Dharmaratne Samath Dhamminda
Dhimal Meghnath
Dianatinasab Mostafa
Dreinhoefer Karsten E
Elhadi Muhammed
Farooque Umar
Farpour Hamid Reza
Filip Irina
Fischer Florian
Freitas Marisa
Fukutaki Kai Glenn
Ganesan Balasankar
Gemeda Belete Negese Belete
Getachew Tamiru
Ghamari Seyyed-Hadi
Ghashghaee Ahmad
Gill Tiffany K
Golechha Mahaveer
Golinelli Davide
Gupta Bhawna
Gupta Veer Bala
Gupta Vivek Kumar
Haddadi Rasool
Hafezi-Nejad Nima
Hagins Hailey
Haile Lydia M
Halwani Rabih
Hamidi Samer
Hanif Asif
Harlianto Netanja I
Haro Josep Maria
Hartvigsen Jan
Hay Simon I
Hebert Jeffrey J
Heidari Golnaz
Hosseini Mohammad-Salar
Hosseinzadeh Mehdi
Hsiao Alexander Kevin
Ilic Irena M
Ilic Milena D
Jabbari Seyed Hossein Yahyazadeh
Jacob Louis
Jayawardena Ranil
Jha Ravi Prakash
Jonas Jost B
Joseph Nitin
Kandel Himal
Kandel Sandhya Neupane
Karaye Ibraheem M
Khan Jobair
Kim Yun Jin
Kolahi Ali-Asghar
Kopec Jacek A
Korzh Oleksii
Koteeswaran Rajasekaran
Krishnamoorthy Vijay
Kumar G Anil
Kumar Narinder
Lee Sang-woong
Lim Stephen S
Lo Justin
Lobo Stany W
Lucchetti Giancarlo
Malekpour Mohammad-Reza
Malik Ahmad Azam
Mandarano-Filho Luiz Garcia Garcia
March Lyn M
Martini Santi
Mentis Alexios-Fotios A
Mesregah Mohamed Kamal
Mestrovic Tomislav
Mirrakhimov Erkin M
Misganaw Awoke
Mohammadpourhodki Reza
Mokdad Ali H
Momtazmanesh Sara
Morrison Shane Douglas
Murray Christopher JL
Nassereldine Hasan
Netsere Henok Biresaw
Ong Kanyin Liane
Owolabi Mayowa O
Panda-Jonas Songhomitra
Pandey Anamika
Pawar Shrikant
Pedersini Paolo
Pereira Jeevan
Radfar Amir
Rafferty Quinn
Rashidi Mohammad-Mahdi
Rawaf David Laith
Rawaf Salman
Rawassizadeh Reza
Rayegani Seyed-Mansoor
Ribeiro Daniela
Riera Lidia Sanchez
Roever Leonardo
Saddik Basema
Sahebkar Amirhossein
Salehi Sana
Sanmarchi Francesco
Santric-Milicevic Milena M
Shahabi Saeed
Shaikh Masood Ali
Shaker Elaheh
Shannawaz Mohammed
Sharma Rajendra
Sharma Saurab
Shetty Jeevan K
Shiri Rahman
Shobeiri Parnian
Silva Diego Augusto Santos
Singh Ambrish
Singh Jasvinder A
Singh Surjit
Skou Søren T
Slater Helen
Smith Amanda E
Soltani-Zangbar Mohammad Sadegh
Starodubova Antonina V
Steinmetz Jaimie D
Tahbaz Sahel Valadan
Tehrani-Banihashemi Arash
Valdez Pascual R
Vo Bay
Vollset Stein Emil
Vos Theo
Vu Linh Gia
Wang Yuan-Pang
Woolf Anthony D
Yonemoto Naohiro
Yunusa Ismaeel
Publication venue: Elsevier
Publication date: 01/09/2023
Field of study

Background Osteoarthritis is the most common form of arthritis in adults, characterised by chronic pain and loss of mobility. Osteoarthritis most frequently occurs after age 40 years and prevalence increases steeply with age. WHO has designated 2021–30 the decade of healthy ageing, which highlights the need to address diseases such as osteoarthritis, which strongly affect functional ability and quality of life. Osteoarthritis can coexist with, and negatively effect, other chronic conditions. Here we estimate the burden of hand, hip, knee, and other sites of osteoarthritis across geographies, age, sex, and time, with forecasts of prevalence to 2050. Methods In this systematic analysis for the Global Burden of Disease Study, osteoarthritis prevalence in 204 countries and territories from 1990 to 2020 was estimated using data from population-based surveys from 26 countries for knee osteoarthritis, 23 countries for hip osteoarthritis, 42 countries for hand osteoarthritis, and US insurance claims for all of the osteoarthritis sites, including the other types of osteoarthritis category. The reference case definition was symptomatic, radiographically confirmed osteoarthritis. Studies using alternative definitions from the reference case definition (for example self-reported osteoarthritis) were adjusted to reference using regression models. Osteoarthritis severity distribution was obtained from a pooled meta-analysis of sources using the Western Ontario and McMaster Universities Arthritis Index. Final prevalence estimates were multiplied by disability weights to calculate years lived with disability (YLDs). Prevalence was forecast to 2050 using a mixed-effects model. Findings Globally, 595 million (95% uncertainty interval 535–656) people had osteoarthritis in 2020, equal to 7·6% (95% UI 6·8–8·4) of the global population, and an increase of 132·2% (130·3–134·1) in total cases since 1990. Compared with 2020, cases of osteoarthritis are projected to increase 74·9% (59·4–89·9) for knee, 48·6% (35·9–67·1) for hand, 78·6% (57·7–105·3) for hip, and 95·1% (68·1–135·0) for other types of osteoarthritis by 2050. The global age-standardised rate of YLDs for total osteoarthritis was 255·0 YLDs (119·7–557·2) per 100 000 in 2020, a 9·5% (8·6–10·1) increase from 1990 (233·0 YLDs per 100 000, 109·3–510·8). For adults aged 70 years and older, osteoarthritis was the seventh ranked cause of YLDs. Age-standardised prevalence in 2020 was more than 5·5% in all world regions, ranging from 5677·4 (5029·8–6318·1) per 100 000 in southeast Asia to 8632·7 (7852·0–9469·1) per 100 000 in high-income Asia Pacific. Knee was the most common site for osteoarthritis. High BMI contributed to 20·4% (95% UI –1·7 to 36·6) of osteoarthritis. Potentially modifiable risk factors for osteoarthritis such as recreational injury prevention and occupational hazards have not yet been explored in GBD modelling. Interpretation Age-standardised YLDs attributable to osteoarthritis are continuing to rise and will lead to substantial increases in case numbers because of population growth and ageing, and because there is no effective cure for osteoarthritis. The demand on health systems for care of patients with osteoarthritis, including joint replacements, which are highly effective for late stage osteoarthritis in hips and knees, will rise in all regions, but might be out of reach and lead to further health inequity for individuals and countries unable to afford them. Much more can and should be done to prevent people getting to that late stage

UNSWorks

Updated diagnostic criteria and nomenclature for neurofibromatosis type 2 and schwannomatosis: An international consensus recommendation

Author: Anten Monique
Avery Robert A.
Aylsworth Arthur
Babovic-Vuksanovic Dusica
Baralle Diana
Barbarot Sebastien
Barker Fred
Ben-Shachar Shay
Bergner Amanda
Bessis Didier
Blakeley Jaishri O.
Blanco Ignacio
Cassiman Catherine
Ciavarelli Patricia
Clementi Maurizio
Evans DG
Ferner Rosalie
Fisher Michael J.
Friedman Jan M.
Frébourg Thierry
Giovannini Marco
Gomes Alicia
Gutmann David H.
Halliday Dorothy
Hanemann Clemens Oliver
Helen Hanson Arvid Heiberg CH
Huson SM
Joly Pascal
Jordan Justin T.
Kalamarides Michel
Karajannis Matthias
Kehrer-Sawatzki Hildegard
Korf Bruce R.
Kroshinsky Daniela
Larralde Margarita
Le L
Legius Eric
Link M
Listernick R
Lázaro C
MacCollin Mia
Mallucci C
Mautner Victor Felix
Merker VL
Messiaen Ludwine
Moertel C
Mueller A
Ngeow J
Oostenbrink R
Packer R
Pancza Patrice
Papi Laura
Parry A
Peltonen J
Pichard D
Plotkin Scott R.
Poppe B
Rauen Katherine A.
Rezende N
Riccardi Vincent
Rodrigues LO
Rosser T
Ruggieri M
Schorry Elizabeth
Serra E
Smith Miriam J.
Steinke-Lange V
Stemmer-Rachamimov Anat
Stevenson David A.
Stivaros SM
Taylor A
Toelen J
Tonsgard J
Trevisson E
Ullrich Nicole J.
Upadhyaya M
Varan A
Viskochil David
Wilson M
Wimmer Katharina
Wolkenstein P
Wu H
Yohay Kaleb
Zadeh G
Publication venue: PEARL
Publication date: 09/06/2022
Field of study

PEARL (Univ. of Plymouth)

Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021

Author: Aali Amirali
Abate Yohannes Habtegiorgis
Abbasi-Kangevari Mohsen
Abbasi-Kangevari Zeinab
Abbasian Mohammadreza
Abd-Rabu Rami
Abdulah Deldar Morad
Abdullah Abu Yousuf Md
Abedi Vida
Abidi Hassan
Aboagye Richard Gyan
Abolhassani Hassan
Abu-Gharbieh Eman
Abu-Zaid Ahmed
Adane Denberu Eshetie
Adane Tigist Demssew
Addo Isaac Yeboah
Adegboye Oyelola A.
Adekanmbi Victor
Adepoju Abiola Victor
Adnani Qorinah Estiningtyas Sakilah
Afolabi Rotimi Felix
Agarwal Gina
Aghdam Zahra Babaei
Agudelo-Botero Marcela
Aguilera Arriagada Constanza Elizabeth
Agyemang-Duah Williams
Ahinkorah Bright Opoku
Ahmad Aqeel
Ahmad Danish
Ahmad Rizwan
Ahmad Rizwan
Ahmad Sajjad
Ahmadi Ali
Ahmadi Keivan
Ahmed Ali
Ahmed Ayman
Ahmed Luai A.
Ahmed Syed Anees
Ajami Marjan
Akinyemi Rufus Olusola
Al Hamad Hanadi
Al Hasan Syed Mahfuz
AL-Ahdal Tareq Mohammed Ali
Al-Aly Ziyad
Al-Raddadi Rajaa M
Alalwan Tariq A.
AlBataineh Mohammad T.
Alcalde-Rabanal Jacqueline Elizabeth
Alemi Sharifullah
Ali Hassam
Alinia Tahereh
Aljunid Syed Mohamed
Almustanyir Sami
Alvis-Guzman Nelson
Amare Firehiwot
Ameyaw Edward Kwabena
Amiri Sohrab
Amusa Ganiyu Adeniyi
Andrei Catalina Liliana
Anjana Ranjit Mohan
Ansar Adnan
Ansari Golnoosh
Ansari-Moghaddam Alireza
Anyasodor Anayochukwu Edward
Arabloo Jalal
Aravkin Aleksandr Y.
Areda Demelash
Arifin Hidayat
Arkew Mesay
Armocida Benedetta
Arnlov Johan
Artamonov Anton A.
Arulappan Judie
Aruleba Raphael Taiwo
Arumugam Ashokan
Aryan Zahra
Asemu Mulu Tiruneh
Asghari-Jafarabadi Mohammad
Askari Elaheh
Asmelash Daniel
Astell-Burt Thomas
Athar Mohammad
Athari Seyyed Shamsadin
Atout Maha Moh’d Wahbi
Avila-Burgos Leticia
Awaisu Ahmed
Azadnajafabad Sina
B B Darshan
Babamohamadi Hassan
Badar Muhammad
Badawi Alaa
Badiye Ashish D.
Baghcheghi Nayereh
Bagheri Nasser
Bagherieh Sara
Bah Sulaiman
Bahadory Saeed
Bai Ruhai
Baig Atif Amin
Baltatu Ovidiu Constantin
Baradaran Hamid Reza
Barchitta Martina
Bardhan Mainak
Barengo Noel C.
Barnighausen Till Winfried
Barone Mark Thomaz Ugliara
Barone-Adesi Francesco
Barrow Amadou
Bashiri Hamideh
Basiru Afisu
Basu Sanjay
Basu Saurav
Batiha Abdul-Monim Mohammad
Batra Kavita
Bayih Mulat Tirfie
Bayileyegn Nebiyou Simegnew
Behnoush Amir Hossein
Bekele Bekele Alehegn
Belete Melaku Ashagrie
Belgaumi Uzma Iqbal
Belo Luis
Bennett Derrick A.
Bensenor Isabela M.
Berhe Kidanemaryam
Berhie Alemshet Yirga
Bhaskar Sonu
Bhat Ajay Nagesh
Bhatti Jasvinder Singh
Bikbov Boris
Bilal Faiq
Bintoro Bagas Suryo
Bitaraf Saeid
Bitra Veera R.
Bjegovic-Mikanovic Vesna
Bodolica Virginia
Boloor Archith
Boyko Edward J.
Brauer Michael
Brazo-Sayavera Javier
Brenner Hermann
Butt Zahid A.
Calina Daniela
Campos Luciana Aparecida
Campos-Nonato Ismael R.
Cao Chao
Cao Yin
Car Josip
Carvalho Marcia
Castaneda-Orjuela Carlos A.
Catala-Lopez Ferran
Cerin Ester
Chadwick Joshua
Chandrasekar Eeshwar K.
Chanie Gashaw Sisay
Charan Jaykaran
Chattu Vijay Kumar
Chauhan Kirti
Cheema Huzaifa Ahmad
Chekol Abebe Endeshaw
Chen Simiao
Cherbuin Nicolas
Chichagi Fatemeh
Chidambaram Saravana Babu
Cho William C.S.
Choudhari Sonali Gajanan
Chowdhury Enayet Karim
Chowdhury Rajiv
Chu Dinh‑Toi
Chukwu Isaac Sunday
Chung Sheng-Chia
Coberly Kaleb
Columbus Alyssa
Contreras Daniela
Cousin Ewerton
Criqui Michael H.
Cruz Jessica A.
Cruz-Martins Natalia
Cuschieri Sarah
Dabo Bashir
Dadras Omid
Dagne Abate Melsew
Dai Xiaochen
Dalton Bronte E.
Damasceno Albertino Antonio Moura
Dandona Lalit
Dandona Rakhi
Das Saswati
Dascalu Ana Maria
Dash Nihar Ranjan
Dashti Mohsen
Davila-Cervantes Claudio Alberto
De la Cruz-Gongora Vanessa
Debele Gebiso Roba
Delpasand Kourosh
Demisse Fitsum Wolde
Demissie Getu Debalkie
Deng Xinlei
Denova-Gutierrez Edgar
Deo Salil V.
Dervisevic Emina
Desai Hardik Dineshbhai
Desale Aragaw Tesfaw
Dessie Anteneh Mengist
Desta Fikreab
Dewan Syed Masudur Rahman
Dey Sourav
Dhama Kuldeep
Dhimal Meghnath
Diao Nancy
Diaz Daniel
Dinu Monica
Diress Mengistie
Djalalinia Shirin
Doan Linh Phuong
Dongarwar Deepa
Dos Santos Figueiredo Francisco Winter
Duncan Bruce B.
Duprey Joe
Dutta Siddhartha
Dziedzic Arkadiusz Marian
Edinur Hisham Atan
Ekholuenetale Michael
Ekundayo Temitope Cyrus
El-Huneidi Waseem
Elgendy Islam Y.
Elhadi Muhammed
ElHafeez Samar Abd
Elmeligy Omar Abdelsadek Abdou
Elmonem Mohamed A.
Endeshaw Destaw
Esayas Hawi Leul
Eshetu Habitu Birhan
Etaee Farshid
Fadhil Ibtihal
Fagbamigbe Adeniyi Francis
Fahim Ayesha
Falahi Shahab
Faris MoezAlIslam Ezzat Mahmoud
Farrokhpour Hossein
Farzadfar Farshad
Fatehizadeh Ali
Fazli Ghazal
Feng Xiaoqi
Ferede Tomas Y.
Fischer Florian
Flood David
Forouhari Ali
Foroumadi Roham
Gaidhane Abhay Motiramji
Gaihre Santosh
Gaipov Abduzhappar
Galali Yaseen
Ganesan Balasankar
Garcia-Gordillo M.A.
Gautam Rupesh K.
Gebrehiwot Mesfin
Gebrekidan Kahsu Gebrekirstos
Gebremeskel Teferi Gebru
Getacher Lemma
Ghadirian Fataneh
Ghalibaf AmirAli Moodi
Ghamari Seyyed-Hadi
Ghasemi Nour Mohammad
Ghassemi Fariba
Golechha Mahaveer
Goleij Pouya
Golinelli Davide
Gopalani Sameer Vali
Guadie Habtamu Alganeh
Guan Shi-Yang
Gudayu Temesgen Worku
Guimaraes Rafael Alves
Guled Rashid Abdi
Gupta Kartik
Gupta Rajeev
Gupta Veer Bala
Gupta Vivek Kumar
Gyawali Bishal
Haddadi Rasool
Hadi Najah R.
Hagins Hailey
Haile Teklehaimanot Gereziher
Haj-Mirzaian Arvin
Hajibeygi Ramtin
Halwani Rabih
Hamidi Samer
Hankey Graeme J.
Hannan Md Abdul
Haque Shafiul
Harandi Hamid
Harlianto Netanja I.
Hasan S.M. Mahmudul
Hasan Syed Shahzad
Hasani Hamidreza
Hassanipour Soheil
Hassen Mohammed Bheser
Haubold Johannes
Hayat Khezar
Heidari Golnaz
Heidari Mohammad
Hessami Kamran
Hiraike Yuta
Holla Ramesh
Hossain Md Shakhaoat
Hossain Sahadat
Hosseini Mohammad-Salar
Hosseinzadeh Hassan
Hosseinzadeh Mehdi
Huang Junjie
Huda Md Nazmul
Hussain Salman
Huynh Hong-Han
Hwang Bing-Fang
Ibitoye Segun Emmanuel
Ikeda Nayu
Ilic Irena M.
Ilic Milena D.
Inbaraj Leeberk Raja
Iqbal Afrin
Islam Rakibul M.
Islam Sheikh Mohammed Shariful
Ismail Nahlah Elkudssiah
Iso Hiroyasu
Isola Gaetano
Itumalla Ramaiah
Iwagami Masao
Iwu Chidozie Declan
Iyamu Ihoghosa Osamuyi
Iyasu Assefa N.
Jacob Louis
Jafarzadeh Abdollah
Jahrami Haitham
Jain Rajesh
Jaja Chinwe
Jamalpoor Zahra
Jamshidi Elham
Janakiraman Balamurugan
Jayanna Krishnamurthy
Jayapal Sathish Kumar
Jayaram Shubha
Jayawardena Ranil
Jebai Rime
Jeong Wonjeong
Jin Yinzi
Jokar Mohammad
Jonas Jost B.
Jonasson Junmei Miao
Joseph Abel
Joseph Nitin
Joshua Charity Ehimwenma
Joukar Farahnaz
Jozwiak Jacek Jerzy
Kaambwa Billingsley
Kabir Ali
Kabthymer Robel Hussen
Kadashetti Vidya
Kahe Farima
Kalhor Rohollah
Kandel Himal
Karanth Shama D.
Karaye Ibraheem M.
Karkhah Samad
Katoto Patrick D.M.C.
Kaur Navjot
Kazemian Sina
Kebede Sewnet Adem
Khader Yousef Saleh
Khajuria Himanshu
Khalaji Amirmohammad
Khan Ajmal
Khan Maseer
Khan Moien A.B.
Khanal Saval
Khatatbeh Moawiah Mohammad
Khater Amir M.
Khateri Sorour
khorashadizadeh Fatemeh
Khubchandani Jagdish
Kibret Biruk Getahun
Kim Min Seo
Kimokoti Ruth W.
Kisa Adnan
Kivimäki Mika
Kolahi Ali-Asghar
Komaki Somayeh
Kompani Farzad
Koohestani Hamid Reza
Korzh Oleksii
Kostev Karel
Kothari Nikhil
Koudehi Masoumeh Foroutan
Koyanagi Ai
Krishan Kewal
Krishnamoorthy Yuvaraj
Kuate Defo Barthelemy
Kuddus Md Abdul
Kuddus Mohammed
Kumar Harish
Kumar Rakesh
Kundu Satyajit
Kurniasari Maria Dyah
Kuttikkattu Ambily
La Vecchia Carlo
Lallukka Tea
Larijani Bagher
Larsson Anders O.
Latief Kamaluddin
Lawal Basira Kankia
Le Thao Thi Thu
Le Trang Thi Bich
Lee Munjae
Lee Paul H.
Lee Sang-woong
Lee Seung Won
Lee Shaun Wen Huey
Lee Wei-Chen
Legesse Samson Mideksa
Lenzi Jacopo
Li Ming-Chieh
Li Yongze
Lim Lee-Ling
Lim Stephen S.
Lindstedt Paulina A.
Liu Chaojie
Liu Xuefeng
Lo Chun-Han
Lopes Graciliana
Lorkowski Stefan
Lozano Rafael
Lucchetti Giancarlo
Maghazachi Azzam A.
Mahasha Phetole Walter
Mahjoub Soleiman
Mahmoud Mansour Adam
Mahmoudi Razzagh
Mahmoudimanesh Marzieh
Mai Anh Tuan
Majeed Azeem
Majma Sanaye Pantea
Makris Konstantinos Christos
Malhotra Kashish
Malik Ahmad Azam
Malik Iram
Mallhi Tauqeer Hussain
Malta Deborah Carvalho
Mamun Abdullah A.
Mansouri Borhan
Marateb Hamid Reza
Mardi Parham
Martini Santi
Martorell Miquel
Marzo Roy Rillera
Masoudi Reza
Masoudi Sahar
Mathews Elezebeth
Maugeri Andrea
Mazzaglia Giampiero
McLaughlin Susan A.
Mekonnen Teferi
Meshkat Mahboobeh
Mestrovic Tomislav
Miazgowski Tomasz
Michalek Irmina Maria
Minh Le Huu Nhat
Mini G.K.
Miranda J. Jaime
Mirfakhraie Reza
Mirrakhimov Erkin M.
Mirza-Aghazadeh-Attari Mohammad
Misganaw Awoke
Misgina Kebede Haile
Mishra Manish
Moazen Babak
Moghaddam Sahar Saeedi
Mohamed Nouh Saad
Mohammadi Esmaeil
Mohammadi Mohsen
Mohammadian-Hafshejani Abdollah
Mohammadshahi Marita
Mohseni Alireza
Mojiri-forushani Hoda
Mokdad Ali H.
Momtazmanesh Sara
Monasta Lorenzo
Moniruzzaman Md
Mons Ute
Montazeri Fateme
Moradi Sarabi Mostafa
Moradi Maryam
Moradi Yousef
Morovatdar Negar
Morrison Shane Douglas
Morze Jakub
Mossialos Elias
Mostafavi Ebrahim
Mueller Ulrich Otto
Mulita Admir
Mulita Francesk
Murillo-Zamora Efren
Musa Kamarul Imran
Mwita Julius
Nagaraju Shankar Prasad
Naghavi Mohsen
Nainu Firzan
Nair Tapas Sadasivan
Najmuldeen Hastyar Hama Rashid
Nangia Vinay
Nargus Shumaila
Naser Abdallah Y.
Nassereldine Hasan
Natto Zuhair S.
Nauman Javaid
Nayak Biswa Prakash
Ndejjo Rawlance
Negash Hadush
Negoi Ruxandra Irina
Nguyen Dang H.
Nguyen Hau Thi Hien
Nguyen Hien Quang
Nguyen Phat Tuan
Nguyen Van Thanh
Niazi Robina Khan
Nigatu Yeshambel T.
Ningrum Dina Nur Anggraini
Nizam Muhammad A.
Nnyanzi Lawrence Achilles
Noreen Mamoona
Noubiap Jean Jacques
Nzoputam Chimezie Igwegbe
Nzoputam Ogochukwu Janet
Oancea Bogdan
Odogwu Nkechi Martina
Odukoya Oluwakemi Ololade
Ojha Vivek Anand
Okati-Aliabad Hassan
Okekunle Akinkunmi Paul
Okonji Osaretin Christabel
Okwute Patrick Godwin
Olufadewa Isaac Iyinoluwa
Ong Kanyin Liane
Onwujekwe Obinna E.
Ordak Michal
Ortiz Alberto
Osuagwu Uchechukwu Levi
Oulhaj Abderrahim
Owolabi Mayowa O.
Padron-Monedero Alicia
Padubidri Jagadish Rao
Palladino Raffaele
Panagiotakos Demosthenes
Panda-Jonas Songhomitra
Pandey Anamika
Pandey Ashok
Pandi-Perumal Seithikurippu R
Pantea Stoian Anca Mihaela
Pardhan Shahina
Parekh Tarang
Parekh Utsav
Pasovic Maja
Patel Jay
Patel Jenil R.
Paudel Uttam
Pepito Veincent Christian Filipino
Pereira Marcos
Perico Norberto
Perna Simone
Petcu Ionela-Roxana
Petermann-Rocha Fanny Emily
Podder Vivek
Postma Maarten J.
Pourali Ghazaleh
Pourtaheri Naeimeh
Prates Elton Junio Sady
Qadir Mirza Muhammad Fahd
Qattea Ibrahim
Raee Pourya
Rafique Ibrar
Rahimi Mehran
Rahimifard Mahban
Rahimi‑Movaghar Vafa
Rahman Md Mosfequr
Rahman Md Obaidur
Rahman Mohammad Hifz Ur
Rahman Mosiur
Rahman Muhammad Aziz
Rahmani Mohamed
Rahmani Shayan
Rahmanian Vahid
Rahmawaty Setyaningrum
Rahnavard Niloufar
Rajbhandari Bibek
Ram Pradhum
Ramazanu Sheena
Rana Juwel
Rancic Nemanja
Ranjha Muhammad Modassar Ali Nawaz
Rao Chythra R.
Rapaka Deepthi
Rasali Drona Prakash
Rashedi Sina
Rashedi Vahid
Rashid Ahmed Mustafa
Rashidi Mohammad-Mahdi
Ratan Zubair Ahmed
Rawaf Salman
Rawal Lal
Redwan Elrashdy Moustafa Mohamed
Remuzzi Giuseppe
Rengasamy Kannan R.R.
Renzaho Andre M.N.
Reyes Luis Felipe
Rezaei Nazila
Rezaei Nima
Rezaeian Mohsen
Rezazadeh Hossein
Riahi Seyed Mohammad
Rias Yohanes Andy
Riaz Muhammad
Ribeiro Daniela
Rodrigues Monica
Rodriguez Jefferson Antonio Buendia
Roever Leonardo
Rohloff Peter
Roshandel Gholamreza
Roustazadeh Abazar
Rwegerera Godfrey M.
Saad Aly M.A.
Saber-Ayad Maha Mohamed
Sabour Siamak
Sabzmakan Leila
Saddik Basema
Sadeghi Erfan
Saeed Umar
Safi Sare
Safi Sher Zaman
Saghazadeh Amene
Saheb Sharif-Askari Fatemeh
Saheb Sharif-Askari Narjes
Sahebkar Amirhossein
Sahoo Harihar
Sahoo Soumya Swaroop
Saif-Ur-Rahman K.M.
Sajid Mirza Rizwan
Salahi Saina
Salahi Sarvenaz
Saleh Mohamed A.
Salehi Mohammad Amin
Salomon Joshua A.
Sanabria Juan
Sanjeev Rama Krishna
Sanmarchi Francesco
Santric-Milicevic Milena M.
Sarasmita Made Ary
Sargazi Saman
Sathian Brijesh
Sathish Thirunavukkarasu
Sawhney Monika
Schlaich Markus P.
Schmidt Maria Ines
Schuermans Art
Seidu Abdul-Aziz
Senthil Kumar Nachimuthu
Sepanlou Sadaf G.
Sethi Yashendra
Seylani Allen
Shabany Maryam
Shafaghat Tahereh
Shafeghat Melika
Shafie Mahan
Shah Nilay S.
Shahid Samiah
Shaikh Masood Ali
Shanawaz Mohd
Shannawaz Mohammed
Sharfaei Sadaf
Shashamo Bereket Beyene
Shiri Rahman
Shittu Aminu
Shivakumar K.M.
Shivalli Siddharudha
Shobeiri Parnian
Shokri Fereshteh
Shuval Kerem
Sibhat Migbar Mekonnen
Silva Luis Manuel Lopes Rodrigues
Simpson Colin R.
Singh Jasvinder A.
Singh Paramdeep
Singh Surjit
Siraj Md Shahjahan
Skryabina Anna Aleksandrovna
Smith Amanda E.
Sohag Abdullah Al Mamun
Soleimani Hamidreza
Solikhah Solikhah
Soltani-Zangbar Mohammad Sadegh
Somayaji Ranjani
Sorensen Reed J.D.
Stafford Lauryn K.
Starodubova Antonina V.
Sujata Sujata
Suleman Muhammad
Sun Jing
Sundstrom Johan
Tabares-Seisdedos Rafael
Tabatabaei Seyyed Mohammad
Tabatabaeizadeh Seyed-Amir
Tabish Mohammad
Taheri Ensiyeh
Taheri Majid
Taki Elahe
Tamuzi Jacques J.L. Lukenze
Tan Ker-Kan
Tat Nathan Y.
Taye Birhan Tsegaw
Temesgen Worku Animaw
Temsah Mohamad-Hani
Tesler Riki
Thangaraju Pugazhenthan
Thankappan Kavumpurathu Raman
Thapa Rajshree
Tharwat Samar
Thomas Nihal
Ticoalu Jansje Henny Vera
Tiyuri Amir
Tonelli Marcello
Tovani-Palone Marcos Roberto
Trico Domenico
Trihandini Indang
Tripathy Jaya Prasad
Tromans Samuel Joseph
Tsegay Guesh Mebrahtom
Tualeka Abdul Rohim
Tufa Derara Girma
Tyrovolas Stefanos
Ullah Sana
Upadhyay Era
Vahabi Seyed Mohammad
Vaithinathan Asokan Govindaraj
Valizadeh Rohollah
Van Daalen Kim Robin
Vart Priya
Varthya Shoban Babu
Vasankari Tommi Juhani
Vaziri Siavash
Verma Madhur verma
Verras Georgios-Ioannis
Vo Danh Cao
Vollset Stein Emil
Wagaye Birhanu
Waheed Yasir
Wang Cong
Wang Fang
Wang Yanqing
Wang Ziyue
Wassie Gizachew Tadesse
Wei Wei Melissa Y.
Weldemariam Abrha Hailay
Westerman Ronny
Wickramasinghe Nuwan Darshana
Wu YiFan
Wulandari Ratna D.W.I .
Xia Juan
Xiao Hong
Xu Suowen
Xu Xiaoyue
Yada Dereje Y.
Yang Lin
Yatsuya Hiroshi
Yesiltepe Metin
Yi Siyan
Yohannis Hunachew Kibret
Yonemoto Naohiro
You Yuyi
Zaman Sojib Bin
Zamora Nelson
Zare Iman
Zarea Kourosh
Zarrintan Armin
Zastrozhin Mikhail Sergeevich
Zeru Naod Gebrekrstos
Zhang Zhi-Jiang
Zhong Chenwen
Zhou Jingjing
Zielińska Magdalena
Zikarg Yossef Teshome
Zodpey Sanjay
Zoladl Mohammad
Zou Zhiyong
Zumla Alimuddin
Zuniga Yves Miel H.
Publication venue: 'Elsevier BV'
Publication date: 22/07/2023
Field of study

This online publication has been corrected. The corrected version first appeared at thelancet.com on September 28, 2023BACKGROUND : Diabetes is one of the leading causes of death and disability worldwide, and affects people regardless of country, age group, or sex. Using the most recent evidentiary and analytical framework from the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD), we produced location-specific, age-specific, and sex-specific estimates of diabetes prevalence and burden from 1990 to 2021, the proportion of type 1 and type 2 diabetes in 2021, the proportion of the type 2 diabetes burden attributable to selected risk factors, and projections of diabetes prevalence through 2050. METHODS : Estimates of diabetes prevalence and burden were computed in 204 countries and territories, across 25 age groups, for males and females separately and combined; these estimates comprised lost years of healthy life, measured in disability-adjusted life-years (DALYs; defined as the sum of years of life lost [YLLs] and years lived with disability [YLDs]). We used the Cause of Death Ensemble model (CODEm) approach to estimate deaths due to diabetes, incorporating 25 666 location-years of data from vital registration and verbal autopsy reports in separate total (including both type 1 and type 2 diabetes) and type-specific models. Other forms of diabetes, including gestational and monogenic diabetes, were not explicitly modelled. Total and type 1 diabetes prevalence was estimated by use of a Bayesian meta-regression modelling tool, DisMod-MR 2.1, to analyse 1527 location-years of data from the scientific literature, survey microdata, and insurance claims; type 2 diabetes estimates were computed by subtracting type 1 diabetes from total estimates. Mortality and prevalence estimates, along with standard life expectancy and disability weights, were used to calculate YLLs, YLDs, and DALYs. When appropriate, we extrapolated estimates to a hypothetical population with a standardised age structure to allow comparison in populations with different age structures. We used the comparative risk assessment framework to estimate the risk-attributable type 2 diabetes burden for 16 risk factors falling under risk categories including environmental and occupational factors, tobacco use, high alcohol use, high body-mass index (BMI), dietary factors, and low physical activity. Using a regression framework, we forecast type 1 and type 2 diabetes prevalence through 2050 with Socio-demographic Index (SDI) and high BMI as predictors, respectively. FINDINGS : In 2021, there were 529 million (95% uncertainty interval [UI] 500–564) people living with diabetes worldwide, and the global age-standardised total diabetes prevalence was 6·1% (5·8–6·5). At the super-region level, the highest age-standardised rates were observed in north Africa and the Middle East (9·3% [8·7–9·9]) and, at the regional level, in Oceania (12·3% [11·5–13·0]). Nationally, Qatar had the world’s highest age-specific prevalence of diabetes, at 76·1% (73·1–79·5) in individuals aged 75–79 years. Total diabetes prevalence—especially among older adults—primarily reflects type 2 diabetes, which in 2021 accounted for 96·0% (95·1–96·8) of diabetes cases and 95·4% (94·9–95·9) of diabetes DALYs worldwide. In 2021, 52·2% (25·5–71·8) of global type 2 diabetes DALYs were attributable to high BMI. The contribution of high BMI to type 2 diabetes DALYs rose by 24·3% (18·5–30·4) worldwide between 1990 and 2021. By 2050, more than 1·31 billion (1·22–1·39) people are projected to have diabetes, with expected age-standardised total diabetes prevalence rates greater than 10% in two super-regions: 16·8% (16·1–17·6) in north Africa and the Middle East and 11·3% (10·8–11·9) in Latin America and Caribbean. By 2050, 89 (43·6%) of 204 countries and territories will have an age-standardised rate greater than 10%. INTERPRETATION : Diabetes remains a substantial public health issue. Type 2 diabetes, which makes up the bulk of diabetes cases, is largely preventable and, in some cases, potentially reversible if identified and managed early in the disease course. However, all evidence indicates that diabetes prevalence is increasing worldwide, primarily due to a rise in obesity caused by multiple factors. Preventing and controlling type 2 diabetes remains an ongoing challenge. It is essential to better understand disparities in risk factor profiles and diabetes burden across populations, to inform strategies to successfully control diabetes risk factors within the context of multiple and complex drivers.Bill & Melinda Gates Foundation.http://www.thelancet.comam2024School of Health Systems and Public Health (SHSPH)SDG-03:Good heatlh and well-bein

UPSpace at the University of Pretoria

Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning

Author: Bian Jiang
Lo-Ciganic Wei-Hsuan
Peng Cheng
Smith Kaleb E
Wu Yonghui
Yu Zehao
Publication venue
Publication date: 18/03/2024
Field of study

The progress in natural language processing (NLP) using large language models (LLMs) has greatly improved patient information extraction from clinical narratives. However, most methods based on the fine-tuning strategy have limited transfer learning ability for cross-domain applications. This study proposed a novel approach that employs a soft prompt-based learning architecture, which introduces trainable prompts to guide LLMs toward desired outputs. We examined two types of LLM architectures, including encoder-only GatorTron and decoder-only GatorTronGPT, and evaluated their performance for the extraction of social determinants of health (SDoH) using a cross-institution dataset from the 2022 n2c2 challenge and a cross-disease dataset from the University of Florida (UF) Health. The results show that decoder-only LLMs with prompt tuning achieved better performance in cross-domain applications. GatorTronGPT achieved the best F1 scores for both datasets, outperforming traditional fine-tuned GatorTron by 8.9% and 21.8% in a cross-institution setting, and 5.5% and 14.5% in a cross-disease setting

arXiv.org e-Print Archive

A large language model for electronic health records

Author: Anthony B. Costa
Aokun Chen
Cheryl Martin
Christopher A. Harle
Christopher Parisien
Colin Compas
Duane A. Mitchell
Elizabeth A. Shenkman
Gloria Lipori
Hoo Chang Shin
Jiang Bian
Kaleb E. Smith
Mona G. Flores
Nima PourNejatian
Tanja Magoc
William R. Hogan
Xi Yang
Ying Zhang
Yonghui Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2022
Field of study

Abstract There is an increasing interest in developing artificial intelligence (AI) systems to process and interpret electronic health records (EHRs). Natural language processing (NLP) powered by pretrained language models is the key technology for medical AI systems utilizing clinical narratives. However, there are few clinical language models, the largest of which trained in the clinical domain is comparatively small at 110 million parameters (compared with billions of parameters in the general domain). It is not clear how large clinical language models with billions of parameters can help medical AI systems utilize unstructured EHRs. In this study, we develop from scratch a large clinical language model—GatorTron—using >90 billion words of text (including >82 billion words of de-identified clinical text) and systematically evaluate it on five clinical NLP tasks including clinical concept extraction, medical relation extraction, semantic textual similarity, natural language inference (NLI), and medical question answering (MQA). We examine how (1) scaling up the number of parameters and (2) scaling up the size of the training data could benefit these NLP tasks. GatorTron models scale up the clinical language model from 110 million to 8.9 billion parameters and improve five clinical NLP tasks (e.g., 9.6% and 9.5% improvement in accuracy for NLI and MQA), which can be applied to medical AI systems to improve healthcare delivery. The GatorTron models are publicly available at: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/clara/models/gatortron_og

Directory of Open Access Journals