Search CORE

66 research outputs found

The Benefits of a Concise Chain of Thought on Problem-Solving in Large Language Models

Author: Guven Erhan
Renze Matthew
Publication venue
Publication date: 10/01/2024
Field of study

In this paper, we introduce Concise Chain-of-Thought (CCoT) prompting. We compared standard CoT and CCoT prompts to see how conciseness impacts response length and correct-answer accuracy. We evaluated this using GPT-3.5 and GPT-4 with a multiple-choice question-and-answer (MCQA) benchmark. CCoT reduced average response length by 48.70% for both GPT-3.5 and GPT-4 while having a negligible impact on problem-solving performance. However, on math problems, GPT-3.5 with CCoT incurs a performance penalty of 27.69%. Overall, CCoT leads to an average per-token cost reduction of 22.67%. These results have practical implications for AI systems engineers using LLMs to solve real-world problems with CoT prompt-engineering techniques. In addition, these results provide more general insight for AI researchers studying the emergent behavior of step-by-step reasoning in LLMs.Comment: All code, data, and supplemental materials are available on GitHub at https://github.com/matthewrenze/jhu-concise-co

arXiv.org e-Print Archive

Correction for Johansson et al., An open challenge to advance probabilistic forecasting for dengue epidemics.

Author: Ackley Sarah
Apfeldorf Karyn M
Asher Jason
Babin Steven M
Bagley Thomas
Bailey Trevor C
Barker Christopher M
Baugher Benjamin
Bell Jesse E
Biggerstaff Matthew
Britog Humberto
Brooks Logan C
Brown Alexandria C
Buczak Anna L
Carvalhou Marilia Sa
Chretien Jean-Paul
Clapham Hannah E
Clay Matt
Cohen Jeremy M
Colwell Rita R
Convertino Matteo
Cummings Derek AT
Devita Jason
Dobson Scott
Farrow David C
Forshey Brett M
Garcia-Diez Markel
George Dylan
Gramacy Robert B
Guven Erhan
Hebbeler Andrew M
Hyun Sangwon
Jiang Gao
Johansson Michael A
Johnson Leah R
Juarrero Alicia
Jutla Antarpreet
Khan Rakibul
Lane Aaron
Lauer Stephen A
Lessler Justin
Liu Fengchen
Liu Yang
Lothian Nick
Lowe Rachel
Manheim David
Margolis Harold S
Meng Xi
Mier-y-Teran-Romero Luis
Moniz Linda J
Moore Melinda
Moore Sean M
Mordecai Erin A
Moschou Terry
Murdock Courtney C
Ortiz Eloy
Osborne Grant
Osobaa Osonde
Paul Richard
Porco Travis C
Poultney Marissa
Rao Dhananjai M
Ray Evan L
Reddy Abraham
Reich Nicholas G
Rivera-Garcia Brenda
Rivero Jorge
Rodo Xavier
Rohr Jason R
Rosenfeld Roni
Ryan Sadie J
Sakrejda Krzysztof
Sardar Tridip
Shaman Jeffrey
Stewart-Ibarra Annam
Swerdlow David
Tibshirani Ryan J
Trtanj Juli
Vardavas Raffaele
Weikel Daniel P
Worden Lee
Yamana Teresa K
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 17/12/2019
Field of study

Correction for “An open challenge to advance probabilistic forecasting for dengue epidemics,” by Michael A. Johansson, Karyn M. Apfeldorf, Scott Dobson, Jason Devita, Anna L. Buczak, Benjamin Baugher, Linda J. Moniz, Thomas Bagley, Steven M. Babin, Erhan Guven, Teresa K. Yamana, Jeffrey Shaman, Terry Moschou, Nick Lothian, Aaron Lane, Grant Osborne, Gao Jiang, Logan C. Brooks, David C. Farrow, Sangwon Hyun, Ryan J. Tibshirani, Roni Rosenfeld, Justin Lessler, Nicholas G. Reich, Derek A. T. Cummings, Stephen A. Lauer, Sean M. Moore, Hannah E. Clapham, Rachel Lowe, Trevor C. Bailey, Markel García-Díez, Marilia Sá Carvalho, Xavier Rodó, Tridip Sardar, Richard Paul, Evan L. Ray, Krzysztof Sakrejda, Alexandria C. Brown, Xi Meng, Osonde Osoba, Raffaele Vardavas, David Manheim, Melinda Moore, Dhananjai M. Rao, Travis C. Porco, Sarah Ackley, Fengchen Liu, Lee Worden, Matteo Convertino, Yang Liu, Abraham Reddy, Eloy Ortiz, Jorge Rivero, Humberto Brito, Alicia Juarrero, Leah R. Johnson, Robert B. Gramacy, Jeremy M. Cohen, Erin A. Mordecai, Courtney C. Murdock, Jason R. Rohr, Sadie J. Ryan, Anna M. Stewart-Ibarra, Daniel P. Weikel, Antarpreet Jutla, Rakibul Khan, Marissa Poultney, Rita R. Colwell, Brenda Rivera-García, Christopher M. Barker, Jesse E. Bell, Matthew Biggerstaff, David Swerdlow, Luis Mier-y-Teran-Romero, Brett M. Forshey, Juli Trtanj, Jason Asher, Matt Clay, Harold S. Margolis, Andrew M. Hebbeler, Dylan George, and Jean-Paul Chretien, which was first published November 11, 2019; 10.1073/pnas.1909865116. The authors note that the affiliation for Xavier Rodó should instead appear as Catalan Institution for Research and Advanced Studies (ICREA) and Climate and Health Program, Barcelona Institute for Global Health (ISGlobal). The corrected author and affiliation lines appear below. The online version has been corrected

LSHTM Research Online

eScholarship - University of California

An open challenge to advance probabilistic forecasting for dengue epidemics.

Author: Ackley Sarah
Apfeldorf Karyn M
Asher Jason
Babin Steven M
Bagley Thomas
Bailey Trevor C
Barker Christopher M
Baugher Benjamin
Bell Jesse E
Biggerstaff Matthew
Brito Humberto
Brooks Logan C
Brown Alexandria C
Buczak Anna L
Carvalho Marilia Sá
Chretien Jean-Paul
Clapham Hannah E
Clay Matt
Cohen Jeremy M
Colwell Rita R
Convertino Matteo
Cummings Derek AT
Devita Jason
Dobson Scott
Farrow David C
Forshey Brett M
García-Díez Markel
George Dylan
Gramacy Robert B
Guven Erhan
Hebbeler Andrew M
Hyun Sangwon
Jiang Gao
Johansson Michael A
Johnson Leah R
Juarrero Alicia
Jutla Antarpreet
Khan Rakibul
Lane Aaron
Lauer Stephen A
Lessler Justin
Liu Fengchen
Liu Yang
Lothian Nick
Lowe Rachel
Manheim David
Margolis Harold S
Meng Xi
Mier-Y-Teran-Romero Luis
Moniz Linda J
Moore Melinda
Moore Sean M
Mordecai Erin A
Moschou Terry
Murdock Courtney C
Ortiz Eloy
Osborne Grant
Osoba Osonde
Paul Richard
Porco Travis C
Poultney Marissa
Rao Dhananjai M
Ray Evan L
Reddy Abraham
Reich Nicholas G
Rivera-García Brenda
Rivero Jorge
Rodó Xavier
Rohr Jason R
Rosenfeld Roni
Ryan Sadie J
Sakrejda Krzysztof
Sardar Tridip
Shaman Jeffrey
Stewart-Ibarra Anna M
Swerdlow David
Tibshirani Ryan J
Trtanj Juli
Vardavas Raffaele
Weikel Daniel P
Worden Lee
Yamana Teresa K
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/11/2019
Field of study

A wide range of research has promised new tools for forecasting infectious disease dynamics, but little of that research is currently being applied in practice, because tools do not address key public health needs, do not produce probabilistic forecasts, have not been evaluated on external data, or do not provide sufficient forecast skill to be useful. We developed an open collaborative forecasting challenge to assess probabilistic forecasts for seasonal epidemics of dengue, a major global public health problem. Sixteen teams used a variety of methods and data to generate forecasts for 3 epidemiological targets (peak incidence, the week of the peak, and total incidence) over 8 dengue seasons in Iquitos, Peru and San Juan, Puerto Rico. Forecast skill was highly variable across teams and targets. While numerous forecasts showed high skill for midseason situational awareness, early season skill was low, and skill was generally lowest for high incidence seasons, those for which forecasts would be most valuable. A comparison of modeling approaches revealed that average forecast skill was lower for models including biologically meaningful data and mechanisms and that both multimodel and multiteam ensemble forecasts consistently outperformed individual model forecasts. Leveraging these insights, data, and the forecasting framework will be critical to improve forecast skill and the application of forecasts in real time for epidemic preparedness and response. Moreover, key components of this project-integration with public health needs, a common forecasting framework, shared and standardized data, and open participation-can help advance infectious disease forecasting beyond dengue

LSHTM Research Online

eScholarship - University of California

HAL: Hyper Article en Ligne

HAL-Pasteur

HAL-Rennes 1

Fuzzy association rule mining and classification for the prediction of malaria in South Korea

Author: A Benali
A Buczak
A Buczak
Anna L. Buczak
B Liu
Benjamin Baugher
C Chatfield
C Corley
Climate and Global Dynamics Section
CM Kuok
DMW Powers
EB Wilson
Erhan Guven
GC Cawley
Global Change Master Directory
Global Health Group and the Ministry of Public Health in the Democratic People’s Republic of Korea
H Lodhi
H-I Ree
H-W Gao
IH Witten
J Sachs
Johns Hopkins University Applied Physics Laboratory
JR Quinlan
JR Quinlan
JR Quinlan
K Linthicum
K Zinszer
L Breiman
L Garcia
L Robert
Liane C. Ramac-Thomas
M Fukuda
M Sinka
N Ferreira
O Briet
P Martens
R Agrawal
S-H Cho
S-Y Yim
Sheri H. Lewis
Steven M. Babin
T Abeku
T Nkya
The Global Fund to Fight AIDS Tuberculosis, and Malaria
U Kitron
U Kitron
US Centers for Disease Control and Prevention
US Geological Survey
US National Aeronautics and Space Administration (NASA) Goddard Earth Sciences Data and Information Services Center
US National Oceanic and Atmospheric Administration
V Machault
Yevgeniy Elbert
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Robust Classification of Emotion in Human Speech Using Spectrogram Features

Author: Guven Erhan
Publication venue: The George Washington University
Publication date: 01/01/2012
Field of study

The recognition of emotions, such as anger, anxiety, joy, etc . from tonal variations in human speech is an important task for research and applications in human computer interaction. The objective of this research is to design, implement and test a Speech Emotion Classification (SEC) engine that can extract useful features and accurately classify emotions in human speech in the presence of speaker-dependent characteristics variations and noise. Current approaches extract several standard global values from the temporal sequence of power spectra, such as pitch, formants, energy, and values from the time signal, such as attack and decay rates. In this work, the frequency dimension of the spectrogram is quantized to simulate the Bark scale in the human audition system, the time dimension of the spectrogram is quantized in units starting from 50 ms, and the linear regression coefficients of the surface of each spectrogram segment are combined into a feature vector. In this way, complete local features are extracted to establish a larger sample. The accumulated feature vectors for each category of emotion provide a robust training basis for a state of the art classifier, such as an SVM. In order to further improve the performance of the SEC engine and to demonstrate the flexibility and benefit of local features, a backward context scheme is introduced. A series of experiments have been designed and conducted using the EMO-DB and LDC-DB speech emotion databases to measure the performance of the SEC engine. First, the accuracy and the precision of the performance are measured in terms of seven to fifteen emotion categories when trained on the speech utterances by random sampling. Next, the generalization performance is measured through a speaker cross-validation scheme. Third, the generalization and robust performance of the SEC engine is measured by performing gender, language and speaker classification with the SEC engine, hence measuring the discrimination power of the engine related to the speaker characteristics variations. Finally, the robust performance of the SEC engine is measured when the SNR is varied between 10 and 50 dB

ProQuest OAI Repository

A Survey of Data Mining and Machine Learning Methods for Cyber Security Intrusion Detection

Author: Anna L. Buczak
Erhan Guven
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Amniotic Fluid Ischemia Modified Albumin as a Novel Prenatal Diagnostic Marker for Down Syndrome: A Prospective Case-Control Study

Author: Bulent Demir
Deniz Karcaaltincaba
Emine Seda Guvendag Guven
Erhan Huseyin Comert
Suleyman Guven*
Publication venue: 'Heighten Science Publications Corporation'
Publication date: 21/06/2023
Field of study

Aims: There is no study in the literature about ischemia-modified albumin (IMA) and hepatocyte growth factor (HGF) levels in amniotic fluid for Down syndrome cases. The aim of this study was to investigate the changes of IMA and HGF in Down syndrome cases at 16-20 weeks of gestation compared to normal fetuses.Methods: For this prospective case-control study, following reaching the number of 20 women (study group) who had the prenatal diagnosis of Down syndrome, maternal and gestational age-matched pregnant women with normal constitutional karyotype were selected for the control group (n = 74) from the stored amniotic fluid samples. Results: Mean women and gestational ages were comparable between the two groups. Amniotic fluid IMA (1.32 ± 0.13 vs. 1.11 ± 0.11 ABSU, respectively, p < 0.001) and HGF (2743.53 ± 1389.28 vs. 2160.12 ± 654.63 pg/mL, respectively, p = 0.008). Levels were significantly higher in pregnant women having Down syndrome fetuses compared with women having normal fetuses. The amniotic fluid IMA levels for the diagnosis of Down syndrome, and the sensitivity and specificity were calculated as 95.0% and 71.6% for the limit value 1.171 cm3, respectively. Conclusion: In cases with suspected Down syndrome, the diagnosis of Down Syndrome may be made in approximately 1 hour with high sensitivity and specificity by measuring the IMA level in the amniotic fluid sample taken for fetal karyotyping

Heighten Science Publications Inc., USA

Analytic Biosurveillance Methods for Resource-Limited Settings

Author: Burkom Howard
Coberly Jacqueline
Elbert Yevgeniy
Guven Erhan
Publication venue: 'University of Illinois Libraries'
Publication date: 03/03/2014
Field of study

The authors describe the challenges of disease surveillance in settings lacking infrastructure and access to medical care. They address the role of analytic methods and evaluate open-source temporal alerting algorithms chosen for the Suite for Automated Global Electronic bioSurveillance (SAGES), collection of modular, freely-available software tools to enable electronic surveillance in these settings. An algorithm test-bed is described and used to compare algorithm alerting performance for both daily and weekly data streams. Multiple detection performance measures are defined, and a practical means of combining them is applied to recommend preferred alerting methods for common scenarios

University of Illinois at Chicago: Journals@UIC

Crossref

PubMed Central