Search CORE

29 research outputs found

Leveraging Uncertainty Estimates To Improve Classifier Performance

Author: Arora Gundeep
Merugu Srujana
Rastogi Rajeev
Saladi Anoop
Publication venue
Publication date: 20/11/2023
Field of study

Binary classification involves predicting the label of an instance based on whether the model score for the positive class exceeds a threshold chosen based on the application requirements (e.g., maximizing recall for a precision bound). However, model scores are often not aligned with the true positivity rate. This is especially true when the training involves a differential sampling across classes or there is distributional drift between train and test settings. In this paper, we provide theoretical analysis and empirical evidence of the dependence of model score estimation bias on both uncertainty and score itself. Further, we formulate the decision boundary selection in terms of both model score and uncertainty, prove that it is NP-hard, and present algorithms based on dynamic programming and isotonic regression. Evaluation of the proposed algorithms on three real-world datasets yield 25%-40% gain in recall at high precision bounds over the traditional approach of using model score alone, highlighting the benefits of leveraging uncertainty

arXiv.org e-Print Archive

A privacy-sensitive approach to distributed clustering

Author: Azoury
Chan
Cover
Dempster
Ghosh
Joydeep Ghosh
Papoulis
Pinkas
Srujana Merugu
Strehl
Yamanishi
Zhong
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States

Author: Abbott Sam
Abernethy Neil F.
Adee Madeline
Adhikari Bijaya
Arik Sercan O.
Asplund John
Ayer Turgay
Baccam Prasith
Baek Jackie
Baer Thomas M.
Ban Xuegang
Bannur Nayana
Barber Ryan
Baxter Arden
Ben-Nun Michal
Bennouna Mohammed Amine
Bertsimas Dimitris
Bian Jiang
Biegel Hannah
Bien Jacob
Biggerstaff Matthew
Bosse Nikos I.
Bracher Johannes
Brennen Andrea
Brooks Logan
Burant John C.
Cao Wei
Castro Rivadeneira Alvaro J.
Castro Lauren
Cavany Sean
Cegan Jeffrey C.
Celi Leo A.
Chen Jinghui
Chen Samuel
Chen YangQuan
Chhatwal Jagpreet
Chinazzi Matteo
Corsetti Sabrina M.
Cramer Estee Y.
Cui Jiaming
Dahan Maytal
Dalgic Ozden O.
Davis Jessica T.
Della Penna Nicolas
Dent Juan
DesRoches David
Dettwiller Ian D.
Deva Ayush
Drake John M.
Dusenberry Mike
Eisenberg Marisa C.
England William P.
Epshteyn Arkady
España Guido
Fairchild Geoffrey
Falb Karl
Faraone Stephen V.
Farias Vivek
Farthing Matthew W.
Forli Pedro
Fox Spencer
Funk Sebastian
Gaither Kelly
Gakidou Emmanuela
Gao Lei
Gao Liyao
Gao Zhifeng
Gardner Lauren
George Glover E.
Georgescu Andreea
Gerding Aaron
Gibson Graham Casey
Gneiting Tilmann
Grantz Kyra H.
Green Alden
Gu Quanquan
Gu Youyang
Gu Zhiling
Guertin Stephanie L.
Guo Lihong
Gurung Heidi L.
Hamory Bruce
Hay Simon I.
Hellewell Joel
Hess Jonathan
Hill Alison L.
Ho Lam Si Tung
Hong Qi-Jun
House Katie H.
Hu Addison J.
Huang Yitao
Huang Yuxin
Hulme-Lowe Christopher
Hunter Robert H.
Huynh Huong
Jadbabaie Ali
Jahja Maria
Jain Sansiddh
Jayawardena Dasuni
Jin Xiaoyong
Johansson Michael A.
Kalantari Rahi
Kaminsky Joshua
Kaminsky Kathryn
Kanal Elli
Kanji Abdul H.
Karlen Dean
Keegan Lindsay T.
Keskinocak Pinar
Khandelwal Ayush
Kim Myungjin
Kinsey Matt
Kong Stanley
Koyluoglu Ugur
Kraus Andrea
Kraus David
Kulkarni Mihir
Kyriakides Christina
Lachmann Michael
Ladd Mary A.
Lafferty Brandon
Lauer Stephen A.
Lavista Ferres Juan
Le Khoa
Le Long T.
Lee Elizabeth C.
Lega Joceline
Leis Helen
Lemaitre Joseph C.
Lessler Justin
Levi Retsef
Li Chaozhuo
Li Chun-Liang
Li Michael L.
Li Xinyi
Lim Steve
Linas Benjamin P.
Linkov Igor
Liu Tie-Yan
Lopez Velma K.
Ma Yian
Marshall Maximilian
Martin Emily T.
Mayo Michael L.
McCauley Ella
McConnell Steve
McDonald Daniel
Meakin Sophie R.
Meredith Hannah R.
Merugu Srujana
Meyers Lauren Ancel
Michaud Isaac
Milliken John
Moloney Michael
Moore Sean
Morgan James
Morley Christopher P.
Mu Kunpeng
Mueller Peter
Mullany Luke C.
Murray Chris
Myers Robert L.
Mühlemann Anja
Nagraj V. P.
Narasimhan Balasubramanian
Niemi Jarad
Nirgudkar Ninad
Nixon Kristen
Nze-Ndong David
Oidtman Rachel
Oruc Buse Eylul
Osthus Dave
Ozcan Gokce
O’Dea Eamon B.
Pagano Robert
Parno Matthew D.
Pastore y Piontti Ana
Pei Sen
Perakis Georgia
Perez-Saez Javier
Perkins Alex
Pfister Tomas
Pigott David
Piwonka Noah
Politsch Collin
Prakash B. Aditya
Rainwater-Lovett Kaitlin
Rajanala Samyak
Raval Alpan
Ravi Matt
Ray Evan L.
Reich Nicholas G.
Reiner Robert C.
Riley Pete
Riley Steven
Rodríguez Alexander
Rowland Michael A.
Rumack Aaron
Salekin Asif
Sarker Arnab
Sava Dario
Schrader Chris
Schwarz Tom
Scott James G.
Serban Nicoleta
Shah Apurv
Shah Devavrat
Shah Sam
Shakhnovich Elizabeth
Shaman Jeffrey
Sheldon Daniel
Sherratt Katharine
Shi Yunfeng
Shin Lauren
Shingi Siddhant
Siegel Daniel
Simon Noah
Singhvi Divya
Sinha Deeksha
Sinha Rajarishi
Skali Lami Omar
Slayton Rachel B.
Smith Claire P.
Soni Saksham
Spantidakis Ioannis
Spatz Ryan
Srivastava Ajitesh
Stage Steven A.
Stark Ariane
Stiefeling Chris
Suchoski Bradley T.
Sundar Saketh
Tabassum Anika
Tallaksen Katharine
Tazi Bouardi Hamza
Tec Mauricio
Thayaparan Leann
Tibshirani Rob
Tibshirani Ryan J.
Tiwari Avtansh
Tran Quoc T.
Truelove Shaun A.
Trump Benjamin D.
Tsai Thomas
Tsiourvas Asterios
Turner Stephen D.
Turtle James A.
van de Walle Axel
Ventura Valerie
Vespignani Alessandro
Walker Jo W.
Walraven Robert
Wang Dongliang
Wang Guannan
Wang Lily
Wang Lingxiao
Wang Qinxia
Wang Yijin
Wang Yu-Xiang
Wang Yuanjia
Wang Yueying
Wasserman Larry
Wattanachit Nutcha
White Jerome
Wilde Joshua
Wilkinson Barrie
Wills Josh
Wilson Shelby
Wolfinger Russ
Wong Alexander
Woody Spencer
Wu Dongxia
Xiao Jade
Xie Jiajia
Xie Shanghong
Xie Xing
Xiong Xinyue
Xu Pan
Yamana Teresa K.
Yan Xifeng
Yoder Nate
Yoon Jinsung
Yu Rose
Yu Shan
Zeng Donglin
Zhang Leyou
Zhang Shun
Zhang Weitong
Zhang-James Yanli
Zhao Yanting
Zheng Andrew
Zheng Shun
Zhou Mingyuan
Zorn Martha W.
Zou Difan
Publication venue: National Academy of Sciences
Publication date: 04/05/2022
Field of study

Short-term probabilistic forecasts of the trajectory of the COVID-19 pandemic in the United States have served as a visible and important communication channel between the scientific modeling community and both the general public and decision-makers. Forecasting models provide specific, quantitative, and evaluable predictions that inform short-term decisions such as healthcare staffing needs, school closures, and allocation of medical supplies. Starting in April 2020, the US COVID-19 Forecast Hub (https://covid19forecasthub.org/) collected, disseminated, and synthesized tens of millions of specific predictions from more than 90 different academic, industry, and independent research groups. A multimodel ensemble forecast that combined predictions from dozens of groups every week provided the most consistently accurate probabilistic forecasts of incident deaths due to COVID-19 at the state and national level from April 2020 through October 2021. The performance of 27 individual models that submitted complete forecasts of COVID-19 deaths consistently throughout this year showed high variability in forecast skill across time, geospatial units, and forecast horizons. Two-thirds of the models evaluated showed better accuracy than a naïve baseline model. Forecast accuracy degraded as models made predictions further into the future, with probabilistic error at a 20-wk horizon three to five times larger than when predicting at a 1-wk horizon. This project underscores the role that collaboration and active coordination between governmental public-health agencies, academic modeling teams, and industry partners can play in developing modern modeling capabilities to support local, state, and federal response to outbreaks

KITopen

The United States COVID-19 Forecast Hub dataset

Author: Abbott Sam
Abu-Mostafa Yaser
Adee Madeline
Adhikari Bijaya
Adiga Aniruddha
Arik Sercan O.
Asplund John
Ayer Turgay
Baccam Prasith
Baek Jackie
Baer Thomas M.
Ban Xuegang
Bannur Nayana
Barber Ryan
Bathwal Rahil
Baxter Arden
Bejar Benjamín
Belov Artur A.
Ben-Nun Michal
Bennouna Amine
Berlin Abraham
Bertsimas Dimitris
Bhatia Sangeeta
Bian Jiang
Biegel Hannah
Bien Jacob
Biggerstaff Matthew
Bosch Jurgen
Bosse Nikos I.
Bouardi Hamza Tazi
Bracher Johannes
Brennen Andrea
Brenner Michael
Brooks Logan
Budzinski Jozef
Burant John C.
Cao Duy
Cao Wei
Castro Lauren
Cavany Sean
Cegan Jeffrey C.
Celi Leo A.
Chang Nicholas A.
Chattopadhyay Ishanu
Chen Jinghui
Chen Samuel
Chen YangQuan
Chen Ye
Chen Yixian
Chhatwal Jagpreet
Chiang Wen-Hao
Chinazzi Matteo
Chintanippu Krishna
Chitta Pavan
Cho Jae H.
Choirat Christine
Chow Carson C.
Coram Marc
Cornell Matthew
Corsetti Sabrina M.
Cramer Estee Y.
Cui Jiaming
Dahan Maytal
Dalgic Ozden O.
Davis Jessica T.
DesRoches David
Dettwiller Ian D.
Deva Ayush
Drake John M.
Dusenberry Mike
Edwards Jessie K.
Eisenberg Marisa C.
England William P.
Epshteyn Arkady
Erickson Anne
España Guido
Fairchild Geoffrey
Falb Karl
Faraone Stephen V.
Farias Vivek
Farthing Matthew W.
Ferres Juan Lavista
Flahault Antoine
Fong Chung-Yan
Forli Pedro
Fox Spencer
Funk Sebastian
Gaikedu Emmanuela
Gaither Kelly
Galasso Joseph
Gandhi Parth D.
Gao Junyi
Gao Lei
Gao Liyao
Gao Zhifeng
Gardner Lauren
George Glover E.
Georgescu Andreea
Gerding Aaron
Gerkin Richard C.
Gibson Graham Casey
Glass Lucas
Gneiting Tilmann
Goel Sumit
Gowda Jethin
Grantz Kyra H.
Green Alden
Gu Quanquan
Gu Youyang
Gu Zhiling
Guertin Stephanie L.
Guo Lihong
Gurung Heidi L.
Hamory Bruce
Hay Simon
Hellewell Joel
Hess Jonathan
Hill Alison L.
Hlavacek William
Ho Lam
Hong Qi-Jun
House Katie
Hu Addison J.
Huang Yi
Huang Yitao
Huang Yuxin
Hulme-Lowe Christopher
Hulse Juan Dent
Hunter Robert H.
Hurt Benjamin
Hussain Fazle
Huynh Huong
Ibrahim Mark
Ivy Julie S.
Jadbabaie Ali
Jahja Maria
Jain Chaman
Jain Chandini
Jain Sansiddh
Jayawardena Dasuni
Jin Qixuan
Jin Xiaoyong
Jivane Viresh
Jo Areum
Jo HyeongChan
Johansson Michael A.
Joshi Keya
Kalantari Rahi
Kaminsky Joshua
Kaminsky Kathryn
Kanal Elli
Kanji Abdul Hannan
Karimzadeh Morteza
Karlen Dean
Keegan Lindsay T.
Keskinocak Pinar
Khan Zeina
Khandelwal Ayush
Khurana Ankita
Kim Juhyun
Kim Myungjin
Kinsey Matt
Klein Ellen
Koyluoglu Ugur
Kraus Andrea
Kraus David
Krymova Ekaterina
Kulkarni Mihir
Kulkarni Pranav
Kumar Ajay
Kyriakides Christina
Lachmann Michael
Lacroix Timothee
Ladd Mary A.
Lafferty Brandon
Lakhani Anshul
Lami Omar Skali
Lauer Stephen A.
Le Khoa
Le Long T.
Le Matthew
Lee Elizabeth C.
Lee Gavin
Lega Joceline
Leis Helen
Lemaitre Joseph C.
Lessler Justin
Levi Retsef
Lewis Bryan
Li Chaozhuo
Li Chun-Liang
Li Michael L.
Li Xinyi
Liao Jason
Lim Steve
Lin Yen Ting
Linas Benjamin P.
Linkov Igor
Liu Tie-Yan
Lopez Velma K.
Lu Guoqing
Lucas Benjamin
Lushtak Samuel M.
Ma Yian
Mallela Abhishek
Manetti Elisa
Mann Ethan
Marathe Madhav
Marshall Maximilian
Martin Emily T.
Mayo Michael L.
Mayorga Maria E.
McAndrew Thomas
McCauley Ella
McConnell Steve
McDonald Daniel
Meakin Sophie R.
Mehrotra Prakhar
Mele Jessica
Meredith Hannah R.
Merugu Srujana
Meyers Lauren Ancel
Michaud Isaac
Miller Ely
Milliken John
Mody Vidhi
Mody Vrushti
Mohler George
Moloney Michael
Moore Sean
Morgan James
Morley Christopher P.
Mu Kunpeng
Mueller Peter
Mullany Luke C.
Murray Chris
Myers Robert L.
Mühlemann Anja
Nagraj V. P.
Namigai Kristen
Narasimhan Balasubramanian
Ndong David Nze
Neumann Jacob
Ngo Thoai
Nickel Maximilian
Niemi Jarad
Nirgudkar Ninad
Nixon Kristen
Nouvellet Pierre
Obozinski Guillaume
Oidtman Rachel
Oruc Buse Eylul
Osthus Dave
Ozcan Gokce
O’Dea Eamon B.
Pagano Robert
Panaggio Mark J.
Parno Matthew D.
Pasumarty Sujitha
Peddireddy Akhil Sai
Penna Nicolas D.
Perakis Georgia
Perez-Saez Javier
Perkins Alex
Pfeiffer Ruth
Pfister Tomas
Pigott David
Piontti Ana Pastore y
Piriya Matthew
Piwonka Noah
Politsch Collin
Popken Max
Porebski Przemyslaw
Posner Richard
Prakash B. Aditya
Qian Cheng
Rainwater-Lovett Kaitlin
Rajanala Samyak
Raval Alpan
Ravi Matt
Ray Evan L.
Reich Nicholas G.
Reich Nicholas G.
Reiner Robert C.
Riley Pete
Riley Steven
Rivadeneira Alvaro J. Castro
Rodríguez Alexander
Romberg Justin
Rosenstrom Erik T.
Rowland Michael A.
Rumack Aaron
Sagun Levent
Salekin Asif
Sarker Arnab
Schrader Chris
Schwarz Tom
Scott James G.
Sen Pei
Serban Nicoleta
Shah Apurv
Shah Devavrat
Shah Sam
Shakhnovich Elizabeth
Shaman Jeffrey
Sharma Rakshith
Sheldon Daniel
Sherratt Katharine
Shi Yunfeng
Shin Lauren
Shingi Siddhant
Shrivastav Monika
Siegel Daniel
Simon Noah
Singhvi Divya
Sinha Deeksha
Sinha Rajarishi
Slayton Rachel B.
Smith Claire P.
Soni Saksham
Soohoo Connor
Spaeder Jeffrey
Spantidakis Ioannis
Spatz Ryan
Srivastava Ajitesh
Stage Steven A.
Stark Ariane
Stiefeling Chris
Suchoski Bradley T.
Sumner Timothy
Sun Jimeng
Sun Tao
Sundar Saketh
Swann Julie L.
Tabassum Anika
Tallaksen Katharine
Tec Mauricio
Thanou Dorina
Thayaparan Leann
Tibshirani Rob
Tibshirani Ryan J.
Tirumala Kushal
Tiwari Avtansh
Tomar Vishal
Tran Quoc
Truelove Shaun A.
Trump Benjamin D.
Tsai Thomas
Tseng Albert
Tsiourvas Asterios
Turner Stephen D.
Turtle James
US COVID-19 Forecast Hub Consortium
Vahedi Behzad
Van Bussel Frank
van de Walle Axel
Varadarajan Vignesh
Venkatramanan Srinivasan
Ventura Valerie
Vespignani Alessandro
Vytheeswaran Jagath
Walker Jo W.
Walraven Robert
Wang Christopher
Wang Dongdong
Wang Dongliang
Wang Guannan
Wang Lijing
Wang Lily
Wang Lingxiao
Wang Liqiang
Wang Qinxia
Wang Yijin
Wang Yu-Xiang
Wang Yuanjia
Wang Yueying
Wang Zhongying
Wasserman Larry
Wattanchit Nutcha
Weisberg Shane
White Jerome
Wilde Joshua
Wilkinson Barrie
Wills Josh
Wilson Austin
Wilson Daniel
Wilson Shelby
Wolffram Daniel
Wolfinger Russ
Wong Alexander
Woody Spencer
Wu Dongxia
Xiao Cao
Xiao Jade
Xie Jiajia
Xie Shanghong
Xie Xing
Xiong Xinyue
Xu Pan
Xu Tianjian
Yamana Teresa K.
Yan Xifeng
Yeluri Akshay
Yeung Dit-Yan
Yoder Nate
Yogurtcu Osman N.
Yoon Jinsung
You Jialu
Yu Rose
Yu Shan
Yurk Dominic
Zeng Donglin
Zhang Leyou
Zhang Michael
Zhang Shun
Zhang Shunpu
Zhang Weitong
Zhang-James Yanli
Zhao Yanting
Zheng Andrew
Zheng Shun
Zhou Mingyuan
Zimmerman Peter
Zlokapa Alexander
Zoraghein Hamidreza
Zorn Martha W.
Zou Difan
Zou Zihang
Publication venue: Nature Research
Publication date: 17/08/2022
Field of study

Academic researchers, government agencies, industry groups, and individuals have produced forecasts at an unprecedented scale during the COVID-19 pandemic. To leverage these forecasts, the United States Centers for Disease Control and Prevention (CDC) partnered with an academic research lab at the University of Massachusetts Amherst to create the US COVID-19 Forecast Hub. Launched in April 2020, the Forecast Hub is a dataset with point and probabilistic forecasts of incident cases, incident hospitalizations, incident deaths, and cumulative deaths due to COVID-19 at county, state, and national, levels in the United States. Included forecasts represent a variety of modeling approaches, data sources, and assumptions regarding the spread of COVID-19. The goal of this dataset is to establish a standardized and comparable set of short-term forecasts from modeling teams. These data can be used to develop ensemble models, communicate forecasts to the public, create visualizations, compare models, and inform policies regarding COVID-19 mitigation. These open-source data are available via download from GitHub, through an online API, and through R packages

KITopen

Recommended from our members

Distributed learning using generative models

Author: Merugu Srujana
Publication venue
Publication date: 01/01/2006
Field of study

textElectrical and Computer Engineerin

Texas ScholarWorks

A Distributed Learning Framework for Heterogeneous Data Sources

Author: Joydeep Ghosh
Srujana Merugu
Publication venue
Publication date: 01/01/2005
Field of study

We present a probabilistic model-based framework for distributed learning that takes into account privacy restrictions and is applicable to scenarios where the different sites have diverse, possibly overlapping subsets of features. Our framework decouples data privacy issues from knowledge integration issues by requiring the individual sites to share only privacy-safe probabilistic models of the local data, which are then integrated to obtain a global probabilistic model based on the union of the features available at all the sites. We provide a mathematical formulation of the model integration problem using the maximum likelihood and maximum entropy principles and describe iterative algorithms that are guaranteed to converge to the optimal solution. For certain commonly occurring special cases involving hierarchically ordered feature sets or conditional independence, we obtain closed form solutions and use these to propose an efficient alternative scheme by recursive decomposition of the model integration problem. To address interpretability concerns, we also present a modified formulation where the global model is assumed to belong to a specified parametric family. Finally, to highlight the generality of our framework, we provide empirical results for various learning tasks such as clustering and classification on different kinds of datasets consisting of continuous vector, categorical and directional attributes. The results show that high quality global models can be obtained without much loss of privacy

CiteSeerX