Search CORE

27 research outputs found

GoSum: Extractive Summarization of Long Documents by Reinforcement Learning and Graph Organized discourse state

Author: Bian Junyi
Huang Xiaodi
Zhou Hong
Zhu Shanfeng
Publication venue
Publication date: 20/01/2023
Field of study

Extracting summaries from long documents can be regarded as sentence classification using the structural information of the documents. How to use such structural information to summarize a document is challenging. In this paper, we propose GoSum, a novel graph and reinforcement learning based extractive model for long-paper summarization. In particular, GoSum encodes sentence states in reinforcement learning by building a heterogeneous graph for each input document at different discourse levels. An edge in the graph reflects the discourse hierarchy of a document for restraining the semantic drifts across section boundaries. We evaluate GoSum on two datasets of scientific articles summarization: PubMed and arXiv. The experimental results have demonstrated that GoSum achieve state-of-the-art results compared with strong baselines of both extractive and abstractive models. The ablation studies further validate that the performance of our GoSum benefits from the use of discourse information

arXiv.org e-Print Archive

HELLaMA: LLaMA-based Table to Text Generation by Highlighting the Important Evidence

Author: Bian Junyi
Huang Mengzuo
Qin Xiaolei
Zhang Weidong
Zou Wuhe
Publication venue
Publication date: 15/11/2023
Field of study

Large models have demonstrated significant progress across various domains, particularly in tasks related to text generation. In the domain of Table to Text, many Large Language Model (LLM)-based methods currently resort to modifying prompts to invoke public APIs, incurring potential costs and information leaks. With the advent of open-source large models, fine-tuning LLMs has become feasible. In this study, we conducted parameter-efficient fine-tuning on the LLaMA2 model. Distinguishing itself from previous fine-tuning-based table-to-text methods, our approach involves injecting reasoning information into the input by emphasizing table-specific row data. Our model consists of two modules: 1) a table reasoner that identifies relevant row evidence, and 2) a table summarizer that generates sentences based on the highlighted table. To facilitate this, we propose a search strategy to construct reasoning labels for training the table reasoner. On both the FetaQA and QTSumm datasets, our approach achieved state-of-the-art results. Additionally, we observed that highlighting input tables significantly enhances the model's performance and provides valuable interpretability

arXiv.org e-Print Archive

Biomedical Entity Recognition by Detection and Matching

Author: Bian Junyi
Huang Tianyang
Jiang Rongze
Zhai Weiqi
Zhou Hong
Zhu Shanfeng
Publication venue
Publication date: 27/06/2023
Field of study

Biomedical named entity recognition (BNER) serves as the foundation for numerous biomedical text mining tasks. Unlike general NER, BNER require a comprehensive grasp of the domain, and incorporating external knowledge beyond training data poses a significant challenge. In this study, we propose a novel BNER framework called DMNER. By leveraging existing entity representation models SAPBERT, we tackle BNER as a two-step process: entity boundary detection and biomedical entity matching. DMNER exhibits applicability across multiple NER scenarios: 1) In supervised NER, we observe that DMNER effectively rectifies the output of baseline NER models, thereby further enhancing performance. 2) In distantly supervised NER, combining MRC and AutoNER as span boundary detectors enables DMNER to achieve satisfactory results. 3) For training NER by merging multiple datasets, we adopt a framework similar to DS-NER but additionally leverage ChatGPT to obtain high-quality phrases in the training. Through extensive experiments conducted on 10 benchmark datasets, we demonstrate the versatility and effectiveness of DMNER.Comment: 9 pages content, 2 pages appendi

arXiv.org e-Print Archive

Proteomics study of changes in soybean lines resistant and sensitive to Phytophthora sojae

Author: Bian XiaoChun
Gai JunYi
Shen Qi
Xiang Yang
Xing Han
Zhang YuMei
Zhao JinMing
Zuo QiaoMei
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background <it>Phytophthora sojae </it>causes soybean root and stem rot, resulting in an annual loss of 1-2 billion US dollars in soybean production worldwide. A proteomic technique was used to determine the effects on soybean hypocotyls of infection with <it>P. sojae</it>. Results In the present study, 46 differentially expressed proteins were identified in soybean hypocotyls infected with <it>P. sojae</it>, using two-dimensional electrophoresis and matrix-assisted laser desorption/ionization tandem time of flight (MALDI-TOF/TOF). The expression levels of 26 proteins were significantly affected at various time points in the tolerant soybean line, Yudou25, (12 up-regulated and 14 down-regulated). In contrast, in the sensitive soybean line, NG6255, only 20 proteins were significantly affected (11 up-regulated and 9 down-regulated). Among these proteins, 26% were related to energy regulation, 15% to protein destination and storage, 11% to defense against disease, 11% to metabolism, 9% to protein synthesis, 4% to secondary metabolism, and 24% were of unknown function. Conclusion Our study provides important information on the use of proteomic methods for studying protein regulation during plant-oomycete interactions.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Self-organized Voids Revisited: Experimental Verification of the Formation Mechanism*

Author: Song Juan
Ye Junyi
Qian Mengdi
Luo Fangfang
Li Xian
Bian Huadong
Dai Ye
Ma Guo-hong
Chen Qingxi
Jiang Yan
Zhao Quanzhong
Qiu Jianrong
Publication venue
Publication date: 01/01/2004
Field of study

In this paper, several experiments were conducted to further clarify the formation mechanism of self organized void array induced by a single laser beam, including energy-related experiments, refractive-index-contrast-related experiments, depth-related experiments and effective-numerical-aperture experiment. These experiments indicate that the interface spherical aberration is indeed responsible for the formation of void arrays

arXiv.org e-Print Archive

Shanghai Institute of Optics and Fine Mechanics,Chinese Academy of Sciences

CiteSeerX

Crossref

UvA-DARE

International Migration, Integration and Social Cohesion online publications

The Tarenaya hassleriana

Author: Amey S. Bhide
Andrea Bräutigam
Andreas P.M. Weber
Annette Becker
Canan Kuelahoglu
Chao Bian
Chengcheng Shi
Erik van den Bergh
Gengyun Zhang
Guangyi Fan
Hongfeng Zou
Jiajia Xu
Jing Chen
Jocelyn C. Hall
Johannes Hofberger
Julian M. Hibberd
Junyi Wang
Kerstin Kaufmann
M. Eric Schranz
Mingju Lv
Peng Zeng
Shifeng Cheng
Suzanne de Bruijn
Wujiao Li
Xiao Zhong
Xin Liu
Xin-Guang Zhu
Xun Xu
Yimin Tao
Zhijun Zheng
Zhiwu Quan
Publication venue: 'American Society of Plant Biologists (ASPB)'
Publication date
Field of study

Crossref

The United States COVID-19 Forecast Hub dataset

Author: Abbott Sam
Abu-Mostafa Yaser
Adee Madeline
Adhikari Bijaya
Adiga Aniruddha
Arik Sercan O.
Asplund John
Ayer Turgay
Baccam Prasith
Baek Jackie
Baer Thomas M.
Ban Xuegang
Bannur Nayana
Barber Ryan
Bathwal Rahil
Baxter Arden
Bejar Benjamín
Belov Artur A.
Ben-Nun Michal
Bennouna Amine
Berlin Abraham
Bertsimas Dimitris
Bhatia Sangeeta
Bian Jiang
Biegel Hannah
Bien Jacob
Biggerstaff Matthew
Bosch Jurgen
Bosse Nikos I.
Bouardi Hamza Tazi
Bracher Johannes
Brennen Andrea
Brenner Michael
Brooks Logan
Budzinski Jozef
Burant John C.
Cao Duy
Cao Wei
Castro Lauren
Cavany Sean
Cegan Jeffrey C.
Celi Leo A.
Chang Nicholas A.
Chattopadhyay Ishanu
Chen Jinghui
Chen Samuel
Chen YangQuan
Chen Ye
Chen Yixian
Chhatwal Jagpreet
Chiang Wen-Hao
Chinazzi Matteo
Chintanippu Krishna
Chitta Pavan
Cho Jae H.
Choirat Christine
Chow Carson C.
Coram Marc
Cornell Matthew
Corsetti Sabrina M.
Cramer Estee Y.
Cui Jiaming
Dahan Maytal
Dalgic Ozden O.
Davis Jessica T.
DesRoches David
Dettwiller Ian D.
Deva Ayush
Drake John M.
Dusenberry Mike
Edwards Jessie K.
Eisenberg Marisa C.
England William P.
Epshteyn Arkady
Erickson Anne
España Guido
Fairchild Geoffrey
Falb Karl
Faraone Stephen V.
Farias Vivek
Farthing Matthew W.
Ferres Juan Lavista
Flahault Antoine
Fong Chung-Yan
Forli Pedro
Fox Spencer
Funk Sebastian
Gaikedu Emmanuela
Gaither Kelly
Galasso Joseph
Gandhi Parth D.
Gao Junyi
Gao Lei
Gao Liyao
Gao Zhifeng
Gardner Lauren
George Glover E.
Georgescu Andreea
Gerding Aaron
Gerkin Richard C.
Gibson Graham Casey
Glass Lucas
Gneiting Tilmann
Goel Sumit
Gowda Jethin
Grantz Kyra H.
Green Alden
Gu Quanquan
Gu Youyang
Gu Zhiling
Guertin Stephanie L.
Guo Lihong
Gurung Heidi L.
Hamory Bruce
Hay Simon
Hellewell Joel
Hess Jonathan
Hill Alison L.
Hlavacek William
Ho Lam
Hong Qi-Jun
House Katie
Hu Addison J.
Huang Yi
Huang Yitao
Huang Yuxin
Hulme-Lowe Christopher
Hulse Juan Dent
Hunter Robert H.
Hurt Benjamin
Hussain Fazle
Huynh Huong
Ibrahim Mark
Ivy Julie S.
Jadbabaie Ali
Jahja Maria
Jain Chaman
Jain Chandini
Jain Sansiddh
Jayawardena Dasuni
Jin Qixuan
Jin Xiaoyong
Jivane Viresh
Jo Areum
Jo HyeongChan
Johansson Michael A.
Joshi Keya
Kalantari Rahi
Kaminsky Joshua
Kaminsky Kathryn
Kanal Elli
Kanji Abdul Hannan
Karimzadeh Morteza
Karlen Dean
Keegan Lindsay T.
Keskinocak Pinar
Khan Zeina
Khandelwal Ayush
Khurana Ankita
Kim Juhyun
Kim Myungjin
Kinsey Matt
Klein Ellen
Koyluoglu Ugur
Kraus Andrea
Kraus David
Krymova Ekaterina
Kulkarni Mihir
Kulkarni Pranav
Kumar Ajay
Kyriakides Christina
Lachmann Michael
Lacroix Timothee
Ladd Mary A.
Lafferty Brandon
Lakhani Anshul
Lami Omar Skali
Lauer Stephen A.
Le Khoa
Le Long T.
Le Matthew
Lee Elizabeth C.
Lee Gavin
Lega Joceline
Leis Helen
Lemaitre Joseph C.
Lessler Justin
Levi Retsef
Lewis Bryan
Li Chaozhuo
Li Chun-Liang
Li Michael L.
Li Xinyi
Liao Jason
Lim Steve
Lin Yen Ting
Linas Benjamin P.
Linkov Igor
Liu Tie-Yan
Lopez Velma K.
Lu Guoqing
Lucas Benjamin
Lushtak Samuel M.
Ma Yian
Mallela Abhishek
Manetti Elisa
Mann Ethan
Marathe Madhav
Marshall Maximilian
Martin Emily T.
Mayo Michael L.
Mayorga Maria E.
McAndrew Thomas
McCauley Ella
McConnell Steve
McDonald Daniel
Meakin Sophie R.
Mehrotra Prakhar
Mele Jessica
Meredith Hannah R.
Merugu Srujana
Meyers Lauren Ancel
Michaud Isaac
Miller Ely
Milliken John
Mody Vidhi
Mody Vrushti
Mohler George
Moloney Michael
Moore Sean
Morgan James
Morley Christopher P.
Mu Kunpeng
Mueller Peter
Mullany Luke C.
Murray Chris
Myers Robert L.
Mühlemann Anja
Nagraj V. P.
Namigai Kristen
Narasimhan Balasubramanian
Ndong David Nze
Neumann Jacob
Ngo Thoai
Nickel Maximilian
Niemi Jarad
Nirgudkar Ninad
Nixon Kristen
Nouvellet Pierre
Obozinski Guillaume
Oidtman Rachel
Oruc Buse Eylul
Osthus Dave
Ozcan Gokce
O’Dea Eamon B.
Pagano Robert
Panaggio Mark J.
Parno Matthew D.
Pasumarty Sujitha
Peddireddy Akhil Sai
Penna Nicolas D.
Perakis Georgia
Perez-Saez Javier
Perkins Alex
Pfeiffer Ruth
Pfister Tomas
Pigott David
Piontti Ana Pastore y
Piriya Matthew
Piwonka Noah
Politsch Collin
Popken Max
Porebski Przemyslaw
Posner Richard
Prakash B. Aditya
Qian Cheng
Rainwater-Lovett Kaitlin
Rajanala Samyak
Raval Alpan
Ravi Matt
Ray Evan L.
Reich Nicholas G.
Reich Nicholas G.
Reiner Robert C.
Riley Pete
Riley Steven
Rivadeneira Alvaro J. Castro
Rodríguez Alexander
Romberg Justin
Rosenstrom Erik T.
Rowland Michael A.
Rumack Aaron
Sagun Levent
Salekin Asif
Sarker Arnab
Schrader Chris
Schwarz Tom
Scott James G.
Sen Pei
Serban Nicoleta
Shah Apurv
Shah Devavrat
Shah Sam
Shakhnovich Elizabeth
Shaman Jeffrey
Sharma Rakshith
Sheldon Daniel
Sherratt Katharine
Shi Yunfeng
Shin Lauren
Shingi Siddhant
Shrivastav Monika
Siegel Daniel
Simon Noah
Singhvi Divya
Sinha Deeksha
Sinha Rajarishi
Slayton Rachel B.
Smith Claire P.
Soni Saksham
Soohoo Connor
Spaeder Jeffrey
Spantidakis Ioannis
Spatz Ryan
Srivastava Ajitesh
Stage Steven A.
Stark Ariane
Stiefeling Chris
Suchoski Bradley T.
Sumner Timothy
Sun Jimeng
Sun Tao
Sundar Saketh
Swann Julie L.
Tabassum Anika
Tallaksen Katharine
Tec Mauricio
Thanou Dorina
Thayaparan Leann
Tibshirani Rob
Tibshirani Ryan J.
Tirumala Kushal
Tiwari Avtansh
Tomar Vishal
Tran Quoc
Truelove Shaun A.
Trump Benjamin D.
Tsai Thomas
Tseng Albert
Tsiourvas Asterios
Turner Stephen D.
Turtle James
US COVID-19 Forecast Hub Consortium
Vahedi Behzad
Van Bussel Frank
van de Walle Axel
Varadarajan Vignesh
Venkatramanan Srinivasan
Ventura Valerie
Vespignani Alessandro
Vytheeswaran Jagath
Walker Jo W.
Walraven Robert
Wang Christopher
Wang Dongdong
Wang Dongliang
Wang Guannan
Wang Lijing
Wang Lily
Wang Lingxiao
Wang Liqiang
Wang Qinxia
Wang Yijin
Wang Yu-Xiang
Wang Yuanjia
Wang Yueying
Wang Zhongying
Wasserman Larry
Wattanchit Nutcha
Weisberg Shane
White Jerome
Wilde Joshua
Wilkinson Barrie
Wills Josh
Wilson Austin
Wilson Daniel
Wilson Shelby
Wolffram Daniel
Wolfinger Russ
Wong Alexander
Woody Spencer
Wu Dongxia
Xiao Cao
Xiao Jade
Xie Jiajia
Xie Shanghong
Xie Xing
Xiong Xinyue
Xu Pan
Xu Tianjian
Yamana Teresa K.
Yan Xifeng
Yeluri Akshay
Yeung Dit-Yan
Yoder Nate
Yogurtcu Osman N.
Yoon Jinsung
You Jialu
Yu Rose
Yu Shan
Yurk Dominic
Zeng Donglin
Zhang Leyou
Zhang Michael
Zhang Shun
Zhang Shunpu
Zhang Weitong
Zhang-James Yanli
Zhao Yanting
Zheng Andrew
Zheng Shun
Zhou Mingyuan
Zimmerman Peter
Zlokapa Alexander
Zoraghein Hamidreza
Zorn Martha W.
Zou Difan
Zou Zihang
Publication venue: Nature Research
Publication date: 17/08/2022
Field of study

Academic researchers, government agencies, industry groups, and individuals have produced forecasts at an unprecedented scale during the COVID-19 pandemic. To leverage these forecasts, the United States Centers for Disease Control and Prevention (CDC) partnered with an academic research lab at the University of Massachusetts Amherst to create the US COVID-19 Forecast Hub. Launched in April 2020, the Forecast Hub is a dataset with point and probabilistic forecasts of incident cases, incident hospitalizations, incident deaths, and cumulative deaths due to COVID-19 at county, state, and national, levels in the United States. Included forecasts represent a variety of modeling approaches, data sources, and assumptions regarding the spread of COVID-19. The goal of this dataset is to establish a standardized and comparable set of short-term forecasts from modeling teams. These data can be used to develop ensemble models, communicate forecasts to the public, create visualizations, compare models, and inform policies regarding COVID-19 mitigation. These open-source data are available via download from GitHub, through an online API, and through R packages

KITopen