Search CORE

83 research outputs found

Far-field Super-resolution Chemical Microscopy

Author: Cheng Ji-Xin
Han Yubing
Jia Danchen
Tang Mingwei
Yang Qing
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/05/2023
Field of study

Far-field chemical microscopy providing molecular electronic or vibrational fingerprint information opens a new window for the study of three-dimensional biological, material, and chemical systems. Chemical microscopy provides a nondestructive way of chemical identification without exterior labels. However, the diffraction limit of optics hindered it from discovering more details under the resolution limit. Recent development of super-resolution techniques gives enlightenment to open this door behind far-field chemical microscopy. Here, we review recent advances that have pushed the boundary of far-field chemical microscopy in terms of spatial resolution. We further highlight applications in biomedical research, material characterization, environmental study, cultural heritage conservation, and integrated chip inspection.Comment: 34 pages, 8 figures,1 tabl

arXiv.org e-Print Archive

Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation

Author: Liu Junwei
Liu Mingwei
Lou Yiling
Peng Xin
Yuan Zhiqiang
Zi Qiancheng
Publication venue
Publication date: 02/08/2023
Field of study

In this work, we evaluate 10 open-source instructed LLMs on four representative code comprehension and generation tasks. We have the following main findings. First, for the zero-shot setting, instructed LLMs are very competitive on code comprehension and generation tasks and sometimes even better than small SOTA models specifically fine-tuned on each downstream task. We also find that larger instructed LLMs are not always better on code-related tasks. Second, for the few-shot setting, we find that adding demonstration examples substantially helps instructed LLMs perform better on most code comprehension and generation tasks; however, the examples would sometimes induce unstable or even worse performance. Furthermore, we find widely-used BM25-based shot selection strategy significantly outperforms the basic random selection or fixed selection only on generation problems. Third, for the fine-tuning setting, we find that fine-tuning could further improve the model performance on downstream code comprehension and generation tasks compared to the zero-shot/one-shot performance. In addition, after being fine-tuned on the same downstream task dataset, instructed LLMs outperform both the small SOTA models and similar-scaled LLMs without instruction tuning. Based on our findings, we further present practical implications on model and usage recommendation, performance and cost trade-offs, and future direction

arXiv.org e-Print Archive

Recommending Analogical APIs via Knowledge Graph Embedding

Author: Du Xueying
Liu Mingwei
Lou Yiling
Peng Xin
Yang Tianyong
Yang Yanjun
Zhou Zhong
Publication venue
Publication date: 22/08/2023
Field of study

Library migration, which re-implements the same software behavior by using a different library instead of using the current one, has been widely observed in software evolution. One essential part of library migration is to find an analogical API that could provide the same functionality as current ones. However, given the large number of libraries/APIs, manually finding an analogical API could be very time-consuming and error-prone. Researchers have developed multiple automated analogical API recommendation techniques. Documentation-based methods have particularly attracted significant interest. Despite their potential, these methods have limitations, such as a lack of comprehensive semantic understanding in documentation and scalability challenges. In this work, we propose KGE4AR, a novel documentation-based approach that leverages knowledge graph (KG) embedding to recommend analogical APIs during library migration. Specifically, KGE4AR proposes a novel unified API KG to comprehensively and structurally represent three types of knowledge in documentation, which can better capture the high-level semantics. Moreover, KGE4AR then proposes to embed the unified API KG into vectors, enabling more effective and scalable similarity calculation. We build KGE4AR' s unified API KG for 35,773 Java libraries and assess it in two API recommendation scenarios: with and without target libraries. Our results show that KGE4AR substantially outperforms state-of-the-art documentation-based techniques in both evaluation scenarios in terms of all metrics (e.g., 47.1%-143.0% and 11.7%-80.6% MRR improvements in each scenario). Additionally, we explore KGE4AR' s scalability, confirming its effective scaling with the growing number of libraries.Comment: Accepted by FSE 202

arXiv.org e-Print Archive

ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code Generation

Author: Chen Yixuan
Du Xueying
Feng Jiayi
Liu Junwei
Liu Mingwei
Lou Yiling
Peng Xin
Sha Chaofeng
Wang Hanlin
Wang Kaixin
Publication venue
Publication date: 14/08/2023
Field of study

In this work, we make the first attempt to evaluate LLMs in a more challenging code generation scenario, i.e. class-level code generation. We first manually construct the first class-level code generation benchmark ClassEval of 100 class-level Python code generation tasks with approximately 500 person-hours. Based on it, we then perform the first study of 11 state-of-the-art LLMs on class-level code generation. Based on our results, we have the following main findings. First, we find that all existing LLMs show much worse performance on class-level code generation compared to on standalone method-level code generation benchmarks like HumanEval; and the method-level coding ability cannot equivalently reflect the class-level coding ability among LLMs. Second, we find that GPT-4 and GPT-3.5 still exhibit dominate superior than other LLMs on class-level code generation, and the second-tier models includes Instruct-Starcoder, Instruct-Codegen, and Wizardcoder with very similar performance. Third, we find that generating the entire class all at once (i.e. holistic generation strategy) is the best generation strategy only for GPT-4 and GPT-3.5, while method-by-method generation (i.e. incremental and compositional) is better strategies for the other models with limited ability of understanding long instructions and utilizing the middle information. Lastly, we find the limited model ability of generating method-dependent code and discuss the frequent error types in generated classes. Our benchmark is available at https://github.com/FudanSELab/ClassEval

arXiv.org e-Print Archive

Mechanism of biomass activation and ammonia modification for nitrogen-doped porous carbon materials

Author: Chen Hanping
Chen Wei
Chen Yingquan
Li Kaixu
Tu Xin
Xia Mingwei
Xia Sunwen
Yang Haiping
Publication venue: 'Elsevier BV'
Publication date: 01/05/2019
Field of study

University of Liverpool Repository

Bamboo wastes catalytic pyrolysis with N-doped biochar catalyst for phenols products

Author: Chen Hanping
Chen Wei
Chen Yingquan
Chen Zhiqun
Fang Yang
Gong Meng
Li Kaixu
Tu Xin
Xia Mingwei
Yang Haiping
Publication venue: 'Elsevier BV'
Publication date: 01/02/2020
Field of study

University of Liverpool Repository

Recommended from our members

Anthropogenic heat release due to energy consumption exacerbates European summer extreme high temperature

Author: Chen Bing
Liu Xiaohong
Lu Mingwei
Lu Zheng
Luo Tao
Song Xin
Wu Chenglai
Wu Xue
Yang Huiyi
Zhao Xiaoqing
Zheng Yuhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/04/2023
Field of study

Anthropogenic heat release (AHR) is the release of heat generated by anthropogenic energy consumption. The global mean flux of AHR is 0.03 W m−2, while AHR is geographically concentrated and fundamentally correlates with economic activ- ity; furthermore, AHR can reach a level sufficient for impacting regional even large-scale climate. In this study, the impacts of AHR on the summer European heatwaves (EHWs) are examined by using the Community Earth System Model version 1 (CESM1). The results show that in Europe, AHR increases the summer mean 2-m temperature by 0.26 °C and the surface minimum and maximum temperatures by 0.14 °C and 0.41 °C, respectively. AHR exacerbates the extreme high temperatures in the summer in Europe, increasing EHW days by 1–2 days in central and eastern Europe in the summer annually from 1992 to 2013. AHR strengthens the surface wind that flows from the ocean to the land in Europe by increasing the land surface temperatures. AHR decreases the lower-troposphere stability (LTS) and reduces the low-cloud amounts in Europe, which leads to more solar shortwave radiation reaching the surface. AHR affects water vapor and the surface energy balance in Europe, which impacts on European summer heatwaves further. AHR acts as a non-negligible factor for summer extreme high temperature in Europe and a potential factor impacting EHW days

Greenwich Academic Literature Archive

Effectiveness of social distancing measures and lockdowns for reducing transmission of COVID-19 in non-healthcare, community-based settings

Author: Bhatt Samir
Chen Dongxuan
Cheung Justin K.
Cowling Benjamin J.
Donnelly Christl A.
Gould Susan
Li Mingwei
Lim Way Wen
Mills Cathal
Murphy Caitriona
Wong Jessica Y.
Xie Yanmy
Xin Hualei
Publication venue: The Royal Society
Publication date: 23/08/2023
Field of study

Social distancing measures (SDMs) are community-level interventions that aim to reduce person-to-person contacts in the community. SDMs were a major part of the responses first to contain, then to mitigate, the spread of SARS-CoV-2 in the community. Common SDMs included limiting the size of gatherings, closing schools and/or workplaces, implementing work-from-home arrangements, or more stringent restrictions such as lockdowns. This systematic review summarized the evidence for the effectiveness of nine SDMs. Almost all of the studies included were observational in nature, which meant that there were intrinsic risks of bias that could have been avoided were conditions randomly assigned to study participants. There were no instances where only one form of SDM had been in place in a particular setting during the study period, making it challenging to estimate the separate effect of each intervention. The more stringent SDMs such as stay-at-home orders, restrictions on mass gatherings and closures were estimated to be most effective at reducing SARS-CoV-2 transmission. Most studies included in this review suggested that combinations of SDMs successfully slowed or even stopped SARS-CoV-2 transmission in the community. However, individual effects and optimal combinations of interventions, as well as the optimal timing for particular measures, require further investigation. This article is part of the theme issue 'The effectiveness of non-pharmaceutical interventions on the COVID-19 pandemic: the evidence'

LSTM Online Archive

Oxford University Research Archive

Ultrafast Radiographic Imaging and Tracking: An overview of instruments, methods, data, and applications

Ultrafast radiographic imaging and tracking (U-RadIT) use state-of-the-art ionizing particle and light sources to experimentally study sub-nanosecond dynamic processes in physics, chemistry, biology, geology, materials science and other fields. These processes, fundamental to nuclear fusion energy, advanced manufacturing, green transportation and others, often involve one mole or more atoms, and thus are challenging to compute by using the first principles of quantum physics or other forward models. One of the central problems in U-RadIT is to optimize information yield through, e.g. high-luminosity X-ray and particle sources, efficient imaging and tracking detectors, novel methods to collect data, and large-bandwidth online and offline data processing, regulated by the underlying physics, statistics, and computing power. We review and highlight recent progress in: a.) Detectors; b.) U-RadIT modalities; c.) Data and algorithms; and d.) Applications. Hardware-centric approaches to U-RadIT optimization are constrained by detector material properties, low signal-to-noise ratio, high cost and long development cycles of critical hardware components such as ASICs. Interpretation of experimental data, including comparisons with forward models, is frequently hindered by sparse measurements, model and measurement uncertainties, and noise. Alternatively, U-RadIT makes increasing use of data science and machine learning algorithms, including experimental implementations of compressed sensing. Machine learning and artificial intelligence approaches, refined by physics and materials information, may also contribute significantly to data interpretation, uncertainty quantification and U-RadIT optimization.Comment: 51 pages, 31 figures; Overview of ultrafast radiographic imaging and tracking as a part of ULITIMA 2023 conference, Mar. 13-16,2023, Menlo Park, CA, US

arXiv.org e-Print Archive

CERN Document Server

Role of Scrib and Dlg in anterior-posterior patterning of the follicular epithelium during Drosophila oogenesis

Abstract Background Proper patterning of the follicle cell epithelium over the egg chamber is essential for the <it>Drosophila </it>egg development. Differentiation of the epithelium into several distinct cell types along the anterior-posterior axis requires coordinated activities of multiple signaling pathways. Previously, we reported that <it>lethal(2)giant larvae </it>(<it>lgl</it>), a <it>Drosophila </it>tumor suppressor gene, is required in the follicle cells for the posterior follicle cell (PFC) fate induction at mid-oogenesis. Here we explore the role of another two tumor suppressor genes, <it>scribble </it>(<it>scrib</it>) and <it>discs large </it>(<it>dlg</it>), in the epithelial patterning. Results We found that removal of <it>scrib </it>or <it>dlg </it>function from the follicle cells at posterior terminal of the egg chamber causes a complete loss of the PFC fate. Aberrant specification and differentiation of the PFCs in the mosaic clones can be ascribed to defects in coordinated activation of the EGFR, JAK and Notch signaling pathways in the multilayered cells. Meanwhile, the clonal analysis revealed that loss-of-function mutations in <it>scrib/dlg </it>at the anterior domains result in a partially penetrant phenotype of defective induction of the stretched and centripetal cell fate, whereas specification of the border cell fate can still occur in the most anterior region of the mutant clones. Further, we showed that <it>scrib </it>genetically interacts with <it>dlg </it>in regulating posterior patterning of the epithelium. Conclusion In this study we provide evidence that <it>scrib </it>and <it>dlg </it>function differentially in anterior and posterior patterning of the follicular epithelium at oogenesis. Further genetic analysis indicates that <it>scrib </it>and <it>dlg </it>act in a common pathway to regulate PFC fate induction. This study may open another window for elucidating role of <it>scrib/dlg </it>in controlling epithelial polarity and cell proliferation during development.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central