Search CORE

193 research outputs found

Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge

Author: Jiang Bowen
Taylor Camillo Jose
Zhuang Zhijun
Publication venue
Publication date: 21/11/2023
Field of study

This work presents an enhanced approach to generating scene graphs by incorporating a relationship hierarchy and commonsense knowledge. Specifically, we propose a Bayesian classification head that exploits an informative hierarchical structure. It jointly predicts the super-category or type of relationship between the two objects, along with the detailed relationship under each super-category. We design a commonsense validation pipeline that uses a large language model to critique the results from the scene graph prediction system and then use that feedback to enhance the model performance. The system requires no external large language model assistance at test time, making it more convenient for practical applications. Experiments on the Visual Genome and the OpenImage V6 datasets demonstrate that harnessing hierarchical relationships enhances the model performance by a large margin. The proposed Bayesian head can also be incorporated as a portable module in existing scene graph generation algorithms to improve their results. In addition, the commonsense validation enables the model to generate an extensive set of reasonable predictions beyond dataset annotations

arXiv.org e-Print Archive

Discontinuous transition to shear flow turbulence

Author: Hof Björn
Marensi Elena
Mukund Vasudevan
Yang Bowen
Zhuang Yi
Publication venue
Publication date: 19/11/2023
Field of study

Depending on the type of flow the transition to turbulence can take one of two forms, either turbulence arises from a sequence of instabilities, or from the spatial proliferation of transiently chaotic domains, a process analogous to directed percolation. Both scenarios are inherently continuous and hence the transformation from ordered laminar to fully turbulent fluid motion is only accomplished gradually with flow speed. Here we show that these established transition types do not account for the more general setting of shear flows subject to body forces. By attenuating spatial coupling and energy transfer, spatio-temporal intermittency is suppressed and with forcing amplitude the transition becomes increasingly sharp and eventually discontinuous. We argue that the suppression of the continuous range and the approach towards a first order, discontinuous scenario applies to a wide range of situations where in addition to shear, flows are subject to e.g. gravitational, centrifugal or electromagnetic forces

arXiv.org e-Print Archive

Event-based Dynamic Graph Representation Learning for Patent Application Trend Prediction

Author: Du Bowen
Sun Leilei
Wang Deqing
Yu Le
Zhuang Fuzhen
Zou Tao
Publication venue
Publication date: 04/09/2023
Field of study

Accurate prediction of what types of patents that companies will apply for in the next period of time can figure out their development strategies and help them discover potential partners or competitors in advance. Although important, this problem has been rarely studied in previous research due to the challenges in modelling companies' continuously evolving preferences and capturing the semantic correlations of classification codes. To fill in this gap, we propose an event-based dynamic graph learning framework for patent application trend prediction. In particular, our method is founded on the memorable representations of both companies and patent classification codes. When a new patent is observed, the representations of the related companies and classification codes are updated according to the historical memories and the currently encoded messages. Moreover, a hierarchical message passing mechanism is provided to capture the semantic proximities of patent classification codes by updating their representations along the hierarchical taxonomy. Finally, the patent application trend is predicted by aggregating the representations of the target company and classification codes from static, dynamic, and hierarchical perspectives. Experiments on real-world data demonstrate the effectiveness of our approach under various experimental conditions, and also reveal the abilities of our method in learning semantics of classification codes and tracking technology developing trajectories of companies.Comment: Accepted by the TKDE journa

arXiv.org e-Print Archive

Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs

Author: Chen Jindong
Hu Zhiting
Liu Lijuan
Tan Bowen
Wang Hongyi
Xing Eric
Zhu Yun
Zhuang Yonghao
Publication venue
Publication date: 25/10/2023
Field of study

The recent progress of AI can be largely attributed to large language models (LLMs). However, their escalating memory requirements introduce challenges for machine learning (ML) researchers and engineers. Addressing this requires developers to partition a large model to distribute it across multiple GPUs or TPUs. This necessitates considerable coding and intricate configuration efforts with existing model parallel tools, such as Megatron-LM, DeepSpeed, and Alpa. These tools require users' expertise in machine learning systems (MLSys), creating a bottleneck in LLM development, particularly for developers without MLSys background. In this work, we present Redco, a lightweight and user-friendly tool crafted to automate distributed training and inference for LLMs, as well as to simplify ML pipeline development. The design of Redco emphasizes two key aspects. Firstly, to automate model parallism, our study identifies two straightforward rules to generate tensor parallel strategies for any given LLM. Integrating these rules into Redco facilitates effortless distributed LLM training and inference, eliminating the need of additional coding or complex configurations. We demonstrate the effectiveness by applying Redco on a set of LLM architectures, such as GPT-J, LLaMA, T5, and OPT, up to the size of 66B. Secondly, we propose a mechanism that allows for the customization of diverse ML pipelines through the definition of merely three functions, eliminating redundant and formulaic code like multi-host related processing. This mechanism proves adaptable across a spectrum of ML algorithms, from foundational language modeling to complex algorithms like meta-learning and reinforcement learning. Consequently, Redco implementations exhibit much fewer code lines compared to their official counterparts.Comment: Released under Apache License 2.0 at https://github.com/tanyuqian/redc

arXiv.org e-Print Archive

Modeling the Field Value Variations and Field Interactions Simultaneously for Fraud Detection

Author: Chen Shuai
He Qing
Qi Yuan
Song Bowen
Xi Dongbo
Zhang Tianyi
Zhu Yongchun
Zhuang Fuzhen
Publication venue
Publication date: 08/08/2020
Field of study

With the explosive growth of e-commerce, online transaction fraud has become one of the biggest challenges for e-commerce platforms. The historical behaviors of users provide rich information for digging into the users' fraud risk. While considerable efforts have been made in this direction, a long-standing challenge is how to effectively exploit internal user information and provide explainable prediction results. In fact, the value variations of same field from different events and the interactions of different fields inside one event have proven to be strong indicators for fraudulent behaviors. In this paper, we propose the Dual Importance-aware Factorization Machines (DIFM), which exploits the internal field information among users' behavior sequence from dual perspectives, i.e., field value variations and field interactions simultaneously for fraud detection. The proposed model is deployed in the risk management system of one of the world's largest e-commerce platforms, which utilize it to provide real-time transaction fraud detection. Experimental results on real industrial data from different regions in the platform clearly demonstrate that our model achieves significant improvements compared with various state-of-the-art baseline models. Moreover, the DIFM could also give an insight into the explanation of the prediction results from dual perspectives.Comment: 11 pages, 4 figure

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Evaluation of branched GDGTs and leaf wax n-alkane δ2H as (paleo) environmental proxies in East Africa

The role of mountain evolution on local climate is poorly understood and potentially underestimated in climate models. One prominent example is East Africa, which underwent major geodynamic changes with the onset of the East African Rift System (EARS) more than 250 Myr ago. This study explores, at the regional East African scale, a molecular approach for terrestrially-based paleo-climatic reconstructions that takes into account both changes in temperature and in altitude, potentially leading to an improved concept in paleo-climatic reconstructions. Using surface soils collected along pronounced altitudinal gradients in Mt. Rungwe (n=40; Southwest Tanzania) and Mt. Kenya (n=20; Central Kenya), we investigate the combination of 2 terrestrial proxies, leaf wax n-alkane δ2H (δ2Hwax) and branched glycerol dialkyl glycerol tetraether (br GDGT) membrane lipids, as (paleo) elevation and (paleo) temperature proxies, respectively. At the mountain scale, a weak link between δ2Hwax and altitude (R2 = 0.33) is observed at Mt. Kenya, but no relationship is observed at Mt. Rungwe. It is likely that additional parameters, such as decreasing relative humidity (RH) or vegetation changes with altitude, are outcompeting the expected 2H-depletion trend along Mt. Rungwe. In contrast, br GDGT-derived absolute mean annual air temperature (MAAT) and temperature lapse rate (0.65 °C/100 m) for both mountains are in good agreement with direct field measurements, further supporting the robustness of this molecular proxy for (paleo) temperature reconstructions. At the regional scale, estimated and observed δ2H data in precipitation along 3 mountains in East Africa (Mts. Rungwe, Kenya and Kilimanjaro) highlight a strong spatial heterogeneity, preventing the establishment of a regional based calibration of δ2Hwax for paeloaltitudinal reconstructions. Different from that, an improved regional soil calibration is developed between br GDGT distribution and MAAT by combining the data from this study (Mts. Rungwe and Kenya) with previous results from East African surface soils along Mts. Kilimanjaro (Tanzania) and Rwenzori (Uganda). This new regional calibration, based on 105 samples, improves both the R2 (0.77) and RMSE (root mean square error; 2.4 °C) of br GDGT-derived MAAT over the global soil calibrations previously established (R2 = 0.56; RMSE = 4.2 °C) and leads to more accurate (paleo) temperature reconstructions in the region

University of East Anglia digital repository

Hal-Diderot

HAL UVSQ

HAL-Polytechnique

Diet of the endangered big-headed turtle \u3cem\u3ePlatysternon megacephalum\u3c/em\u3e

Author: AFCD
Bates
Bearhop
Bonin
Bowen
Braun
Cagle
Caputo
Chen
Cheung
Clarke
Cobo
Corlett
Corlett
Demuth
Elsey
Ernst
Hendrie
Hong Kong Observatory
Horne
Kimmons
Lau
Legler
Mace
Magurran
Moll
Parmenter
Pielou
Pipatsawasdikul
Platt
R Development Core Team
Rust
Sargeant
Shelmidine
Shen
Shi
Spencer
Sterrett
Stuart
Sung
Sung
Sung
Sung
Tana
Traveset
Wan
Wei
Zhuang
Zuur
Publication venue: DigitalCommons@URI
Publication date: 01/01/2016
Field of study

Populations of the big-headed turtle Platysternon megacephalum are declining at unprecedented rates across most of its distribution in Southeast Asia owing to unsustainable harvest for pet, food, and Chinese medicine markets. Research on Asian freshwater turtles becomes more challenging as populations decline and basic ecological information is needed to inform conservation efforts. We examined fecal samples collected from P. megacephalum in five streams in Hong Kong to quantify the diet, and we compared the germination success of ingested and uningested seeds. Fruits, primarily of Machilus spp., were most frequently consumed, followed by insects, plant matter, crabs and mollusks. The niche breadth of adults was wider than that of juveniles. Diet composition differed between sites, which may be attributable to the history of illegal trapping at some sites, which reduced the proportion of larger and older individuals. Digestion of Machilus spp. fruits by P. megacephalum enhanced germination success of seeds by about 30%. However, most digested seeds are likely defecated in water in this highly aquatic species, which limits the potential benefit to dispersal. The results of our study can be used by conservation-related captive breeding programs to ensure a more optimal diet is provided to captive P. megacephalum

Crossref

Directory of Open Access Journals

PubMed Central

HKU Scholars Hub

DigitalCommons@URI

Experimental and Numerical Analysis of High-Resolution Injection Technique for Capillary Electrophoresis Microchip

Author: Barz
Blas
Blas
Bowen
Cao
Chang
Chin-Lung Chang
Chuang
Dossi
Fernandes
Fu
Fu
Fu
Fu
Fu
Guo
Hao
Hong
Hou
Hupert
Jik-Chang Leong
Johns
Lee
Leong
Li
Lin
Lin
Liu
Lung-Ming Fu
Luo
Mao
Nieuwstadt
Oh
Rosenauer
Sun
Ting-Fu Hong
Tofteberg
Tsai
Tsai
Tsai
Tsai
Wang
Wang
Wen
Wenclawiak
Wu
Wu
Xuan
Yang
Yang
Yao-Nan Wang
Zeng
Zhuang
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/01/2011
Field of study

This study presents an experimental and numerical investigation on the use of high-resolution injection techniques to deliver sample plugs within a capillary electrophoresis (CE) microchip. The CE microfluidic device was integrated into a U-shaped injection system and an expansion chamber located at the inlet of the separation channel, which can miniize the sample leakage effect and deliver a high-quality sample plug into the separation channel so that the detection performance of the device is enhanced. The proposed 45° U-shaped injection system was investigated using a sample of Rhodamine B dye. Meanwhile, the analysis of the current CE microfluidic chip was studied by considering the separation of Hae III digested ϕx-174 DNA samples. The experimental and numerical results indicate that the included 45° U-shaped injector completely eliminates the sample leakage and an expansion separation channel with an expansion ratio of 2.5 delivers a sample plug with a perfect detection shape and highest concentration intensity, hence enabling an optimal injection and separation performance

Multidisciplinary Digital Publishing Institute

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

A review into the use of ceramics in microbial fuel cells

Author: Abbasi
Ajayi
Allen
Baur
Behera
Behera
Behera
Bond
Bowen
Chatterjee
Chatterjee
Cheng
Degrenne
Dzyazko
Gajda
Gajda
Gajda
Gajda
Gajda
Ghadge
Ghadge
Ghadge
Ghadge
Ghosh Ray
Gringer
Habermann
Harnisch
He
Hernández-Fernández
Hidalgo
Ieropoulos
Ieropoulos
Ieropoulos
Ieropoulos
Ioannis Ieropoulos
Irimia-Vladu
Iwona Gajda
Jadhav
Jadhav
Jadhav
Jana
John Greenman
Jonathan Winfield
Kim
Kondaveeti
Kraft
Li
Li
Linkov
Liu
Liu
Logan
Massazza
Minh
Ortiz-Martínez
Park
Pasternak
Rajesh
Reuss
Richerson
Santoro
Schultz
Seo
Sondhi
Thorne
Torres
Vandiver
Walter
Walter
Walter
Walter
Wang
Wei
Winfield
Winfield
Winfield
Winfield
Zhuang
Zuo
Publication venue: 'Elsevier BV'
Publication date: 01/09/2016
Field of study

© 2016 The Authors. Microbial fuel cells (MFCs) offer great promise as a technology that can produce electricity whilst at the same time treat wastewater. Although significant progress has been made in recent years, the requirement for cheaper materials has prevented the technology from wider, out-of-the-lab, implementation. Recently, researchers have started using ceramics with encouraging results, suggesting that this inexpensive material might be the solution for propelling MFC technology towards real world applications. Studies have demonstrated that ceramics can provide stability, improve power and treatment efficiencies, create a better environment for the electro-active bacteria and contribute towards resource recovery. This review discusses progress to date using ceramics as (i) the structural material, (ii) the medium for ion exchange and (iii) the electrode for MFCs

Crossref

Elsevier - Publisher Connector

Southampton (e-Prints Soton)

UWE Bristol Research Repository

Single-molecule approaches to characterizing kinetics of biomolecular interactions

Author: Al-Soufi
Antoine M van Oijen
Axelrod
Bacia
Benitez
Benitez
Blosser
Bowen
Chen
Cisse
Cornish
Edman
Eid
Elenko
Elenko
English
Flomenbom
Greenleaf
Haustein
Haustein
Hell
Hilario
Joo
Kapanidis
Karymov
Kasper
Kuo
Levene
Levene
Lewis
Li
Lu
Lu
Miyake
Moerner
Myong
Okumus
Patterson
Price
Rigler
Rissin
Roy
Schneckenburger
Schutz
Toprak
van Oijen
van Oijen
Wang
Xie
Xie
Xie
Yang
Yang
Zhuang
Zhuang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref