193 research outputs found

    Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge

    Full text link
    This work presents an enhanced approach to generating scene graphs by incorporating a relationship hierarchy and commonsense knowledge. Specifically, we propose a Bayesian classification head that exploits an informative hierarchical structure. It jointly predicts the super-category or type of relationship between the two objects, along with the detailed relationship under each super-category. We design a commonsense validation pipeline that uses a large language model to critique the results from the scene graph prediction system and then use that feedback to enhance the model performance. The system requires no external large language model assistance at test time, making it more convenient for practical applications. Experiments on the Visual Genome and the OpenImage V6 datasets demonstrate that harnessing hierarchical relationships enhances the model performance by a large margin. The proposed Bayesian head can also be incorporated as a portable module in existing scene graph generation algorithms to improve their results. In addition, the commonsense validation enables the model to generate an extensive set of reasonable predictions beyond dataset annotations

    Discontinuous transition to shear flow turbulence

    Full text link
    Depending on the type of flow the transition to turbulence can take one of two forms, either turbulence arises from a sequence of instabilities, or from the spatial proliferation of transiently chaotic domains, a process analogous to directed percolation. Both scenarios are inherently continuous and hence the transformation from ordered laminar to fully turbulent fluid motion is only accomplished gradually with flow speed. Here we show that these established transition types do not account for the more general setting of shear flows subject to body forces. By attenuating spatial coupling and energy transfer, spatio-temporal intermittency is suppressed and with forcing amplitude the transition becomes increasingly sharp and eventually discontinuous. We argue that the suppression of the continuous range and the approach towards a first order, discontinuous scenario applies to a wide range of situations where in addition to shear, flows are subject to e.g. gravitational, centrifugal or electromagnetic forces

    Event-based Dynamic Graph Representation Learning for Patent Application Trend Prediction

    Full text link
    Accurate prediction of what types of patents that companies will apply for in the next period of time can figure out their development strategies and help them discover potential partners or competitors in advance. Although important, this problem has been rarely studied in previous research due to the challenges in modelling companies' continuously evolving preferences and capturing the semantic correlations of classification codes. To fill in this gap, we propose an event-based dynamic graph learning framework for patent application trend prediction. In particular, our method is founded on the memorable representations of both companies and patent classification codes. When a new patent is observed, the representations of the related companies and classification codes are updated according to the historical memories and the currently encoded messages. Moreover, a hierarchical message passing mechanism is provided to capture the semantic proximities of patent classification codes by updating their representations along the hierarchical taxonomy. Finally, the patent application trend is predicted by aggregating the representations of the target company and classification codes from static, dynamic, and hierarchical perspectives. Experiments on real-world data demonstrate the effectiveness of our approach under various experimental conditions, and also reveal the abilities of our method in learning semantics of classification codes and tracking technology developing trajectories of companies.Comment: Accepted by the TKDE journa

    Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs

    Full text link
    The recent progress of AI can be largely attributed to large language models (LLMs). However, their escalating memory requirements introduce challenges for machine learning (ML) researchers and engineers. Addressing this requires developers to partition a large model to distribute it across multiple GPUs or TPUs. This necessitates considerable coding and intricate configuration efforts with existing model parallel tools, such as Megatron-LM, DeepSpeed, and Alpa. These tools require users' expertise in machine learning systems (MLSys), creating a bottleneck in LLM development, particularly for developers without MLSys background. In this work, we present Redco, a lightweight and user-friendly tool crafted to automate distributed training and inference for LLMs, as well as to simplify ML pipeline development. The design of Redco emphasizes two key aspects. Firstly, to automate model parallism, our study identifies two straightforward rules to generate tensor parallel strategies for any given LLM. Integrating these rules into Redco facilitates effortless distributed LLM training and inference, eliminating the need of additional coding or complex configurations. We demonstrate the effectiveness by applying Redco on a set of LLM architectures, such as GPT-J, LLaMA, T5, and OPT, up to the size of 66B. Secondly, we propose a mechanism that allows for the customization of diverse ML pipelines through the definition of merely three functions, eliminating redundant and formulaic code like multi-host related processing. This mechanism proves adaptable across a spectrum of ML algorithms, from foundational language modeling to complex algorithms like meta-learning and reinforcement learning. Consequently, Redco implementations exhibit much fewer code lines compared to their official counterparts.Comment: Released under Apache License 2.0 at https://github.com/tanyuqian/redc

    Modeling the Field Value Variations and Field Interactions Simultaneously for Fraud Detection

    Full text link
    With the explosive growth of e-commerce, online transaction fraud has become one of the biggest challenges for e-commerce platforms. The historical behaviors of users provide rich information for digging into the users' fraud risk. While considerable efforts have been made in this direction, a long-standing challenge is how to effectively exploit internal user information and provide explainable prediction results. In fact, the value variations of same field from different events and the interactions of different fields inside one event have proven to be strong indicators for fraudulent behaviors. In this paper, we propose the Dual Importance-aware Factorization Machines (DIFM), which exploits the internal field information among users' behavior sequence from dual perspectives, i.e., field value variations and field interactions simultaneously for fraud detection. The proposed model is deployed in the risk management system of one of the world's largest e-commerce platforms, which utilize it to provide real-time transaction fraud detection. Experimental results on real industrial data from different regions in the platform clearly demonstrate that our model achieves significant improvements compared with various state-of-the-art baseline models. Moreover, the DIFM could also give an insight into the explanation of the prediction results from dual perspectives.Comment: 11 pages, 4 figure

    Evaluation of branched GDGTs and leaf wax n-alkane δ2H as (paleo) environmental proxies in East Africa

    Get PDF
    The role of mountain evolution on local climate is poorly understood and potentially underestimated in climate models. One prominent example is East Africa, which underwent major geodynamic changes with the onset of the East African Rift System (EARS) more than 250 Myr ago. This study explores, at the regional East African scale, a molecular approach for terrestrially-based paleo-climatic reconstructions that takes into account both changes in temperature and in altitude, potentially leading to an improved concept in paleo-climatic reconstructions. Using surface soils collected along pronounced altitudinal gradients in Mt. Rungwe (n=40; Southwest Tanzania) and Mt. Kenya (n=20; Central Kenya), we investigate the combination of 2 terrestrial proxies, leaf wax n-alkane δ2H (δ2Hwax) and branched glycerol dialkyl glycerol tetraether (br GDGT) membrane lipids, as (paleo) elevation and (paleo) temperature proxies, respectively. At the mountain scale, a weak link between δ2Hwax and altitude (R2 = 0.33) is observed at Mt. Kenya, but no relationship is observed at Mt. Rungwe. It is likely that additional parameters, such as decreasing relative humidity (RH) or vegetation changes with altitude, are outcompeting the expected 2H-depletion trend along Mt. Rungwe. In contrast, br GDGT-derived absolute mean annual air temperature (MAAT) and temperature lapse rate (0.65 °C/100 m) for both mountains are in good agreement with direct field measurements, further supporting the robustness of this molecular proxy for (paleo) temperature reconstructions. At the regional scale, estimated and observed δ2H data in precipitation along 3 mountains in East Africa (Mts. Rungwe, Kenya and Kilimanjaro) highlight a strong spatial heterogeneity, preventing the establishment of a regional based calibration of δ2Hwax for paeloaltitudinal reconstructions. Different from that, an improved regional soil calibration is developed between br GDGT distribution and MAAT by combining the data from this study (Mts. Rungwe and Kenya) with previous results from East African surface soils along Mts. Kilimanjaro (Tanzania) and Rwenzori (Uganda). This new regional calibration, based on 105 samples, improves both the R2 (0.77) and RMSE (root mean square error; 2.4 °C) of br GDGT-derived MAAT over the global soil calibrations previously established (R2 = 0.56; RMSE = 4.2 °C) and leads to more accurate (paleo) temperature reconstructions in the region

    Diet of the endangered big-headed turtle \u3cem\u3ePlatysternon megacephalum\u3c/em\u3e

    Get PDF
    Populations of the big-headed turtle Platysternon megacephalum are declining at unprecedented rates across most of its distribution in Southeast Asia owing to unsustainable harvest for pet, food, and Chinese medicine markets. Research on Asian freshwater turtles becomes more challenging as populations decline and basic ecological information is needed to inform conservation efforts. We examined fecal samples collected from P. megacephalum in five streams in Hong Kong to quantify the diet, and we compared the germination success of ingested and uningested seeds. Fruits, primarily of Machilus spp., were most frequently consumed, followed by insects, plant matter, crabs and mollusks. The niche breadth of adults was wider than that of juveniles. Diet composition differed between sites, which may be attributable to the history of illegal trapping at some sites, which reduced the proportion of larger and older individuals. Digestion of Machilus spp. fruits by P. megacephalum enhanced germination success of seeds by about 30%. However, most digested seeds are likely defecated in water in this highly aquatic species, which limits the potential benefit to dispersal. The results of our study can be used by conservation-related captive breeding programs to ensure a more optimal diet is provided to captive P. megacephalum

    Experimental and Numerical Analysis of High-Resolution Injection Technique for Capillary Electrophoresis Microchip

    Get PDF
    This study presents an experimental and numerical investigation on the use of high-resolution injection techniques to deliver sample plugs within a capillary electrophoresis (CE) microchip. The CE microfluidic device was integrated into a U-shaped injection system and an expansion chamber located at the inlet of the separation channel, which can miniize the sample leakage effect and deliver a high-quality sample plug into the separation channel so that the detection performance of the device is enhanced. The proposed 45° U-shaped injection system was investigated using a sample of Rhodamine B dye. Meanwhile, the analysis of the current CE microfluidic chip was studied by considering the separation of Hae III digested ϕx-174 DNA samples. The experimental and numerical results indicate that the included 45° U-shaped injector completely eliminates the sample leakage and an expansion separation channel with an expansion ratio of 2.5 delivers a sample plug with a perfect detection shape and highest concentration intensity, hence enabling an optimal injection and separation performance

    A review into the use of ceramics in microbial fuel cells

    Get PDF
    © 2016 The Authors. Microbial fuel cells (MFCs) offer great promise as a technology that can produce electricity whilst at the same time treat wastewater. Although significant progress has been made in recent years, the requirement for cheaper materials has prevented the technology from wider, out-of-the-lab, implementation. Recently, researchers have started using ceramics with encouraging results, suggesting that this inexpensive material might be the solution for propelling MFC technology towards real world applications. Studies have demonstrated that ceramics can provide stability, improve power and treatment efficiencies, create a better environment for the electro-active bacteria and contribute towards resource recovery. This review discusses progress to date using ceramics as (i) the structural material, (ii) the medium for ion exchange and (iii) the electrode for MFCs
    corecore