Search CORE

134 research outputs found

NExT-Chat: An LMM for Chat, Detection and Segmentation

Author: Chua Tat-Seng
Ji Wei
Liu Zhiyuan
Yao Yuan
Zhang Ao
Publication venue
Publication date: 18/12/2023
Field of study

The development of large language models (LLMs) has greatly advanced the field of multimodal understanding, leading to the emergence of large multimodal models (LMMs). In order to enhance the level of visual comprehension, recent studies have equipped LMMs with region-level understanding capabilities by representing object bounding box coordinates as a series of text sequences (pix2seq). In this paper, we introduce a novel paradigm for object location modeling called pix2emb method, where we ask the LMM to output the location embeddings and then decode them with different decoders. This paradigm allows us to use different location formats (such as bounding boxes and masks) in multimodal conversations. Leveraging the proposed pix2emb method, we train an LMM named NExT-Chat and demonstrate its capability of handling multiple tasks like visual grounding, region captioning, and grounded reasoning. Comprehensive experiments show the effectiveness of our NExT-Chat on various tasks, e.g., NExT-Chat (87.7) vs. Shikra (86.9) on POPE-Random, NExT-Chat (68.9) vs. LISA (67.9) on referring expression segmentation task, and NExT-Chat (79.6) vs. Kosmos-2 (62.3) on region caption task. The code and model are released at https://github.com/NExT-ChatV/NExT-Chat.Comment: Technical Report (https://next-chatv.github.io/

arXiv.org e-Print Archive

Fine-Grained Scene Graph Generation with Data Transfer

Author: Chen Qianyu
Chua Tat-Seng
Ji Wei
Liu Zhiyuan
Sun Maosong
Yao Yuan
Zhang Ao
Publication venue
Publication date: 20/07/2022
Field of study

Scene graph generation (SGG) is designed to extract (subject, predicate, object) triplets in images. Recent works have made a steady progress on SGG, and provide useful tools for high-level vision and language understanding. However, due to the data distribution problems including long-tail distribution and semantic ambiguity, the predictions of current SGG models tend to collapse to several frequent but uninformative predicates (e.g., on, at), which limits practical application of these models in downstream tasks. To deal with the problems above, we propose a novel Internal and External Data Transfer (IETrans) method, which can be applied in a plug-and-play fashion and expanded to large SGG with 1,807 predicate classes. Our IETrans tries to relieve the data distribution problem by automatically creating an enhanced dataset that provides more sufficient and coherent annotations for all predicates. By training on the enhanced dataset, a Neural Motif model doubles the macro performance while maintaining competitive micro performance. The code and data are publicly available at https://github.com/waxnkw/IETrans-SGG.pytorch.Comment: ECCV 2022 (Oral

arXiv.org e-Print Archive

Transfer Visual Prompt Generator across LLMs

Author: Chua Tat-Seng
Fei Hao
Ji Wei
Li Li
Liu Zhiyuan
Yao Yuan
Zhang Ao
Publication venue
Publication date: 02/05/2023
Field of study

While developing a new vision-language LLM (VL-LLM) by pre-training on tremendous image-text pairs from scratch can be exceedingly resource-consuming, connecting an existing LLM with a comparatively lightweight visual prompt generator (VPG) becomes a feasible paradigm. However, further tuning the VPG part of the VL-LLM still suffers from indispensable computational costs, i.e., requiring thousands of GPU hours and millions of training data. One alternative solution is to transfer an existing VPG from any existing VL-LLMs for the target VL-LLM. In this work, we for the first time investigate the VPG transferability across LLMs, and explore a solution to reduce the cost of VPG transfer. We first study the VPG transfer across different LLM sizes (e.g., small-to-large), and across different LLM types, through which we diagnose the key factors to maximize the transfer efficiency. Based on our observation, we design a two-stage transfer framework named VPGTrans, which is simple yet highly effective. Through extensive experiments, we demonstrate that VPGTrans helps significantly speed up the transfer learning process without compromising performance. Remarkably, it helps achieve the VPG transfer from BLIP-2 OPT

_\text{2.7B}

to BLIP-2 OPT

_\text{6.7B}

with over 10 times speed-up and 10.7% training data compared with connecting a VPG to OPT

_\text{6.7B}

from scratch. Further, a series of intriguing findings and potential rationales behind them are provided and discussed. Finally, we showcase the practical value of our VPGTrans approach, by customizing two novel VL-LLMs, including VL-LLaMA and VL-Vicuna, with recently released LLaMA and Vicuna LLMs.Comment: Project Website: https://vpgtrans.github.io Code: https://github.com/VPGTrans/VPGTran

arXiv.org e-Print Archive

Visually Grounded Commonsense Knowledge Acquisition

Author: Chua Tat-Seng
Li Mengdi
Liu Zhiyuan
Sun Maosong
Weber Cornelius
Wermter Stefan
Xie Ruobing
Yao Yuan
Yu Tianyu
Zhang Ao
Zheng Haitao
Publication venue
Publication date: 22/11/2022
Field of study

Large-scale commonsense knowledge bases empower a broad range of AI applications, where the automatic extraction of commonsense knowledge (CKE) is a fundamental and challenging problem. CKE from text is known for suffering from the inherent sparsity and reporting bias of commonsense in text. Visual perception, on the other hand, contains rich commonsense knowledge about real-world entities, e.g., (person, can_hold, bottle), which can serve as promising sources for acquiring grounded commonsense knowledge. In this work, we present CLEVER, which formulates CKE as a distantly supervised multi-instance learning problem, where models learn to summarize commonsense relations from a bag of images about an entity pair without any human annotation on image instances. To address the problem, CLEVER leverages vision-language pre-training models for deep understanding of each image in the bag, and selects informative instances from the bag to summarize commonsense entity relations via a novel contrastive attention mechanism. Comprehensive experimental results in held-out and human evaluation show that CLEVER can extract commonsense knowledge in promising quality, outperforming pre-trained language model-based methods by 3.9 AUC and 6.4 mAUC points. The predicted commonsense scores show strong correlation with human judgment with a 0.78 Spearman coefficient. Moreover, the extracted commonsense can also be grounded into images with reasonable interpretability. The data and codes can be obtained at https://github.com/thunlp/CLEVER.Comment: Accepted by AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

ADC Histograms from Routine DWI for Longitudinal Studies in Cerebral Small Vessel Disease: A Field Study in CADASIL.

Author: A Nitkunan
A Nitkunan
A Viswanathan
AO Nusbaum
AO Nusbaum
Bence Gunda
C Tessa
E Jouvent
E Pagani
Eric Jouvent
H Chabriat
H Chabriat
H Chabriat
H Chabriat
H Vrenken
Hugues Chabriat
Jean-Pierre Guichard
Jerome Mawet
M Cercignani
M Dichgans
M Giannelli
M Holtmannspotter
M Mascalchi
M O'Sullivan
M Rovaris
M Rovaris
Marco Duering
Martin Dichgans
N Molko
N Molko
R Della Nave
R Schmidt
RA Charlton
Raphael Porcher
RJ Fox
SC Steens
Sune Nørhøj Jespersen
T Kin
TC Chua
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

Diffusion tensor imaging (DTI) histogram metrics are correlated with clinical parameters in cerebral small vessel diseases (cSVD). Whether ADC histogram parameters derived from simple diffusion weighted imaging (DWI) can provide relevant markers for long term studies of cSVD remains unknown. CADASIL patients were evaluated by DWI and DTI in a large cohort study overa6-year period. ADC histogram parameters were compared to those derived from mean diffusivity (MD) histograms in 280 patients using intra-class correlation and Bland-Altman plots. Impact of image corrections applied to ADC maps was assessed and a mixed effect model was used for analyzing the effects of scanner upgrades. The results showed that ADC histogram parameters are strongly correlated to MD histogram parameters and that image corrections have only limited influence on these results. Unexpectedly, scanner upgrades were found to have major effects on diffusion measures with DWI or DTI that can be even larger than those related to patients' characteristics. These data support that ADC histograms from daily used DWI can provide relevant parameters for assessing cSVD, but the variability related to scanner upgrades as regularly performed in clinical centers should be determined precisely for longitudinal and multicentric studies using diffusion MRI in cSVD

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Open Access LMU

PubMed Central

Semmelweis Repository

IL-12 RB1 Genetic Variants Contribute to Human Susceptibility to Severe Acute Respiratory Syndrome Infection among Chinese

Author: AO Chua
C Fieschi
CK Wong
CM Booth
DH Presky
Douglas F. Nixon
DR Nyholt
Fang Tang
Fang Zhang
G Trinchieri
H Zhang
Hinh Ly
Hong Yang
HW Lee
I Caragol
J He
JS Peiris
JW Chan
K Van Reeth
KY Chan
Mao-Ti Wei
MJ Cameron
MK Kennedy
MW Ng
O Staretz-Haham
Pan-He Zhang
R Manetti
RW Chiu
S Itoyama
S Itoyama
S.M Leal
SE Dorman
T Sakai
Wei Liu
WK Ip
WP Chong
Wu-Chun Cao
YL Lau
Zhong-Tao Xin
Publication venue: Public Library of Science
Publication date: 14/05/2008
Field of study

BACKGROUND: Cytokines play important roles in antiviral action. We examined whether polymorphisms of interleukin (IL)-12 receptor B1 (IL-12RB1) affect the susceptibility to and outcome of severe acute respiratory syndrome (SARS). METHODS: A case-control study was carried out in Chinese SARS patients and healthy controls. The genotypes of 4SNPs on IL-12 RB1 gene, +705A/G,+1158T/C, +1196G/C and +1664 C/T, were determined by PCR-RFLP. Haplotypes were estimated from the genotype data using the expectation-maximisation algorithm. RESULTS: Comparison between patients and close contacts showed that individuals with the +1664 C/T (CT and TT) genotype had a 2.09-fold (95% confidence interval [CI], 1.90-7.16) and 2.34-fold (95% CI, 1.79-13.37) increased risk of developing SARS, respectively. For any of the other three polymorphisms, however, no significant difference can be detected in allele or genotype frequencies between patients and controls. Additionally, estimation of the frequencies of multiple-locus haplotypes revealed potential risk haplotypes (GCCT) for SARS infection. CONCLUSIONS: Our data indicate that genetic variants of IL12RB1confer genetic susceptibility to SARS infection, but not necessary associated with the progression of the disease in Chinese population

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Angiostatin anti-angiogenesis requires IL-12: The innate immune system as a key target

Author: A Albini
A Albini
A Albini
Adriana Albini
Agostina Ventura
Alessandra Mancino
Antonio Sica
AO Chua
C Murdoch
Claudio Brigati
D Pereg
DH Presky
Douglas M Noonan
E Pluskota
F Shojaei
F Shojaei
Girieca Lorusso
H Ito
JJ Walter
K Aase
KE de Visser
KS Moulton
L Paleari
M Abad
M Morini
Marta Pinter
MH Prandini
ML Wahl
ML Wahl
Monica Morini
MS O'Reilly
MS O'Reilly
MS O'Reilly
N Ferrari
N Ferrari
O Peyruchaud
R Benelli
R Benelli
R Huegel
SR Perri
T Chavakis
T Matsunaga
T Moser
Y Cao
Y Okamura
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

Archivio istituzionale della ricerca - Università dell'Insubria

PubMed Central

Archivio Istituzionale della Ricerca- Università del Piemonte Orientale

Redundant Mechanisms Prevent Mitotic Entry Following Replication Arrest in the Absence of Cdc25 Hyper-Phosphorylation in Fission Yeast

Following replication arrest the Cdc25 phosphatase is phosphorylated and inhibited by Cds1. It has previously been reported that expressing Cdc25 where 9 putative amino-terminal Cds1 phosphorylation sites have been substituted to alanine results in bypass of the DNA replication checkpoint. However, these results were acquired by expression of the phosphorylation mutant using a multicopy expression vector in a genetic background where the DNA replication checkpoint is intact. In order to clarify these results we constructed a Cdc25(9A)-GFP native promoter integrant and examined its effect on the replication checkpoint at endogenous expression levels. In this strain the replication checkpoint operates normally, conditional on the presence of the Mik1 kinase. In response to replication arrest the Cdc25(9A)-GFP protein is degraded, suggesting the presence of a backup mechanism to eliminate the phosphatase when it cannot be inhibited through phosphorylation

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Prevalence, Distribution and Functional Significance of the −237C to T Polymorphism in the IL-12Rβ2 Promoter in Indian Tuberculosis Patients

Author: A Singhal
AJ Smith
Anand Jaiswal
AO Chua
BS Liu
C Rodriguez-Antona
CK Cheng
CY Wu
Digamber Behera
E Matsui
E Schreiber
FA Letimier
Gobardhan Das
GR Sarma
GT Seah
H Ohyama
Hanumanthappa Krishna Prasad
J Bidwell
JC Knight
JG van Rietschoten
JG van Rietschoten
JJ Yim
JP Huber
JW Kim
K Imai
KJ Livak
L Liang
L Rogge
LJ Keen
M Jaberipour
M Rayamajhi
M Zhang
MH Zaki
ML Toh
MW Pfaffl
N Mermod
P Boeuf
PY Mantel
R Yagi
RA Taha
RA Taha
S Bhattacharyya
S Burl
S Hori
S Keerthivasan
Sangeeta Sharma
SD Rosenzweig
Shyam Singh Chauhan
SJ Szabo
SJ van Deventer
SL Ma
V Gupta
V Guyot-Revol
Vibha Taneja
Vikas Kumar Verma
Vishnubhatla Sreenivas
WT Watford
YF Hu
Z Zhao
Publication venue: Public Library of Science
Publication date: 03/04/2012
Field of study

Cytokine/cytokine receptor gene polymorphisms related to structure/expression could impact immune response. Hence, the −237 polymorphic site in the 5′ promoter region of the IL-12Rβ2 (SNP ID: rs11810249) gene associated with the AP-4 transcription motif GAGCTG, was examined. Amplicons encompassing the polymorphism were generated from 46 pulmonary tuberculosis patients, 35 family contacts and 28 miscellaneous volunteers and sequenced. The C allele predominated among patients, (93.4%, 43/46), and in all volunteers and contacts screened, but the T allele was exclusively limited to patients, (6.5%, 3/46). The functional impact of this polymorphism on transcriptional activity was assessed by Luciferase-reporter and electrophoretic mobility shift assays (EMSA). Luciferase-reporter assays showed a significant reduction in transcriptional efficiency with T compared to C allele. The reduction in transcriptional efficiency with the T allele construct (pGIL-12Rb2-T), in U-87MG, THP-1 and Jurkat cell lines, were 53, 37.6, and 49.8% respectively, compared to the C allele construct (pGIL-12Rb2-C). Similarly, densitometric analysis of the EMSA assay showed reduced binding of the AP-4 transcription factor, to T compared to the C nucleotide probe. Reduced mRNA expression in all patients (3/3) harboring the T allele was seen, whereas individuals with the C allele exhibited high mRNA expression (17/25; 68%, p = 0.05). These observations were in agreement with the in vitro assessment of the promoter activity by Luciferase-reporter and EMSA assays. The reduced expression of IL-12Rβ2 transcripts in 8 patients despite having the C allele was attributed to the predominant over expression of the suppressors (IL-4 and GATA-3) and reduced expression of enhancers (IFN-α) of IL-12Rβ2 transcripts. The 17 high IL-12Rβ2 mRNA expressers had significantly elevated IFN-α mRNA levels compared to low expressers and volunteers. Notwithstanding the presence of high levels of IL-12Rβ2 mRNA in these patients elevated IFN-α expression could modulate their immune responses to Mycobacterium tuberculosis

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The significance of the complement system for the pathogenesis of age-related macular degeneration — current evidence and translation into clinical application

Author: A Farwick
A Laude
A Swaroop
A Thakkinstian
A Tortajada
AE Hughes
AE Hughes
Age-Related Eye Disease Study Research Group
AL Wang
AO Edwards
AP Ciardella
AP Herbert
AP Sjoberg
AY Lee
B Chua
BJ Wegscheider
C Skerka
CC Klaver
CE McAvoy
CJ Boon
CJ Boon
D Nitsch
DA Schaumberg
DD Despriet
DH Anderson
DH Anderson
E Cho
E Wagner
EWT Chong
GS Hageman
GS Hageman
GS Hageman
H Chen
H Hayashi
Hendrik P. N. Scholl
HP Scholl
HPN Scholl
HPN Scholl
HR Coleman
IM Heiba
J Duvall-Young
J Duvall-Young
J Jakobsdottir
J Yu
J Zhou
JA Fagerness
JB Maller
JG Hollyfield
JJ Wang
JL Haines
JM Seddon
JM Seddon
JM Seddon
JM Seddon
JM Seddon
JM Thurman
JR Dunkelberger
JR Yates
JW Crabb
KL Spencer
KL Spencer
KP Magnusson
KY Lee
L Luo
LA Hecker
LH Lima
LV Johnson
M Chen
M Chen
M Laine
MA Brantley Jr
MA Brantley Jr
MA Brantley Jr
MA Grassi
MJ Walport
ML Klein
ML Klein
MM DeAngelis
MT Andreoli
N Kondo
N Leveziel
N Leveziel
N. Victor Chong
P Charbel Issa
Peter Charbel Issa
PF Zipfel
PJ Coffey
PJ Francis
PJ Francis
PN Baird
PT Jong de
R Reynolds
RD Jager
RF Mullins
RF Mullins
RJ Klein
RJ Ormsby
RK Shuler Jr
RO Schlingemann
S Ennis
S Haddad
S Hakobyan
S Katta
S Wasmuth
SJ Clark
SJ Clark
SP Seitsonen
SV Goverdhan
T Sepp
TE Mollnes
U Kelly
W Smith
X Ding
X Feng
X Yuan
YP Conley
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

BACKGROUND: Dysregulation of the complement system has been shown to play a major role in the pathogenesis of age-related macular degeneration (AMD). METHODS: The current evidence from human studies derives from immunohistochemical and proteomic studies in donor eyes, genetic association studies, and studies of blood complement protein levels. These lines of evidence are corroborated by in vitro and animal studies. RESULTS: In AMD donor eyes, detection of complement proteins in drusen suggested local inflammatory processes involving the complement system. Moreover, higher levels of complement proteins in the Bruch's membrane/choroid complex could be detected in AMD donor eyes compared to controls. A large number of independent genetic studies have consistently confirmed the association of AMD with risk or protective variants in genes coding for complement proteins, including complement factor H (CFH), CFH-related proteins 1 and 3, factor B/C2, C3 and factor I. Another set of independent studies detected increased levels of complement activation products in plasma of AMD patients, suggesting that AMD may be a systemic disease and the macula a vulnerable anatomic site of minimal resistance to complement activation. Genotype-phenotype correlations, including the impact of genetic variants on disease progression, gene-environment and pharmacogenetic interactions, have been investigated. There is evidence that complement gene variants may be associated with the progression from early to late forms of AMD, whereas they do not appear to play a significant role when late atrophic AMD has already developed. There are indications for an interaction between genetic variants and supplementation and dietary factors. Also, there is some evidence that variants in the CFH gene influence treatment effects in patients with neovascular AMD. CONCLUSIONS: Such data suggest that the complement system may have a significant role for developing new prophylactic and therapeutic interventions in AMD. In fact, several compounds acting on the complement pathway are currently in clinical trials. Therapeutics that modulate the complement system need to balance inhibition with preservation of sufficient functional activity in order to maintain adequate immune responses and tissue homeostasis. Specifically, targeting the dysfunction appears more adequate than a global suppression of complement activation in chronic diseases such as AMD

Crossref

Springer - Publisher Connector

PubMed Central

Oxford University Research Archive