Search CORE

14 research outputs found

Deep learning for reconstructing protein structures from cryo-EM density maps: recent advances and future directions

Author: Cheng Jianlin
Giri Nabin
Roy Raj S.
Publication venue
Publication date: 16/09/2022
Field of study

Cryo-Electron Microscopy (cryo-EM) has emerged as a key technology to determine the structure of proteins, particularly large protein complexes and assemblies in recent years. A key challenge in cryo-EM data analysis is to automatically reconstruct accurate protein structures from cryo-EM density maps. In this review, we briefly overview various deep learning methods for building protein structures from cryo-EM density maps, analyze their impact, and discuss the challenges of preparing high-quality data sets for training deep learning models. Looking into the future, more advanced deep learning models of effectively integrating cryo-EM data with other sources of complementary data such as protein sequences and AlphaFold-predicted structures need to be developed to further advance the field

arXiv.org e-Print Archive

Impact of AlphaFold on Structure Prediction of Protein Complexes: The CASP15-CAPRI Experiment

Author: Anika Jain J
Barradas-Bautista Didier
Bates Paul
Beglov Dmitri
Bonvin Alexandre M.J.J.
Brysbaert Guillaume
Canner Sam
Cao Zhen
Carpio Carlos Del
Cavallo Luigi
Chang Shan
Chawla Mohit
Chen Chen
Chen Xiao
Cheng Jianlin
Cheung Melyssa
Christoffer Charles
Chu Lee-Shin
Cohen Tomer
Czaplewski Cezary
Danielsson Annemarie
Dapkūnas Justas
Duan Rui
Dziadek Lukasz
Fernández-Recio Juan
Fujuta Hayato
Gaardlos Margrethe
Ghani Usman
Giełdoń Artur
Giri Nabin
Giulini Marco
Gray Jeffrey
Guest Johnathan
Guo Shuai
Guo Zhiye
Halfon Matan
Harmalkar Ameya
He Jiahua
He Xiaodong
Honorato Rodrigo Vargas
Hou Chengyu
Huang Shengyou
Ichiishi Eichiro
Jiang Shenda
Jimenez-Garcia Brian
Kagaya Yuki
Kannan Harini
Khan Omeir
Kihara Daisuke
Kiyota Yasuomi
Kobayashi Shinpei
Kong Ren
Kotelnikov Sergei
Kozakov Dima
Krzysztof Bojarski K
Lee Jessica
Lensink Marc
Li Hao
Lin Peicong
Liu Jian
Liwo Jozef
Lu Xufeng
Lubecka Emilia
Luis Rodriguez-Lumbreras A
Ma Xiaoliang
Marcisz Mateusz
Maszota-Zieleniak Martyna
Miyakawa Yuta
Morehead Alex
Nakamura Tsukasa
Noort Charlotte van
Olechnovič Kliment
Oliva Romina
Padhorny Dzmitry
Pierce Brian
Qiu Liming
Quadir Farhan
Raouraoua Nessim
Ricciardelli Tiziana
Roel Jorge
Roy Raj
Samsonov Sergey
Schneidman-Duhovny Dina
Sekijima Masakazu
Shen Yang
Shi Hang
Shor Ben
Shoshana Wodak J
Sieradzan Adam
Slusarz Rafal
Smanta Rituparna
Sun Yuanfei
Surendra Negi S
Takeda-Shitaka Mayuko
Tao Huanyu
Teixeira João
Terashi Genki
Vajda Sandor
Valančauskas Lukas
Velankar Sameer
Venclovas Ceslovas
Verburgt Jacob
Wallner Björn
Wu Tianqi
Xu Xianjin
Yang Lin
Yin Rujie
Yin Rujie
Zhang Yuanyuan
Zhang Zicong
Zhu Shaowen
Zieba Karolina
Zou Xiaoqin
Publication venue: 'Authorea, Inc.'
Publication date: 09/07/2023
Field of study

We present the results for CAPRI Round 54, the 5th joint CASP-CAPRI protein assembly prediction challenge. The Round offered 37 targets, including 14 homo-dimers, 3 homo-trimers, 13 hetero-dimers including 3 antibody-antigen complexes, and 7 large assemblies. On average ~70 CASP and CAPRI predictor groups, including more than 20 automatics servers, submitted models for each target. A total of 21941 models submitted by these groups and by 15 CAPRI scorer groups were evaluated using the CAPRI model quality measures and the DockQ score consolidating these measures. The prediction performance was quantified by a weighted score based on the number of models of acceptable quality or higher submitted by each group among their 5 best models. Results show substantial progress achieved across a significant fraction of the 60+ participating groups. High-quality models were produced for about 40% for the targets compared to 8% two years earlier, a remarkable improvement resulting from the wide use of the AlphaFold2 and AlphaFold-Multimer software. Creative use was made of the deep learning inference engines affording the sampling of a much larger number of models and enriching the multiple sequence alignments with sequences from various sources. Wide use was also made of the AlphaFold confidence metrics to rank models, permitting top performing groups to exceed the results of the public AlphaFold-Multimer version used as a yard stick. This notwithstanding, performance remained poor for complexes with antibodies and nanobodies, where evolutionary relationships between the binding partners are lacking, and for complexes featuring conformational flexibility, clearly indicating that the prediction of protein complexes remains a challenging problem

Utrecht University Repository

Impact of AlphaFold on structure prediction of protein complexes: The CASP15-CAPRI experiment

Author: Barradas-Bautista Didier
Bates Paul A
Beglov Dmitri
Bojarski Krzysztof K
Bonvin Alexandre M J J
Brysbaert Guillaume
Canner Sam
Cao Zhen
Cavallo Luigi
Chang Shan
Chawla Mohit
Chen Chen
Chen Xiao
Cheng Jianlin
Cheung Melyssa
Christoffer Charles W
Chu Lee-Shin
Cohen Tomer
Czaplewski Cezary
Danielsson Annemarie
Dapkunas Justas
Del Carpio Carlos A
Duan Rui
Dziadek Lukasz
Fernandez-Recio Juan
Fujuta Hayato
Gaardlos Margrethe
Ghani Usman
Gieldon Artur
Giri Nabin
Giulini Marco
Gray Jeffrey J
Guest Johnathan D
Guo Shuai
Guo Zhiye
Halfon Matan
Harmalkar Ameya
He Jiahua
He Xiaodong
Honorato Rodrigo V
Hou Chengyu
Huang Sheng-You
Ichiishi Eichiro
Jain Anika J
Jiang Shenda
Jimenez-Garcia Brian
Kagaya Yuki
Kannan Harini
Khan Omeir
Kihara Daisuke
Kiyota Yasuomi
Kobayashi Shinpei
Kong Ren
Kotelnikov Sergei
Kozakov Dima
Lee Jessica
Lensink Marc F
Li Hao
Lin Peicong
Liu Jian
Liwo Adam
Lu Xufeng
Lubecka Emilia A
Ma Xiaoliang
Marcisz Mateusz
Maszota-Zieleniak Martyna
Miyakawa Yuta
Morehead Alex
Nakamura Tsukasa
Negi Surendra S
Olechnovic Kliment
Oliva Romina
Padhorny Dzmitry
Pierce Brian G
Quadir Farhan
Qui Liming
Raouraoua Nessim
Ricciardelli Tiziana
Rodriguez-Lumbreras Luis A
Roel-Touris Jorge
Roy Raj S
Samsonov Sergey A
Schneidman-Duhovny Dina
Sekijima Masakazu
Shen Yang
Shi Hang
Shor Ben
Sieradzan Adam K
Slusarz Rafal
Smanta Rituparna
Sun Yuanfei
Takeda-Shitaka Mayuko
Tao Huanyu
Teixeira Joao M C
Terashi Genki
Vajda Sandor
Valancauskas Lukas
van Noort Charlotte
Velankar Sameer
Venclovas Ceslovas
Verburgt Jacob C
Wallner Bjorn
Wodak Shoshana J
Wu Tianqi
Xu Xianjin
Yang Lin
Yin Rujie
Yin Rujie
Zhang Yuanyuan
Zhang Zicong
Zhu Shaowen
Zieba Karolina
Zou Xiaoqin
Publication venue
Publication date: 01/12/2023
Field of study

We present the results for CAPRI Round 54, the 5th joint CASP-CAPRI protein assembly prediction challenge. The Round offered 37 targets, including 14 homodimers, 3 homo-trimers, 13 heterodimers including 3 antibody-antigen complexes, and 7 large assemblies. On average ~70 CASP and CAPRI predictor groups, including more than 20 automatics servers, submitted models for each target. A total of 21 941 models submitted by these groups and by 15 CAPRI scorer groups were evaluated using the CAPRI model quality measures and the DockQ score consolidating these measures. The prediction performance was quantified by a weighted score based on the number of models of acceptable quality or higher submitted by each group among their five best models. Results show substantial progress achieved across a significant fraction of the 60+ participating groups. High-quality models were produced for about 40% of the targets compared to 8% two years earlier. This remarkable improvement is due to the wide use of the AlphaFold2 and AlphaFold2-Multimer software and the confidence metrics they provide. Notably, expanded sampling of candidate solutions by manipulating these deep learning inference engines, enriching multiple sequence alignments, or integration of advanced modeling tools, enabled top performing groups to exceed the performance of a standard AlphaFold2-Multimer version used as a yard stick. This notwithstanding, performance remained poor for complexes with antibodies and nanobodies, where evolutionary relationships between the binding partners are lacking, and for complexes featuring conformational flexibility, clearly indicating that the prediction of protein complexes remains a challenging problem

Utrecht University Repository

Improving Protein–Ligand Interaction Modeling with cryo-EM Data, Templates, and Deep Learning in 2021 Ligand Model Challenge

Author: Jianlin Cheng
Nabin Giri
Publication venue: 'MDPI AG'
Publication date: 01/01/2023
Field of study

Elucidating protein–ligand interaction is crucial for studying the function of proteins and compounds in an organism and critical for drug discovery and design. The problem of protein–ligand interaction is traditionally tackled by molecular docking and simulation, which is based on physical forces and statistical potentials and cannot effectively leverage cryo-EM data and existing protein structural information in the protein–ligand modeling process. In this work, we developed a deep learning bioinformatics pipeline (DeepProLigand) to predict protein–ligand interactions from cryo-EM density maps of proteins and ligands. DeepProLigand first uses a deep learning method to predict the structure of proteins from cryo-EM maps, which is averaged with a reference (template) structure of the proteins to produce a combined structure to add ligands. The ligands are then identified and added into the structure to generate a protein–ligand complex structure, which is further refined. The method based on the deep learning prediction and template-based modeling was blindly tested in the 2021 EMDataResource Ligand Challenge and was ranked first in fitting ligands to cryo-EM density maps. These results demonstrate that the deep learning bioinformatics approach is a promising direction for modeling protein–ligand interactions on cryo-EM data using prior structural information

Directory of Open Access Journals

Consignment stock policy in an integrated vendor-buyer model for deteriorating item with stock dependent demand under buyer’s space limitation

Author: Bibhas Chandra Giri
Nabin Sen
Sudarshan Bardhan
Publication venue: 'EDP Sciences'
Publication date: 02/03/2021
Field of study

In this paper, a single-vendor single-buyer integrated inventory model for a deteriorating item with consignment stock policy is developed, assuming that the market demand is stock dependent and there is space limitation on the buyer’s storage capacity. Both equal and unequal shipments from the vendor to the buyer are considered. The effects of the buyer’s space capacity on the average cost, shipment size, and production batch are studied through numerical example. It is deduced that production rate is the key factor to determine whether to use equal or unequal shipment strategy. Sensitivity analysis is carried out to establish the robustness of the solutions of the models developed

EDP Sciences OAI-PMH repository (1.2.0)

Ebola paranoia in the age of the internet and social media

Author: BFN
Nabin Shrestha
Ranjan Pathak
Rosenbaum
Smith Giri
Publication venue: 'Medknow'
Publication date: 01/01/2015
Field of study

Crossref

Risk of second primary malignancy (SPM) and survival of adult patients with polycythemia vera (PV): A U.S. population-based study.

Author: Nabin Khanal
Smith Giri
Smrity Upadhyay
Vijaya Raj Bhatt
Publication venue: 'American Society of Clinical Oncology (ASCO)'
Publication date
Field of study

Crossref

A National Cancer Database (NCDB) analysis of factors affecting survival in stage II and III colon cancer.

Author: Nabin Khanal
Peter T. Silberstein
Smith Giri
Smrity Upadhyay
Publication venue: 'American Society of Clinical Oncology (ASCO)'
Publication date
Field of study

Crossref

DRLComplex: Reconstruction of protein quaternary structures using deep reinforcement learning

Author: Cheng Jianlin
Giri Nabin
Morehead Alex
Quadir Farhan
Roy Raj S.
Soltanikazemi Elham
Publication venue
Publication date: 26/05/2022
Field of study

Predicted inter-chain residue-residue contacts can be used to build the quaternary structure of protein complexes from scratch. However, only a small number of methods have been developed to reconstruct protein quaternary structures using predicted inter-chain contacts. Here, we present an agent-based self-learning method based on deep reinforcement learning (DRLComplex) to build protein complex structures using inter-chain contacts as distance constraints. We rigorously tested DRLComplex on two standard datasets of homodimeric and heterodimeric protein complexes (i.e., the CASP-CAPRI homodimer and Std_32 heterodimer datasets) using both true and predicted interchain contacts as inputs. Utilizing true contacts as input, DRLComplex achieved high average TM-scores of 0.9895 and 0.9881 and a low average interface RMSD (I_RMSD) of 0.2197 and 0.92 on the two datasets, respectively. When predicted contacts are used, the method achieves TM-scores of 0.73 and 0.76 for homodimers and heterodimers, respectively. Our experiments find that the accuracy of reconstructed quaternary structures depends on the accuracy of the contact predictions. Compared to other optimization methods for reconstructing quaternary structures from inter-chain contacts, DRLComplex performs similar to an advanced gradient descent method and better than a Markov Chain Monte Carlo simulation method and a simulated annealing-based method, validating the effectiveness of DRLComplex for quaternary reconstruction of protein complexes.Comment: 20 pages, 8 figures, 12 tables. Under revie

arXiv.org e-Print Archive

Distribution of Microplastic Contamination in Sapta-Gandaki River System, Nepal

Author: Anuradha K. C.
Asmita Karki
Baburam Kandel
Basant Giri
Bhanu Neupane
Hari Paudyal
Khaga Raj Sharma
Nabin Adhikari
Publication venue: Open Science Framework
Publication date: 11/12/2023
Field of study

Microplastic (MP) contamination has been reported in many Rivers worldwide. However, there is an increasing concern regarding data quality, particularly in the studies that do not account for positive and negative controls. Additionally, spatiotemporal distribution of MP in transboundary Himalayan River is underexplored. Here, we report spatiotemporal distribution of MP in the second largest river of Nepal; Sapta-Gandaki River system which is 810 km long starting from Himalayan headstream to the Ganges with a catchment area of 46,300 km^2. A total of 120 integrated water samples were collected in pre and post monsoons from 30 sites (2850-140 masl) along three tributaries of Saptagandaki River. The MP data were corrected for procedural blanks (n=23) and positive controls (n=18). We found that the MPs count (cut off size ≥30μm) in pre (dry) monsoon time was significantly higher (61.2±27.8 MP/L, p<0.01) than in post monsoon (winter) time (24.7±10.8 MP/L). High count was observed in the sites near major cities and highways. A gradual increase in MPs count was observed as the River stretches up to downstream (r=-0.6). The shape, size, and color dominance were fragments>pellets>fibers, 30-100>100-250>250-500>500-5000µm, blue>black>transparent; respectively. Most MP particles consisted of polyethylene terephthalate, cellophane, polyethylene, polyvinyl chloride type material. Annual flux discharge calculation showed that Saptagandaki River discharges 0.7×10^8 MP/s. The findings of this study provide baseline data for MPs contamination in one of the major Himalayan River water systems of Nepal and the data could be useful to identify potential control measures

OSF Preprints