47 research outputs found
Protein-Centric Omics Analysis Reveals Circulating Complements Linked to Non-viral Liver Diseases As Potential Therapeutic Targets
BACKGROUND/AIMS: To evaluate the causal correlation between complement components and non-viral liver diseases and their potential use as druggable targets.
METHODS: We conducted Mendelian randomization (MR) to assess the causal role of circulating complements in the risk of non-viral liver diseases. A complement-centric protein interaction network was constructed to explore biological functions and identify potential therapeutic options.
RESULTS: In the MR analysis, genetically predicted levels of complement C1q C chain (C1QC) were positively associated with the risk of autoimmune hepatitis (odds ratio 1.125, 95% confidence interval 1.018-1.244), while complement factor H-related protein 5 (CFHR5) was positively associated with the risk of primary sclerosing cholangitis (PSC;1.193, 1.048- 1.357). On the other hand, CFHR1 (0.621, 0.497-0.776) and CFHR2 (0.824, 0.703-0.965) were inversely associated with the risk of alcohol-related cirrhosis. There were also significant inverse associations between C8 gamma chain (C8G) and PSC (0.832, 0.707-0.979), as well as the risk of metabolic dysfunction-associated steatotic liver disease (1.167, 1.036-1.314). Additionally, C1S (0.111, 0.018-0.672), C7 (1.631, 1.190-2.236), and CFHR2 (1.279, 1.059-1.546) were significantly associated with the risk of hepatocellular carcinoma. Proteins from the complement regulatory networks and various liver diseaserelated proteins share common biological processes. Furthermore, potential therapeutic drugs for various liver diseases were identified through drug repurposing based on the complement regulatory network.
CONCLUSION: Our study suggests that certain complement components, including C1S, C1QC, CFHR1, CFHR2, CFHR5, C7, and C8G, might play a role in non-viral liver diseases and could be potential targets for drug development
Flash Flood Risk Analysis Based on Machine Learning Techniques in the Yunnan Province, China
Flash flood, one of the most devastating weather-related hazards in the world, has become more and more frequent in past decades. For the purpose of flood mitigation, it is necessary to understand the distribution of flash flood risk. In this study, artificial intelligence (Least squares support vector machine: LSSVM) and classical canonical method (Logistic regression: LR) are used to assess the flash flood risk in the Yunnan Province based on historical flash flood records and 13 meteorological, topographical, hydrological and anthropological factors. Results indicate that: (1) the LSSVM with Radial basis function (RBF) Kernel works the best (Accuracy = 0.79) and the LR is the worst (Accuracy = 0.75) in testing; (2) flash flood risk distribution identified by the LSSVM in Yunnan province is near normal distribution; (3) the high-risk areas are mainly concentrated in the central and southeastern regions, where with a large curve number; and (4) the impact factors contributing the flash flood risk map from higher to low are: Curve number > Digital elevation > Slope > River density > Flash Flood preventions > Topographic Wetness Index > annual maximum 24 h precipitation > annual maximum 3 h precipitation
Microarray scanner calibration curves: characteristics and implications
BACKGROUND: Microarray-based measurement of mRNA abundance assumes a linear relationship between the fluorescence intensity and the dye concentration. In reality, however, the calibration curve can be nonlinear. RESULTS: By scanning a microarray scanner calibration slide containing known concentrations of fluorescent dyes under 18 PMT gains, we were able to evaluate the differences in calibration characteristics of Cy5 and Cy3. First, the calibration curve for the same dye under the same PMT gain is nonlinear at both the high and low intensity ends. Second, the degree of nonlinearity of the calibration curve depends on the PMT gain. Third, the two PMTs (for Cy5 and Cy3) behave differently even under the same gain. Fourth, the background intensity for the Cy3 channel is higher than that for the Cy5 channel. The impact of such characteristics on the accuracy and reproducibility of measured mRNA abundance and the calculated ratios was demonstrated. Combined with simulation results, we provided explanations to the existence of ratio underestimation, intensity-dependence of ratio bias, and anti-correlation of ratios in dye-swap replicates. We further demonstrated that although Lowess normalization effectively eliminates the intensity-dependence of ratio bias, the systematic deviation from true ratios largely remained. A method of calculating ratios based on concentrations estimated from the calibration curves was proposed for correcting ratio bias. CONCLUSION: It is preferable to scan microarray slides at fixed, optimal gain settings under which the linearity between concentration and intensity is maximized. Although normalization methods improve reproducibility of microarray measurements, they appear less effective in improving accuracy
In Situ and Operando Investigation of the Dynamic Morphological and Phase Changes of Selenium-doped Germanium Electrode during (De)Lithiation Processes
To understand the effect of selenium doping on the good cycling performance and rate capability of a Ge0.9Se0.1 electrode, the dynamic morphological and phase changes of the Ge0.9Se0.1 electrode were investigated by synchrotron-based operando transmission X-ray microscopy (TXM) imaging, X-ray diffraction (XRD), and X-ray absorption spectroscopy (XAS). The TXM results show that the Ge0.9Se0.1 particle retains its original shape after a large volume change induced by (de)lithiation and undergoes a more sudden morphological and optical density change than pure Ge. The difference between Ge0.9Se0.1 and Ge is attributed to a super-ionically conductive Li–Se–Ge network formed inside Ge0.9Se0.1 particles, which contributes to fast Li-ion pathways into the particle and nano-structuring of Ge as well as buffering the volume change of Ge. The XRD and XAS results confirm the formation of a Li–Se–Ge network and reveal that the Li–Se–Ge phase forms during the early stages of lithiation and is an inactive phase. The Li–Se–Ge network also can suppress the formation of the crystalline Li15Ge4 phase. These in situ and operando results reveal the effect of the in situ formed, super-ionically conductive, and inactive network on the cycling performance of Li-ion batteries and shed light on the design of high capacity electrode materials
Assessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samples
<p>Abstract</p> <p>Background</p> <p>Genome-wide association studies (GWAS) aim to identify genetic variants (usually single nucleotide polymorphisms [SNPs]) across the entire human genome that are associated with phenotypic traits such as disease status and drug response. Highly accurate and reproducible genotype calling are paramount since errors introduced by calling algorithms can lead to inflation of false associations between genotype and phenotype. Most genotype calling algorithms currently used for GWAS are based on multiple arrays. Because hundreds of gigabytes (GB) of raw data are generated from a GWAS, the samples are typically partitioned into batches containing subsets of the entire dataset for genotype calling. High call rates and accuracies have been achieved. However, the effects of batch size (i.e., number of chips analyzed together) and of batch composition (i.e., the choice of chips in a batch) on call rate and accuracy as well as the propagation of the effects into significantly associated SNPs identified have not been investigated. In this paper, we analyzed both the batch size and batch composition for effects on the genotype calling algorithm BRLMM using raw data of 270 HapMap samples analyzed with the Affymetrix Human Mapping 500 K array set.</p> <p>Results</p> <p>Using data from 270 HapMap samples interrogated with the Affymetrix Human Mapping 500 K array set, three different batch sizes and three different batch compositions were used for genotyping using the BRLMM algorithm. Comparative analysis of the calling results and the corresponding lists of significant SNPs identified through association analysis revealed that both batch size and composition affected genotype calling results and significantly associated SNPs. Batch size and batch composition effects were more severe on samples and SNPs with lower call rates than ones with higher call rates, and on heterozygous genotype calls compared to homozygous genotype calls.</p> <p>Conclusion</p> <p>Batch size and composition affect the genotype calling results in GWAS using BRLMM. The larger the differences in batch sizes, the larger the effect. The more homogenous the samples in the batches, the more consistent the genotype calls. The inconsistency propagates to the lists of significantly associated SNPs identified in downstream association analysis. Thus, uniform and large batch sizes should be used to make genotype calls for GWAS. In addition, samples of high homogeneity should be placed into the same batch.</p
Assessing Reproducibility of Inherited Variants Detected With Short-Read Whole Genome Sequencing
Background: Reproducible detection of inherited variants with whole genome sequencing (WGS) is vital for the implementation of precision medicine and is a complicated process in which each step affects variant call quality. Systematically assessing reproducibility of inherited variants with WGS and impact of each step in the process is needed for understanding and improving quality of inherited variants from WGS.
Results: To dissect the impact of factors involved in detection of inherited variants with WGS, we sequence triplicates of eight DNA samples representing two populations on three short-read sequencing platforms using three library kits in six labs and call variants with 56 combinations of aligners and callers. We find that bioinformatics pipelines (callers and aligners) have a larger impact on variant reproducibility than WGS platform or library preparation. Single-nucleotide variants (SNVs), particularly outside difficult-to-map regions, are more reproducible than small insertions and deletions (indels), which are least reproducible when \u3e 5 bp. Increasing sequencing coverage improves indel reproducibility but has limited impact on SNVs above 30×.
Conclusions: Our findings highlight sources of variability in variant detection and the need for improvement of bioinformatics pipelines in the era of precision medicine with WGS
Assessing reproducibility of inherited variants detected with short-read whole genome sequencing
Background: Reproducible detection of inherited variants with whole genome sequencing (WGS) is vital for the implementation of precision medicine and is a complicated process in which each step affects variant call quality. Systematically assessing reproducibility of inherited variants with WGS and impact of each step in the process is needed for understanding and improving quality of inherited variants from WGS. Results: To dissect the impact of factors involved in detection of inherited variants with WGS, we sequence triplicates of eight DNA samples representing two populations on three short-read sequencing platforms using three library kits in six labs and call variants with 56 combinations of aligners and callers. We find that bioinformatics pipelines (callers and aligners) have a larger impact on variant reproducibility than WGS platform or library preparation. Single-nucleotide variants (SNVs), particularly outside difficult-to-map regions, are more reproducible than small insertions and deletions (indels), which are least reproducible when > 5 bp. Increasing sequencing coverage improves indel reproducibility but has limited impact on SNVs above 30x. Conclusions: Our findings highlight sources of variability in variant detection and the need for improvement of bioinformatics pipelines in the era of precision medicine with WGS.Peer reviewe