630 research outputs found

    DIANet: Dense-and-Implicit Attention Network

    Full text link
    Attention networks have successfully boosted the performance in various vision problems. Previous works lay emphasis on designing a new attention module and individually plug them into the networks. Our paper proposes a novel-and-simple framework that shares an attention module throughout different network layers to encourage the integration of layer-wise information and this parameter-sharing module is referred as Dense-and-Implicit-Attention (DIA) unit. Many choices of modules can be used in the DIA unit. Since Long Short Term Memory (LSTM) has a capacity of capturing long-distance dependency, we focus on the case when the DIA unit is the modified LSTM (refer as DIA-LSTM). Experiments on benchmark datasets show that the DIA-LSTM unit is capable of emphasizing layer-wise feature interrelation and leads to significant improvement of image classification accuracy. We further empirically show that the DIA-LSTM has a strong regularization ability on stabilizing the training of deep networks by the experiments with the removal of skip connections or Batch Normalization in the whole residual network. The code is released at https://github.com/gbup-group/DIANet

    Modeling HIV Drug Resistance

    Get PDF
    Despite the development of antiviral drugs and the optimization of therapies, the emergence of drug resistance remains one of the most challenging issues for successful treatments of HIV-infected patients. The availability of massive HIV drug resistance data provides us not only exciting opportunities for HIV research, but also the curse of high dimensionality. We provide several statistical learning methods in this thesis to analyze sequence data from different perspectives. We propose a hierarchical random graph approach to identify possible covariation among residue-specific mutations. Viral progression pathways were inferred using an EM-like algorithm in literature, and we present a normalization method to improve the accuracy of parameter estimations. To predict the drug resistance from genotypic data, we also build a novel regression model utilizing the information from progression pathways. Finally, we introduce a computational approach to determine viral fitness, for which our initial computational results closely agree with experimental results. Work on two other topics are presented in the Appendices. Latent class models find applications in several areas including social and biological sciences. Finding explicit maximum likelihood estimation has been elusive. We present a positive solution to a conjecture on a special latent class model proposed by Bernd Sturmfels from UC Berkeley. Monomial ideals provide ubiquitous links between combinatorics and commutative algebra. Irreducible decomposition of monomial ideals is a basic computational problem and it finds applications in several areas. We present two algorithms for finding irreducible decomposition of monomial ideals

    APPLICATION OF MACHINE LEARNING TO CHF MODELLING

    Get PDF
    Accurate prediction of CHF is still a challenging issue in the study of boiling heat transfer. Many factors contribute to the occurrence of CHF and the various trigger mechanisms are proposed to unravel physical phenomena behind CHF. However, those mechanisms cannot cover the multiple primary factors simultaneously and even some of them still remain controversially unresolved. In light of the complexity and difficulty of CHF modelling, hereby an ensemble-learning based framework is proposed to model and predict CHF based on the databank of CHF. Some prior trials have been done for three primary aspects of dominant factors, that is, surface morphology, geometrical dimension and operation condition. These three primary constituents are respectively analyzed though three different sub-models of the ensemble framework in Chapter 3, 4 and 5. In Chapter Three, relevant experiments about micro-pillar enhanced CHF are reviewed and the corresponding databank of microstructure enhanced CHF is compiled based on those CHF experiments from published papers. Although the impacts of micro-pillars on CHF are still not clear, through qualitative analyses, the parametrical trends of CHF with respect to geometrical parameters of pillar array can be roughly foreseen. Meanwhile, this study also evaluates performance of prediction accuracy among four current physical models of microstructure-enhanced CHF. Comparative results show that two capillary wicking models have higher prediction accuracy. Particularly, a special terminology, zero-infinity convergence, is introduced to discuss the parametrical trends of CHF and qualitatively assess veracity of two capillary wicking models. Given the drawbacks of current physical models, the DBN is proposed to more accurately predict CHF and study parametric trends of CHF based on the microstructure enhanced CHF databank. Different from the training process of other regression modelling problems, constrained CHF points, which are artificially derived from the training data datasets, are required to be coupled with the raw training datasets for achieving the zero-infinity convergence of the DBN based CHF model, exhibiting accurate parametric trends of CHF and improving the prediction accuracy. This new training technique provides a new reliable solution to the similar constrained machine learning problems. Numerical results demonstrate that DBN can achieve the best performance of CHF prediction in terms of prediction accuracy. Through studying parametrical trends of CHF reveals that micro pillar arrays with the same parameters on heat transfer substrates with different dimensional sizes presents different CHF enhancement profiles. The presented methodology provides new insights for CHF modelling in pool boiling enhanced by other surface modification techniques, including porous layer coating, nanoparticle deposition, textured roughen, and nanowire fabrication. The effects of dimensions and materials of boiling surfaces on CHF are correlated and studied through the GRNN modelling in Chapter Four. Instead of inputting all parameters that indicate the thermal properties of materials into the trained model, the aggregated parameters from the primitive parameters of thermal properties, thermal activity and thermal diffusivity, are utilized as the input parameters of the trained model. This technique not only could capture the effects of thermal properties of materials on CHF effectively but also helps reduce the computational loads. The trained model shows the similar parametric trends of CHF to that of the traditional empirical correlation with respect to the thermal activity. If the thermal activity of heat transfer substrate is beyond a certain value, the corresponding effect of thermal activity will be absent, which somehow implies that the thickness of heat transfer substrate will not impact CHF after the asymptomatic thickness is reached. On the other hand, thermal diffusivity still affects CHF occurrence even if the effect of thermal activity is negligible. When coming to the effect of dimension size on CHF, it was found that when the side length of square heat transfer substrate is 5 times greater than the capillary length of working fluid, the CHF will be independent on the side length. Otherwise, CHF will be affected by the side length, and the influence of side length on CHF reaches ultimate if the side length of square boiling surface is exactly equal to the Raleigh-Plateau instability wavelength. This instability wavelength is only dependent on the thermal properties of working fluids, meaning that the optimal side length for CHF optimization is only related to the thermal properties of working fluid, namely, the surface tension, and the liquid and vapor densities of working fluid. In Chapter Five of this study, n-support vector machine is adopted to explore and study experimental strategies for the data-driven approaches of CHF look-up table construction, on the basis of sparingly-distributed experimental CHF data points. In the virtue of the CHF look-up table of Groeneveld et al (2007), those CHF data was used as the reference data of this research. In this data collection, CHF data of the subcooled flow boiling (Xe \u3c 0) is chosen to concentrate on the PWR steady-state condition because the in the normal operation of PWR, the system is under the subcooled flow boiling. The numerical results have demonstrated that Ī½-SVM trained by well sparsely-distributed training data in the parameter region of interest (pressure and mass flux) can yield a fairly acceptable degree of CHF prediction accuracy. Procuring training data points that can imply the parametric behaviors of CHF with respect to pressure and mass flux for support vector machine is the essential key of machine learning to achieving a high level of CHF prediction accuracy. For capturing the pressure-variant CHF behavior, training data that are in the proximity of the CHF inflection point significantly contribute to the improvement of prediction accuracy. Hence, training data preparation physics-informed with knowledge of CHF inflection points definitely augments the prediction accuracy of CHF. How the parametrical trends of CHF with respect to pressure and mass flux are close to the linear trends determines the level of prediction accuracy when lacking of a good spread of training data points. Besides, it is found that CHF extrapolation to a higher pressure with many data points collected at different low pressures can be effectively achieved by SVM if a few CHF data points are available under the high pressure, especially for PWR pressure of 15.5 MPa. This announces a possibility of strategic integration experiments between high pressure and low pressure, reducing experimental costs associated with the high pressure testing in terms of efforts and money. The proposed methodologies provides engineers and experimentalists with useful strategies to construct the look-up table tabulation of advanced cladding materials of ATFs. It is found out that there are multiple sub-problems that could be divided for CHF prediction and each sub-problem has its individual suitable machine learning model. Those prior work done by this study proves that the data-driven CHF modelling by sub-models can provide accurate CHF prediction under various scenarios and correct parametrical trends with respect to separate variables. Last but not least, another contribution of this thesis to the field of boiling heat transfer is that two databanks of experimental CHF data are compiled for the CHF enhancement by microstructures. The compiled databanks provide useful information and guidelines to the future design of surface structures that will possibly be applied to heat exchanger and nuclear fuel rod

    On de novo Bridging Paired-end RNA-seq Data

    Full text link
    The high-throughput short-reads RNA-seq protocols often produce paired-end reads, with the middle portion of the fragments being unsequenced. We explore if the full-length fragments can be computationally reconstructed from the sequenced two ends in the absence of the reference genome - a problem here we refer to as de novo bridging. Solving this problem provides longer, more informative RNA-seq reads, and benefits downstream RNA-seq analysis such as transcript assembly, expression quantification, and splicing differential analysis. However, de novo bridging is a challenging and complicated task owing to alternative splicing, transcript noises, and sequencing errors. It remains unclear if the data provides sufficient information for accurate bridging, let alone efficient algorithms that determine the true bridges. Methods have been proposed to bridge paired-end reads in the presence of reference genome (called reference-based bridging), but the algorithms are far away from scaling for de novo bridging as the underlying compacted de Bruijn graph(cdBG) used in the latter task often contains millions of vertices and edges. We designed a new truncated Dijkstra's algorithm for this problem, and proposed a novel algorithm that reuses the shortest path tree to avoid running the truncated Dijkstra's algorithm from scratch for all vertices for further speeding up. These innovative techniques result in scalable algorithms that can bridge all paired-end reads in a cdBG with millions of vertices. Our experiments showed that paired-end RNA-seq reads can be accurately bridged to a large extent. The resulting tool is freely available at https://github.com/Shao-Group/rnabridge-denovo.Comment: 10 pages, 4 figure
    • ā€¦
    corecore