323 research outputs found

    A Parallel and Efficient Algorithm for Learning to Match

    Full text link
    Many tasks in data mining and related fields can be formalized as matching between objects in two heterogeneous domains, including collaborative filtering, link prediction, image tagging, and web search. Machine learning techniques, referred to as learning-to-match in this paper, have been successfully applied to the problems. Among them, a class of state-of-the-art methods, named feature-based matrix factorization, formalize the task as an extension to matrix factorization by incorporating auxiliary features into the model. Unfortunately, making those algorithms scale to real world problems is challenging, and simple parallelization strategies fail due to the complex cross talking patterns between sub-tasks. In this paper, we tackle this challenge with a novel parallel and efficient algorithm for feature-based matrix factorization. Our algorithm, based on coordinate descent, can easily handle hundreds of millions of instances and features on a single machine. The key recipe of this algorithm is an iterative relaxation of the objective to facilitate parallel updates of parameters, with guaranteed convergence on minimizing the original objective function. Experimental results demonstrate that the proposed method is effective on a wide range of matching problems, with efficiency significantly improved upon the baselines while accuracy retained unchanged.Comment: 10 pages, short version was published in ICDM 201

    Hardware for Machine Learning: Challenges and Opportunities

    Get PDF
    Machine learning plays a critical role in extracting meaningful information out of the zetabytes of sensor data collected every day. For some applications, the goal is to analyze and understand the data to identify trends (e.g., surveillance, portable/wearable electronics); in other applications, the goal is to take immediate action based the data (e.g., robotics/drones, self-driving cars, smart Internet of Things). For many of these applications, local embedded processing near the sensor is preferred over the cloud due to privacy or latency concerns, or limitations in the communication bandwidth. However, at the sensor there are often stringent constraints on energy consumption and cost in addition to throughput and accuracy requirements. Furthermore, flexibility is often required such that the processing can be adapted for different applications or environments (e.g., update the weights and model in the classifier). In many applications, machine learning often involves transforming the input data into a higher dimensional space, which, along with programmable weights, increases data movement and consequently energy consumption. In this paper, we will discuss how these challenges can be addressed at various levels of hardware design ranging from architecture, hardware-friendly algorithms, mixed-signal circuits, and advanced technologies (including memories and sensors).United States. Defense Advanced Research Projects Agency (DARPA)Texas Instruments IncorporatedIntel Corporatio

    Fractal aggregation kinetics contributions to thermal conductivity of nano-suspensions in unsteady thermal convection

    Get PDF
    Nano-suspensions (NS) exhibit unusual thermophysical behaviors once interparticle aggregations and the shear flows are imposed, which occur ubiquitously in applications but remain poorly understood, because existing theories have not paid these attentions but focused mainly on stationary NS. Here we report the critical role of time-dependent fractal aggregation in the unsteady thermal convection of NS systematically. Interestingly, a time ratio λ = t(p)/t(m) (t(p) is the aggregate characteristic time, t(m) the mean convection time) is introduced to characterize the slow and fast aggregations, which affect distinctly the thermal convection process over time. The increase of fractal dimension reduces both momentum and thermal boundary layers, meanwhile extends the time duration for the full development of thermal convection. We find a nonlinear growth relation of the momentum layer, but a linear one of the thermal layer, with the increase of primary volume fraction of nanoparticles for different fractal dimensions. We present two global fractal scaling formulas to describe these two distinct relations properly, respectively. Our theories and methods in this study provide new evidence for understanding shear-flow and anomalous heat transfer of NS associated non-equilibrium aggregation processes by fractal laws, moreover, applications in modern micro-flow technology in nanodevices

    Causal relationship between iron deficiency anemia and asthma: a Mendelian randomization study

    Get PDF
    BackgroundObservational studies have suggested an association between iron deficiency anemia (IDA) and asthma, which may affect the occurrence of asthma. However, whether IDA is a new management goal for asthma remains to be determined.ObjectiveWe conducted a two-sample Mendelian randomization(MR)analysis to assess the association between IDA and asthma.MethodsWe performed a two-sample MR study to assess a causal relationship between IDA (ncase = 12,434, ncontrol = 59,827) and asthma (ncase = 20,629, ncontrol = 135,449). Inverse variance weighted (IVW) was used as the primary method for the analyses. Furthermore, we used weighted medians and MR-Egger to enhance robustness. Data linking genetic variation to IDA and asthma were combined to assess the impact of IDA on asthma risk.ResultsThere are five single nucleotide polymorphisms (SNPs) were used as genetic tool variables for exposure factors. Genetically determined IDA was significantly associated with an increased risk of asthma (OR = 1.37, 95% CI: 1.09–1.72, p = 0.007). There was little heterogeneity in the MR studies and no evidence of level pleiotropy was found.ConclusionsIn our MR study, our findings emphasize that IDA may be associated with a high risk of asthma, indicating a potential role for IDA in the development of asthma. Future research needs to elucidate its potential mechanisms to pave the way for the prevention and treatment of asthma

    Increasing entropy for colloidal stabilization

    Get PDF
    Stability is of paramount importance in colloidal applications. Attraction between colloidal particles is believed to lead to particle aggregation and phase separation; hence, stability improvement can be achieved through either increasing repulsion or reducing attraction by modifying the fluid medium or by using additives. Two traditional mechanisms for colloidal stability are electrostatic stabilization and steric stabilization. However, stability improvement by mixing attractive and unstable particles has rarely been considered. Here, we emphasize the function of mixing entropy in colloidal stabilization. Dispersion stability improvement is demonstrated by mixing suspensions of attractive nanosized titania spheres and platelets. A three-dimensional phase diagram is proposed to illustrate the collaborative effects of particle mixing and particle attraction on colloidal stability. This discovery provides a novel method for enhancing colloidal stability and opens a novel opportunity for engineering applications

    A Novel Equivalent Agglomeration Model for Heat Conduction Enhancement in Nanofluids

    Get PDF
    We propose a multilevel equivalent agglomeration (MEA) model in which all particles in an irregular cluster are treated as a new particle with equivalent volume, the liquid molecules wrapping the cluster and in the gaps are considered to assemble on the surface of new particle as mixing nanolayer (MNL), the thermal conductivity in MNL is assumed to satisfy exponential distribution. Theoretical predictions for thermal conductivity enhancement are highly in agreement with the classical experimental data. Also, we first try to employ TEM information quantitatively to offer probable reference agglomeration ratio (not necessary a very precise value) to just test rational estimations range by present model. The comparison results indicate the satisfactory priori agglomeration ratio estimations range from renovated model

    Context-Dependent Translation Selection Using Convolutional Neural Network

    Get PDF
    Abstract We propose a novel method for translation selection in statistical machine translation, in which a convolutional neural network is employed to judge the similarity between a phrase pair in two languages. The specifically designed convolutional architecture encodes not only the semantic similarity of the translation pair, but also the context containing the phrase in the source language. Therefore, our approach is able to capture context-dependent semantic similarities of translation pairs. We adopt a curriculum learning strategy to train the model: we classify the training examples into easy, medium, and difficult categories, and gradually build the ability of representing phrases and sentencelevel contexts by using training examples from easy to difficult. Experimental results show that our approach significantly outperforms the baseline system by up to 1.4 BLEU points
    corecore