323 research outputs found
A Parallel and Efficient Algorithm for Learning to Match
Many tasks in data mining and related fields can be formalized as matching
between objects in two heterogeneous domains, including collaborative
filtering, link prediction, image tagging, and web search. Machine learning
techniques, referred to as learning-to-match in this paper, have been
successfully applied to the problems. Among them, a class of state-of-the-art
methods, named feature-based matrix factorization, formalize the task as an
extension to matrix factorization by incorporating auxiliary features into the
model. Unfortunately, making those algorithms scale to real world problems is
challenging, and simple parallelization strategies fail due to the complex
cross talking patterns between sub-tasks. In this paper, we tackle this
challenge with a novel parallel and efficient algorithm for feature-based
matrix factorization. Our algorithm, based on coordinate descent, can easily
handle hundreds of millions of instances and features on a single machine. The
key recipe of this algorithm is an iterative relaxation of the objective to
facilitate parallel updates of parameters, with guaranteed convergence on
minimizing the original objective function. Experimental results demonstrate
that the proposed method is effective on a wide range of matching problems,
with efficiency significantly improved upon the baselines while accuracy
retained unchanged.Comment: 10 pages, short version was published in ICDM 201
Hardware for Machine Learning: Challenges and Opportunities
Machine learning plays a critical role in extracting meaningful information out of the zetabytes of sensor data collected every day. For some applications, the goal is to analyze and understand the data to identify trends (e.g., surveillance, portable/wearable electronics); in other applications, the goal is to take immediate action based the data (e.g., robotics/drones, self-driving cars, smart Internet of Things). For many of these applications, local embedded processing near the sensor is preferred over the cloud due to privacy or latency concerns, or limitations in the communication bandwidth. However, at the sensor there are often stringent constraints on energy consumption and cost in addition to throughput and accuracy requirements. Furthermore, flexibility is often required such that the processing can be adapted for different applications or environments (e.g., update the weights and model in the classifier). In many applications, machine learning often involves transforming the input data into a higher dimensional space, which, along with programmable weights, increases data movement and consequently energy consumption. In this paper, we will discuss how these challenges can be addressed at various levels of hardware design ranging from architecture, hardware-friendly algorithms, mixed-signal circuits, and advanced technologies (including memories and sensors).United States. Defense Advanced Research Projects Agency (DARPA)Texas Instruments IncorporatedIntel Corporatio
Fractal aggregation kinetics contributions to thermal conductivity of nano-suspensions in unsteady thermal convection
Nano-suspensions (NS) exhibit unusual thermophysical behaviors once interparticle aggregations and the shear flows are imposed, which occur ubiquitously in applications but remain poorly understood, because existing theories have not paid these attentions but focused mainly on stationary NS. Here we report the critical role of time-dependent fractal aggregation in the unsteady thermal convection of NS systematically. Interestingly, a time ratio λ = t(p)/t(m) (t(p) is the aggregate characteristic time, t(m) the mean convection time) is introduced to characterize the slow and fast aggregations, which affect distinctly the thermal convection process over time. The increase of fractal dimension reduces both momentum and thermal boundary layers, meanwhile extends the time duration for the full development of thermal convection. We find a nonlinear growth relation of the momentum layer, but a linear one of the thermal layer, with the increase of primary volume fraction of nanoparticles for different fractal dimensions. We present two global fractal scaling formulas to describe these two distinct relations properly, respectively. Our theories and methods in this study provide new evidence for understanding shear-flow and anomalous heat transfer of NS associated non-equilibrium aggregation processes by fractal laws, moreover, applications in modern micro-flow technology in nanodevices
Causal relationship between iron deficiency anemia and asthma: a Mendelian randomization study
BackgroundObservational studies have suggested an association between iron deficiency anemia (IDA) and asthma, which may affect the occurrence of asthma. However, whether IDA is a new management goal for asthma remains to be determined.ObjectiveWe conducted a two-sample Mendelian randomization(MR)analysis to assess the association between IDA and asthma.MethodsWe performed a two-sample MR study to assess a causal relationship between IDA (ncase = 12,434, ncontrol = 59,827) and asthma (ncase = 20,629, ncontrol = 135,449). Inverse variance weighted (IVW) was used as the primary method for the analyses. Furthermore, we used weighted medians and MR-Egger to enhance robustness. Data linking genetic variation to IDA and asthma were combined to assess the impact of IDA on asthma risk.ResultsThere are five single nucleotide polymorphisms (SNPs) were used as genetic tool variables for exposure factors. Genetically determined IDA was significantly associated with an increased risk of asthma (OR = 1.37, 95% CI: 1.09–1.72, p = 0.007). There was little heterogeneity in the MR studies and no evidence of level pleiotropy was found.ConclusionsIn our MR study, our findings emphasize that IDA may be associated with a high risk of asthma, indicating a potential role for IDA in the development of asthma. Future research needs to elucidate its potential mechanisms to pave the way for the prevention and treatment of asthma
Increasing entropy for colloidal stabilization
Stability is of paramount importance in colloidal applications. Attraction between colloidal particles is believed to lead to particle aggregation and phase separation; hence, stability improvement can be achieved through either increasing repulsion or reducing attraction by modifying the fluid medium or by using additives. Two traditional mechanisms for colloidal stability are electrostatic stabilization and steric stabilization. However, stability improvement by mixing attractive and unstable particles has rarely been considered. Here, we emphasize the function of mixing entropy in colloidal stabilization. Dispersion stability improvement is demonstrated by mixing suspensions of attractive nanosized titania spheres and platelets. A three-dimensional phase diagram is proposed to illustrate the collaborative effects of particle mixing and particle attraction on colloidal stability. This discovery provides a novel method for enhancing colloidal stability and opens a novel opportunity for engineering applications
A Novel Equivalent Agglomeration Model for Heat Conduction Enhancement in Nanofluids
We propose a multilevel equivalent agglomeration (MEA) model in which all particles in an irregular cluster are treated as a new particle with equivalent volume, the liquid molecules wrapping the cluster and in the gaps are considered to assemble on the surface of new particle as mixing nanolayer (MNL), the thermal conductivity in MNL is assumed to satisfy exponential distribution. Theoretical predictions for thermal conductivity enhancement are highly in agreement with the classical experimental data. Also, we first try to employ TEM information quantitatively to offer probable reference agglomeration ratio (not necessary a very precise value) to just test rational estimations range by present model. The comparison results indicate the satisfactory priori agglomeration ratio estimations range from renovated model
Context-Dependent Translation Selection Using Convolutional Neural Network
Abstract We propose a novel method for translation selection in statistical machine translation, in which a convolutional neural network is employed to judge the similarity between a phrase pair in two languages. The specifically designed convolutional architecture encodes not only the semantic similarity of the translation pair, but also the context containing the phrase in the source language. Therefore, our approach is able to capture context-dependent semantic similarities of translation pairs. We adopt a curriculum learning strategy to train the model: we classify the training examples into easy, medium, and difficult categories, and gradually build the ability of representing phrases and sentencelevel contexts by using training examples from easy to difficult. Experimental results show that our approach significantly outperforms the baseline system by up to 1.4 BLEU points
- …