85 research outputs found
A high-resolution dynamical view on momentum methods for over-parameterized neural networks
Due to the simplicity and efficiency of the first-order gradient method, it
has been widely used in training neural networks. Although the optimization
problem of the neural network is non-convex, recent research has proved that
the first-order method is capable of attaining a global minimum for training
over-parameterized neural networks, where the number of parameters is
significantly larger than that of training instances. Momentum methods,
including heavy ball method (HB) and Nesterov's accelerated method (NAG), are
the workhorse first-order gradient methods owning to their accelerated
convergence. In practice, NAG often exhibits better performance than HB.
However, current research fails to distinguish their convergence difference in
training neural networks. Motivated by this, we provide convergence analysis of
HB and NAG in training an over-parameterized two-layer neural network with ReLU
activation, through the lens of high-resolution dynamical systems and neural
tangent kernel (NTK) theory. Compared to existing works, our analysis not only
establishes tighter upper bounds of the convergence rate for both HB and NAG,
but also characterizes the effect of the gradient correction term, which leads
to the acceleration of NAG over HB. Finally, we validate our theoretical result
on three benchmark datasets.Comment: 19 page
Active module identification in intracellular networks using a memetic algorithm with a new binary decoding scheme
BACKGROUND: Active modules are connected regions in biological network which show significant changes in expression over particular conditions. The identification of such modules is important since it may reveal the regulatory and signaling mechanisms that associate with a given cellular response. RESULTS: In this paper, we propose a novel active module identification algorithm based on a memetic algorithm. We propose a novel encoding/decoding scheme to ensure the connectedness of the identified active modules. Based on the scheme, we also design and incorporate a local search operator into the memetic algorithm to improve its performance. CONCLUSION: The effectiveness of proposed algorithm is validated on both small and large protein interaction networks
GraphMoco:a Graph Momentum Contrast Model that Using Multimodel Structure Information for Large-scale Binary Function Representation Learning
In the field of cybersecurity, the ability to compute similarity scores at
the function level is import. Considering that a single binary file may contain
an extensive amount of functions, an effective learning framework must exhibit
both high accuracy and efficiency when handling substantial volumes of data.
Nonetheless, conventional methods encounter several limitations. Firstly,
accurately annotating different pairs of functions with appropriate labels
poses a significant challenge, thereby making it difficult to employ supervised
learning methods without risk of overtraining on erroneous labels. Secondly,
while SOTA models often rely on pre-trained encoders or fine-grained graph
comparison techniques, these approaches suffer from drawbacks related to time
and memory consumption. Thirdly, the momentum update algorithm utilized in
graph-based contrastive learning models can result in information leakage.
Surprisingly, none of the existing articles address this issue. This research
focuses on addressing the challenges associated with large-scale BCSD. To
overcome the aforementioned problems, we propose GraphMoco: a graph momentum
contrast model that leverages multimodal structural information for efficient
binary function representation learning on a large scale. Our approach employs
a CNN-based model and departs from the usage of memory-intensive pre-trained
models. We adopt an unsupervised learning strategy that effectively use the
intrinsic structural information present in the binary code. Our approach
eliminates the need for manual labeling of similar or dissimilar
information.Importantly, GraphMoco demonstrates exceptional performance in
terms of both efficiency and accuracy when operating on extensive datasets. Our
experimental results indicate that our method surpasses the current SOTA
approaches in terms of accuracy.Comment: 22 pages,7 figure
A FAST ITERATIVE METHOD FOR SOLVING THE EIKONAL EQUATION ON TRIANGULATED SURFACES
This paper presents an efficient, fine-grained parallel algorithm for solving the Eikonal equation on triangular meshes. The Eikonal equation, and the broader class of Hamilton-Jacobi equations to which it belongs, have a wide range of applications from geometric optics and seismology to biological modeling and analysis of geometry and images. The ability to solve such equations accurately and efficiently provides new capabilities for exploring and visualizing parameter spaces and for solving inverse problems that rely on such equations in the forward model. Efficient solvers on state-of-the-art, parallel architectures require new algorithms that are not, in many cases, optimal, but are better suited to synchronous updates of the solution. In previous work [W. K. Jeong and R. T. Whitaker, SIAM J. Sci. Comput., 30 (2008), pp. 2512-2534], the authors proposed the fast iterative method (FIM) to efficiently solve the Eikonal equation on regular grids. In this paper we extend the fast iterative method to solve Eikonal equations efficiently on triangulated domains on the CPU and on parallel architectures, including graphics processors. We propose a new local update scheme that provides solutions of first-order accuracy for both architectures. We also propose a novel triangle-based update scheme and its corresponding data structure for efficient irregular data mapping to parallel single-instruction multiple-data (SIMD) processors. We provide detailed descriptions of the implementations on a single CPU, a multicore CPU with shared memory, and SIMD architectures with comparative results against state-of-the-art Eikonal solvers.open4
- …