85,696 research outputs found
Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations
Web applications are increasingly becoming the primary platform for AI
service delivery, making in-browser deep learning (DL) inference more
prominent. However, current in-browser inference systems fail to effectively
utilize advanced web programming techniques and customize kernels for various
client devices, leading to suboptimal performance.
To address the issues, this paper presents the first in-browser inference
system, nn-JIT.web, which enables just-in-time (JIT) auto-generation of
optimized kernels for both CPUs and GPUs during inference. The system achieves
this by using two novel web programming techniques that can significantly
reduce kernel generation time, compared to other tensor compilers such as TVM,
while maintaining or even improving performance. The first technique,
Tensor-Web Compiling Co-Design, lowers compiling costs by unifying tensor and
web compiling and eliminating redundant and ineffective compiling passes. The
second technique, Web-Specific Lite Kernel Optimization Space Design, reduces
kernel tuning costs by focusing on web programming requirements and efficient
hardware resource utilization, limiting the optimization space to only dozens.
nn-JIT.web is evaluated for modern transformer models on a range of client
devices, including the mainstream CPUs and GPUs from ARM, Intel, AMD and
Nvidia. Results show that nn-JIT.web can achieve up to 8.2x faster within 30
seconds compared to the baselines across various models
Robust Image Recognition Based on a New Supervised Kernel Subspace Learning Method
Fecha de lectura de Tesis Doctoral: 13 de septiembre 2019Image recognition is a term for computer technologies that can recognize certain people, objects or other targeted subjects through the use of algorithms and machine learning concepts. Face recognition is one of the most popular techniques to achieve the goal of figuring out the identity of a person. This study has been conducted to develop a new non-linear subspace learning method named “supervised kernel locality-based discriminant neighborhood embedding,” which performs data classification by learning an optimum embedded subspace from a principal high dimensional space. In this approach, not only is a nonlinear and complex variation of face images effectively represented using nonlinear kernel mapping, but local structure information of data from the same class and discriminant information from distinct classes are also simultaneously preserved to further improve final classification performance. Moreover, to evaluate the robustness of the proposed method, it was compared with several well-known pattern recognition methods through comprehensive experiments with six publicly accessible datasets. In this research, we particularly focus on face recognition however, two other types of databases rather than face databases are also applied to well investigate the implementation of our algorithm. Experimental results reveal that our method consistently outperforms its competitors across a wide range of dimensionality on all the datasets. SKLDNE method has reached 100 percent of recognition rate for Tn=17 on the Sheffield, 9 on the Yale, 8 on the ORL, 7 on the Finger vein and 11on the Finger Knuckle respectively, while the results are much lower for other methods. This demonstrates the robustness and effectiveness of the proposed method
- …