Introduction
Human is much superior to traditional VLSI processors in cognitive functions since brains can self-learn from samples. Mimicking such self-learning process, several VLSI implementations have been explored for pattern recognition based on the Support Vector Machine (SVM) algorithm [1] , which was originally developed for software programs. The SVM using a Gaussian function (GF) kernel is one of the most powerful classifiers [2] , but GF is very expensive computation in silicon. Therefore, analog circuits were developed to generate GF with high speed and compact area. An analog fully-parallel SVM with very high training speed was proposed in [3] . However, this approach requires a large chip area for the array of GF circuits to carry out kernel computation. Thus, the results were only shown by circuit simulation for four sample vectors of only two dimensions. Another row-parallel structure [4] reduced the chip size greatly but made training slower since it requires multi-iterations of sample-serial learning (the chip was built for only two dimension vectors). Furthermore, in both approaches above, it is very difficult to increase the vector dimension because the error increases multiplicably when the system is extended to high dimension vectors.
A Gaussian function generation circuit extendable to higher dimensions has been developed in this work to build a new type of compact fully-parallel SVM system for recognition of 64-D image vectors. Learning process is accomplished autonomously in a self feed-back configuration. Therefore, no iterative operation is required. In addition, the chip size is much smaller than the approach in [3] . The chip was designed in a 0.18um CMOS process and it is now under fabrication.
System structure .1 SVM algorithm applied in this work
SVM is a binary classifier and the classifying task is calculating the following function for an input vector X :
Where, is the i-th training sample with class label of
b is a constant bias (which is set to 0 since it has a negligible effect on the performance of SVM with Gaussian kernel [5] ). The self-training process is to determine  -values by backward propagation. In this work, the gradient-descent algorithm was chosen using the following updating rule [4] :
where C is the regularization parameter.
System organization
Assuming the total number of training samples N, the system organization is illustrated in Fig. 1(a) along with the proof-of-concept chip layout in Fig. 1(b) . N sets of Euclidean distance calculation circuits were constructed in blockⅠto compute the distances between the vector i X and all other samples in parallel. Each cell in blockⅡ contains a capacitor (as an analog memory) and an exponential generation circuit. The Euclidean distance values are stored in arrayⅡrow by row as voltages. As a result, a fully-parallel array of Gaussian kernels has been implemented in such a small area even for high dimensions (64 dimensions of sample vectors of the presented work). During the training process, the  -values in block Ⅲare fed-back to blockⅡand the learning process proceeds in a fully-parallel manner. Therefore, the training process is accomplished autonomously. This is far faster than the clock-based sample-serial iteration approach in [4] . The circuits of  adjuster are current mirror based adders/subtracters. E x t e n d e d A b s t r a c t s o f t h e 2 0 1 1 I n t e r n a t i o n a l C o n f e r e n c e o n S o l i d S t a t e D e v i c e s a n d M a t e r i a l s , N a g o y a , 2 0 1 1 , p p 1 7 4 -1 7 5 P -5 -8 
Simulation results
The performance-robustness against fabrication process variations and the programmability of our proposed Gaussian circuit is verified by HSPICE simulation as shown in Fig. 3 . We randomly set the parameters (mainly threshold voltages) for all of the devices with variations of  5%; select five elements from 64 dimensions randomly and sweep them independently. According to the simulation results in Fig. 3 , the peak-height values of Gaussian feature can be scaled linearly by changing t I with acceptable fluctuations. The COIL-20 (Columbia Object Image Library) database was used to verify training/classifying performances. The images from database were pre-processed and converted into 64-D feature vectors by the PPED method [6] (Projected Principle Edge Distribution). We selected eight images from the "obj16" category of COIL-20 as training samples of class "a"; other eight images from "obj10" category as samples of class "b". After the training process, eight testing samples are used to verify classification correction. The signal in lower window of Fig. 4 represents the classification results in binary. According to the Nanosim simulation results, our proposed system recognized and classified all of eight testing images correctly.
Conclusions
An analog high-dimensional Gaussian function generation circuit has been developed in this work, which is robust against process variations. Based on this GF circuit, we built a fully-parallel self-training SVM system with high training speed and compact chip size. According to the simulation results, our proposed SVM system recognized and classified all of test images from an actual database correctly. A mm 5 . 2 5 . 2  VLSI chip for this system is under fabrication and the measurement results would be presented at the time of the conference.
