Introduction
Image filtering is an important preliminary stage in many image processing algorithms. The computational image sensor is a promising solution since more transistors can be implemented for each pixel in advanced semiconductor fabrication technologies. For image filtering sensor, the programmability of the filter is a very important factor when adapting such an image sensor to various applications. However, many general purpose image sensors are limited to 3×3 kernel filtering scheme, and the kernel size can only be expanded by repeating these 3×3 kernel operations [1] [2] .
Among various kinds of filters, the Difference of Gaussian (DoG) filter is an indispensable processing employed in the well-known feature description algorithm named Scale-Invariant Feature Transform (SIFT) [3] . Moreover, it is well known in physiology that human retina can be seen as performing a retinal filter kernel [4] , and that the retinal filter kernel has a center-surround organization, where its general model is a DoG. Therefore, DoG filter is important not only in general image processing but also in bio-inspired circuits. To implement the DoG filter on the image sensor faces many challenges. One serious problem is the device variation, because it is rooted in the fundamentals of nature. Image sensors capable of compensating such process-variation-induced noises utilizing the floating-gate memory effect were reported [5] . However, this method needs large power supply and takes a few seconds during the process-variation cancelation, which severely limit the usage of this architecture. In this paper, we present an analog CMOS image sensor architecture equipped with fully programmable Gaussian image smoothing filters and variable scale spatial DoG filters. They are both continuous functions and their kernel size is programmable according to the application need. In the design of a proof-of-concept chip, kernel size was made programmable from 3×3 up to 25×25. Fabrication-process-variations have been alleviated by employing a switched floating-gate MOS (SFMOS) circuitry [6] . The circuit simulation results obtained from a 25×25 pixel array and measurement results of the pixel circuit verified this concept.
Architecture
The organization of the image sensor is shown in Fig. 1 . The core of the system is an array of pixel circuits. The X 2 generator and Y 2 generator produce voltages proportional to X 2 and Y 2 . The shift registers are used for kernel scanning in X and Y directions in combination with the switch array. The region of filtering is also defined by the switch array which is controlled by the data in the shift register. By changing the length of (1, 1, ... , 1) data string, the kernel size can be arbitrarily altered. Since DoG profile is produced by subtracting a wide and short Gaussian function from a slender and tall Gaussian function, realizing programmable Gaussian function is quite essential. Fig. 2 (a) shows the pixel circuit having an exponential weighting function. A two-input SFMOS (νMOS) M 0 is used to memorize the photodiode (PD) current I pixel . This circuit can cancel the process-variation error and, at the same time, multiply an exponential factor . The circuit operates in two steps. In the first step, S 1 . At this moment, both S 3 and S 4 are connected to V DD . Therefore, ϕ F is determined as
A. Exponential Weighting Circuit
where C T = 2C + C O and Q F is the charge on C O . Assuming C O << C, Eq. (1) can be rewritten as
 
In the second step, S 1 is opened and S 3 and S 4 are connected to V x and V y , respectively. Then the current flowing in M 0 changes to I Gauss. , which is given by
(4) Dividing Eq. (4) by Eq. (3), we obtain
It should be noted that I S (I S ) , which mainly causes FPN, has disappeared in Eq. (5). Fig. 1 is an array of square-divider circuits. Fig. 2(b) shows a square-divider circuit, which is composed of a squaring circuit based on the translinear principle (M 1 , M 2 , M 3 , M 4 ) and a PMOS M 5 operating in the linear region.
where, R represents the equivalent resistance of the PMOS. The same circuits are used in Y 2 generator. Therefore, by setting I in = I x in X 2 generator and I in = I y in Y 2 generator, the current flowing in the pixel circuit locating at (x, y) is obtained from Eq. (5) as
Here A = . The Gaussian kernel profile can be arbitrarily programmed by changing I σ and I α as parameters. A DoG filter is produced as the subtraction of two Gaussians of different shapes. Subtractors in Fig. 1 Fig. 3(a) shows the layout of the proof-of-concept chip and Monte Carlo simulation (V TH fluctuates within the ±0.02V range) of Gaussian profile circuit without SFMOS (b) and with SFMOS (c), corresponding relative error is 20.27% and 2.1% respectively. Fig.4 shows the measured waveforms from Gaussian generator which verified the width of Gaussian profile is programmable. Two Gaussian kernel profiles and DoG profile obtained by HSPICE simulation are also shown in this figure as (b), (c), and (d). In Fig. 5 , the HSPICE simulation results of 5×5 and 7×7 DoG filtering are illustrated as 2D maps (Darker color means a smaller value). Horizontal stripe lines are produced by the loss of changes in M0 (Fig. 2(a) ) due to transistor leakage during the kernel scanning. It shows the effect of parallel processing of every five and seven rows for 5×5 and 7×7 kernels, respectively. Characteristics of DoG filters are well observed. Comparison with related work in chip specifications and functional features are presented in Table I . 
Simulation and Measurement results

