The stereo matching is one of the most widely used algorithms in real-time image processing applications such as positioning systems for mobile robots, three-dimensional building mapping and both recognition, detection and three-dimensional reconstruction of objects. In order to improve the runtime, stereo matching algorithms often have been implemented in dedicated hardware such as FPGA or GPU devices. In this article an FPGA stereo matching unit based on fuzzy logic is described. The proposed method consists of three stages: first, three similarity parameters inherent to each pixel contained in the input stereo pair are determined; later, these parameters are submitted to a fuzzy inference system that determines a value of fuzzy-similarity; finally, the disparity value is determined as the index for the maximum value of the fuzzy-similarity values (zero up to d max ). Dense disparity maps are computed at a rate of 76 frames per second for input stereo pairs of 1280x1024 pixel resolution and a maximum expected disparity equal to 15. The developed FPGA architecture provides reduction of the hardware resource demand; up to 67,384, minimum 9,788 for logic units, up to 35,475, minimum 11,766 for bits of memory. Increases the processing speed; up to 78,725,120, minimum 14,417,920 pixels per second and outperforms the accuracy level of most of real-time stereo matching algorithms reported in the literature.
Introduction
Depht values of the points contained in an image is one of the most used tasks of the computer vision systems and has been used in several applications such as positioning systems for mobile robots and both recognition, detection and threedimensional reconstruction of objects [1] [2] [3] [4] [5] [6] [7] . In stereo vision technique the correspondence between stereo pairs and the geometrical configuration of the stereo camera allows to obtain images of depth called disparity maps. In order to determine a disparity map it is necessary to measure the similarity of the points contained in the stereo pair. Techniques to determine these similarities are divided in two categories: area-based algorithms [8, 9] and feature-based algorithms [10, 11] .
Area-based algorithm use the color value of the surrounding pixels to the interest pixel and produce dense disparity maps, i. e., these compute the disparity value for each pixel in the input stereo pair. The main characteristic of area-based algorithms is mathematical simplicity and low runtime. On the other hand, feature-based algorithms are based on specific interest points and are more stable against changes of contrast, enviroment conditions and illumination due to these represent the geometric properties of the input stereo pair. However, these algorithms possess high runtime and high mathematical complexity.
Related works
FPGA devices allow the implementation of dedicated computational architectures that can accelerate algorithms. In previous work there has been several FPGA stereo matching architectures reported in the literature [12, 13] . However, in most of these FPGA architectures the authors have been concentrated in the accuracy level and not in the performance regarding to hardware resource consumption, which is an important parameter in mobile autonomous applications in which the use of small FPGA devices and small power sources are required.
FPGA implementations
The system presented in [14] consists in a 4×4 array of FPGAs connected in mesh type configuration, authors use a maximum total of near 35,000 LUT of 4 inputs, allowing to process 40 frames per second for images of 320×240 pixel resolution. In [15] , a structure based on four FPGAs Virtex 2000E of Xilinx is presented, obtaining dense disparity maps at a speed of 40 frames per second for images of 256×360 pixel resolution. In [16] , the use of a single FPGA is proposed, the developed system processes images at 30 frames per second using images of 640×480 pixel resolution. The architecture developed in [17] , uses a technique based on SAD to calculate the optical flow efficiently, the system generates dense disparity maps at speeds superiors to 800 frames per second for images of 320×240 pixels using a correlation window of 7×7 and a maximum expected disparity equal to 121.
A modification of SAD is shown in [18] , the authors of this work synthesize diverse versions of SAD to determine the needs and the performance of the hardaware resources, by decomposing the correlation window of SAD in rows and columns using buffers a saving of resource of around 50% is reached. Using different forms of windows, the high consumptions of memory decreases without any detriment of the quality. Disparity maps are calculated at speed of 122 frames per second for images of 320×240 pixels and a maximum expected disparity equal to 64.
The architecture in [19] uses four FPGAs to conduct a rectification in real-time, later, a verification of left-right consistency was applied in order to improve the quality of the produced disparity map. Speeds of 30 frames per second are reached for images of 640×480 pixel resolution and a maximum expected disparity equal to 128. In [9] an FPGA correlation-edge distance approach is proposed. Speeds of 76 frames per second are reached for images of 1280×1024 pixel resolution and a maximum expected disparity equal to 15. By using a geometric feature, the euclidean distance between the selected point and the nearest left edge, the developed FPGA architecture provides a improvement over others conventional correlation-based stereo matching algorithms.
Fuzzy logic approaches
In [20] , similarity measure for stereo matching based on fuzzy relations is proposed. The strength of relationship of fuzzy data of two windows in the left image and the right image is determined by applying fuzzy aggregation operators. However, these measures fail to establish correspondence of the occluded pixels. In order to outperform the performance in the occluded pixels an modification of the weighted normalized cross correlation (WNCC) algorithm based on fuzzy relations of fuzzy data is used for the stereo matching process.
Authors of [21] present an modification of the Zitnick and Kanade stereo matching algorithm [22] . The authors propose the use of a balanced correlation window. In addition, the use of a new fuzzy factor in the calculation of initial matching values which expresses the possibility of matching between two pixels and a new iterative function for the refinement of the initial matching values are presented. Experimental results are evaluated on synthetic and real images and a comparison of the results regarding to other algorithms reported in the literature, using the ground truth data supplied by the University of Tsukuba, is presented.
Finally, in [23] an FPGA module suitable for real-time disparity map computation is presented. This enables a hardwarebased fuzzy inference system parallel-pipelined design, for the overall module, implemented on a single FPGA device with a operating frequency of 138 MHz. This provides disparity map computation at a rate of nearly 440 frames per second, for a input stereo pair with a disparity range of 80 pixels and 640×480 pixel resolution. The proposed method allows high speed processing, enabling a suitable module for real-time stereo vision applications.
The proposed method
Stereo matching algorithms suitable for real-time processing have been studied by several authors, section 2.1. However, most of these algorithms possess high mathematical complexity; therefore, large FPGA devices are required in order implement them. On the other hand, although some approaches based on fuzzy logic have been reported in the literature section 2.2, the algorithms described in [20, 22] presented high mathematical complexity and do not allow to be implemented in dedicated hardware for real-time processing such as FPGA devices. In case of the FPGA architecture in [23] possesses high accuracy and high speed processing. However, the developed module maintains high hardware resource consumption.
The main objective in this research is to develop an FPGA stereo matching unit suitable for real-time processing. The disparity maps are computed at a rate of 76 frames per second for input stereo pairs of 1280×1024 pixel resolution and a maximum expected disparity (d max ) equal to 15. In order to reach high accuracy level the use of multiple similarity parameters and a fuzzy approach is proposed. Furthermore, to obtain high speed processing, the proposed method was implemented in an FPGA device. On the other hand, in order to maintain low hardware resources consumption it is proposed the use of new strategies such as: a fuzzy inference system based on point-slope equations and the use of storage vectors. Finally, to obtain an appropriate configuration regarding to the environment setting of the input stereo pair, the developed FPGA architecture allows to set up the exposition of the cameras and allows scalability for different levels of maximum disparity of simple and systematic form. This allows that the resulting hardware can be applied to a wide range of applications of real-time stereo vision.
A general block diagram of the proposed method is shown in the Figure 1 , the data obtained through a stereo camera is processed to the cadence of video and it is simultaneously computed three similarity parameters inherent to each pixel. 1 -correlation obtained via the Sum of absolute differences (SAD) algorithm [24] [25] [26] [27] . 2 -similarity of the distance in pixels to the closer left edge. 3 -similarity of the distance in pixels the closer right edge. Later, these three parameters are submited to a fuzzy inference system which assigns fuzzy-similarity values for the zero up to d max disparity levels. Finally, the value of disparity is assigned as the index that maximizes the fuzzy-similarity value. 
Correlation computation
In majority of area-based algorithms, a rectangular vicinity centered on a reference pixel in one of the images from a stereo pair is compared with similar vicinities for some pixels in the same raster line of the other image. Vicinities are called correlation windows and can be compared using a correlation-based measure such as SAD, equation 1; where 2w + 1 is the size of the correlation window centered in the pixel located in the (x, y) position. I l , I r are the grayscale values of the pixels in the images left and right respectively and z takes values from 0 up to d max .
Crl(x, y, z) = u=w,v=w u=−w,v=−w
Similarity of the edge distances
The euclidean distance between each pixel with coordenates I l (x, y 1 ) or I r (x, y 1 ) and the closer left edge (d 1 ), considering images of X × Y resolution and a maximum level of expected disparity equal to d max is computed via equations 2 -3; where β represents the value of threshold to determine an edge. δ is defined as l or r for the left or right images, respectively. z ranges from 0 up to d max . y 1 ranges from 1 up to Y − d max and g 1 is defined by equation 4.
In Figure 2 the left image of the Venus scene is shown. On the other hand, Figure 3 shown the distance to the closer left edge for all the pixels that integrate the scene. The darker values represent small distances whilst the brighter values represent great distances. The euclidean distance between each pixel with coordenates I l (x, y 1 ) or I r (x, y 1 ) and the closer right edge (d 2 ), considering images of X × Y resolution and a maximum level of expected disparity equal to d max is computed via equations 5 -6; where β represents the value of threshold to determine an edge. δ is defined as l or r for the left or right images, respectively. z ranges from 0 up to d max . y 1 ranges from 1 up to Y − d max and g 1 is defined by equation 7. In Figure 4 the distance to the closer right edge for all the pixels that integrate the Venus scene is shown. In order to determine the similarity between the distances to the edges d l , d r of the left and right images, I l and I r , it is proposed the use of the equation 8; where k is defined as {l, r} and z ranged from 0 up to d max .
Fuzzy Inference System
The three similarity parameters, correlation (Crl) and the edge distances similarities (D l , D r ), ineherent to each (x, y) pixel for each z level of disparity, are submited to a fuzzy inference system that determines a fuzzy-similarity value. In the proposed fuzzy inference system, input membership functions for Crl, D 1 and D 2 are used. These functions determine the degree of truth to the fuzzy sets good , medium and bad, that represent the degree of similarity between the pixels of the left and right images, being good a high value of similarity, bad a low value of similarity and medium an intermediate value between high and low similarity.
Due to the linearity of the input data it is proposed to use triangular type functions with binary based slopes. Functions of this type allow to simplify the products and quotients to an shift operation over registers. This will be useful when implementing the proposed method in FPGA devices, this allow to reduce the hardware resources demand and maintaining low mathematical complexity. The input membership functions of for the good, medium and bad sets were defined as shown in equations 9 -11; where α, β and γ are equal to 0, 128 and 255 respectively. h corresponds to the similarity parameters values.
In the proposed method the output with the larger numerical value regarding to the diferent z disparity levels. In order to assign output values three fuzzy output sets are proposed good, medium and bad. Where good represents the major numerical output values, descending to the values pertaining to the medium set and finally the values from the bad set that represents the minimum numerical values that can define the output. The degrees of truth to these sets are determined via the functions ψ good , ψ medium and ψ bad , equations 12 -14; where α, β, γ, ρ and σ are equal to 0, 128, 255, 96 and 160 respectively.
To relating the input sets to the output sets, the use of rules IF-THEN type and the AND operator of Zadet is proposed [28] . The design rules are shown in the Table 1 . In order to use a fuzzy inference process with low hardware resource consumption, it is proposed the following fuzzy inference mechanism. First, the precedent of each rule is computed replacing the numerical values of the Crl, D 1 and D 2 inputs in equations 9-11; where ε n (x, y) is the precedent of the nth rule and µ 1 (x, y), µ 2 (x, y) are both fuzzy sets included in each rule (Table 1). On the other hand, the consequent to each rule is determined as shown in equation 16; where ϑ n (x, y) is the consequent of the nth rule. inp is the input value for the corresponding similarity parameter and f s is an point-slope type equation that describes a linear segment of the output membership functions, equations 12-14. In the column f r of the Table 1 the assignation of functions f s for each rule is shown. Finally, the result of the fuzzy inference is determined using the Mean-Max method, equation 20; where ϑ m1 (x, y), ϑ m2 (x, y) are the consequence of the two rules with higher implication in the fuzzy inference process while the corresponding disparity value is determined via the equation 21.
Output mapping
In order to turn the disparity values defined by values from 0 up to (d max ) to RGB values with 10 bits of depth, it is proposed to applying the equation 22; where d max is the maximum expected disparity value ∀ η {1, 2, 3}. disparity mapped (x, y, η) = disparity(x, y) * 1023/d max (22) 
Computational complexity
In order to explain the computational complexity of the proposed algorithm, first, the SAD computational complexity is analyzed. In this case the computational complexity is defined as following:
where M is the size of the input stereo pair. S is the size of the correlation window. D is the maximum expected disparity and d is the increment regarding to the disparity values. Like SAD, the proposed algorithm possesses a computational complexity defined in the same terms, Table 3 
Based on the high efficient of to the SAD algorithm and considering that his computational complexity is equal to the complexity of the proposed algorithm. It is possible to affirm the high efficient of the proposed method.
On the other hand, although the proposed method possesses mathematical complexity similar to SAD, due to edges extraction, similarity computation stage and the fuzzy inference process, the runtime is increased. In Table 2 the runtime of the SAD algorithm considering different synthetic stereo pairs and different values for the maximum expected disparity compared with the proposed method runtime considering similar settings is presented. As can be seen, in all the cases an increment of the runtime is observed, near to 10 times. However, most of the new runtime corresponds to the fuzzy inference process. In MatLab this is an iterative process that computes in sequential form the degree of truth for all the input fuzzy sets. Then evaluates in sequential form all the fuzzy rules and assigns the corresponding antecedent and consequent values. Finally the fuzzy similarity value is computed. However, this process could be parallelized when the proposed method is implemented in dedicated hardware such as FPGA devices that enables to reach real-time processing. Furthermore, considering the proposed fuzzy inference method any FPGA architecture must maintain low hardware resources consumption. Finally, in Table 3 the detailed pseudocode for the proposed method is shown. As can be seen this consist into several recursive and sequential operations, such as edge extraction stage, edge similarity computation, fuzzy inference process and so on, characteristics that allow parallel-pipelined design suitable for FPGA devices. On the other hand, the developed architecture has one output, disparity [10:0], corresponding to disparity value for the selected pixel. The developed FPGA architecture was set to process input stereo pairs of 1200 × 1024 pixel resolution, and considering a maximum expected disparity equal to 15. Its general behavior can be described as following: first, the image acquisition modules capture stereo video streams. Then, the storage vector modules store gray scale values of pixels contained in 5 horizontal lines for both left and right images of input stereo pair. Later, the disparity value are computed via disparity computation module. Finally, the mapping convert the final disparity value to grayscale values of 10 bits of depth. In the following subsections the architecture of all the individual modules is shown in detail.
The image acquisition module
In order to acquire input stereo pairs, CCD sensors connected in a TRDB DC2 board is used. This board provides stereo pairs of 1280×1024 pixel resolution in RGB scale. In most of the FPGA stereo matching implementations reported in the literature, grayscale values with 8 bits of depth are used. However, the TRDB DC2 board provided images in RBG format with 10 bits of depth. Although the developed architecture could be configured to process 10 bits of depth, this increases the hardware resource consumption. In order to perform appropriate comparisons regarding to other methods, only the most significant 8 bits of the data provided by the TRDB DC2 board are used. On the other hand, in order to determine the grayscale value for the input images, the value of the green channel is used as grayscale value.
In Figure 6 the detailed architecture of the image acquisition module is shown. The settings module send instructions regarding to the exposition and operation mode to the TRDB DC2 board, P 
The storage vector module
In order to store necessary data for the disparity computation, the use of the storage vector module is proposed. This module consists into five logic vectors. The size of four vector is equal to horizontal resolution×bits per pixel−1 while one vector size is defined as d max × bits per pixel − 1.
Although the proposed module possesses similar behavior with respect to a shift register unit, this allows to read multiple data in one clock cycle. In general, when a image line begins, the gray scale value of the pixel with coordinate (1) is stored in index [7: 0] of one storage vector, in the following clock cycle, this value is moved to index [15:8] and the gray scale value of the pixel with coordinate (2) is stored in index [7:0] . A similar process is repeated for all the pixels that integrate the line. In Figure 7 , behavior of storage vector module with settings as follows: number of lines to process = 5, horizontal resolution = 1200. 
The disparity computation module
In Figure 8 the detailed architecture of the image acquisition module is shown.
An pixel-parallel window-parallel architecture was designed. Using the appropriate indexes for the storage vector is possible to process the image to the video cadence, giving as result a disparity map of (X − w)*(Y − w) pixels of resolution, where X, Y are the values of resolution of the input frames and 2w + 1 is the dimension of the correlation window used. In the following subsections, details regarding to the operation of the module disparity computation is shown.
The similarity parameter module
The similarity parameter module calculates the similarity parameters necessary to estimate the disparity value for one particular pixel, Crl(x, y, z), D l , D r . This module consists into three sub-modules implemented in parallel form. Furthermore, a 5×5 correlation window and maximum expected disparity equal to 15 are used. In order to compute the first parameter (Crl(x, y, z)); the first sub-modules performs the absolute difference between all pixels in the correlation window in parallel form. Then, all absolute difference values are summed.
The second sub-module computes the second similarity parameter D l . First, this sub-module applies the equations 3 and 4 and computes the k 1 value. The Figure 9 shows the position of the pixels used in this stage; by maintaining this position it is possible to compute the d l value at the video cadence. In addition, this figure shows a scenario in which the storage process for the five lines of the first correlation window have been completed; as can be seen the third vector correspond to the central line of the correlation. Due to shift behavior it is possible to compute the k 1 value at video cadence. After the k l value is computed , it is determines the d 1 value by assigning to β (equation 2) a constant value equal to 32. Finally, the similarity value between the edges of the left and right images is computed via equation 8. Finally, the third sub-module computes the third similarity parameter, D r . This sub-module possesses a similar behavior to the D l computation sub-module. First, it is applied the equations 6 and 7 and it is computed the k r value. After the k r value is computed , it is determines the d r value by assigning to β (equation 5) a constant value equal to 32. Finally, the similarity value between the edges of the left and right images is computed via equation 8. However, the storage vector module, Figure 7 only allows to determine the left edge distance values. In order to compute the right edge distance values it is proposed the use of the right direction vector module.
The right direction vector module
This module consist into three stages. First, it is determined when all the pixels from one line of the input frame are stored in the number 1 vector of the storage vector module, Figure 7. i.e., it is determined when the storage process for all the pixels of the line 1 or 2 or 3 and so on is completed. Then, in the second stage the values contained in the number 1 vector of the storage vector module are copied and placed in the k r vector, Figure 11-(a) . The k r vector possesses a similar behavior to the vectors of the storage vector module, however, the order in the data of the k r are inverted. Using the k r vector it is possible to obtain the D r value. With the purpose of reserving the logical elements (LEs) of the FPGA for the defuzzification stage, the equations 9-11 were evaluated with values of 0 up to 255 with increases equal to 1, the obtained values were integrated in a CASE structure. This structure assigns the degrees of truth to the fuzzy input sets good, medium and bad for the three similarity parameters. Later, in the second sub-module the 18 rules that integrate the proposed bank of rules are evaluated in parallel form, the antecedent and consequent regarding to each rule are calculated according to the equations 15-16. Then, the 18 calculated consequent enter to the max third module that determines the two rules with higher implication in the inference process. Finally, the fourth module computes the arithmetic mean of its two inputs, defuzzification process. i.e. computes the fuzzy-similatity value.
The max module
In order to reach an appropriate propagation of the processed data, the use of the max actor is proposed, Figure 10 
The mapping module
In order to display the calculated disparity maps a 4,3" LCD screen of 800x480 pixels of resolution of the terasIC brand was used. Due to the used screen operates with color values of 10 bits per pixel, the mapping module, Fig. 5 , turns the disparity values, in this case velues ranged from 0 to 15, to grayscale values with 10 bits per pixel, equation 22. In order to reduce the hardware resources requirements the equation 22 was evaluated with disparity values from 0 up to 15 with increases equal to 1, the obtained values were integrated in a CASE structure which assigns disparity values with 10 bits per pixel.
Discussion and analysis of results
The architecture presented in Section 3 was implemented using a top-down approach. All modules were codified in Verilog and were simulated using post-synthesis simulations performed in ModelSim-Altera 6.6c in order to verify its functionality. Quartus II Web Edition version 10.1SP1 was used for the synthesis and FPGA implementation process. An FPGA Cyclone II EP2C35F672C6 embedded in an Altera DE2 development board was used. The hardware resource consumption for the developed FPGA architecture is shown in Table 5 . In order to evaluate the performance of the proposed method, the developed architecture was analyzed using different values for the β parameter (equations 2, 5) and different sizes for the correlation windows. The tests were conducted using as test images the Tsukuba, Venus, Teddy and Cones scenes. β was evaluated using the values {8, 16, 32, 64, 128} while the window sizes used (2w + 1) were {3, 5, 7, 9, 11, 13, 15}. In the Figure 12 is observed the behavior of the error obtained in the disparity maps generated for different values of β considering 2w + 1 = 5. On the other hand, in Figure 13 the behavior of the error obtained in the disparity maps generated for different values of 2w + 1 considering β = 32 is shown. To determine the number of erroneous pixels, the corresponding disparity maps were submited in the website for evaluation of stereo systems of the Middlebury University [29] .
In the Table 6 it is presented quantitative results of the number of erroneous pixels obtained by the proposed method for a correlation window of 5x5 (2w + 1 = 5) and β = 32 compared with other methods reported in the Literature. Disparity maps have been compared using the method proposed in [30] using the following configuration of images: Tsukuba (384×288, d max = 15), Venus (434×383, d max = 19), Teddy (450×375, d max = 59),Cones (450×375, d max = 59). In order to process and to collect the data presented in the Table 6 , the developed architecture was scaled to operate with 64 levels of disparity. In Figure 14 the generated disparity maps for the proposed algorithm are shown. By analyzing Table 6 , due to use of geometrical features such as edge distances the proposed algorithm allows high accuracy level in homogeneous areas, therefore, the results of the proposed algorithm improved most of SAD, SSD or NCC-based algorithms reported in literature, [31-33, 36, 37] . In addition, similar to most of stereo matching algorithms in the literature, the proposed algorithm has high performance with small values of maximum disparity (Tsukuba, Venus scenes) while medium performance with large values of maximum disparity (Teddy, Cones scenes). On the other hand, the proposed algorithm performance is lower than the census based algorithms, [34, 35] , however, census-based algorithms involve high mathematical complexity and the hardware resources requirements are high. Table 7 presents comparisons of processing speed regarding to other real-time stereo matching algorithms reported in the literature. Due to the mathematical simplicity of the proposed algorithm, the developed architecture does not require complex arithmetical operations such as calculation of quotients and radicals (which require a high runtime), hence, it maintains high processing speed. When comparison of processing speed is conducted, Table 7 , it is observed an increase with respect to other algorithms implemented in FPGA devices up to 78,725,120, minimum 14,417,920 pixels per second. In all cases except for the algorithm presented in [33] ,the processing speed for the proposed method is higher. However, considering the clock specifications for the algorithm described in [33] , it is expected higher speed processing than the proposed algorithm. It is possible to affirm that if the same clock specifications are used, the proposed algorithm outperforms the speed processing for most of the FPGA-based stereo matching algorithms reported in the literature. Table 5 presents a comparison of the use of hardware resource regarding to other FPGA implementations in the literature. By analyzing the Figure 8 , the higher performance for the proposed algorithm can be reached using a 5×5 correlation window. It is the main characteristic and difference regarding to all others stereo matching algorithms in the literature whose performance is proportional to the correlation window size. In order to reach high performance, large correlation window sizes are required. However, the correlation window size required for the proposed method is small, therefore, it is possible to decrease the hardware resource consumption, see Table 8 . Finally, in Tables 9 and 10 present a comparison of the use of hardware resource regarding to all the synthesized and simulated configurations of the developed FPGA architecture. as can bee seen the hardware consumption for the developed FPGA architecture is appropriate for the majority of the low gamma of FPGA devices such as the Cyclone II family of Altera or Spartan II family of Xilinx. Low hardware resources requirements is useful in autonomous applications such as robotic applications, where the use of small FPGA devices that implies relatively few hardware resources is needed. On the other hand, due to high speed processing, low hardware resources consumption and high accuracy level compared with most of the FPGA implementations in the literature, any real-time application wich involve real-time stereo matching algorithms will take advantages by applying the proposed algorithm. 
Conclusions
In this article, an area-based algorithm suitable for real-time stereo matching using similarity measures and geometric features was presented. Geometric features such as edge sitantaces allow high accuracy level in homogeneous areas. Then, in order to relate the similarity measures with the geometrical features, the use of a fuzzy inference system is proposed. Furthermore, in order to maintain low mathematical complexity suitable for FPGA implementations, a fuzzy inference mechanism based in point-slope equations is proposed. The main difference and advantage for the proposed algorithm is the use of small correlation windows in order to reach high accuracy level. Small correlation windows allow higher speed processing and lower computational resources requirement in comparison with the large correlation windows used in most of the SAD-based stereo matching algorithms reported in the literature.
In order to improve its processing speed, the proposed algorithm was implemented in a FPGA device. The developed FPGA architecture outperforms most of the real-time stereo matching algorithms in the literature, allowing high accuracy level and enables both increasing the processing speed and decreases the hardware resources consumption.
Finally, an important characteristic of the presented architecture is the scalability permissible; all the modules in the developed FPGA architecture, allow to be adapted for processing larger correlation windows than the simulated and implemented correlation windows. On the other hand, the FPGA architecture enables to configure different levels of maximum expected disparity (d max ), consequently, it is possible to configure the module for the computation of disparity maps with appropriate values to the environmental characteristics of the input video streams. This allows that the developed architecture can be applied to a wide range of applications of real-time stereo vision such as positioning systems for mobile robots and recognition, detection and so on.
Referencias

