Abstract-The aim of this research is to implement sensorless vector control algorithms on a single, eventually reconfigurable, chip, with a computation timing constraint of, at most, 1-6 microseconds, and a concern for implementation cost. In this article, we discuss the implementation problems and tradeoffs involved in meeting these goals on Field-Programmable Gate Arrays (FPGAs). To be able to fit a complete induction motor vector controller on a single, inexpensive FPGA chip, we estimate the area/time requirements of each module involved in sensorless vector control. We discuss, in particular, the tradeoffs of implementing the key modules, the speed and flux observers and the Clarke and Park transformations. The speed and flux observers here under consideration are extended Kalman filter-based.
I. INTRODUCTION
Technological progress depends more and more on miniaturization and increasingly fast and powerful computing systems, and we witness a true revolution in microelectronics. In almost every field, significant progress now greatly depends on advances in nanotechnologies and their applications. The design of lighter, less cumbersome, and more economic applicationspecific computing processors becomes required in order to reach increasingly demanding performances.
The very fast evolution of CMOS integrated circuit fabrication technologies already makes it possible to design complete digital systems integrated on the same chip.
The reconfigurable chips known as FPGAs (FieldProgrammable Gate Arrays) currently on the market are manufactured at a level of integration of around 90 nanometers or less, and usually comprise several millions of gates on the same chip. They offer considerable advantages for accelerating the time to market, and reduce the development and production costs.
In field of electric motor control, with which we are concerned in this research, the real-time computing capacity of traditional approaches using PC computers and off-the-shelf digital signal processors (DSPs), is largely superseded. One must now turn to nanotechnologies in order to obtain adequate hardware acceleration. FPGAs, light and relatively inexpensive, are usually more powerful than traditional devices, and appear therefore ideal for implementing real-time control systems without involving the considerable costs traditionally related to the design and fabrication of application-specific integrated circuits (ASIC).
The semiconductor industry is trying to design digital signal controllers (DSCs) having a computing time of only a few microseconds for the precise and robust control of electric motors. It would actually be possible of increasing the current operating efficiency of electric motors, now in the order of approximately 40-60%, up to 90%.
The high cost and complexity of the required electronics always constituted a significant impediment to the implementation of complex algorithms within a dynamic of only a few microseconds. The DSC technology, traditionally composed of DSPs coupled to a microcontroller unit or a microprocessor, is presently reaching its physical limits, with a minimal computing time about 6 microseconds for the most minimalistic implementation of vector control using a speed sensor. The challenge to which we attack ourselves here is to realize on a single chip, without using a speed sensor, and in an even shorter lapse of time, the most sophisticated vector control algorithms, where the speed and the flux will not be measured, but estimated by hardware. We will see the complexity of these estimates in the following discussion.
II. INDUCTION MOTOR VECTOR CONTOL
The characteristics of the induction motor are basically nonlinear. Vector control, also called flux directed control, is the first method which makes it possible to artificially give a certain linearity to the torque control of the induction motor. Speed sensors are however necessited, in general, for the implementation of vector control. This does not pose any problem as long as the induction motor is used for regular motion control using a position or speed encoder, but whenever it is impossible to connect a sensor to the motor shaft, the implementation proves difficult. It is then necessary to carry out vector control without using a speed sensor, with all the difficulties that that poses.
Precise induction motor control requires the independent control of the components of its input current producing the field and the couple, as it is the case with the DC motor. The only theoretical solution which makes it possible to realize such an independence of control consists in breaking up the stator current into its components in a domain or a suitable frame of reference. By taking a synchronously revolving frame of reference, with the space vector of rotor flux as phasor of reference, it is mathematically possible to separate the stator current in two independent components i sd and i sq , respectively controlling the field and the torque of the motor. Decoupling between the effects of the components of the current then makes it possible to simplify the control of the mechanical variables of the drive, and to impose on the motor fast variations of the rotor flux at starting and at constant power.
A. Induction Motor Model in Park Domain
The electromechanical model of the induction motor in the Park reference frame (d, q), known as Park domain, synchronously revolving at speed ω, is stated as follows: 
B. System Block Diagram
In the following, we develop the mathematical expressions for the blocs in Fig. 1 . The starred ( * ) variables are the input references for the PI controllers. In general, denotes the error signal, and k p , k i , the PI controller parameters. The α and β subscripts denote the components of the corresponding variables in the stationnary (α, β) reference frame. The a, b, and c subscripts denote the components of the corresponding variables in the stationnary (a, b, c) reference frame. θ is the angular position of the rotor flux vector, with θ = ω dt.
a) Speed PI Controller:
c) Rotor Flux Estimator: 
When it is necessary to carry out vector control without using a speed sensor, the speed can be calculated from the values of the current and voltage of an AC motor. Openloop solutions give a certain speed estimate, but inherently bear a large error. For better results, it is necessary to design an estimator or a filter. The Kalman filter has a good dynamic behavior, a good resistance to perturbations, and it can function at stand still. Designing a filter for an AC motor remains however a very complex problem which requires the calculation of the motor model in real time. Moreover, it is necessary to calculate the filter equations, which normally implies several matrix multiplications as well as one matrix inversion. These requirements can nevertheless be satisfied by a high-performance computing processor. Fig. 2 shows the speed-sensorless induction motor vector control scheme including the speed and rotor flux estimator based on an extended kalman filter (EKF). The tradeoffs involved in the implementation of this estimator constitute the core of the present research and require in-depth analysis and careful considerations which will make the object of a subsequent study.
III. IMPLEMENTATION ANALYSIS
The basic IM vector control scheme (Fig.1 ) comprises 24 multiplications (including multiplication by a constant and squaring), 3 divisions, and only one square root operation. We further observe that the division and square root operators are localized in close mutual coupling inside the rotor flux and ω estimators. Because of the fact that cos θ = Ψ rα /Ψ r and sin θ = Ψ rβ /Ψ r , no actual sine or cosine computation is involved.
Most modern FPGAs embed several tens, even up to a couple hundreds, of small, ultra-fast, 18x18 VLSI multipliers, which can readily be used in signal processing applications. All multiplications involved in an implementation of the IM vector control scheme can efficiently be implemented using these embedded multipliers.
The division and square root operators, thanks to their small number and the modest operand width required, can efficiently be implemented by traditional hardware modules without being overcostly in area. An extensive comparative study of divider implementations on FPGAs, including all kinds of restoring, non-restoring, and SRT dividers has been presented in [1] . It is even possible to implement dividers based on the small embedded 18x18 multiplier blocks [3] , [4] .
The matrix operations involved in implementing the EKFbased estimator are outside the scope of the present article.
We analysed the system of Fig.1 using the Xilinx blockset in Simulink. This analysis revealed that the required internal precision would be of at least 32 bits. We will show in the next section the results of a straightforward FPGA implementation of this design using System Generator.
IV. AREA AND TIME ESTIMATES
We did a worst case analysis of the system implementation as a network of on-line modules following the methodology proposed in [2] . Table II shows our area cost estimates for a 16-bit on-line implementation. The arithmetic modules implementation data presented in Table I have been taken from [2] for the on-line (ol-XXX) modules and SRT-DIV, while NR-SQRT is our home implementation of a non-restoring parallelsequential square root. We see from Table III that the online delay on the bottleneck path of the system is about 116 clock cycles. One lap of the whole control loop therefore takes about 164 clock cycles to complete, taking into account the required conversions back and forth between standard binary and redundant number representations. Estimating that this online design could be clocked at 100 MHz on a Virtex-II FPGA, its total computation time would therefore be of around 1.6 microseconds.
For a crude, but rather convincing, comparision with a 32-bit parallel arithmetic implementation, the synthesis results of the VHDL code generated using System Generator, targetting a Virtex II xc2c2000-4 FPGA, are presented in Table IV . The dividers and square root operators used in this design are all sequential, based on the non-restoring algorithm. On lap of the complete control loop of an implementation of this design using the minimum possible number of registers takes 131 clock cycles at a maximum frequency of 50 MHz. The total computation time is therefore of about 2,5 microseconds. In addition, two variants of this design have been evaluated where each arithmetic module outputs are registered. The first, Variant A, uses sequential dividers and square root operators, while the second, Variant B, uses a 1-clock version of the same operators. Variant A can be clocked up to around 100 MHz and takes 120 clock cycles to complete one lap of the whole control loop. The maximum frequency of Variant B is about 8 MHz, and one lap takes 27 clock cycles. From this data, a simple calculation tells us that the computation time of Variant A is about 1.2 microseconds, while that of Variant B is about 3.4 microseconds.
V. CONCLUSION
For the basic IM vector control scheme using a speed sensor, our analyses and estimations showed that, thanks to the absence of actual sine and cosine functions computations, and to the localized, small number of dividers (3 of them) and square root operators (only 1) involved, an FPGA implementation in parallel arithmetic using the embedded multipliers present in modern FPGAs has the potential to outperform one in on-line arithmetic, while not being overcostly in area. Even at twice the number of bits of precision than used in the on-line design, the parallel designs compare advantageously with the on-line one, both in area cost and computation time.
On-line arithmetic might reveal superior characteristics, however, when implementing the sensorless, EKF-based, vector control scheme, because of the intrinsic complexity of the matrix operations involved and the consequently increased system size. This will make the object of a further study.
