Abstract -This paper proposes an on-line system for feature extraction from ECG signal.
INTRODUCTION
Features from ECG signal are of fundamental importance for monitoring health status and diagnosis of many diseases. Detection of QRS complexes is the first step in determining heart rate (HR), heart rate variability (HR V) and detecting cardiac arrhythmias (CA). Many algorithms for QRS detection exist and several of them employ Wavelet Transform (WT), which demonstrated good performances in analyzing non stationary signals, like ECG [1] . Very often, the algorithms for feature extraction from ECG, including QRS detection, are implemented off-line using dedicated software running on personal computer.
On the other hand, the field of telemedicine and wearable health care systems are among the fastest growing areas, where autonomous, miniature, low-cost, ultra-low-power devices, usually, based on a single chip, play main role. In addition to digitalization, data storage and communication, these chips need to perform complex signal processing, all in real-time. It is not a trivial task considering the limitations in arithmetic power, memory resources, consumption budget, etc.
Thus, there is an essential need to optimize different signal processing algorithms, including those for ECG processing, for their on-chip implementation. Some trials are presented in [2] , [3] and [4] , which use Digital Signal Processors (DSPs), Microcontrollers (MCs) or Field Programmable Gate Arrays (FPGAs). Among them, FPGAs have many advantages including low price, inherent parallelism, design and testing, flexibility and feasibility to transform to Application Specific Integrated Circuit (ASIC) design. Comparing to the implementations in DSPs or MCs, the FPGA design can work with much higher throughput, can be integrated with other modules on the same chip and has more processing power for further development of the system. This paper presents a trial to compromise good characteristics of FPGA and wavelets through the development and implementation of on-line single-chip system for QRS detection and calculation of RR intervals (RRI) and HR. First, ECG signal is decomposed by Discrete Wavelet Transform (DWT) optimized in [3] . Then, QRS complexes are detected by technique based on modulus maxima and adaptive thresholding. Last, RRI and HR are calculated by counter circuits and sent out in a user defmed format.
In the following text, the theoretical background, the system architecture and preliminary testing results are presented.
IT. DWT TRANSFORM AND QRS DETECTION
In practice, DWT is computed by passing the signal through a Low-Pass (Ld) and a High-Pass (Hd) filters successively according to the Mallat's decomposition scheme shown in Fig. 1 Haar wavelet is considered to be the simplest one where filters are of two elements wide.
Haar transform (HT) has a number of advantages; it is (i) conceptually simple, (ii) fast, (iii) memory efficient, since it can be calculated in a place without a temporary array. Also, it is reversible without the edge effect that can be a problem with O,(n) some other WTs. But, this transform has several limitations, mainly in signal compression and noise removal from faster signals, which is not an issue in case of ECG. Like any other WT, the HT can be generalized to an integer to integer version. One technique is proposed in [3] , where:
and L J denotes rounding operator. As seen, the approximation and detail coefficients can be computed by simple arithmetic units like adderlsubtractor and shifter, without multiplier, because lxl2 J=x» 1, where "»" is the right shift operator.
DWT is capable of distinguishing the QRS-complexes within ECG signal. The Dln) coefficients across the scales show that the peak of the QRS complex, position of R wave, corresponds to the zero crossing (zc) between two modulus maxima within Dln). Fig. 2 illustrates the decomposition of real discrete ECG signal X(n) up to the 4 th level, Dln), D2(n), D3(n) and Din). For each decomposition level, the QRS complex produces two modulus maxima (min and max) with opposite signs, with a zc between. Thus, the problem of QRS detection translates to the problem of finding the occurrence of local min, zc and local max points. Energy of QRS complex mainly belongs to Din) and D�(n), while higher levels contain components produced by motion artifacts and baseline wandering. It means that three levels of decomposition, DJ, D2 and D3, are enough for QRS detection that is used in this project. 
FPGA IMPLEMENTATION
A. System architecture Block diagram of the overall system is given in Fig. 3 . It consists of several modules and uses 50 MHz clock rate. Budva, Montenegro of their occurrence in true sequence, the Pulse _i is generated. Condition for successful detection of QRS complex is appearance of at least one Pulse _i at the inputs of OR gate. Afterwards, the Final yulse generator produces a Pulse Jar _QRS which is sent out of chip. Simultaneously, the Shortyulse, which represents the nsmg edge of Pulse Jar _QRS, is forwarded to the module RR _interval and HR calculator, the outputs of which are 7-segment display bits (for HR) and serial communication signal (RRT). Fig. 4 .
B. Local min, max and zero crossing
Each module Local min, Local max and Zero-cross. consists of 5 sub-modules: LocaC min. detector; LocaC max. detector; Zero_cross. detector; State machine for QRS recognition and Pulse generator,
Clock Reset
Local_min. The architecture of Local min. detector and Local max. The positive and negative thresholds are adapted by: [[(REG7 < REG6) and (REG6 > TpJ and (REGS < REG6) then LocaLmax = REG6 (6) [[(REG7 > REG6) and (REG6 < Tni) and (REGS> REG6) then LocaLmin = REG6 C. QRS recognition QRS recognition is carried out using a state machine whose combinational inputs are Min_detected, ZC _detected and Max_detected. State machine has four states, Fig. 8 , and the initial state is state of initial adaptation of the QRS detector, after powering-up. The initial state lasts for 5 seconds and it serves for initial suiting of the adaptive thresholds, because of loading of registers REG1, REG2, REG3 and REG4. In the process of adaptation the quest for the QRS complex is not done, and the output signals QRS _detected and Enable_measuring are at low. After initial adaptation, the signal Enable_measuring becomes high and process of QRS detection begin. From the initial adaptation state, a state machine goes to the state in which the local minimum of D; is expected, state-Local minimum search.
Di
When high level appears to the input of the state machine that is connected to the circuit for local minimum detection, the machine switches to state for searching zero-crossing, state Zero crossing search, and after crossing zero, the state machine switches to the search for a local maximum of D;, state-Local maximum search. Detection of QRS complex is 2n d Mediterranean Conference on Embedded Computing ,,/" MECD -2013 Budva, Montenegro successful if local max is detected after local min and zc. Then QRS _detected synchronized output is generated.
QRS _detected from the state machine is fed to the Pulse Jjenerator, which generates Pulse _i of 30 ms duration. Pulse _1, Pulse j and Pulse _3 are passing through the OR gate creating OR Jjate yulse which is in fact the overlapped summation. ORJjateyulse is fed to the Finalyulse generator which produces Pulse Jar _QRS of 20 ms duration. Pulse Jar _QRS happens only if between two sequent OR Jjate yulses passed 200 ms that is considered as down physiological limit for RR interval. Simultaneously with Pulse Jar _QRS this circuit generates Short yulse of one clock duration, which is forwarded to RRT and HR calculators. 
D. Calculating RRI and HR
The circuit for RRI and HR calculation consists of four counters (Cl, C2, C3 and C4), two latch registers and one shifer, Fig. 9 . It starts after initial adaptation, when signal Enable_measuring is high. Output RR _int[I 0" OJ gives binary equivalent of RR intervals in ms, while HR[S" OJ gives beats per min. C2 is clocked each 1 ms using overflow (of) of Cl, and latched by Shortyulse. C4 is clocked by Shortyulse and latched each 15 s using C3 's overflow. The final HR in beats per minute is obtained by left shifting "«2", that is in fact "x4", 60s= 15x4s. 
IV. TESTING AND RESULTS
Designed system was implemented on Cyclone 11 FPGA chip EP2C70F896C6N [6] . All modules, functions and routines are implemented in VHDL. Prototyping and testing was performed on-line using Altera's DE2-70 development board. TLC2543 AID converter from Tl is employed for digitizing ECG signals. RR intervals were transferred out via serial port, while HR values are observed by 7-segment displays.
The records from the MTT-BTH Arrhythmia Database [7] were used to test the system. These records represent different heart rhythms, and contain variety of waveforms and artifacts that ECG feature extractor might encounter in everyday clinical practice. The signals are applied directly to the designed system, without prior filtering for noise removal or stabilization in term of base line.
Only feature extractor occupies 1459 LEs of FPGA chip. Together with AID controller, RS232 serial communication module and 7-segment display module, it is 1838 LEs, that is 2.68% of EP2C70F896C6N chip capacity. Fig. 10 shows the oscilographs of ECG signals and digital pulses at output of the QRS detector, at the pin of the FPGA chip, Pulse Jar _QRS. Fig. 10 (b) shows the result of the detection for signal affected by noise and baseline wandering. As seen, the designed system has good characteristics in noise environment. There is some delay in the detection of QRS complexes, because of the algorithm, Fig. 11 . After the detection of local minimum and zero-crossing, the system should detect the local maximum and then to decide. The result of QRS detection at three levels of decomposition is shown in Fig. 11 (a) , Pulse_1, Pulsej and Pulse_3, while Fig. 11 (b) shows PulseJor _QRS. The delay in the detection of QRS complex for ECG signal in Fig. 11 (b) is about 16 ms. The results of the detection of QRS complexes in 10 half hour recordings from the MLT-BIH Arrhythmia database are summarized in Table I . Test of QRS detector lasted for five hours, Testing was conducted on a total of TB=22031 beats, where there were FP=1 05 false detections and FN=424 missed detections, Using the formula ACC[%]= 1 OO(I-(FP+FN)/TB), for every of the records, the average accuracy of the QRS detector is ACCav=97,53%, It may be noted that the detection results vary for different signals. 
V. CONCLUSION
A system and methodology for on-line QRS detection and RR interval and heart rate calculation are presented. All tasks are implemented in single FPGA chip. The system is imune on various noise, achieving accuracy of more than 97% behind low occupation of silicium resources. As such, system is ideal for embeded systems or wearable health care devices.
ACKNOWLEDGMENT
The work is supported by National project of Ministry of Science of Montenegro, "Development and implementation of embedded systems for medical applications", MESI, and EU TEMPUS project "Studies in Bioengineering and Medical Informatics", BioEMIS. The authors are thankfull for great support.
