# FPGA based Accelerators for ECG signal classifications using Convolutional Neural Network– A Brief Review

## Soumyashree Mangaraj School of Electronics Sciences Odisha University of Technology and Research, Bhubaneswar, Odisha

Abstract - Electrocardiogram (ECG) is the recorded form of electrical potential induced by cardiac impulse, measured through the electrodes placed on body surface on opposite sides of the heart. These signals are a vital feature to identify a healthy body. CNN uses a huge amount of data and large number of multiply and accumulate (MAC) operations. Graphical Processing Units (GPU) is proven to be an efficient solution for CNN implementations with huge training dataset but is a computationally expensive solution. In recent times, several Field Programmable Gate Array (FPGA) based solutions are accepted as a choice for CNN implementation. The implementation is likely to demand pipeline structure for synchronization and pooling purpose. Reconfigurable FPGA would allow use of multipliers rather than MAC and hence would provide a better solution for hardware realization. FPGA is certainly a well thought out solution that would additionally facilitate energy efficiency. For applications involving healthcare IoT, FPGA would provide the required flexibility and hence would support IoT device interfacing with outside world with low power, minimal latency and probably best determinism. A limited amount of work has been reported in literature that explores the usage of resource constrained FPGA acceleration for classifying ECG signals via CNN for detecting abnormalities in heart beats. In this investigation, an attempt is made to bring out the state of the art where FPGA acceleration is a potent solution for implementing CNN in identifying/classifying heart diseases and using in such healthcare IoT systems.

Keywords - Electrocardiogram (ECG) classification, FPGA, Convolutional Neural Networks, hardware accelerators, IoT

### I INTRODUCTION

In a cardiac cycle that begins with one heartbeat till the next one appears; an action potential is generated spontaneously in the sinus node that travels from the atria (primary pumps) into the ventricles while at the same time pumping blood in to a body's vascular system. Electrocardiogram (ECG) is the signal generated by heart in the form of electrical voltage signal that are recorded from the surface of a body by electrocardiograph. Cardiac arrest is the cessation of all electrical control signals in heart leading to abnormality of cardiac rhythmicity. In an adult abnormal Sinus Rhythms are classified as: "tachycardia", fast heart rate (greater than 100 beats/min) and "bradycardia", slow heart rate (lesser than 60 beats/min) [1].

Electrical signals during heart muscle depolarization triggered via each heartbeat can be measured through ECG. Generally, waveforms recorded through standard 12-electrodes are used in diagnostic centers for detecting abnormalities in the PQRST wave. In the research front, apart from heavy duty, costly biomedical devices, hardware prototypes such as the reported work [2, 3] for Internet of Things (IoT) platform has been highly cited. Field Programmable Gate Arrays (FPGA) has also been widely used as a resource constrained, low-power, parallel processing, higher throughput and low-cost prototype to detect ECG signals and their metrics. This has been used reportedly for features like R-peak heart rate detection, as accelerators for filtering and denoising signals via various algorithmic methodologies, among many others.

A surge in number of cardiovascular patients, lack of skilled medical fraternity and unavailability of proper resources has made the area of automatically detecting and classifying ECG signals of prominent significance, which has been reported since long [4]. Beginning from premature ventricular contractions essential for arrhythmias [5], a number of articles can be found in the literature. For detecting abnormalities via classifying ECG signals using CNN can be found in the literature aplenty as well [6]. But attempting to use FPGA, a resource constrained prototype, as accelerators for ECG signal classifications using CNN is scarcely available. Thus an attempt has been made in this paper to accumulate the resources available on state-of-the-art methodologies and is showcased for enthusiasts working in this domain.

The paper is organized in the following ways. Section II outlines the performance analysis of various FPGA prototypes processing the ECG signals available in the literature. Section III accumulates and diversifies the FPGA acceleration of ECG signals using CNN models, selectively reported on literature. Finally, concluding the discussion on state-of-the-art work reported on the topic for continuing further work, with a glimpse towards possible future activities in this direction.

## II ECG signal performance analysis

This section is organized to discuss briefly the performance analysis of ECG signal high-lighting ECG signal compression, denoising and various methods of classification in a few state-of-the-art FPGA platforms.

Association for the Advancement of Medical Instrumentation (AAMI) recommends five types of heart beats: normal beat (N), supraventricular ectopic beat (S), ventricular ectopic beat (V), fusion beat (F) and unclassified beat (Q) [7]. The different types of heart arrhythmias are Atrial Fibrillation (AF), Premature Atrial Contractions (PAC), Atrial Tachycardia, Atrial Flutter, Premature Ventricular Contractions (PVC), Ventricular Tachycardia (VT), and Ventricular Fibrillation (VF). The most common form of arrhythmia is premature ventricular contraction (PVC) [8] [9].

The fundamental stages of ECG signal analysis are, pre-processing, feature extraction and classification.

**Pre-processing:** In this stage ECG signal is made noise free to be further used for QRS complex detection. The various noises present in ECG signal are, baseline wander (electrode contact noise and electrode motion artifacts), power line interference (PLI), electromyogram (EMG) noise, and instrument associated noise. The filters used ECG signal filtration are: moving average, median, frequency-selective, adaptive, polynomial, and Wiener, and other methods like singular value decomposition (SVD), discrete cosine transform (DCT), discrete wavelet transform (DWT), empirical mode decomposition (EMD), nonlinear Bayesian filter, mathematical morphological operators, independent component analysis (ICA), nonlocal means method, variational mode decomposition and EMD-wavelet method [10]. The filtered signal is in the frequency range of 5Hz to 50Hz.

**QRS complex detection:** The detected QRS peak or complex is used for various feature extraction. QRS complex detection has been carried out in numerous ways available in literature; among them the most widely used methods Pan and Tompkin's algorithm (PAT), Wavelet transform, and sloping method for hardware implementation are highlighted here.

Pan and Tompkins algorithm (PAT) or modified version of it [11] [12] [13] [14], Sloping method-based techniques [15] [16] [17] [18] and Discrete/ Integer Wavelet transform based algorithms [19] [20] [21] [22] [23] [24] are verified on Xilinx System Generator for Digital signal processor (DSP) tool and implemented on various platforms of Field Programmable Gate Array (FPGA).

Feature extraction and classification: Extracted feature set and a variety of machine learning techniques like support vector machines (SVMs), naive Bayes, random forest, multiple layer perceptron (MLP), recurrent neural network (RNN) probabilistic neural networks (PNN), artificial neural networks (ANN) and deep neural network (DNN) have been used in literature to classify respected arrhythmias.

The QRS complex detection and feature extractions methods are implemented on various platforms of microcontroller [25] [26], Analog signal processor (ASP) is also found as an option for feature extraction in order to avoid intensive computation and more power consuming DSP-based techniques [27].

A tabular summary of the state-of-the-art techniques for analyzing various platforms has been reported in Table 1, citing available resources in literature. FPGAs have been chosen as an efficient option to validate various target applications, some of which are discussed in the following section.

## 1. ECG Compressor

An ECG compressor and decompressor is designed and verified on Terasic DE1-SOC platform embedded with ARM processor in Altera FPGA platform for MIT-BIH arrhythmia database in [28]. Here an 11 bit digitized ECG signal is generated at the output of an ADC and then prediction error is computed through the predictor. Two Huffman tables with six stages of look-up table are used to generate the encoded ECG signal, with a latency of 50ns. The maximum six ECG samples are compressed and packed into 16-bit words to be stored in a memory block. The total latency between capturing ECG signal and storing in memory is less than 14ms. The decompressor is implemented on ARM processor and correctness of the resulting signal is checked with original ECG signal.

| Paper | Database                                                                                                 | Methodology/Model                                                                     | FPGA Platform           | Accuracy (in %) | Power               | Target Application                            |
|-------|----------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------|-------------------------|-----------------|---------------------|-----------------------------------------------|
| [28]  | MIT-BIH                                                                                                  | Multi-Stage Huffman<br>Coding                                                         | Terasic DE1-<br>SOC     | -               | -                   | ECG compressor                                |
| [29]  | InCarTDb                                                                                                 | Pipelined<br>Architecture with<br>Probabilistic Neural<br>Network (PNN)<br>classifier | Zynq-7 ZC702            | 97              | 192mW<br>(4classes) | Embedded ECG<br>classifier                    |
| [30]  | MIT-BIH                                                                                                  | Artificial Neural<br>Network (ANN)-<br>Cardiac Arrhythmia<br>Classifier (CAC)         | Xilinx Pynq-Z2          | 98              | 13.34 μW            | ASIC (0.18 μm<br>CMOS) ANN-<br>CAC classifier |
| [31]  | Recording from Tongji<br>Hospital (Huazhong<br>University of Science<br>and Technology,<br>Wuhan, China) | Deep Convolutional<br>Neural Network<br>(DCNN)                                        | Xilinx Zynq<br>XC-7Z020 | 86.7            | -                   | homecare ECG<br>diagnosis                     |

TABLE 1 ECG PERFORMANCE ANALYSIS ON VARIOUS FPGA PLATFORMS

| [32] | MIT-BIH                                  | K-NN         | Zybo Zynq 7000<br>(xc7z010clg400) | 99.5                     | EigenDecompo<br>sition-<br>1.857(100)<br>SVD-<br>1.725(100) | ECG based bio-<br>metric system         |
|------|------------------------------------------|--------------|-----------------------------------|--------------------------|-------------------------------------------------------------|-----------------------------------------|
| [33] | MIT-BIH                                  | Trace- STDP  | Zynq-7020                         | 83                       | 230mW (4<br>classes of heart<br>conditions)                 | Wearable Bio-<br>signal Processor       |
| [3]  | Acquired ECG via ECG<br>module AD8232    | Modified PAT | Zynq-7000 Zybo                    | Real time<br>observation | -                                                           | IoT enabled ECG<br>monitoring<br>system |
| [34] | MIT-BIH ECG Sensor<br>VS100 and Shimmer3 | AES & PCA    | Xilinx ZC702                      | -                        | 107mW                                                       | ECG based bio-<br>metric system         |

## 2. ECG arrhythmia Classifier:

Probabilistic Neural Network (PNN) and Artificial Neural Network (ANN) based cardiac arrhythmia classifier (CAC) and the implementation on FPGA boards has been discussed here.

The authors in [29] have proposed a pipelined structure for embedded devices and smart systems with stages: de-noising, detection and segmentation of filtered ECG signal, dynamic morphological features extraction, and 5 types of heartbeat classification. The implementation utilized less than 30% of the FPGA resource usage.

The system is validated with InCarTDb database (from PhysioNet) featured with Lead 1 seventy-five 30-minutes recordings of 32 different patients data sampled at 257Hz, and a maximum 180 bpm heartbeat rate. In the design, the FIFO generates buffered ECG lead signal and digitized samples to batches of 500ms window size. Then another FIFO block stores the signal filtered through the Moving Average filter. Pre-processing is done for each sample batch with low-pass filter and normalization by comparing with peak amplitude value of the previous PQRS wave. Morphological Feature Extraction and Dynamic Feature Extraction Unit extract the features which are the input for Probabilistic Neural Network (PNN) classifier, resulting in the classified heartbeats: (i) normal beat (ii) APC (iii) RBBB and (iv) PVC.

This prototype is developed on Zynq-7 ZC702 Evaluation board with operating frequency of 35MHz, total power consumption found to be 192mW, 200mW and 208mW for 4, 8 and 16 classes of heart conditions with accuracy of 97%.

CAC-ANN designed in [30] introduced continuous-in-time-discrete-in-amplitude (CTDA) aiming to reduce number of multiplication, conditional grouping scheme (CGS) and biased training (BT) to manage the non-uniform imbalanced training samples of the MIT-BIH database, event driven approach to reduce input complexity, ReLU activation function, and arithmetic unit with only one multiplier based on 16-bit three-dimensional reduction multiplication (TDM) and 24-bit Sklansky-Tree (ST) adder for better power efficiency.

The Level-Crossing ADC (LC-ADC) does sampling based on event-driven approach and generates CTDA signal. Each generated sample is featured by two pulses and the time labels. The polarity of these pulses is presented with 1-bit ("1" or "0") per sample, reducing the data volume to 91% as compared to 8-bit Nyquist ADC for CAC. This ANN-CAC has 3 layers ( $32 \times 16 \times 5$ ) and takes 96 bits of input data frame, where two adjacent RR intervals are presented with 22 bits and pulses centered at R peak with last 74 bits. Training and verification is done in 7:3 ratio of the total samples of each record.

The ANN-CAC classifies N, S, V and F types by achieving 98% accuracy, 97% sensitivity, and 94% positive predictivity. The model is verified on Xilinx Pynq-Z2 FPGA board, with operating frequency of 2.5 MHz, and uses 5269 look up tables (LUT), 1024 LUT based RAM (LUTRAM) and 1311 flip-flops (FF). These resource usages are less than 10% of the total logic gate resources from Xilinx. The ASIC implementation of this design claims average power of  $13.34\mu$ W for heart rate of 75 bpm, synthesized with 1.8V, 0.18 $\mu$ m CMOS process at 25MHz clock frequency. This implementation can work with frequency 10 KHz to 25MHz and resulting in lower leakage as the average current drops.

## 3. ECG monitoring system:

Targeting a real time ECG monitoring system, the prototypes implemented on FPGA board have been verified with acquired ECG signal. These have been performed using existing dataset as well as recorded signals from patient's history.

Deep convolutional neural network (DCNN) based homecare directed ECG monitoring system is proposed in [31] with pruning scheme in channel-level a reduction of parameters up to the extent of 75.4% is gained. Parameters quantization helped to reduce

the floating-point operations by 42.7%. The design is implemented for 12-lead ECG raw signals classification on an embedded platform Xilinx Zynq XC-7Z020 FPGA chip, with two ARM Cortex-A9 processors and Artix-7 FPGA. For evaluation of the proposed DCNN model the dataset of 206468 ECG recordings from Tongji Hospital is distributed in random manner with 80%, 10%, and 10% for training, validation, and test sets, respectively.

The 3 main modules of proposed DCNN model are: 1) pre-convolution module 2) multiple-block CNN 3) multilevel classifier. This model is having convolutional layers of 13 stages and two subsequent fully connected (FC) layers. The first three convolution layers with kernel size 21, 32 filters and stride 2 are fed with long input data. The rest convolution layers stacked with kernel size 5, stride 1 and a filter size of 64 filters in the first group, and gradual increase up-to 256 by a multiplication factor 2. Fast down-sampling is done by three max-pooling operations in the last convolution block. Batch normalization (BN), and multilevel classification with ReLU activation function is done after each convolutional layer. Two dropout layers are added to the fully connected (FC) layers to address overfitting. At the end of the model, C-sigmoid activation functions are used to convert real values into class probabilities, and the binary cross-entropy loss function is applied.

The prototype of ECG detection system has ECG acquisition unit- ADS1298 and transmit raw signal sampled at 500Hz to STM32. Then the processed ECG recording undergoes noise elimination, normalization, quantization (8-bit integer), and then convolutional computation is done in the programmable logic (PL) part for of Xilinx Zynq XC-7Z020 FPGA chip.

The diagnosing time achieved for 10-s ECG data is 2.895s, F1 score is 0.913, the average area under the receiver operating characteristic curve (AUC), and the average specificity are 0.994 and 0.997 respectively, the average sensitivity 0.891, and exact match ratio, extension of the prediction accuracy 86.7%. This prototype being a wearable one has a major limitation with complex sensing component of 12-lead ECG sample collection system which can be improved with 3 or 6-laed arrangement.

In the work [3], an IoT based ECG monitoring system is proposed with optimized Pan and Tompkins algorithm, and implemented on Xilinx Zynq-7000 Zybo board. The patient's heart rate and temperature are measured by ECG module AD8232 and analog thermometer 3950 NTC (negative temperature coefficient) respectively. The heart rate and temperature computed are displayed in the web page with ESP8266 WiFi module using WiFi 802.11b/g/n operated at 2.4 GHz channel.

The collected ECG signal and temperature from patient's body are converted to digital signals through Xilinx Analog-to-Digital Converter (XADC) which is a dual 12-bit ADC. XADC is a part of the PL of Zynq device and it interfaces with PS through the PS-XADC interface. The IP core is created with modified Pan-Tomkins algorithm by eliminating the derivative and integration parts for QRS detection and heart rate measurement. The ECG simulator and the temperature sensor data received trough FPGA board are processed and displayed on the web page via the ESP module by serial UART-lite interface.

A wearable neuromorphic Bio-signal Processor is proposed [33]for four class arrhythmia detection using MIT-BIH ECG dataset. A single-layer excitatory inhibitory Spiking Neural Network (SNN) is modeled on unsupervised Spiking-Timing-Dependent-Plasticity (STDP) algorithm. The excitatory neurons (EN) receive spikes from input neurons through a fully connected layer, and then each EN connects to inhibitory neuron (IN) through positive weights in one-to-one connected layer. Each IN is connected with all EN except the previously connected one through negative weights. The input neuron is modeled with Leaky-Integrate-and-Fire (LIF) model and spikes have been generated based on the membrane potential. Based on the generated spikes, weights are updated and an event driven architecture is modeled.

The real time input and output events, reception or generation of input or output spikes are managed through Address Event Representation (AER) asynchronous protocol. The AER packets having spiking neuron ID and the respective timestamp are generated by AER bus and stored in two FIFOs. Three event handlers: Integrate Handler, Leaky Handler and File Handler are meant to work in event-trigger mode. To model low power architecture, the processor was only activated on completion of the previous event and set idle for rest cases.

The SNN model has 251 input neurons and 251 ENs, and time step of 100 to encode the raw ECG signal. The processor is implemented on Zynq 7020 FPGA platform and has achieved recognition accuracy of 83% on MIT BIH dataset. The processor has utilized 344 and 579, LUT and FF respectively, and the consumed power of 230mw.

Quite a few works available in literature has used CNN as an efficient metric for feature extraction of ECG signal, which has been reported for a brief understanding in the next section.

### III FPGA AS ACCELERATORS FOR ECG SIGNAL CLASSIFICATION USING CNN

This section gives a short preview on various acceleration boosted by FPGAs for medical arena, specifically for the ECG signals classifying types of heartbeats. Table 2 shows the comparative list of articles depicting the summary of reported metrics. A brief zest of each article has been shown in the rest of this section.

For Real time ECG classification on low-powered edge devices Deep learning model- CNN is over-parameterized with huge volume of dataset and computational complexity. To get savings on computational effort, power usage and memory for storage, a group of recent works has introduced compression techniques like, sparse model, pruning scheme and binarized CNN for CNN implementation on FPGA.

The above-mentioned CNN models are implemented on both 1D and 2D. In literature 1D CNN models are less resource intensive compared to 2D CNN models, whereas 2D based models are robust to noise. The details of network arrangements are discussed in subsequent section.

In [35] the authors has used Pruning technology to develop a small-scale sparse 1-D CNN model with five significant modules: group of RAM, array of processing elements (PEs), pixel scatter, central controller and auxiliary circuits, and works on 16bit fixed point ECG data of MIT-BIH database.60% sparsity is achieved in input feature map by taking the non-zero indexed weight values. These weights, instructions, signal activations are transferred to on-chip RAM from external memory. The Pes takes one clock cycle for completion of one multiplication and addition. The non-zero weight and index distribution for respective PEs along with selection of data path is done by a weight decoder unit. The valid activation will be loaded to PE by MUX and shift register in Pixel (weight/ activation) scatter. The central controller generates the control signal for the pipelined operations of the PEs. The auxiliary circuits add bias; do the pooling, and truncating operations on the partial sum obtained from the specific data path.

This is implemented on Xilinx Zynq ZC706 FPGA platform, with an accuracy of 99.17% on five types of ECG beats (normal beat (NOR), LBBB, RBBB, PVC and APB) utilizing 1995 LUTs, 3011 FFs and 12 DSPs.

Three network pruning algorithms based on magnitude of weights and bias: a) simple pruning, b) fine-tuning based pruning c) multistage pruning with reference to a 3layered base-line CNN model are implemented in [36]. Zeroing and fine-tuning have been done on each layer of this model. They have evaluated performance and complexity with respect to different sparsity levels ( $\eta$ ), ranging from 10% to 90%. They claimed of achieving accuracy of 98.12%, sensitivity of 98.07%, and specificity of 98.29% in classifying ECG into N, SVEB, VEB, F and Q standard through 10layers of the model. This evaluation may be extended for other datasets along with CNN models.

[37] This is paper aims to develop a flexible, efficient wearable ECG classifier based a small-scale unstructured 1D-CNN accelerator with 70% sparsity. It is designed with a tile-first dataflow excluding zero weight multiplications, for data compression and power gating on activation. A configurable array of four 12-stage cascade, 32-bit instruction processing element (PE) is designed for multiplication of weights and addition of bias of a tile. This model supports pipelined processing of multiple pooling types (none, max pooling, average pooling and global average pooling (GAP)), varying kernel sizes and number of feature maps. The accelerator can be a good option for wearable ECG device with computing efficiency of 118.75% and energy efficiency of 3.93µJ per classification by completing a beat classification in 4.6ms under 2MHz clock frequency.

A wearable Artificial Intelligence-of-Things (AIoT) device targeting to classify ECG excluding feature extraction or preprocessing steps is modeled in [38]. 2D-bCNN classifier for classifying class 1 for the true label all related V beat and the rest as class 0 is modeled with quantized multi-layer perceptron (qMLP) and bCNN perform image generation and image classification, respectively. The model categorized as better performance (BP) and low power (LP) models taking the image of size  $16\times20$  and  $10\times10$ . The dense network qMLP consists of 3 layers. The first layer, qDense1 takes 11-bit ADC output and multiplies with quantized 8bit weights and accumulates. The activation unit quantized rectified linear unit (qReLU), quantized the 32-bit accumulator output to 8bit. qDense2 layer performs the multiplication and activation using qRelu. The output of qDense3 layer uses binary hyperbolic tangent (bTanH) and generates binary inputs for the succeeding bCNN layer to perform multiplication operation loss (CL) and classifies the binary image as V or non-V beat deploying bCONP (convolution and pooling), bDense1 and bDense2 layer with sigmoid activation function. The device chosen for implementing model is Lattice Semiconductor's ultralow power iCE40 UltraPlus iCE40UP5k. The accuracy is 98.5% and 98.2%; sensitivity is 85.4% and 80.6%; and F1 score is 89.2% and 86.2% for BP and LP model respectively. Dynamic power consumption by BP and LP models are of 55.4  $\mu$ W and 34.9  $\mu$ W; and static power are 377.9  $\mu$ W and 376.6  $\mu$ W, respectively at 100 kHz. This proposed CNN model comparatively uses less MAC operations and made it preferable for wearable edge IoT device.

Resource constrained FPGA may be an attractive option for accelerating CNN models for ECG classification with scaling down the input feature map or the introducing quantization in various layers.

### IV CONCLUSION

The state-of-the-art technologies used for classifying ECG signal using FPGAs as accelerators previews the power of resource constrained devices that can be utilized in the health sector domain. Taking advantage of convolutional neural networks, various applications can be tapped and a huge market can be explored. This paper gives a glimpse of ongoing activities in this domain of CNN based classifications with the assistance of FPGAs as accelerators to diagnose small variations of ECG signals. Further activities to understand heart beats in a large scale can be done in the Internet of Things arena, where sensors and actuators can securely communicate with the server cloud without hampering the data.

#### REFERENCES

- [1] J. E. Hall, Guyton and Hall Textbook of Medical Physiology, Elsevier, 2011.
- [2] M. Yasin, T. Tekeste, H. Saleh, B. Mohammad, O. Sinanoglu and M. Ismail, "Ultra-Low Power, Secure IoT Platform for Predicting Cardiovascular Diseases," *IEEE Transactions on Circuits and Systems I: Regular Papers*, vol. 64, no. 9, pp. 2624-2637, Sept. 2017.
- [3] I. S. R. M. N. S. N. A. a. T. A. Shaker F. K. Abushukor, "FPGA Implementation of IoT-Based Health Monitoring System," in 15th International Conference on Telecommunication Systems, Services, and Applications (TSSA), Bandung, Indonesia, 2021.
- [4] C. B. B. L. K. P. E. R. a. L. S. L. Lapidus, "Distribution of adipose tissue and risk of cardiovascular disease and death: a 12 year follow up of participants in the population study of women in Gothenburg, Sweden," *British Medical Journal*, vol. 289, no. 6454, pp. 1257-1261, 1984.
- [5] M. B. S. a. G. D. C. O. Sayadi, "Robust Detection of Premature Ventricular Contractions Using a Wave-Based Bayesian Framework," IEEE Transaction Biomedocal Engineering, vol. 57, no. 2, pp. 353-362, 2010.
- [6] T. I. a. M. G. S. Kiranyaz, "Real-Time Patient-Specific ECG Classification by 1-D Convolutional Neural Networks," *IEEE Transactions on Biomedical Engineering*, vol. 63, no. 3, pp. 664-675, 2016.
- [7] A. J. Prakash and S. Ari, "AAMI Standard Cardiac Arrhythmia Detection with Random Forest Using Mixed Features," in *IEEE 16th India Council International Conference* (INDICON), 2019.
- [8] N. S. E. W. C. A. I. S. &. L. Y. Clark, "A wearable ECG monitoring system for real-time arrhythmia detection," in IEEE 61st International Midwest Symposium on Circuits and Systems (MWSCAS), 2018.
- [9] M. I. H. M. B. &. U. A. F. M. N. Khan, "Performance analysis of modified zero crossing counts method for heart arrhythmias detection and implementation in HDL," in *International Conference on Informatics, Electronics and Vision (ICIEV)*, 2013.
- [10] U. Satija, B. Ramkumar and M. S. Manikandan, "A Review of Signal Processing Techniques for Electrocardiogram Signal Quality Assessment," IEEE REVIEWS IN BIOMEDICAL ENGINEERING, vol. 11, pp. 36-52, 2018.
- [11] E. H. E. Mimouni and M. Karim, "A Real-Time FPGA-Based QRS Detector Using Adaptive Threshold with the Previous Smallest Peak," Progress in Computing Applications, vol. 1, no. 2, pp. 97-111, 2012.
- [12] N. S. Madiraju, N. Kurella and R. Valapudasu, "FPGA Implementation of ECG feature extraction using Time domain analysis," 2018.
- [13] A. E. Hassen, A. Histace, M. Terosiet and O. Romain, "FPGA-Based Detection of QRS Complexes in ECG Signal," in Design and Architectures for Signal and Image Processing (DASIP), 2015.
- [14] J. Kovačević and et al., "FPGA Low-Power Implementation of QRS Detectors," in IEEE 3rd Mediterranean Conference on Embedded Computing (MECO), 2014.
- [15] M. R. Arefin, K. Tavakolian and R. F. Rezai, "QRS Complex Detection in ECG Signal for Wearable Devices," in Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015.
- [16] X. Xu and Y. Liu, "ECG QRS Complex Detection Using Slope Vector Waveform (SVW) Algorithm," in The 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society., San Francisco, CA, USA, 2004.
- [17] K. F. Tan, K. F. Tan, K. L. Chan and K. Choi, "Detection of the QRS complex, P wave and T wave in electrocardiogram," in *First International Conference on Advances in Medical Signal and Information Processing*, 2000.
- [18] J. Sarma and R. Biswas, "VLSI based Adaptive Power Management Architecture for ECG Monitoring in WBAN," in 2020 33rd International Conference on 19th International Conference on VLSI Design and Embedded Systems (VLSID), 2020.
- [19] Y. Zou, J. Han, X. Weng and X. Zeng, "An Ultra-Low Power QRS Complex Detection Algorithm Based on Down-Sampling Wavelet Transform," *IEEE Signal Processing Letters*, vol. 20, no. 5, pp. 515-518, 2013.
- [20] S. Acharya, H. Kabra, P. V. Kasambe and S. S. Rathod, "Performance evaluation of an integer wavelet transform for FPGA implementation," in International Conference on Nascent Technologies in the Engineering Field (ICNTE), 2015.
- [21] C. I. Ieong, M. I. Vai and P. U. Mak, "ECG QRS Complex detection with programmable hardware," in 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2008.
- [22] E. M. E. Hassan and M. Karim, "An FPGA-based implementation of a pre-processing stage for ECG signal analysis using DWT," in Second World Conference on Complex Systems (WCCS), 2014.
- [23] B. Zhang, L. Sieler, Y. Morère, B. Bolmont and G. Bourhis, "Dedicated wavelet QRS complex detection for FPGA implementation," in *International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)*, 2017.
- [24] Y. Li, H. Yu, L. Jiang, L. Ma and Z. Ji, "Adaptive Lifting Scheme for ECG QRS complexes detection and its FPGA implementation," in 3rd International Conference on Biomedical Engineering and Informatics, 2010.
- [25] B. Mishra, N. Arora and Y. Vora, "Wearable ECG for Real Time Complex P-QRS-T Detection and Classification of Various Arrhythmias," in 11th International Conference on Communication Systems & Networks (COMSNETS), 2019.
- [26] E. H. E. Mimouni and M. Karim, "A MicroBlaze-Based Multiprocessor System on Chip for Real-Time Cardiac Monitoring," in International Conference on Multimedia Computing and Systems (ICMCS), 2014.
- [27] P. K. Jha, M. K. Rajendran, P. K. Lenka, A. Acharyya and A. Dutta, "A Fully Analog Autonomous QRS Complex Detection and Low-Complexity Asystole, Extreme Bradycardia, and Tachycardia Classification System," *IEEE Transactions on Instrumentation and Measurement*, vol. 71, 2022.
- [28] K. J. Lin, H. H. Huang and Y. Y. Lin, "An FPGA Implementation of Lossless ECG Compressors Based on Multi-Stage Huffman Coding," in 7th Global Conference on Consumer Electronics (GCCE 2018), 2018.
- [29] P. Sá, H. Aidos, N. Roma and P. Tomás, "Heart Disease Detection Architecture for Lead I Off-the-Person ECG Monitoring Devices," in 27th European Signal Processing Conference (EUSIPCO), 2019.
- [30] Y. Zhao, Z. Shang and Y. Lian, "13.34 μW Event Driven Patient-Specific ANN Cardiac Arrythmia Classification for wearable ECG Sensors," IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, vol. 14, no. 2, pp. 186-197, 2020.

- [31] S. Ran, X. Yang, M. Liu, Y. Zhang, C. Cheng, H. Zhu and Y. Yuan, "Homecare-Oriented ECG Diagnosis With Large-Scale Deep Neural Network for Continuous Monitoring on Embedded Devices," *IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT*, vol. 71, 2022.
- [32] C. Fang, Z. Shen, F. Tian, J. Yang and M. Sawan, "A Compact Online-Learning Spiking Neuromorphic Biosignal Processor," in IEEE International Symposium on Circuits and Systems (ISCAS), 2022.
- [33] J. Lu, D. Liu, A. Hu, C. Zhang, C. Mo, R. Guo and H. Li, "A Low-cost and Configurable Hardware Architecture of Sparse 1-D CNN for ECG Classification," in 2022 IEEE 16th International Conference on Solid-State & Integrated Circuit Technology (ICSICT), China, 2022.
- [34] L. Xiaolin, R. C. Panicker, B. Cardiff and D. John, "Multistage Pruning of CNN Based ECG Classifiers for Edge Devices," in 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Mexcico, 2021.
- [35] J. Lu, D. Liu, X. Cheng, L. Wei, A. Hu and X. Zou, "An Efficient Unstructured Sparse Convolutional Neural Network Accelerator for Wearable ECG Classification Device," IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I, vol. 69, no. 11, pp. 4572-4582, 2022.
- [36] D. L. T. Wong, Y. Li, D. John, W. K. Ho and C.-H. Heng, "Low Complexity Binarized 2D-CNN Classifier for Wearable Edge AI Devices," *IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS*, vol. 16, no. 5, pp. 822-831, 2022.