

# Pedro Miguel Lobato Garcia

Licenciado em Ciências da Engenharia Eletrotécnica e de Computadores

# Design of passive $\Sigma\Delta$ modulator for ESP32

Dissertação para obtenção do Grau de Mestre em Engenharia Electrotécnica e de Computadores

Orientador: Nuno Filipe Silva Veríssimo Paulino, Professor Auxiliar, Universidade Nova de Lisboa

Júri

Presidente:Doutor Fernando José Almeida Vieira do Coito - FCT/UNLArguente:Doutor João Pedro Abreu de Oliveira - FCT/UNLVogal:Doutor Nuno Filipe Silva Veríssimo Paulino - FCT/UNL



Março, 2020

Copyright © Pedro Miguel Lobato Garcia, Faculty of Sciences and Technology, NOVA University Lisbon.

The Faculty of Sciences and Technology and the NOVA University Lisbon have the right, perpetual and without geographical boundaries, to file and publish this dissertation through printed copies reproduced on paper or on digital form, or by any other means known or that may be invented, and to disseminate through scientific repositories and admit its copying and distribution for non-commercial, educational or research purposes, as long as credit is given to the author and editor.

Este documento foi gerado utilizando o processador (pdf) LTEX, com base no template "novathesis" [1] desenvolvido no Dep. Informática da FCT-NOVA [2]. [1] https://github.com/joaomlourenco/novathesis [2] http://www.di.fct.unl.pt

Aos meus pais, Gabriela e Francisco, e à minha irmã Ana.

## ACKNOWLEDGEMENTS

First and foremost, I would like to acknowledge and show my appreciation to my supervisor Prof. Nuno Paulino, for his advice, orientation, patience and availability, without which I would have been utterly lost.

I would like to thank Prof. Rui Tavares for his technical support.

I would also like to show my appreciation and gratitude to my colleagues and friends for all their support.

Finally, I would like to thank my family for their encouragement, motivation and patience.

### ABSTRACT

This thesis addresses the problem of voice recognition, by studying how to build a lowcost solution for data acquisition, presents the analysis, design implementation and simulation of a low-cost continuous-time(CT) passive sigma-delta modulator( $\Sigma\Delta M$ ) with signal-to-noise-plus-distortion ratio(SNDR) > 86dB to increase fidelity in voice recognition applications.

Employing passive RC integrators and a differential signal structure, the CT passive  $\Sigma\Delta M$  is optimized to work as independently from a specific comparator module as possible, by decreasing the necessary gain of the comparator in the modulator gain. Nevertheless, the loop gain is restricted by the comparator's noise, aggravated by the high attenuation of the passive RC integrators on the signal, causing low voltage swing at the comparator's input.

It is discussed the viability of utilizing a microprocessor ( $\mu$ P) as substitute of the comparator block in a CT passive  $\Sigma\Delta M$ , specifically, the esp32. Due to hardware limitations and the high-frequency requirements of the CT passive  $\Sigma\Delta M$ , it is proven this substitution is not viable.

Along the course of the design implementation a comparison in the performance of three different simulation software is presented, these software being, the open-source *LTspice VII*, the open-source *Ngspice* and the private *Cadence Virtuoso*. As it was necessary to change simulation software at different stages in the design of the circuit.

### Resumo

Esta tese discute o problema de reconhecimento de voz, estudando como construir uma solução baixo custo de acquisição de dados, apresenta ainda uma análise, projecto e simulação de um conversor analógico-digital(ADC), usando um modulador sigma-delta( $\Sigma\Delta M$ ) passivo de tempo contínuo(CT) com SNDR > 86dB, para o efeito de melhorar a resolução de aplicações de reconhecimento de voz.

Através da utilização de integradores passivos RC e uma estrutura de sinais differenciais, o  $\Sigma\Delta M$  é optimizado para funcionar o mais independentemente possivel de um comparador especifico, sendo para isso minimizado o ganho do comparador no ganho do modulador. Contudo, o ganho do modulador é limitado pelo ruido do comparador, agravado pelo efeito de atenuação dos integradores passivos no sinal, causando um sinal de baixa tensão na entrada do comparador.

É discutida a viabilidade de utilizar um micro processador( $\mu$ P) para substituir o módulo do comparador no  $\Sigma\Delta$ M, especificamente, o  $\mu$ P esp32. Devido às limitações de *hardware* e à alta frequência que é necessária para o  $\Sigma\Delta$ M, é provado que esta substituição não é viável.

Ao decorrer do projecto do  $\Sigma\Delta M$  é apresentada a comparação de desempenho de três software de simulação utilizados, especificamente, o *LTspice VII*, o *Ngspice* e o *Cadence Virtuoso*. Visto ter sido necessário a alteração de software de simulação em diferentes fases do projecto do circuito.

# Contents

| List of Figures xv |                    |                  |                                         |     |  |  |
|--------------------|--------------------|------------------|-----------------------------------------|-----|--|--|
| Li                 | List of Tables xvi |                  |                                         |     |  |  |
| Ac                 | rony               | ms               |                                         | xix |  |  |
| 1                  | Intr               | oductio          | n                                       | 1   |  |  |
|                    | 1.1                | Motiva           | ation                                   | 1   |  |  |
|                    | 1.2                | Object           | tives                                   | 2   |  |  |
|                    | 1.3                | Docun            | nent Overview                           | 2   |  |  |
| 2                  | Stat               | e Of Ar          | ·t                                      | 5   |  |  |
|                    | 2.1                | Voice l          | Recognition Systems                     | 5   |  |  |
|                    |                    | 2.1.1            | Vocca PRO Light                         | 5   |  |  |
|                    |                    | 2.1.2            | Honeywell Smart Thermostat              | 6   |  |  |
|                    |                    | 2.1.3            | Amazon Echo Dot                         | 6   |  |  |
|                    |                    | 2.1.4            | Apple HomePod                           | 7   |  |  |
|                    | 2.2                | Voice l          | Recognition Software's                  | 8   |  |  |
|                    |                    | 2.2.1            | Speech Data Acquisition                 | 9   |  |  |
|                    | 2.3                | Microo           | controllers                             | 10  |  |  |
|                    |                    | 2.3.1            | ESP32                                   | 10  |  |  |
|                    |                    | 2.3.2            | Arduino                                 | 11  |  |  |
|                    |                    | 2.3.3            | Raspberry Pi                            | 12  |  |  |
|                    |                    | 2.3.4            | RDA5981A                                | 13  |  |  |
|                    | 2.4                | System           | ns on a chip                            | 14  |  |  |
|                    |                    | 2.4.1            | ESP32                                   | 14  |  |  |
|                    |                    | 2.4.2            | Arduino                                 | 15  |  |  |
|                    |                    | 2.4.3            | RDA5981A                                | 16  |  |  |
|                    | 2.5                | $\Sigma\Delta$ M | odulators                               | 17  |  |  |
|                    |                    | 2.5.1            | Basic Operation of an ADC               | 17  |  |  |
|                    |                    | 2.5.2            | Principles of $\Sigma\Delta$ Modulators | 18  |  |  |
|                    |                    | 2.5.3            | $Σ\Delta M$ Loop Filter                 | 22  |  |  |

| 3  | Arc            | hitecture Development                              | 27 |
|----|----------------|----------------------------------------------------|----|
|    | 3.1            | Circuit Design                                     | 27 |
|    |                | 3.1.1 Theoretical analysis                         | 27 |
|    |                | 3.1.2 $\Sigma\Delta 1^{st}$ Order Modulator        | 27 |
|    | 3.2            | Simulation programs                                | 30 |
| 4  | $\Sigma\Delta$ | 2 <sup>nd</sup> Order Modulator Development        | 33 |
|    | 4.1            | $\Sigma\Delta 2^{nd}$ Order Modulator Single Ended | 33 |
|    |                | 4.1.1 Theoretical analysis and calculations        | 34 |
|    |                | 4.1.2 Electrical Simulation                        | 36 |
|    | 4.2            | $\Sigma\Delta 2^{nd}$ Order Differential Modulator | 38 |
| 5  | ESP            | 32 Development & Implementation                    | 45 |
|    | 5.1            | Viability Study for ESP32                          | 45 |
|    |                | 5.1.1 Theoretical Viability Study                  | 45 |
|    |                | 5.1.2 Practical Viability Study                    | 46 |
| 6  | Sun            | nmary & Future Work                                | 49 |
|    | 6.1            | Summary                                            | 49 |
|    | 6.2            | Future Work                                        | 50 |
| Bi | bliog          | raphy                                              | 51 |

# List of Figures

| 2.1  | VOCCA Pro                                                                                                                            | 5  |
|------|--------------------------------------------------------------------------------------------------------------------------------------|----|
| 2.2  | Honeywell Wi-Fi Thermostat RTH9590WF1011                                                                                             | 6  |
| 2.3  | Amazon Echo Dot 3rd Generation                                                                                                       | 6  |
| 2.4  | Apple HomePod                                                                                                                        | 7  |
| 2.5  | ESP32 Development board DevKitC                                                                                                      | 10 |
| 2.6  | Arduino Micro development board                                                                                                      | 11 |
| 2.7  | Raspberry Pi 2                                                                                                                       | 12 |
| 2.8  | RDA5981a development board                                                                                                           | 13 |
| 2.9  | System structure block diagram                                                                                                       | 14 |
| 2.10 | System address mapping                                                                                                               | 15 |
| 2.11 | (a) Conceptual block diagram of a generic LP ADC (b) Signal Processing $\ . \ .$                                                     | 17 |
| 2.12 | Representation of anti-aliasing filtering for (a) nyquist-rate (b) oversampling ADC                                                  | 18 |
| 2.13 | Quantization Process (a) Ideal characteristic of a quantizer (b) Quantization                                                        |    |
|      | error                                                                                                                                | 19 |
| 2.14 | Quantization white noise (a) probability density function (b) power spectral                                                         |    |
|      | density                                                                                                                              | 20 |
| 2.15 | Time domain representation of first order $\Sigma\Delta$ modulator $\ldots$                                                          | 20 |
| 2.16 | Frequency-domain representation, the relation of noise characteristic to the operation of $\Sigma\Delta Ms$                          | 20 |
| 2.17 | Oversampling results in quantization noise shaping                                                                                   | 21 |
| 2.18 | RC passive integrator                                                                                                                | 24 |
| 2.19 | $2^{nd}$ order $\Sigma \Delta M$ with feedforward structure (a) block diagram (b) circuit rep-                                       |    |
|      | resentation                                                                                                                          | 24 |
| 3.1  | Solution Block Diagram                                                                                                               | 27 |
| 3.2  | $1^{st}$ order $\Sigma\Delta$ modulator Block Diagram $\ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots \ldots$ | 28 |
| 3.3  | $1^{st}$ order $\Sigma\Delta$ modulator Simulation Schematic                                                                         | 28 |
| 3.4  | $1^{st}$ order $\Sigma\Delta$ modulator FFT in <i>LTspice VII</i>                                                                    | 29 |
| 3.5  | $2^{nd}$ Order $\Sigma\Delta$ SE Modulator FFT with 0dBV Input Signal Amplitude (a) <i>LTspice</i>                                   |    |
|      | VII (b) Ngspice                                                                                                                      | 30 |

#### LIST OF FIGURES

| 3.6  | $2^{nd}$ Order $\Sigma\Delta$ SE Modulator FFT with -13dBV Input Signal Amplitude (a)<br>LTspice VII (b) Ngspice | 30 |
|------|------------------------------------------------------------------------------------------------------------------|----|
| 4.1  | $2^{nd}$ Order $\Sigma\Delta$ Modulator Block Diagram                                                            | 33 |
| 4.2  | 2 <sup><i>nd</i></sup> Integrator with Zero Block Diagram                                                        | 34 |
| 4.3  | $2^{nd}$ Order $\Sigma\Delta$ Modulator Circuit Schematic                                                        | 34 |
| 4.4  | STF & NTF                                                                                                        | 36 |
| 4.5  | $2^{nd}$ Order $\Sigma\Delta$ Single-ended Modulator SINAD                                                       | 36 |
| 4.6  | $2^{nd}$ Order $\Sigma\Delta$ Single-ended Modulator FFT in <i>Ngspice</i>                                       | 37 |
| 4.7  | $2^{nd}$ Order $\Sigma\Delta$ Single-ended Modulator Noiseless FFT in <i>Virtuoso</i>                            | 37 |
| 4.8  | $2^{nd}$ Order $\Sigma\Delta$ SE Modulator Noise FFT in <i>Virtuoso</i> (a) Noise < 16MHz (b)                    |    |
|      | Noise < 32MHz (c) Noise < 80MHz                                                                                  | 38 |
| 4.9  | $2^{nd}$ Order $\Sigma\Delta$ Differential Modulator Electrical Schematic                                        | 38 |
| 4.10 | $2^{nd}$ Order $\Sigma\Delta$ Differential Modulator SINAD in <i>Virtuoso</i>                                    | 39 |
|      | $2^{nd}$ Order $\Sigma\Delta$ Differential Modulator Noiseless FFT in <i>Virtuoso</i>                            | 39 |
|      | $2^{nd}$ Order $\Sigma\Delta$ Differential Modulator Noise FFT in <i>Virtuoso</i> (a) Noise < 16MHz              |    |
|      | (b) Noise < 32MHz (c) Noise < 80MHz                                                                              | 40 |
| 4.13 | $2^{nd}$ Order $\Sigma\Delta$ Differential Modulator Capacitors 10% Variation FFT in <i>Virtu</i> -              |    |
|      | <i>oso</i> (a) C1 (b) C2                                                                                         | 40 |
| 4.14 | $2^{nd}$ Order $\Sigma\Delta$ Differential Modulator Resistors 10% Variation FFT in <i>Virtuoso</i>              |    |
|      | (a) R1 (b) R2 (c) R3 (d) R4                                                                                      | 41 |
| 4.15 | $2^{nd}$ Order $\Sigma\Delta$ Differential Modulator Capacitors 20% Variation FFT in <i>Virtu</i> -              |    |
|      | <i>oso</i> (a) C1 (b) C2                                                                                         | 41 |
| 4.16 | $2^{nd}$ Order $\Sigma\Delta$ Differential Modulator Resistors 20% Variation FFT in <i>Virtuoso</i>              |    |
|      | (a) R1 (b) R2 (c) R3 (d) R4                                                                                      | 42 |
| 4.17 | $2^{nd}$ Order $\Sigma\Delta$ Differential Modulator Comparator Offset Variation FFT in Vir-                     |    |
|      | <i>tuoso</i> (a) 10% (200mV) (b) 7% (100mV)                                                                      | 42 |
| 5.1  | ESP32 GPIO Loop at 8KHz (a) Input (b) Output                                                                     | 46 |
| 5.2  | ESP32 GPIO Loop at 80KHz (a) Input (b) Output                                                                    | 47 |
| 5.3  | ESP32 GPIO Loop at 8MHz (a) Input (b) Output                                                                     | 47 |

# LIST OF TABLES

| 2.1 | System examples comparison                    | 7  |
|-----|-----------------------------------------------|----|
| 2.2 | Audio data rate vs. sound quality             | 9  |
| 2.3 | Passive $\Sigma \Delta Ms$ : CT VS DT         | 25 |
| 3.1 | Resistors values                              | 29 |
| 3.2 | Simulation Comparison: LTspice VII VS Ngspice | 31 |
| 4.1 | Circuit and Frequency values                  | 35 |
| 4.2 | Theoretical Results                           | 38 |
| 4.3 | Result of Component Variation                 | 43 |
| 6.1 | Simulation Programs Comparison                | 49 |

# ACRONYMS

| ΣΔΜ    | Sigma-Delta Modulator.                               |
|--------|------------------------------------------------------|
| Δ      | Delta.                                               |
| Σ      | Sigma.                                               |
| μP     | Micro Processor.                                     |
| $B_W$  | Bandwidth.                                           |
| $f_P$  | Pole Frequency.                                      |
| $f_S$  | Sampling Frequency.                                  |
| AAF    | Anti-aliasing Filter.                                |
| ADC    | Analog to Digital Converter.                         |
| ARM    | Advanced RISC Machine.                               |
| BLE    | Bluetooth Low Energy.                                |
| С      | Capacitor.                                           |
| CPU    | Central Processing Unit.                             |
| СТ     | Continuous Time.                                     |
| DAC    | Digital to Analog Converter.                         |
| DIY    | Do It Yourself.                                      |
| DR     | Dynamic Range.                                       |
| DT     | Discrete Time.                                       |
| EEPROM | Electrically Erasable Programmable Read-only Memory. |
| FAT    | File Allocation Table.                               |
| FFT    | Fast Fourier Transform.                              |
| GPIO   | General-purpose Input/Output.                        |

| I/O                                    | Input/Output.                          |  |  |  |  |
|----------------------------------------|----------------------------------------|--|--|--|--|
| I2C                                    | Multi-Master Bus.                      |  |  |  |  |
| I2S                                    | Integrated Inter-IC Sound Bus.         |  |  |  |  |
| IC                                     | Integrated Circuit.                    |  |  |  |  |
| ICSP                                   | In-Circuit Serial Programming.         |  |  |  |  |
| IDE                                    | Integrated Development Environment.    |  |  |  |  |
| IoT                                    | Internet of Things.                    |  |  |  |  |
| IP                                     | Internet Protocol.                     |  |  |  |  |
|                                        |                                        |  |  |  |  |
| MCU                                    | Microcontroller.                       |  |  |  |  |
| MQTT                                   | MQ Telemetry Transport.                |  |  |  |  |
|                                        |                                        |  |  |  |  |
| NLP                                    | Natural Language Processing.           |  |  |  |  |
| NTF                                    | Noise Transfer Function.               |  |  |  |  |
|                                        |                                        |  |  |  |  |
| OSR                                    | Oversampling Ratio.                    |  |  |  |  |
| OTA Operational Transconductance Ampli |                                        |  |  |  |  |
| DOD                                    |                                        |  |  |  |  |
| PSD                                    | Power Spectral Density.                |  |  |  |  |
| PSRAM                                  |                                        |  |  |  |  |
| PWM                                    | Pulse Width Modulation.                |  |  |  |  |
| R                                      | Resistor.                              |  |  |  |  |
| RAM                                    | Random-access Memory.                  |  |  |  |  |
| rms                                    | root-square-mean.                      |  |  |  |  |
| RTOS                                   | Real-time Operating System.            |  |  |  |  |
| K105                                   | Keal-time Operating System.            |  |  |  |  |
| S/H                                    | Sample and Hold.                       |  |  |  |  |
| SC                                     | Switched-capacitor.                    |  |  |  |  |
| SDIO                                   | Secure Digital Input/Output.           |  |  |  |  |
| SINAD                                  | Signal-to-Noise and Distortion ratio.  |  |  |  |  |
| SNDR                                   | Signal-to-Noise-plus-Distortion Ratio. |  |  |  |  |
| SNR                                    | Signal-to-Noise Ratio.                 |  |  |  |  |
| SoC                                    | System on a Chip.                      |  |  |  |  |
| SPi                                    | Serial Peripheral Interface.           |  |  |  |  |
| SRAM                                   | Static Random-access Memory.           |  |  |  |  |
|                                        | ·                                      |  |  |  |  |

| SSL  | Secure Sockets Layer.                        |
|------|----------------------------------------------|
| STF  | Signal Transfer Function.                    |
| TCP  | Transmission Control Protocol.               |
| TEQ  | Time Encoded Quantization.                   |
| UART | Universal Asynchronous Receiver/Transmitter. |
| ULP  | Ultra-Low Power.                             |

USB Universal Serial Bus.



## INTRODUCTION

#### 1.1 Motivation

Technology is ever more present in everyday life. Nowadays, more than ever before, we are dependant on the ease and comfort proportioned by the technological advances that have boomed in the past few years. This dependence tends to become heavier with even more advances and tends to accelerate the development of technology. More so on the side of IoT (Internet of Things) as it is the development of this type of technology which allows us more comfort and connectivity to the world and others.

Also due to the spread and ease of access of DIY development kits and cheap technology, the IoT and home automation industries have seen rapid advances, and have garnered significant interest from the industry to the tech-savvy hobbyists.

In IoT and home automation, there are various areas to take into account when developing a device, or various devices depending on the needs of the user. However, one area that is always needed and of utmost importance is sensors and analysis of the data they gather.

IoT works through sensors, they are the bridge between the physical world and the digital world [1]. Home automation as an application of IoT works in much the same way. With a special focus on the interaction of the user with the home, through interfaces such as *"Alexa"* and other such systems which we will delve into deeper detail further on in this document.

These interactions usually happen through various ways which have been developed to cope with specific necessities and convenience of the user. As is the example of voice recognition systems or software application, in either a mobile phone or computer, even gesture recognition has been attempted to interface with home automation systems [2–4].

So that these systems can work on a single device, we need devices powerful and capable enough to receive and process the information created by the user. It is the case that microcontrollers (MCUs) are just one of the cheapest and supported tools for that end. There is however, the caveat that the cheaper and more powerful options among these devices as is the case of the ESP32 do not have the best on analog to digital converter (ADC) technologies, which are of high importance when you want to have reliable, accurate voice control over a device.

When talking about home automation, due to its many concerns about privacy and security, many, choose to go about it on theirs own terms. Turning to small developers, or trying at their own hand to implement a system in their homes. For these people, securing cheap but quality products is paramount, even at the cost of a full natural language recognition system. For many cases, this might be unnecessary, as simple commands can do the job, while not having to put their private information at risk.

### 1.2 Objectives

This work, aims to develop a cost-effective high resolution external ADC shield for the ESP32 to further improve its voice recognition capabilities and therefore augmenting its versatility and usability. With a special focus on uses such as home automation that can work both over the internet or locally without a connection.

This work's goal is to design a sigma-delta passive modulator, due to its specific characteristics in integrated circuits, to work as a high resolution ADC in conjunction with the ESP32, in which the ESP32 will substitute the the sigma-delta passive modulator's comparator module.

In order to have a proper solution, a succinct market search will be necessary. An appraisal of what is already available, that might compete with our solution to solve domestic implementation of home automation with voice control.

### **1.3 Document Overview**

This document is structured in the following way:

- **Chapter 2** presents a small market search representative of today's mainstream voice recognition devices. Is also presented the State-of-Art technology in voice recognition software, as well as an introduction to the concept of speech acquisition and the function of sigma-delta modulators. The microprocessors with biggest community support are discussed and compared.
- **Chapter 3** describes some accomplished preparatory research into the first steps of development of the proposed project.
- Chapter 4 presents a time table for the work to be developed.

- Chapter 5 Studies the viability of using an ESP32 microprocessor( $\mu P$ ), both per its theoretical capabilities as well as with practical tests.
- **Chapter 6** Summarises this work along with what can be interesting to make a follow-up study, in a future work section.



# STATE OF ART

### 2.1 Voice Recognition Systems

As we have seen before, voice recognition is being more and more developed and sought after, due to the possibilities of convenience and ease of use it can provide when fully matured.

As can be seen in these next few examples the more widespread usage of voice recognition in home automation restricts the user to specific devices or specific platforms such as the HomePod being attached to Siri and Apple compatible smart devices. Though there might be a few devices trying to make their stand, usually, the cost to make an appealing device pushes its price up more than users are willing to spend, besides the compatibility issues that might arise.

From here, we will see some examples of developed systems already available for customers, from simple gadgets to smart home hubs:

#### 2.1.1 Vocca PRO Light



VOCCA Pro, a smart lamp adapter for any standard lamp and socket. Works through Bluetooth 4.0 with a smartphone for voice recognition. It works optimally up to four and a half meters. Priced at around  $70 \in .$  [5] Through the accompanying app it allows the user to set his own trigger words. It does not connect to a wireless network or cloud. [6]

Figure 2.1: VOCCA Pro Source: Reprinted from [5].

### 2.1.2 Honeywell Smart Thermostat



Figure 2.2: Honeywell Wi-Fi Thermostat RTH9590WF1011

Honeywell Smart Thermostat, this

RTH9590WF1011 version of smart thermostat uses "far-field"voice control technology to recognize voice commands over ambient noise. It features a learning algorithm to learn more voice commands as interaction with the user increases. It Maintains all functions that should be expected of a smart thermostat, including connection to the cloud, and encrypted access. Priced at around  $176 \in$ . [7] Which might explain it being discontinued, and substituted by a more affordable option without integrated voice control, compatible with Alexa and/or Siri. This RTH6580WF version, for example, is accompanied by a price tag of  $70 \in$ . Also allowing a

connection to your cloud and Wi-Fi network. [8]

### 2.1.3 Amazon Echo Dot



Figure 2.3: Amazon Echo Dot 3rd Generation

Amazon Echo Dot, the cheapest option in the Amazon family of smart home hubs, the Echo Dot is tagged at 44€ along with the most basic features to function as smart home hub controlled by voice in conjunction with the Alexa app one will need to install on his devices. The Echo Dot needs a connection to the home Wi-Fi, and it is through cloud and Wi-Fi connectivity that the Echo Dot will manage the various smart devices around the house. [9]

### 2.1.4 Apple HomePod



Figure 2.4: Apple HomePod

Apple HomePod, similar in function to Amazon's Echo family the HomePod from Apple is more restricted in its compatibilities with devices to manage and has only one option priced at  $307 \in$ . Employing all the technologies Apple has access to, and like the Echo needs access to Wi-Fi and cloud as it can only be used along with Siri. [10]

As explained previously, both the devices from Amazon and Apple need a connection to their cloud to access the Speech-to-Text software they employ. Then processing the commands of the user with the use of Natural Language processing (NLP).

In reality, the physical devices only process a trigger word, i.e."'*Alexa'* or '*Siri*", and then connect the user to the cloud for the rest of the processing of the commands. This is mainly due to the space this type of software occupies in physical devices, and the processing capabilities they require can not be met by most IoT devices.

Following, table 2.1 illustrates a more straightforward comparison between the discussed examples.

| System                                   | Function                    | Word<br>Recognition | Cloud Processing | Cost |
|------------------------------------------|-----------------------------|---------------------|------------------|------|
| Vocca Pro                                | Smart light bulb<br>adapter | Trigger word No     |                  | 70€  |
| Honeywell<br>Thermostat<br>RTH9590WF1011 | Smart<br>Thermostat         | NLP                 | Yes              | 176€ |
| Amazon Echo<br>Dot                       | Home hub                    | NLP                 | Yes              | 44€  |
| Apple HomePod                            | Home hub                    | NLP                 | Yes              | 307€ |

Table 2.1: System examples comparison

### 2.2 Voice Recognition Software's

Voice recognition technology has already come a long way in its development. Being one of the big focus of the developers of this technology to assure high accuracy, for that is what makes or breaks a good voice recognition software.

As the technology becomes more and more accurate, it will start to have a more widespread impact on our interactions with the digital world, as humans have an easier time speaking more words than they can write in the same time.

Some big companies though, already have very satisfying levels. An example of such companies, we have Baidu, China's search engine has achieved 96% accuracy in low noise environments; Hound from SoundHound company, Siri from Apple and Google Now from Google, are others such that have achieved similar rates of accuracy above 90%. And while some of these might not achieve a rate as high as shown by Baidu's, they compensate by integrating learning algorithms that learn the user's speech specificities such as dialect and way of speaking, the more the user interacts with the program. As is the case of Alexa from Amazon. [11]

These technologies though, are all under restricted access to their parent companies and not accessible to developers and the public. There are, however, already some open source tools to allow developers and hobbyists to use speech recognition in their various applications.

Some platforms, like Mycroft, offer free and reliable tools for wake word detection and speech to text for use in any project a developer might need, it requires, however, further development to be ready for mainstream availability, being more suited for developers and hobbyists. [12]

These tools include mainly a package developed by Carnegie Mellon University, the CMUSphinx, and more specifically the PocketSphinx is a lighter more portable version of their speech recognition engine which bases its technology on a database of sounds-[12], there is also the more recently adopted, Precise, which is based on a neural network to train wake words. [12]

There is also the community-oriented Home Assistant being developed, and while not being a voice recognition software, this technology is encompassed in their work, and because there is a big community feeling, everything is open source and mostly modular. One thing they do try to do differently though is to make sure the home automation system does not need to rely on the cloud to work. As the founder Paulus Schoutsen said, *"The cloud should be treated as an extension to your smart home instead of running it."* [13]

While there are numerous more advanced open-source and restricted speech recognition software, most would not be able to be utilised locally due to their size and computing resource constraints.

#### 2.2.1 Speech Data Acquisition

To capture voice signals, there needs to be a microphone so it can emulate the auditory system, converting pressure waves into a time-varying electrical signal. For this purpose, one can use condenser, electret, dynamic or carbon button microphones, each with their advantages and disadvantages. Typically, for high-quality recordings, condenser microphones are preferred, however, even the cheap electret microphones can deliver a good recording.

Besides the circuit on which a microphone is based, there also needs to be some consideration on the directional pattern one uses, depending on the use the microphone will be given. An omnidirectional microphone will be preferred when trying to pick up sound from all directions, however, it will also pick up more noise than a cardioid microphone which is more precise when the location of the speaker can be guaranteed. [14]

To process the analog signal produced by the microphone, the necessity of sampling and digitization through an analog-to-digital converter (ADC), will be made apparent. These operations will be explained in more detail further in the document.

| Sound Quality Required                        | Bandwidth            | Sampling<br>rate | Number<br>of bits | Data rate<br>(bits/sec) | Comments                                                                         |
|-----------------------------------------------|----------------------|------------------|-------------------|-------------------------|----------------------------------------------------------------------------------|
| High fidelity music<br>(compact disc)         | 5 Hz to<br>20 kHz    | 44.1 kHz         | 16 bit            | 706k                    | Satisfies even the most picky<br>audiophile. Better than<br>human hearing.       |
| Telephone quality speech                      | 200 Hz to<br>3.2 kHz | 8 kHz            | 12 bit            | 96k                     | Good speech quality, but<br>very poor for music.                                 |
| (with companding)                             | 200 Hz to<br>3.2 kHz | 8 kHz            | 8 bit             | 64k                     | Nonlinear ADC reduces the data rate by 50%. A very common technique.             |
| Speech encoded by Linear<br>Predictive Coding | 200 Hz to<br>3.2 kHz | 8 kHz            | 12 bit            | 4k                      | DSP speech compression<br>technique. Very low data<br>rates, poor voice quality. |

Table 2.2: Audio data rate vs. sound quality Source: Reprinted from [15]

To assure a minimum accuracy in our recording of the sound signal, there needs to be a sufficiency of samples. The maximum audio frequency a human can hear is taken to be between 20Hz-20KHz, though this window is shorter (between 200Hz-16KHz) for most people. Due to the Nyquist theorem, any frequency needs double the samples per second to be accurately described. This implies the sampling frequency should be at least 40KHz, for a faithful representation of most sound signals. [15]

Speech recognition in practical systems, start from sampling rates of 8KHz up to 44,1KHz, as illustrated in table 2.2 above, corresponding in use, to telephone quality up to audio CD quality. For very good quality speech recognition, it is sufficient to sample at 16KHz wide band.

After a signal has been sampled it needs to be quantized and turned into digital information, these samples for the aforementioned values, are typically restricted to a finite set of digital values, N = 8 to 16, being more common, and representing high-fidelity at 16bit quantization. [14]

### 2.3 Microcontrollers

This section will discuss, generally, what are some of the more popular and interesting microprocessors ( $\mu$ P) to use in the development of a small cost-effective device with voice recognition capabilities, which creates a certain limiting factor in processing power and cost, of said  $\mu$ Ps.

### 2.3.1 ESP32

This  $\mu$ P was chosen for this project, due to its compromise regarding price and computing power. Being one of the development board SoCs with the highest specifications in its price range, starting at just around 4€ at the time of writing.

This microprocessor was developed by Espressif Systems based out of Shanghai, as a Wi-Fi and Bluetooth/Bluetooth Low Energy (BLE) networking solution which can also run self-contained local applications. Appearing as a successor to the ESP8266, it is more powerful containing a new version of the Tenselica range processors. [16]

Due to the precedent and compatibility, the ESP32 has with the ESP8266, the community has wholly embraced this new more powerful  $\mu$ P, creating the third biggest community following in the microprocessors and DIY world, after Arduino's and Raspberry Pi's.



Figure 2.5: ESP32 Development board DevKitC Source: Reprinted from [17]

### 2.3.2 Arduino

The Ivrea Interaction Design Institute created an open-source electronics platform that was developed as an easy to use hardware and software prototyping tool. Initially aimed at students with no background in electronics, it started to change and adapt as the community around it grew, and it began to be used for more projects that presented new and more challenges. [18]

The ATmega range based boards development owes much to its worldwide community of users, ranging from hobbyists and students to professionals, who all contributed to the increase of accessible knowledge about the various boards that came about, being used in all different types of projects.



Figure 2.6: Arduino Micro development board Source: Reprinted from [19]

Its popularity has much to do with its relative inexpensiveness, with the modules being found online costing from around  $3 \in$  and up. And from  $18 \in$  up on their official website. The Arduino Software integrated development environment (IDE) being beginner-friendly as well as flexible enough for advanced usage, is also a big appeal.

The biggest downside to this  $\mu P$  in this project is the need for further expenses to integrate Wi-Fi support, and not being as powerful a device as the chosen.

### 2.3.3 Raspberry Pi

The Raspberry Pi Foundation, much in the same fashion as the Arduino idea, set out to promote basic computer science, by creating a series of small single-board computers. This single-board computer differs from regular  $\mu$ Ps in not only size, being a credit card sized board, but also on power, seeing as it can be considered a full-fledged computer. [20]

Their cheapest option is an affordable  $5,22 \in$ , cutting expenses in both power and versatility to provide nonetheless a useful learning tool that can be applied to a wide range of small scale projects. [21]



Figure 2.7: Raspberry Pi 2 Source: Reprinted from [22]

### 2.3.4 RDA5981A

Appearing at around the same time as the ESP32 in 2016, it is a competitor of the ESP8266 and not the more powerful ESP32, being based on ARM core technology, it has some features not found in the hardware of the ESP8266 and ESP32. This RDA chip is also maybe the biggest competitor to the ESP in terms of price at just under  $2 \in$  with Wi-Fi integrated. [23] One available development board is illustrated in (Fig. 2.8).

The drawback of this  $\mu$ P, along with most new ones, is exactly the fact that they are new, while not bringing innovation. Due to this, they cannot capture the community following the ESP32 experiences, which makes it more inconvenient to make projects based on these boards.



Figure 2.8: RDA5981a development board Source: Reprinted from [23]

## 2.4 Systems on a chip

In this section, we will discuss the various features and specifications of the devices in study. Due to the wide range of devices on each brand, we will study the most affordable devices on each.

The Raspberry Pi will not be included, as it is not a microprocessor and its price towers above the others under consideration.

### 2.4.1 ESP32

The ESP32 is a dual-core system of two Harvard Architecture Xtensa LX6 CPUs, which are called "PRO\_CPU" and "APP\_CPU", for protocol and application respectively. Even though for most uses they are interchangeable, as they have the same memory accesses. Illustrated in (Fig. 2.9) is the block diagram of the system as according to Espressif, along with the system address mapping in (Fig. 2.10).



Figure 2.9: System structure block diagram Source: Reprinted from [24]

Being a range of four chips, the one in question is the "ESP32-D0WDQ6", featuring ultra-low-power solutions such as fine-grained clock gating, various power modes (standard, sleep, deep-sleep) and dynamic power scaling.

This device's specifications as described by the manufacturer are as follows: 3.3V to operate, dual 160MHz clock speed, 520KB RAM, has thirty-four general-purpose-input/outputs (GPIOs) of which seven are analog to digital, supports Wi-Fi 802.11b/g/n/e/i, Bluetooth low energy, support for up to sixteen concurrent TCP connections, three SPI ports, and two ports each for I2S, I2C and UART. [24]



Figure 2.10: System address mapping Source: Reprinted from [24]

Various modules have been created for the ESP32, the one in question in this project is the "ESP32-DEVKITC" module created by Espressif themselves so they can bring their module to a broader audience, as it is a breadboard-friendly module. [16] This module comes with a micro USB adapter and two buttons, "ENABLE" and "BOOT". These buttons are used to flash or download a new code or application to the board.

Due to its powerful and dual cores, as well as high clock frequency and respective flexibility, this microprocessor is very capable of providing the necessary sampling rates and digitization, as will be explained ahead in this document.

#### 2.4.2 Arduino

The smallest board in the Arduino family is the Arduino Micro, also the cheapest in the official website at  $18 \in$ .

This  $\mu$ P based on the ATmega32U4 was developed along with Adafruit. It has 20 digital I/O pins out of which 7 can be used as PWM outputs and 12 as analog inputs, a 16MHz crystal oscillator, micro USB connection, an ICSP header and reset button. All in a 48mm by 18mm board. [19]

To power up this  $\mu$ P it is recommended to use a source between 7-12V, with a maximum limit supported of 20V and the minimum, 5V.

Atmel's ATmega32U4 which is the basis of this board features a low power CMOS AVR<sup>®</sup> 8-bit microcontroller, with a maximum clock of 16MHz. An ICSP of 32KB, 2.5KB of SRAM and 1KB of EEPROM allows for it to be programmed on the fly. [25]

### 2.4.3 RDA5981A

Developed with IoT, smart home, Wi-Fi speaker/home audio and smartwatch applications as its main focus, this low-power µP provides TCP/IP protocols along with SSL with its fully integrated 2.4GHz 802.11b/g/n MAC/PHY/radio. Contained in a compact 5x5mm<sup>2</sup> QFN package, 0,4mm pitch QFN-40.

The Board is based on an ARM Contex M4 + FPU/MPU core, with a clock frequency of 160MHz. The  $\mu$ P leaves up to 288KB SRAM for the user and uses 160KB SRAM for the Wi-Fi stack and flash cache making a total of 448KB SRAM. Despite that, it also includes 32Mb SPI flash memory and allows for a PSRAM expansion of up to 64M. [23]

On another note, it comes integrated with a USB 2.0 interface, supporting only the FAT file system, and has an SDIO that allows a 256G SD card. To power it up, one would need between 3.0-3.5V.

In the matter of pins, we have two serial pins, two I2S, one I2C, four SPI, eight PWM, two 10-bit ADCs, fourteen GPIOs available for interrupts.

In terms of software, this  $\mu$ P supports MQTT, smartconfig, airkiss, between others, all on a mbed RTOS operating system. [26]

## **2.5** $\Sigma \Delta$ **Modulators**

This section will present a simple and general introduction to the functioning and basic architectures of  $\Sigma\Delta$  modulators ( $\Sigma\Delta$ M), as well as go in more depth on the realization of passive continuous-time  $\Sigma\Delta$ Ms purposed for speech recognition.

### 2.5.1 Basic Operation of an ADC

The fundamental operations an analog-to-digital converter (ADC) utilizes for the digitization of an analog signal are sampling and quantization. These processes transform the signal from continuous-to-discrete, respectively in time and amplitude. These transformations will, therefore, place speed and accuracy limitations on the performance of an ADC.

A general ADC consists of mainly an anti-aliasing filter (AAF), a sampler, a quantizer and a coder [27] as illustrated in (Fig.2.11a).



Figure 2.11: (a) Conceptual block diagram of a generic LP ADC (b) Signal Processing Source: Reprinted from [28]

These blocks work in such a way, as illustrated by (Fig.2.11b), in which the AAF first receives the analog signal,  $x_a(t)$ , to prevent folding or aliasing from high-frequency components to contaminate the bandwidth,  $B_W$ , according to the Nyquist sampling theorem [29]. Afterwards, the signal now band-limited is sampled at  $f_S$  by the sample and hold (S/H) circuit, producing a discrete-time signal,  $x_S(n)$ , following which a quantizer maps its continuous range of amplitudes into a finite number of discrete levels.

Finishing up is the coder that transforms each discrete level into a unique binary code,  $y_d(n)$ . This code represents the digital output of the ADC. [30]

#### **2.5.2** Principles of $\Sigma\Delta$ Modulators

#### 2.5.2.1 Oversampling

According to the Nyquist theorem,  $x_a(t)$ , must be sampled at a minimum of  $f_S = 2B_W$ . While any ADC that follows this relation is called Nyquist-rate ADC, some follow an  $f_S > 2B_W$  relation. These are called *oversampling* ADCs, having an *oversampling ratio* (OSR), such that:

$$OSR = f_S / (2B_W) \tag{2.1}$$

A  $\Sigma\Delta M$  is an example of such an ADC. Due to oversampling the sampling frequency, the design of the AAF can be simplified, by not needing a high-order filter to achieve anti-aliasing, as illustrated in (Fig.2.12).



Figure 2.12: Representation of anti-aliasing filtering for (a) nyquist-rate (b) oversampling ADC

Source: Reprinted from [31]

#### 2.5.2.2 Quantization

By mapping continuous-value levels into discrete levels, the quality of the input signal is degraded. This transformation in amplitude generates an error, the *quantization error*. This causes a loss of resolution due to the non-reversibility of the quantization operation, as opposed to the sampling operation. [29]

A quantizer outputs a digital signal with  $2^B$  discrete levels, which depends on its *B*-bit resolution. The ideal operation of a quantizer is illustrated in (Fig.2.13).



Figure 2.13: Quantization Process (a) Ideal characteristic of a quantizer (b) Quantization error

Source: Reprinted from [28].

The quantization step,  $\Delta$ , can be calculated as  $\Delta = Y_{FS}/(2^B - 1)$ , and the quantization error, as seen in (Fig. 2.14b), is limited to  $\pm \Delta/2$  with a corresponding average of zero. [32] This allows for the assumption that the quantization error, e(n), is distributed across the range  $X_{FS}$  uniformly, with a rectangular probability density, [30] making a similar power spectral density, as shown in (Fig. 2.14).

The power of the quantization error can thus be:

$$e^{2} = \int S_{E}(f)df = S_{E} \int_{-f_{S}/2}^{+f_{S}/2} df = \frac{\Delta^{2}}{12}$$
(2.2)

As such, the power spectral density (PSD) in this range is:

$$S_E = \frac{e^2}{f_S} = \frac{\Delta^2}{12 * f_S}$$
(2.3)

Therefore, the in-band quantization noise power can be calculated by:

$$P_Q = \int_{-B_W}^{+B_W} S_E df = \frac{\Delta^2}{12 * OSR}$$
(2.4)

Theoretically, and according to [28, 30, 32, 33], by doubling OSR the  $P_Q$  decreases by 3dB, and the larger the OSR the smaller the in-band noise power.

By applying a filter to the quantization noise in such a way that most of the power is kept outside the signal band, the accuracy of an oversampling ADC can be improved. [30]



Figure 2.14: Quantization white noise (a) probability density function (b) power spectral density

Source: Reprinted from [28]

#### 2.5.2.3 Basic Architecture

 $\Sigma\Delta$ Ms being an *oversampling* converter, along with the  $\Delta$ modulator, have the advantage of a feedback loop. The modulator is composed of a difference amplifier, ( $\Delta$ ), an integrator, ( $\Sigma$ ), a comparator and a DAC looping to the  $\Delta$ , as illustrated in a first-order modulator with (Fig. 2.15).



Figure 2.15: Time domain representation of first order  $\Sigma\Delta$  modulator Source: Reprinted from [34]



Figure 2.16: Frequency-domain representation, the relation of noise characteristic to the operation of  $\Sigma\Delta Ms$ 

Source: Reprinted from [34]

As can be observed in (Fig. 2.16), through the frequency domain representation, the system can be represented in function of the noise transfer function (NTF) and the signal transfer function (STF), by:

$$D_{OUT} = V_{IN}(f) * STF(f) + e(n) * NTF(f)$$
(2.5)

If the Z-transform is applied, there results:

$$Y(z) = X(z) * STF(z) + E(z) * NTF(z)$$
(2.6)

The respective transfer functions are given by:

$$STF(z) = \frac{H(z)}{1+H(z)}; NTF(z) = \frac{1}{1+H(z)}$$
 (2.7)

Ideally, a DC signal would have the quantization noise cancelled, in which case

$$NTF(z) = (1 - z^{-1})^L$$
(2.8)

L, representing the order of the filter. Therefore the power quantization noise can be seen as a white noise source, such that [32]:

$$P_Q = \int_{-B_W}^{+B_W} \frac{\Delta^2}{12 * F_S} * |NTF(f)|^2 df = \frac{\Delta^2}{12} * \frac{\pi^{2L}}{(2 * L + 1) * OSR^{2*L+1}}$$
(2.9)

Through noise shaping, then, the noise is *sent* outside the in-band  $(f_0)$ , as seen in (Fig. 2.17)



Figure 2.17: Oversampling results in quantization noise shaping Source: Reprinted from [31]

After going through these processes, by measuring the influence of noise in the signal, the performance of the modulator can be ascertained. This can be done through the signal-to-noise ratio (SNR), [28, 32] where considering a sinusoid signal input of amplitude A,

$$SNR = \frac{A^2}{2*P_Q} \tag{2.10}$$

Another important performance metric is the dynamic range (DR) at the output of the modulator, representing the ratio of the output power at the frequency of an input sinusoid with maximum amplitude ( $A_{MAX}$ ) to the output power for a small input amplitude for which SNR = 0 dB. [28]

#### 2.5.2.4 Decimation Filters

After the input signal passes through the modulator, the resulting bitstream is output to a digital decimator filter which will average and downsample the stream. [31] Therefore producing an n-bit sample at the nyquist frequency, resulting in a digital signal that corresponds to direct sampling of the original waveform. [15]

This operation of averaging and *decimating* has an effect similar to a low-pass filter of the signal in the frequency domain, from which results attenuation of quantization noise and removes aliases in the band of interest. [34]

Typically, decimation filters are implemented in hardware along with the modulator. [34] Due to dealing with digital data, one can emulate the decimation filter with a processor, such as the ESP32 microprocessor. In this case, the only part of the converter that needs to be implemented is the modulator, reducing costs at no expense to performance.

#### **2.5.3** $\Sigma \Delta \mathbf{M}$ Loop Filter

There are a variety of ways to implement the loop filter in a  $\Sigma\Delta M$ , such as:

- Switched-Capacitor (SC)-ΣΔMs;
- Continuous-Time (CT)- $\Sigma\Delta Ms$ ;
- Time encoded quantization (TEQ)- $\Sigma\Delta Ms$ ;
- Re-configurable  $\Sigma \Delta Ms$ , either SC or CT;
- Passive-ΣΔMs;
- Hybrid  $\Sigma\Delta$ Ms, these being generally a match between different architectures.

One can also separate the various  $\Sigma\Delta Ms$  in three categories, based on the implementation of the integrators. These being, active, hybrid or passive integrator  $\Sigma\Delta Ms$ .

#### **2.5.3.1** Active Integrator $\Sigma \Delta Ms$

Traditionally,  $\Sigma\Delta Ms$  use amplifiers for the integrator in the same number as the order of the modulator. There are, however, some techniques which allow the use of fewer amplifiers, such as presented by Matsukawa [35].

Amplifiers are very power demanding, as such, inverter-based operational transconductance amplifiers (OTAs) can be used for better energy efficiency. Using amplifiers allows better control of the loop gain in the modulator, there is though, more restrictions in the power consumption, efficiency, size and design and circuit complexity of the modulators.

#### 2.5.3.2 Hybrid Integrator $\Sigma \Delta Ms$

As an approach to minimize power consumption and achieve higher energy efficiency, reducing the number of amplifiers and implementing passive integrators such as RC filters, allows to achieve not only that, as to also increase the modulator's linearity. By using less active components, the loop gain becomes more concentrated on the comparator, this gain is defined by the root-square-mean (rms) value ratio between the comparator's output and input.

#### **2.5.3.3** Passive Integrator $\Sigma \Delta Ms$

Removing the utilization of amplifiers completely, the integrators are implemented by switched-capacitors or RC filters. This approach causes the loop gain to be entirely focused on the comparator, making a good comparator design imperative. In addition due to not using amplifiers, the circuit design can be simplified and the circuit area reduced. This modulators generally have high energy efficiency and low power dissipation. Due to the nature of the integrators, they can be relatively easier to manipulate for a stable loop.

A **passive discrete time (DT)**  $\Sigma \Delta \mathbf{M}$  utilizes switched-capacitors (SC) to implement the integrator, and due to their naturally discrete-time functionality, they also take the role of sample and hold in the entry of the circuit.

These implementations allow to more easily utilize a multi-bit system which allows a lower sampling frequency and helps reducing power dissipation. They have the sampling frequency restricted by the quantization regeneration time and the feedback loop update rate and are generally resistant to clock jitter noise.

According to *de la Rosa* [36], SC modulators are favoured when working with bandwidths lower than 100KHz and DRs higher than 14bits.

A passive continuous-time (CT)  $\Sigma \Delta M$  generally uses a low-pass RC filter as integrator, while in some cases where the bandwidth is desired around a central frequency a band-pass filter can be used instead while maintaining the same characteristics and function.

These modulators are more linear due to the utilization of the filters, being, therefore, more important to use single bit comparators and DACs. Due to the noise shaping abilities of the loop, they don't need an anti-aliasing filter. They can also work with very high sampling frequencies while being more sensitive to clock jitter noise as the signal enters the discrete part of the circuit.

CT modulators are favoured by their energy efficiency, high stability and ability to be scaled down to a small area. According to *de la Rosa* [36], CT modulators are favoured when working with bandwidths higher than 1MHz and DRs lower than 15bits.

Analyzing the passive RC integrator illustrated in (Fig. 2.18) results in the following transfer function:



Figure 2.18: RC passive integrator Source: Reprinted from [37]

$$H(s) = \frac{V_{out_x}}{V_{in_x}} = \frac{1}{s * R_1 * C_1 + 1}$$
(2.11)

As seen from expression 2.11, the RC integrator is a low-pass filter with pole frequency larger than 0Hz, as would be expected, since an ideal integrator is impossible to build.

From further analysis according to [37, 38], due to the attenuated output of the passive filter, the gain of the comparator will need to be increased accordingly, as to allow the modulator to have the projected loop gain.

There have been successful attempts at implementations based on this design with [37–40], such as illustrated in (Fig. 2.19).



Figure 2.19:  $2^{nd}$  order  $\Sigma \Delta M$  with feedforward structure (a) block diagram (b) circuit representation

Source: Reprinted from [37].

A simplified and illustrative comparison is made in table 2.3 between CT and DT passive  $\Sigma\Delta$ Ms. In this work, the CT passive  $\Sigma\Delta$ M was chosen to be implemented, mainly due to its low cost and simple design.

|                      | СТ                                                           | DT                                                                              |  |
|----------------------|--------------------------------------------------------------|---------------------------------------------------------------------------------|--|
| Clock jitter noise   | Sensitive                                                    | Resistant                                                                       |  |
| Anti-aliasing Filter | Unnecessary due to the<br>loop's noise-shaping<br>capability |                                                                                 |  |
| Price                | Very inexpensive                                             | Inexpensive                                                                     |  |
| Energy efficiency    | High                                                         | High                                                                            |  |
| Circuit Design       | Simple and of small print                                    | $\begin{array}{ l l l l l l l l l l l l l l l l l l l$                          |  |
| Sampling Frequency   | Very high                                                    | Restricted by feedback<br>loop update and<br>quantization regeneration<br>times |  |

Table 2.3: Passive  $\Sigma \Delta Ms$ : CT VS DT



## ARCHITECTURE DEVELOPMENT

This chapter presents the solution proposed to complete the objectives defined in 1.2. Here will be presented the preparatory study for the implementation of the  $\Sigma\Delta$  modulator.

## 3.1 Circuit Design

### 3.1.1 Theoretical analysis

The proposed solution, as presented in figure 3.1, is composed of a passive loop filter connected to a microprocessor( $\mu$ P) by GPIOs which will function as the comparator and DAC. The  $\mu$ P will then be responsible for maintaining the closed loop and retrieving, decimating and processing the signal.



## **3.1.2** $\Sigma \Delta \ 1^{st}$ Order Modulator

First, to better understand how this process works, a 1<sup>st</sup> order modulator is analyzed.



Figure 3.2:  $1^{st}$  order  $\Sigma\Delta$  modulator Block Diagram

Deducing eq. 3.1 by examining fig. 3.2, it should be noticed the substantial impact the thermal noise and comparator noise will have on the circuit.

$$Dout \approx \frac{V_{in} * |H_1(s)| * G_C}{1 + |H_1(s)| * G_C} + \frac{E_{TN} * G_C}{1 + |H_1(s)| * G_C} + \frac{E_C * G_C}{1 + |H_1(s)| * G_C} + \frac{E_Q}{1 + |H_1(s)| * G_C}$$
(3.1)

Due to the nature of the passive filter  $|H_1(s)| \approx 1$ , the input signal at the comparator is rather small, the thermal noise will significantly limit the SNR of the modulator, along with the comparator noise due to having similar weight to the output like the input signal.

The quantization noise will, however, be suppressed by the comparator gain, being therefore of low importance to the circuit.



Figure 3.3:  $1^{st}$  order  $\Sigma\Delta$  modulator Simulation Schematic

The passive, continuous-time  $1^{st}$  order  $\Sigma\Delta$  modulator starts with designing a lowpass RC filter. This will be the integrator as represented by  $H_1(s)$  in Fig. 3.2 and the RC circuit in Fig. 3.3.

The filter is designed for an amplitude of 40dB, as that will more easily allow achieving 86dB when adding the second integrator in cascade to model a  $2^{nd}$  order modulator later. Having the filter amplitude and considering a sampling frequency of 5MHz, it is now possible to calculate the necessary values for the passive components  $R_{eq}$  and C through the use of eq. 3.2

$$\frac{1}{A} = \frac{1}{2\pi * R_{eq}C + 1}$$
(3.2)

Fixating the value of C at 1nF and using a system of equations with the voltage divider eq. 3.3 and resistors parallel eq. 3.4 the initial values of R1 and R2 are calculated.

$$0 = 1.5 * \frac{R2}{R1 + R2} + 1.5 * \frac{R1}{R1 + R2}$$
(3.3)

$$R_{eq} = \frac{R1 + R2}{R1 * R2} \tag{3.4}$$

For the high-level simulation, the comparator will be an inverter followed by a D flip-flop as presented in Fig. 3.3. As this will emulate the GPIOs of the  $\mu$ P.

Not having achieved the expected results from the simulation with the calculated values, adjustments were made, taking into account the relation between the resistors and considering the already published circuit in [39] as represented in table 3.1.

| Iterations            | $R1[\Omega]$ | $R2[\Omega]$ | SNDR[dB] |
|-----------------------|--------------|--------------|----------|
| 1 <sup>st</sup> value | 165.5        | 146.6        | 19       |
| 2 <sup>nd</sup> value | 9.3k         | 1.1k         | 3        |
| 3 <sup>rd</sup> value | 3.1M         | 8.86k        | Undef.   |
| 4 <sup>th</sup> value | 465k         | 400k         | Undef.   |
| Final value           | 930k         | 3.72M        | 46       |

Table 3.1: Resistors values

The final values of R1 and R2 accomplish the design of this modulator as shown in figure 3.4, having achieved a peak signal-to-noise-plus-distortion ratio (SNDR) of 46dB, which is substantially less than the theoretical result of 71dB.



Figure 3.4:  $1^{st}$  order  $\Sigma\Delta$  modulator FFT in *LTspice VII* 

## 3.2 Simulation programs

The simulations were run on the simulation program LTspiceVII, due to its simplicity of creating a circuit and running its spice simulation. It should be taken note that all simulations were extremely slow and processor taxing which shouldn't, as the circuit is rather simple. The few reasons for which this might have happened were too much strain due to the high sampling frequency requested, as well as lack of optimisation on the part of the inverter and D flip-flop modules.

For a more explicit presentation of the problem figures 3.5, 3.6 and table 3.2 show the fast Fourier transforms (FFTs) and SNDR for the same circuit, in the two simulation software, for different input parameters. Through analysis of the figures and table it can be observed the software only coincide when the circuit is simulated at saturation with 0dBV input signal amplitude. This work presents only one example corresponding to the best FFT of *Ngspice*, as other input variations on *LTspice VII* resulted equally to figure 3.6a.



Figure 3.5:  $2^{nd}$  Order  $\Sigma\Delta$  SE Modulator FFT with 0dBV Input Signal Amplitude (a) *LTspice VII* (b) *Ngspice* 



Figure 3.6:  $2^{nd}$  Order  $\Sigma \Delta$  SE Modulator FFT with -13dBV Input Signal Amplitude (a) *LTspice VII* (b) *Ngspice* 

| Input Signal<br>Amplitude[dBV] | LTspice VII<br>SNDR[dB] | Ngspice<br>SNDR[dB] |
|--------------------------------|-------------------------|---------------------|
| -20                            | Undef.                  | 82.9                |
| -13                            | Undef.                  | 86.9                |
| -10                            | Undef.                  | 85.3                |
| 0                              | 9.5                     | 9.5                 |

Table 3.2: Simulation Comparison: LTspice VII VS Ngspice

CHAPTER

# $\Sigma\Delta$ 2<sup>nd</sup> Order Modulator Development

To achieve an SNDR of 86dB it is necessary to decrease the quantization noise, which depends on the loop gain.

## **4.1** $\Sigma \Delta 2^{nd}$ Order Modulator Single Ended

The loop gain inside the signal band for the filter is given by eq. 4.1

$$|H(f_{band})| * K_G = \frac{|H(f_{band})|}{|H(f_s/2)|} \approx \frac{f_s}{2 * f_p}$$
(4.1)

where  $f_s$  is the sampling frequency and  $f_p$  is the pole frequency. This expression shows that either by increasing  $f_s$  or decreasing  $f_p$  the quantization noise in the signal band can be suppressed. Due to the goal of the ADC to be a high-resolution voice audio receiver, the pole frequency is set at 10KHz, which is around the middle of the audible spectrum, and the sampling frequency is increased to 8MHz.



Figure 4.1:  $2^{nd}$  Order  $\Sigma\Delta$  Modulator Block Diagram

Following on the work presented in [37], the order of the modulator is increased by adding an integrator to the loop and a zero to maintain the stability of the closed loop.



Figure 4.2: 2<sup>nd</sup> Integrator with Zero Block Diagram

Source: Reprinted from [37].

This further ensures an increase in the SNDR. The block diagram of the proposed circuit is presented in Fig. 4.1 and the feed-forward structure is presented in Fig. 4.2.

### 4.1.1 Theoretical analysis and calculations



Figure 4.3:  $2^{nd}$  Order  $\Sigma\Delta$  Modulator Circuit Schematic

By analysing the diagram presented in Fig. 4.1 the transfer function can be deduced to be as shown in eq. 4.2 from which the poles are deduced in eq. 4.3 and eq. 4.4, and the zero, in eq. 4.5.

$$H(s) = \frac{C2R1R5s + R1}{C1R1R2s(C2s(R4 + R5) + 1) + C2R1s(R2 + R4 + R5) + C2R2s(R4 + R5) + R1 + R2}$$
(4.2)

 $s \rightarrow -\frac{\sqrt{(C1R1R2+C2R1(R2+R4+R5)+C2R2(R4+R5))^2-4C1C2R1R2(R1+R2)(R4+R5)}+C1R1R2+C2R1R2+C2R1R4+C2R1R5+C2R2R4+C2R2R5)}{2C1C2R1R2(R4+R5)}+C1R1R2+C2R1R2+C2R1R4+C2R1R5+C2R2R4+C2R2R5)}$ 

(4.3)



$$s \to -\frac{1}{C2R5}$$
 (4.5)

Due to the complexity of the transfer function represented by equation 4.2, a simplified version, eq. 4.6, was used for the calculation of the noise transfer function(eq. 4.10).

$$H(s) = \frac{K_0 V_{ref}(\frac{s}{z_1} + 1)}{(\frac{s}{p_1} + 1)(\frac{s}{p_2} + 1)}$$
(4.6)

Starting by converting the transfer function(TF) from Laplace domain to time domain and afterwards to discrete-time, so we can apply the z-transform and arrive at expression 4.7 which represents the transfer function in z domain.

$$H(z) = \frac{K_0 V_{ref}((p_1 - p_2)z_1 + e^{Tclk(p_1 + p_2)}z(p_1 - p_2)z_1 + e^{Tclkp_1}((-1 + z)p_1p_2 - zp_1z_1 + p_2z_1) + e^{Tclkp_2}(zp_2z_1 - p_1((-1 + z)p_2 + z_1)))}{(-1 + e^{Tclkp_2}z)^2(p_1 - p_2)z_1}$$

$$(4.7)$$

As the z domain transfer function has been deduced, it is now possible to conclude the noise transfer function(4.10) by using the comparator gain equation 4.8 and the closed loop expression 4.9.

$$K_G = \frac{1}{|H(e^{\frac{j*2\pi}{4}})|}$$
(4.8)

$$Dout = V_{NQ} - Dout * H(z) * K_G$$
(4.9)



Fixing the value for both capacitors(C1 and C2) and the circuit entry resistor(R1) as presented in the circuit electrical schematic in Fig. 4.3 and adjusting the poles and zero frequencies, it becomes possible to solve the system of equations 4.3, 4.4 and 4.5, calculating the remaining component's values. After the necessary adjustments to achieve an SNDR of at least 86dB and to minimize the necessary comparator gain, the final values for the components and the poles and zero frequencies are presented in table 4.1.

| Components                                                | Value            | Poles & Zero         | Frequency[Hz] |
|-----------------------------------------------------------|------------------|----------------------|---------------|
| $ \begin{array}{c} R1[\Omega] \\ R2[\Omega] \end{array} $ | 700<br>2.01K     | 1 <sup>st</sup> pole | -22.9K        |
| $\begin{array}{c} R3[\Omega] \\ R4[\Omega] \end{array}$   | 219.52K<br>6.25K | 2 <sup>nd</sup> pole | -20.7K        |
| C1[F]<br>C2[F]                                            | 90n<br>200p      | zero                 | -800K         |

Table 4.1: Circuit and Frequency values

Once all components have been calculated, a theoretical simulation was made to verify the calculations as is presented in the amplitude bode diagram of figure 4.4. Here the calculated theoretical value of SNDR is 95dB



### 4.1.2 Electrical Simulation

To proceed with the electrical simulation, the open-source program, *Ngspice*, was utilized in expectation of better performance than the previously used *LTspice VII*.

The electrical circuit to simulate is the one represented in figure 4.3. The input signal uses a frequency of 2.1KHz with an amplitude of -13dBV or 0.224V, for all *Ngspice* simulations. The simulations were made with noise at the input, showing remarkable results as presented in the SINAD and Fourier transform of the results, respectively figures 4.5 and 4.6. Achieving both the projected SNDR and demonstrating a good noise resistance at the input.



Figure 4.5:  $2^{nd}$  Order  $\Sigma\Delta$  Single-ended Modulator SINAD



Figure 4.6:  $2^{nd}$  Order  $\Sigma\Delta$  Single-ended Modulator FFT in Ngspice

The simulation for the differential circuit having failed to run in *Ngspice*, a second simulation was then run in *Cadence Virtuoso*. This time showing substantially more conservative results. [*Ngspice* SNDR: 86.9dB VS *Virtuoso* SNDR: 83.5dB]





While the SNDR achieved in the new simulation does not reach the projected value, the differential structure can bridge the gap missing to achieve 86dB SNDR. It is now expected, though, through analysing figure 4.8, which presents Transient noise simulations for different maximum noise frequencies, that the circuit will be considerably more sensitive to noise at the input than the previous *Ngspice* simulation made believe.



Figure 4.8:  $2^{nd}$  Order  $\Sigma\Delta$  SE Modulator Noise FFT in *Virtuoso* (a) Noise < 16MHz (b) Noise < 32MHz (c) Noise < 80MHz

## **4.2** $\Sigma \Delta 2^{nd}$ Order Differential Modulator

A differential structure is used to minimize error and noise interference at the input and to maximize the SNDR. The fully differential circuit to simulate will be the same as previously studied, as shown in figure 4.9.



Figure 4.9:  $2^{nd}$  Order  $\Sigma\Delta$  Differential Modulator Electrical Schematic

Through the SINAD, it is concluded the best result is obtained for -2.5dBV of input signal amplitude, as such, all following simulation of the differential circuit will have this input signal amplitude, while keeping the previous frequency of 2.1KHz.

Table 4.2: Theoretical Results

| SNDR[dB] | DR[dB] |
|----------|--------|
| 95       | 98     |

The SINAD in figure 4.10 and fast fourier transform(FFT) in figure 4.11 both confirm



Figure 4.10:  $2^{nd}$  Order  $\Sigma \Delta$  Differential Modulator SINAD in *Virtuoso* 



Figure 4.11:  $2^{nd}$  Order  $\Sigma \Delta$  Differential Modulator Noiseless FFT in Virtuoso

the advantage of utilizing a differential circuit to maximize the SNDR, achieving simulation results of 99.3dB SNDR. This one is higher than the theoretical value presented in table 4.2.

This is good to accommodate any impact that noise might have on the circuit.

A transient noise simulation in figure 4.12 shows clearly how even though the circuit is highly sensitive to noise, due to the maximization of the SNDR, the results become more acceptable than previously with the single-ended circuit.



Figure 4.12:  $2^{nd}$  Order  $\Sigma\Delta$  Differential Modulator Noise FFT in *Virtuoso* (a) Noise < 16MHz (b) Noise < 32MHz (c) Noise < 80MHz

It is also important to study the effect of small variances in the components of the circuit. For this purpose a simulation was run for a 10% and 20% variance on each electrical component of the circuit.



Figure 4.13:  $2^{nd}$  Order  $\Sigma\Delta$  Differential Modulator Capacitors 10% Variation FFT in *Virtuoso* (a) C1 (b) C2

Through the analysis of figures 4.13 to 4.16 and table 4.3, it can be concluded that for the usual components tolerance (5% for resistors and 15% for capacitors) there will not exist problems for the circuit, as it proves to be sufficiently resistant against these variations.



Figure 4.14:  $2^{nd}$  Order  $\Sigma\Delta$  Differential Modulator Resistors 10% Variation FFT in *Virtuoso* (a) R1 (b) R2 (c) R3 (d) R4



Figure 4.15:  $2^{nd}$  Order  $\Sigma\Delta$  Differential Modulator Capacitors 20% Variation FFT in *Virtuoso* (a) C1 (b) C2

The comparator's offset is the difference between the reference voltage in each of the comparators as seen in figure 4.9. With a nominal voltage of 1.5V each, a variation simulation was made at 10% of the nominal value, equating to 200mV, adding 10% to the reference of the positive branch's comparator and subtracting from the negative branch's comparator.

The circuit shows high sensitivity to a 10% variation, as it causes the SNDR to drop very significantly from the nominal 99.3dB to 78.3dB as presented by figure 4.17a.



Figure 4.16:  $2^{nd}$  Order  $\Sigma\Delta$  Differential Modulator Resistors 20% Variation FFT in *Virtuoso* (a) R1 (b) R2 (c) R3 (d) R4

A second simulation was ran following the same characteristics as before, except for the value of variation. At 7% variation, or 100mV, the circuit still keeps its function without problem as seen in figure 4.17b.

As a variation higher than 100mV is not likely to happen in the ESP32's GPIO's digital buffers' threshold voltage, it can be expected for this offset to not become a problem.

Table 4.3 illustrates the effects of variation of the components and the comparator's offset in the circuit, by comparing with the nominal simulated value of the circuit's SNDR.



Figure 4.17:  $2^{nd}$  Order  $\Sigma\Delta$  Differential Modulator Comparator Offset Variation FFT in *Virtuoso* (a) 10% (200mV) (b) 7% (100mV)

|                   | 10% Variation[dB]                                  | 20% Variation[dB] |  |
|-------------------|----------------------------------------------------|-------------------|--|
| C1                | 98.4                                               | 86.1              |  |
| C2                | 99.5                                               | 86.1              |  |
| R1                | 99.3                                               | 86.1              |  |
| R2                | 97.9                                               | 85.8              |  |
| R3                | 98.9                                               | 86.2              |  |
| R4                | 99.0                                               | 86.1              |  |
| Nominal SNDR[dB]  | 99.3                                               |                   |  |
|                   | 100mV (7%) Variation[dB] 200mV (10%) Variation[dB] |                   |  |
| Comparator Offset | 96.3                                               | 78.3              |  |

Table 4.3: Result of Component Variation

## **ESP32** Development & Implementation

After the circuit is studied and simulated, this chapter presents the study for the implementation of the circuit with the chosen  $\mu$ P.

## 5.1 Viability Study for ESP32

### 5.1.1 Theoretical Viability Study

To know whether the ESP32 will be able to take on the work of digital decimation filter and DAC, there needs to be a study on the speed of CPU operations.

According to community benchmarks [41], at 240MHz clock frequency, or one clock every 4,17ns, the ESP32 can do one integer addition operation in 1.6ns, and one integer multiplication operation in 23.4ns. This corresponds to one clock cycle per addition operation and six clock cycles per multiplication operation.

It needs to be considered that this test was at full power without interference. For a better understanding of how the processing of the digital output from the  $\Sigma\Delta M$  should be done, and considering the ultra low power capabilities (ULP) of the ESP32 more testing should be done with the ULP co-processor, which has a frequency of 8MHz to 40MHz.

In order to achieve the best optimization of quality, cost and power-efficiency, it is important to know the range of specifications of the  $\Sigma\Delta M$  ADC

According to the table 2.2 in page 9, to which can be added the limitation range of DR and values per sample, [74dB - 98dB] and [4096 - 65536] respectively, a compromise can then be made after the prioritization of specifications to consider.

A fundamental task the esp32 must accomplish is the feedback of the circuit, this would require to synchronize both the GPIOs connected to the filters, as shown previously in figure 3.1. The way to do this is through the use of either external or software interrupts,

however, as the interrupts take 2.5µs to solve it is too slow to maintain the projected 8MHz sampling frequency.

#### 5.1.2 Practical Viability Study

An issue came up with the initially chosen programming environment, Arduino IDE. The GPIOs could not transmit a signal with a frequency higher than 1MHz, due to the way the environment itself programs the low-level code, to guarantee compatibility between C++, its programming language, and C, the language in which the esp-IDF is programmed. To solve this, another programming environment was used, an extension of *Visual Studio Code*, PIO. This allowed the integration of both Arduino and esp-IDF libraries, independently from which programming language was chosen.

Regardless of the inability to utilize interrupts to keep synchronous action between the GPIOs, due to the high sampling frequency, there is a possibility the  $\mu$ P can keep the signal flowing well on its own. Therefore, a simple test was made, which consisted of generating an external square signal to enter through one GPIO and exit through one different one, where it would be read by an oscilloscope. From the oscilloscope results presented in figures 5.1, 5.2, 5.3 can be ascertained that the processor fails to guarantee the stability and integrity of the signal at the projected sampling frequency(8MHz). It is also of note there was a need of at least 2V amplitude in the input signal for the output GPIO to have a signal recognizable by the oscilloscope.

It should be observed that each input/output pair in figures 5.1, 5.2, 5.3 is in the same time scale and all three of the figures have the same voltage scale.





As the use of the GPIOs was deemed incapable to accomplish the objective, by using the incorporated ADC and DAC pins of the ESP32 and slowing the sampling frequency to 20KHz the previous experiment was remade, only to show similar results.



Figure 5.2: ESP32 GPIO Loop at 80KHz (a) Input (b) Output



Figure 5.3: ESP32 GPIO Loop at 8MHz (a) Input (b) Output

СНАРТЕК

## SUMMARY & FUTURE WORK

## 6.1 Summary

In this work, a  $2^{nd}$  order CT  $\Sigma\Delta$  modulator employing passive RC integrators with at least 86dB SNDR was proposed. A differential pair structure maximized the loop gain and minimized input noise, to try and minimize the necessary comparator gain. The circuit was optimized to keep the loop gain as independent from the comparator's as possible. Nevertheless, due to the high attenuation of the signal by the passive integrators the comparator's noise substantially limits the gain.

A performance comparison was registered between various spice simulation software, observing limitations on high-frequency simulations with the increasing complexity of circuitry, as is illustrated in table 6.1.

| Circuit                                | LTspice VII                                               | Ngspice                                                                   | Cadence Virtuoso              |
|----------------------------------------|-----------------------------------------------------------|---------------------------------------------------------------------------|-------------------------------|
| $1^{st}$ Order $\Sigma\Delta M$        | Simulation ran as expected                                | _                                                                         | _                             |
| $2^{nd}$ Order $\Sigma \Delta M$ SE    | Simulation ran for<br>too long with<br>unexpected results | Simulation ran as<br>expected                                             | _                             |
| $2^{nd}$ Order $\Sigma \Delta M$ Diff. |                                                           | Would not run the<br>simulation, giving<br>error of data size<br>overflow | Simulation ran as<br>expected |

Table 6.1: Simulation Programs Comparison

A study on the possibility of utilizing an ESP32  $\mu$ P to complete the circuit by substituting on the task of comparator was done, both by studying technical capabilities and by testing them. Due to hardware limitations of the ESP32, it was not able to work according to the required high-frequency, thus being deemed inappropriate to substitute the comparator block in a CT passive  $\Sigma\Delta M$ .

## 6.2 Future Work

Despite the circuit not being viable to work in the desired fashion with the ESP32, it might be possible to substitute the comparator with a  $\mu$ P that can overcome these hardware limitations in the GPIOs.

It can be interesting to study optimization of the circuit with different approaches in which the sampling frequency can be reduced, making the circuit more compatible with the use of  $\mu$ P in exchange of a comparator. It is also important to verify approaches which cause less attenuation to the input signal of the comparator block and therefore make more resistant to the comparator's noise.

Other implementations of passive  $\Sigma \Delta M$  might also cover some of the limitations noticed along with this work, such as the use of SC filters and using multi-bit techniques, which would allow to use a significantly lower sampling frequency and keep the resolution of the modulator intact.

## BIBLIOGRAPHY

- [1] M. Conner. "Sensors empower the 'Internet of Things'." In: EDN (2010), pp. 31–37.
- [2] M. A. A. Ahasan. "Smart Home Automation Based on Voice Command Using Smart Phone." In: *International Journal of Advances in Computer and Electronics Engineering* 2.2 (2017), pp. 34–40.
- [3] N. Prathima, P Sai Kumar, S. L. Ahmed, and G. Chakradhar. "Voice Recognition Based Home Automation System for Paralyzed People." In: *International Journal for Modern Trends in Science and Technology* 03.02 (2017), pp. 101–106. URL: http: //ijmtst.com/ncee2017/23.NCEE{\\_}24.pdf.
- [4] D. Purohit and M. Ghosh. "Challenges and Types of Home Automation Systems." In: *International Journal of Computer Science and Mobile Computing* 6.4 (2017), pp. 369– 375.
- [5] Amazon. Vocca Pro Light Smart Lights Voice Activated Lights Adapter Record Your Own Voice Triggers - - Amazon.com. URL: https://www.amazon.com/Vocca-Pro-Light-Activated-Triggers/dp/B0180S3P68 (visited on 02/11/2019).
- [6] Verbal Machines. VPRO Vocca Pro Voice Activated Bulb Adapter User Manual Vocca Pro - Users Manual Activocal LTD. URL: https://fccid.io/2ADI5-VPRO/User-Manual/User-Manual-2547185 (visited on 02/11/2019).
- [7] Honeywell. Honeywell Wifi Smart Thermostat with Voice Control RTH9590WF | Honeywell Store. URL: https://www.honeywellstore.com/store/products/wifi-smart-thermostat-with-voice-rth9590wf1003u.htm (visited on 02/05/2019).
- [8] Amazon. Honeywell Wi-Fi 7-Day Programmable Thermostat (RTH6580WF), Requires C Wire, Works with Alexa - - Amazon.com. URL: https://www.amazon.com/ Honeywell-Programmable-Thermostat-RTH6580WF-Requires/dp/B00Y6M20UC/ ref=sr{\\_}1{\\_}3?ie=UTF8{\&}qid=1549912687{\&}sr=8-3{\&}keywords= honeywell+thermostat (visited on 02/11/2019).
- [9] Amazon. Echo Dot (3rd Generation) | Alexa-enabled Bluetooth Speaker. URL: https: //www.amazon.com/All-new-Echo-Dot-3rd-Gen/dp/B0792KTHKJ/ref= sr{\\_}1{\\_}cc{\\_}1?s=aps{\&}ie=UTF8{\&}qid=1549374695{\&}sr=1-1catcorr{\&}keywords=echo+dot+3rd+generation (visited on 02/05/2019).

- [10] Apple. HomePod Apple. URL: https://www.apple.com/homepod/ (visited on 02/05/2019).
- [11] K. J. Ryan. 7 of the World's Most Accurate Voice Recognition Systems | Inc.com. 2016. URL: https://www.inc.com/kevin-j-ryan/internet-trends-7-mostaccurate-word-recognition-platforms.html (visited on 02/05/2019).
- [12] Mycroft. About Mycroft Mycroft. URL: https://mycroft.ai/documentation/ mycroft-software-hardware/ (visited on 02/05/2019).
- [13] P. Schoutsen. Perfect Home Automation Home Assistant. URL: https://www. home-assistant.io/blog/2016/01/19/perfect-home-automation/ (visited on 02/06/2019).
- [14] B. Raj and R. Singh. Design and Implementation of Speech Recognition Systems Class 2: Data Capture. 2013.
- [15] S. W. Smith. "Chapter 22: Audio Processing." In: The Scientist and Engineer's Guide to Digital Signal Processing. 1. 2013, pp. 2–5.
- [16] N. Kolban. Kolban's Book on ESP32. September. Unpublished entry, 2018, p. 1228.
- [17] ESP32-DevKitC Development Boards Espressif Systems | Mouser Portugal. URL: https://pt.mouser.com/new/espressif/espressif-esp32-devkitc-boards/ (visited on 02/14/2019).
- [18] Arduino Introduction. URL: https://www.arduino.cc/en/Guide/Introduction (visited on 02/14/2019).
- [19] Arduino Micro. URL: https://store.arduino.cc/arduino-micro (visited on 02/14/2019).
- [20] Raspberry Pi Foundation. What is a Raspberry Pi? URL: https://www.raspberrypi. org/help/what-is-a-raspberry-pi/ (visited on 02/06/2019).
- [21] Raspberry Pi Zero v1.3 KUBII. URL: https://www.kubii.fr/pi-zero-v13/1401raspberry-pi-zero-v13-kubii-3272496006973.html (visited on 02/06/2019).
- [22] Raspberry Pi 2 Model B Raspberry Pi. URL: https://www.raspberrypi.org/ products/raspberry-pi-2-model-b/ (visited on 02/14/2019).
- [23] A. Allan. The RDA5981, an ESP8266 Competitor? Hackster Blog. URL: https: //blog.hackster.io/the-rda5981-an-esp8266-competitor-da161c9c37e2 (visited on 02/07/2019).
- [24] T. R. Manual. "Technical Reference Manual." In: (2018).
- [25] Atmel. "ATmega16U4/ATmega32U4 Datasheet." In: (2016). URL: http://ww1. microchip.com/downloads/en/DeviceDoc/Atmel-7766-8-bit-AVR-ATmega16U4-32U4{\\_}Datasheet.pdf.
- [26] ElectroDragon RDA5981a-WIFI. URL: https://www.electrodragon.com/w/ RDA5981 (visited on 02/14/2019).

- [27] B. Razavi. Principles of Data Conversion System Design. IEEE Press, 1995, p. 134.
- [28] J. M. de la Rosa. SIGMA-DELTA CONVERTERS Practical Design Guide. 2nd. Wiley-IEEE Press, 2018, p. 533. ISBN: 9781119275756.
- [29] A. V. Oppenheim and R. W. Schafer. Discrete-Time Signal Processing. 3rd. Pearson Higher Education, Inc., 2009, p. 1108. ISBN: 0131988425.
- [30] J. M. De La Rosa. "Sigma-delta modulators: Tutorial overview, design guide, and state-of-the-art survey." In: *IEEE Transactions on Circuits and Systems I: Regular Papers* 58.1 (2011), pp. 1–21. ISSN: 15498328. DOI: 10.1109/TCSI.2010.2097652.
- [31] NAtionnal Instruments Corp. Benefits of Delta-Sigma Analog-to-Digital Conversion -National Instruments. 2017. URL: http://www.ni.com/white-paper/11342/en/ (visited on 02/13/2019).
- [32] B. Nowacki. "Design of Sigma-Delta Modulators for Analog-to-Digital Conversion Intensively Using Passive Circuits." In: (2016), p. 151. URL: http://hdl.handle. net/10362/18507.
- [33] S. Pavan, R. Schreier, and G. C. Temes. "Sampling, Oversampling, and Noise-shaping." In: Understanding Delta-Sigma Data Converters. 10.1002/97. 2017, pp. 27–61. DOI: 10.1002/9781119258308.ch2.
- [34] B. Baker. *Best of Baker's Best Precision Data Converters Delta-Sigma ADCs*. Texas Instruments Incorporated, 2015, p. 19.
- [35] K. Matsukawa, Y. Mitani, M. Takayama, K. Obata, S. Dosho, and A. Matsuzawa. "A Fifth-Order Continuous-Time Delta-Sigma Modulator With Single-Opamp Resonator." In: *IEEE Journal of Solid-State Circuits* 45.4 (2010), pp. 697–706.
- [36] J. M. De La Rosa. "An empirical and statistical comparison of state-of-the-art sigma-delta modulators." In: Proceedings - IEEE International Symposium on Circuits and Systems (2013), pp. 825–828. ISSN: 02714310. DOI: 10.1109/ISCAS.2013. 6571974.
- [37] J. L. De Melo, N. Paulino, and J. Goes. "Continuous-time delta-sigma modulators based on passive RC integrators." In: *IEEE Transactions on Circuits and Systems I: Regular Papers* 65.11 (2018), pp. 3662–3674. ISSN: 15498328. DOI: 10.1109/TCSI. 2018.2855649.
- [38] J. L. A. D. Melo. "Sigma-Delta Modulators with Passive RC Integrators : Theory , Design Methodology for Optimization and Silicon Results Motivation ΣΔ Ms – Background / Trends Passive versus Active ΔΣ Ms Proposed Idea Proposed Circuit / Implementation Measurement Results Co." Doctoral dissertation. FCT - Universidade Nova de Lisboa, 2017.

- [39] J. L. De Melo. "A low power 1-MHz continuous-time ΣΔM Using a passive loop filter designed with a genetic algorithm tool." In: *Proceedings IEEE International Symposium on Circuits and Systems* 1 (2013), pp. 586–589. ISSN: 02714310. DOI: 10.1109/ISCAS.2013.6571910.
- [40] J. L. De Melo, P. V. Leitao, J. Goes, and N. Paulino. "A simple clab-D audio power amplifier using a pabive CT ΣΔ modulator for medium quality sound systems." In: *Proceedings of the 22nd International Conference Mixed Design of Integrated Circuits and Systems, MIXDES 2015* (2015), pp. 543–546. ISSN: 0044-4251 (Print). DOI: 10.1109/MIXDES.2015.7208582.