960 research outputs found
Can my chip behave like my brain?
Many decades ago, Carver Mead established the foundations of neuromorphic systems. Neuromorphic systems are analog circuits that emulate biology. These circuits utilize subthreshold dynamics of CMOS transistors to mimic the behavior of neurons. The objective is to not only simulate the human brain, but also to build useful applications using these bio-inspired circuits for ultra low power speech processing, image processing, and robotics. This can be achieved using reconfigurable hardware, like field programmable analog arrays (FPAAs), which enable configuring different applications on a cross platform system. As digital systems saturate in terms of power efficiency, this alternate approach has the potential to improve computational efficiency by approximately eight orders of magnitude. These systems, which include analog, digital, and neuromorphic elements combine to result in a very powerful reconfigurable processing machine.Ph.D
Continuous Integration for Fast SoC Algorithm Development
Digital systems have become advanced, hard to design and optimize due to ever-growing technology. Integrated Circuits (ICs) have become more complicated due to complex computations in latest technologies. Communication systems such as mobile networks have evolved and become a part of our daily lives with the advancement in technology over the years. Hence, need of efficient, reusable and automated processes for System-on-a-Chip (SoC) development has been increased. Purpose of this thesis is to study and evaluate currently used SoC development processes and presents guidelines on how these processes can be streamlined.
The thesis starts by evaluating currently used SoC development flows and their advantages and disadvantages. One important aspect is to identify step which cause duplication of work and unnecessary idle times in SoC development teams. A study is conducted and input from SoC development experts is taken in order to optimize SoC flows and use of Continuous Integration (CI) system. An algorithm model is implemented that can be used in multiple stages of SoC development at adequate complexity and is “easy enough” to be used for a person not mastering the topic. The thesis outcome is proposal for CI system in SoC development for accelerating the speed and reliability of implementing algorithms to RTL code and finally into product. CI system tool is also implemented to automate and test the model design so that it also remains up to date
Didactic platform with a DSP to support the teaching of digital signal processing
Si ens posem en context d'un estudiant d'enginyeria, descobrirem que una de les majors motivacions de l'aprenentatge són les pràctiques de laboratori. Aquest treball de fi de grau tractarà sobre la recerca i el desenvolupament d'una plataforma didàctica per a l'assignatura de «Processament Digital del Senyal», impartida durant el tercer curs acadèmic del grau d'Enginyeria de Sistemes TIC. Aquesta plataforma que desenvoluparem inclourà un processador de senyals digitals (DSP) i els perifèrics necessaris perquè els estudiants i professors creïn projectes en un entorn de prototipatge ràpid. A més, els annexos proporcionats haurien de complir amb els requisits per a que aquells que estiguin interessats puguin fabricar el nostre disseny amb poques dificultats.If we put ourselves in the context of an engineering student, we will discover that one of the greatest motivations for learning are laboratory works. This final degree thesis will be about the research and development of a didactic platform for the subject of Digital Signal Processing, taught during the third academical year of the ICT Systems Engineering degree. This platform we are going to develop will encase a digital signal processor (DSP) and the required peripherals for the students and teachers to quickly create projects in a fast prototyping environment. Additionally, the provided annexes should meet with the requirements for those who are interested to manufacture our design with little trouble
Energy reconstruction on the LHC ATLAS TileCal upgraded front end: feasibility study for a sROD co-processing unit
Dissertation presented in ful lment of the requirements for the degree of:
Master of Science in Physics
2016The Phase-II upgrade of the Large Hadron Collider at CERN in the early 2020s
will enable an order of magnitude increase in the data produced, unlocking the
potential for new physics discoveries. In the ATLAS detector, the upgraded Hadronic
Tile Calorimeter (TileCal) Phase-II front end read out system is currently being
prototyped to handle a total data throughput of 5.1 TB/s, from the current 20.4 GB/s.
The FPGA based Super Read Out Driver (sROD) prototype must perform an energy
reconstruction algorithm on 2.88 GB/s raw data, or 275 million events per second.
Due to the very high level of pro ciency required and time consuming nature of
FPGA rmware development, it may be more e ective to implement certain complex
energy reconstruction and monitoring algorithms on a general purpose, CPU based
sROD co-processor. Hence, the feasibility of a general purpose ARM System on Chip
based co-processing unit (PU) for the sROD is determined in this work.
A PCI-Express test platform was designed and constructed to link two ARM
Cortex-A9 SoCs via their PCI-Express Gen-2 x1 interfaces. Test results indicate that
the latency of the PCI-Express interface is su ciently low and the data throughput is
superior to that of alternative interfaces such as Ethernet, for use as an interconnect
for the SoCs to the sROD. CPU performance benchmarks were performed on ve ARM
development platforms to determine the CPU integer,
oating point and memory
system performance as well as energy e ciency. To complement the benchmarks,
Fast Fourier Transform and Optimal Filtering (OF) applications were also tested.
Based on the test results, in order for the PU to process 275 million events per
second with OF, within the 6 s timing budget of the ATLAS triggering system, a
cluster of three Tegra-K1, Cortex-A15 SoCs connected to the sROD via a Gen-2 x8
PCI-Express interface would be suitable. A high level design for the PU is proposed
which surpasses the requirements for the sROD co-processor and can also be used
in a general purpose, high data throughput system, with 80 Gb/s Ethernet and
15 GB/s PCI-Express throughput, using four X-Gene SoCs
A Radar Kit for Hands-On Distance-Learning
We present an approach to experimental radar systems education based on a
combination of commercial low-cost hardware with modern open-source software
technologies. Following a discussion of the general top-level architecture of
flexible, software-defined radar systems, we introduce the specific selection
of subsystems, their capabilities, and current system limitations. Compared to
existing approaches to practical radar education, a more top-level modular
design with a greater focus on performance and flexibility of baseband
processing is selected while reducing the complexity of circuit and subsystem
assembly and total system cost. We present example measurements obtained from
the radar kit. The radar kit allows for bringing a radar lab to the students
instead of students into the labs. It enables practical hands-on radar
education also in distance-only-learning scenarios.Comment: Presented at the European Microwave Week 2021, Focussed Session on on
Teaching Methods for Microwave Engineerin
Programmable flexible cores for SoC applications
Tese de mestrado. Engenharia Electrotécnica e de Computadores. Faculdade de Engenharia. Universidade do Porto. 200
Dynamic Power Management for Neuromorphic Many-Core Systems
This work presents a dynamic power management architecture for neuromorphic
many core systems such as SpiNNaker. A fast dynamic voltage and frequency
scaling (DVFS) technique is presented which allows the processing elements (PE)
to change their supply voltage and clock frequency individually and
autonomously within less than 100 ns. This is employed by the neuromorphic
simulation software flow, which defines the performance level (PL) of the PE
based on the actual workload within each simulation cycle. A test chip in 28 nm
SLP CMOS technology has been implemented. It includes 4 PEs which can be scaled
from 0.7 V to 1.0 V with frequencies from 125 MHz to 500 MHz at three distinct
PLs. By measurement of three neuromorphic benchmarks it is shown that the total
PE power consumption can be reduced by 75%, with 80% baseline power reduction
and a 50% reduction of energy per neuron and synapse computation, all while
maintaining temporary peak system performance to achieve biological real-time
operation of the system. A numerical model of this power management model is
derived which allows DVFS architecture exploration for neuromorphics. The
proposed technique is to be used for the second generation SpiNNaker
neuromorphic many core system
Wearable electroencephalography for long-term monitoring and diagnostic purposes
Truly Wearable EEG (WEEG) can be considered as the future of ambulatory EEG
units, which are the current standard for long-term EEG monitoring. Replacing
these short lifetime, bulky units with long-lasting, miniature and wearable devices
that can be easily worn by patients will result in more EEG data being collected for
extended monitoring periods. This thesis presents three new fabricated systems, in
the form of Application Specific Integrated Circuits (ASICs), to aid the diagnosis of
epilepsy and sleep disorders by detecting specific clinically important EEG events
on the sensor node, while discarding background activity. The power consumption
of the WEEG monitoring device incorporating these systems can be reduced since
the transmitter, which is the dominating element in terms of power consumption,
will only become active based on the output of these systems.
Candidate interictal activity is identified by the developed analog-based interictal
spike selection system-on-chip (SoC), using an approximation of the Continuous
Wavelet Transform (CWT), as a bandpass filter, and thresholding. The spike
selection SoC is fabricated in a 0.35 μm CMOS process and consumes 950 nW.
Experimental results reveal that the SoC is able to identify 87% of interictal spikes
correctly while only transmitting 45% of the data.
Sections of EEG data containing likely ictal activity are detected by an analog
seizure selection SoC using the low complexity line length feature. This SoC is
fabricated in a 0.18 μm CMOS technology and consumes 1.14 μW. Based on experimental
results, the fabricated SoC is able to correctly detect 83% of seizure
episodes while transmitting 52% of the overall EEG data.
A single-channel analog-based sleep spindle detection SoC is developed to aid
the diagnosis of sleep disorders by detecting sleep spindles, which are characteristic
events of sleep. The system identifies spindle events by monitoring abrupt changes
in the input EEG. An approximation of the median frequency calculation, incorporated
as part of the system, allows for non-spindle activity incorrectly identified
by the system as sleep spindles to be discarded. The sleep spindle detection SoC
is fabricated in a 0.18 μm CMOS technology, consuming only 515 nW. The SoC
achieves a sensitivity and specificity of 71.5% and 98% respectively.Open Acces
- …