CouCouGen is a lightweight pseudorandom number generator (PRNG) based on three coupled counters with nonlinear updates and a PCG-style output mixer. This document provides a complete introduction to the algorithm, including its formal... more
The design and development of image processing units (IPUs) has traditionally involved trade-offs between cost, real-time properties, portability, and ease of programming. A standard PC can be turned into an IPU relatively easily with the... more
The design and development of image processing units (IPUs) has traditionally involved trade-offs between cost, real-time properties, portability, and ease of programming. A standard PC can be turned into an IPU relatively easily with the... more
A rigorous introduction to feedforward architectures, activation functions, and the mathematics of supervised learning. Covers universal approximation theory, backpropagation derived from first principles, regularisation, and the geometry... more
-Neste artigo apresentam-se os resultados da implementação em uma placa FPGA de equalizadores adaptativos LMS largamente linear (LMS-LL). Foram utilizadas e comparadas três estruturas de implementação: a paralela, a semi-paralela e a... more
Both modern datacenter and embedded FPGAs provide great opportunities for high-performance and high energy-eiciency computing. With the growing public availability of FPGAs from major cloud service providers such as AWS, Alibaba, and... more
In this paper, we develop a framework called MAPLE to enable the aging-aware FPGA architecture exploration. The core idea is to efficiently model the aging-induced delay degradation at the coarse-grained FPGA basic block level using deep... more
With the public availability of FPGAs from major cloud service providers like AWS, Alibaba, and Nimbix, hardware and software developers can now easily access FPGA platforms. However, it is nontrivial to develop efficient FPGA... more
A fundamental difference between ASICs and FPGAs is that wires in ASICs are designed such that they match the requirements of a particular design. Wire parameters such as length, width, layout and the number of wires can be varied to... more
Most existing soft-processors on FPGAs today support a fixed-latency instruction pipeline. Therefore, for integer division, a simple fixed-latency radix-2 integer divider is typically used, or algorithm-level changes are made to avoid... more
Designing Systems on-Chip is becoming increasingly popular as die sizes increase and technology sizes decrease. The complexity of integrating different types of Processing Elements (PEs) that use different communication protocols and... more
Process technology has reduced in size such that it is possible to implement complete applicationspecific architectures as Systems-on-Chip (SoCs) using both Application-Specific Integrated Circuits (ASICs) and Field Programmable Gate... more
Many complex systems require the use of floating point arithmetic that is exceedingly time consuming to perform on personal computers. However, floating point operators are also hardware resource intensive and require longer latencies... more
The author has granted a nonexclusive licence aIlowing the National Library of Canada to reproduce, loan, distribute or sen copies of this thesis in microform, paper or electronic formats. The author retains ownership of the copyright in... more
When designing modern embedded computing systems, most software programmers choose to use multicore processors, possibly in combination with general-purpose graphics processing units (GPGPUs) and/or hardware accelerators. They also often... more
Many aerospace and automotive applications use FPGAs in their designs due to their low power and reconfigurability requirements. Meanwhile, such applications also pose a high standard on system reliability, which makes the early-stage... more
Historically designers created computing systems by combining Integrated Circuits (ICs) on Printed Circuit Boards (PCBs), whereas now they are able to form complete Systems-on-Chip (SoCs). For the purpose of this study, SoCs are defined... more
The importance of fetal monitoring through the different stages of pregnancy has recently expanded the interest of the research community in Non-Invasive Fetal Electrocardiography (NI-fECG). This technique makes use of multi-channel,... more
The speed of reverse converters in Residue Number System is one the most important and effective factors which is strictly dependent on the selected moduli set. In this paper, the four-moduli set {2 n+k , 2 2n-1 -1, 2 n/2 + 1, 2 n/2 -1}... more
The speed of reverse converters in Residue Number System is one the most important and effective factors which is strictly dependent on the selected moduli set. In this paper, the four-moduli set {2 n+k , 2 2n-1 -1, 2 n/2 + 1, 2 n/2 -1}... more
In this paper, a new four-moduli set {2 2n , 2 2n+1 -1, 2 n/2 + 1, 2 n/2 -1} for even n is introduced. This moduli set has 5n-bit dynamic range and well-formed moduli which result in a high speed RNS arithmetic unit. Furthermore, an... more
In this paper, a new force observation approach is proposed to realize a superior wideband and periodicity-free sensorless force control system. The proposed force observer is established by the periodicity estimation integrated singular... more
Hardware Trojans are deliberate malicious hardware modifications inserted in semiconductor Integrated circuits (ICs) for the purpose of stealing or leaking sensitive information, as well as disrupting critical systems upon activation,... more
Self-timed circuits and scalable parallel computers seem to be a natural match, but one that has not been exploited by traditional system designers. One reason for the lack of experimentation with self-timed systems such as these is the... more
Figure 1: Test scenes used to evaluate the DRPU ASIC: Conference (282k triangles) , Mafia (15k triangles), Skeleton (16k triangles), Helix (78k triangles), and DynGael (85k triangles). For more test scenes see Figure 6.
ABSTRACT This paper presents a radically new approach to time discretization in nonlinear dissipative systems. Unlike the classical uniform time grid, the authors develop and theoretically validate a binary modulation of the integration... more
A frequency generating system, capable of producing 36 different sound frequencies, corresponding to tones and semitones of 3rd, 4th and 5th music octaves, is manufactured, programmed and presented here. The system uses two passive... more
We describe the implementation of a crosscoupled parity Built-In Self-Test (BIST) approach for the global routing resources in Field Programmable Gate Arrays (FPGAs). The BIST approach facilitates systemlevel testing of the FPGA global... more
Spread over an area in Western Australia, this telescope is being designed to study the sun and its inner heliosphere, and time-varying astronomical phenomena.
Protecting large-scale networks, especially Software-Defined Networks (SDNs), against distributed attacks in a cost-effective manner plays a prominent role in cybersecurity. One of the pervasive approaches to plug security holes and... more
This article discusses the design of an application specific MPSoC architecture dedicated to Multiple Target Tracking (MTT). This application has its utility in driver assistant systems, more precisely in collision avoidance and warning... more
This paper presents efficient distributed arithmetic (DA)-based approaches for high-throughput reconfigurable implementation of finite impulse response (FIR) filters whose filter coefficients change during runtime. Conventionally, for... more
Numerous threats are associated with the globalized integrated circuit (IC) supply chain, such as piracy, reverse engineering, overproduction, and malicious logic insertion. Many obfuscation approaches have been proposed to mitigate these... more
MIMO is a modern mobile scheme used for the transmission of digitized data at high speed, which has a large number of narrow band subcarriers. It needs multiple antennas at both the transmitter and receiver side. In real time, the... more
Physical Unclonable Function (PUF) has recently attracted interest from both industry and academia as a potential alternative approach to secure Internet of Things (IoT) devices from the more traditional computational-based approach using... more
Physical Unclonable Function (PUF) has recently attracted interested from both industry and academia as a potential alternative approach to secure Internet of Things (IoT) devices from the more traditional computational based approach... more
Penulisan kode program yang besar dan kompleks pada bahasa pemrograman C sering kali menghadapi kendala keterbacaan serta tingginya risiko duplikasi data akibat pendekatan monolitik. Penelitian ini bertujuan untuk menganalisis penerapan... more
In this paper, a networked embedded control of modular robot manipulators without using joint torque sensing is presented. The proposed solution uses an effective control and communication mechanism based on the virtual decomposition... more
The proliferation of Internet-of-Things (IoT) devices has catalyzed interest in Tiny Machine Learning (TinyML), the practice of deploying machine-learning models on ultralow-power microcontrollers (MCUs) with memory budgets measured in... more
During the last few years more and more functionalities of RNA have been discovered that were previously thought of being carried out by proteins alone. One of the most striking discoveries was the detection of microRNAs, a class of... more
Next-Generation Sequencing technologies generate a vast and exponentially increasing amount of sequence data. The Interleaved Bloom Filter (IBF) is a novel indexing data structure which is stateof-the-art for distributing approximate... more
In this paper we present the architecture and implementation of a hardware NIC scheduler to guarantee QoS on servers for high speed LAN/SAN. Our proposal employs a programmable logic device based on an FPGA in order to store and update... more
Embedded applications usually require Software/Hardware (SW/HW) designs to meet the hard timing constraints and the required design flexibility. Exhaustive exploration for SW/HW designs is a very time consuming task, while the adhoc... more
A systematic methodology for near-optimal software/hardware codesign mapping onto an FPGA platform with microprocessor and HW accelerators is proposed. The mapping steps deal with the inter-organization, the foreground memory management,... more
Battery-free radiofrequency (RF) and microwave systems have emerged as a promising solution for enabling long-lifetime, maintenance-free wireless sensing and communication platforms by eliminating conventional battery dependence. These... more