246 research outputs found
Process-induced skew reduction in nominal zero-skew clock trees
Abstract — This work develops an analytic framework for clock tree analysis considering process variations that is shown to correspond well with Monte Carlo results. The analysis frame-work is used in a new algorithm that constructs deterministic nominal zero-skew clock trees that have reduced sensitivity to process variation. The new algorithm uses a sampling approach to perform route embedding during a bottom-up merging phase, but does not select the best embedding until the top-down phase. This results in clock trees that exhibit a mean skew reduction of 32.4 % on average and a standard deviation reduction of 40.7 % as verified by Monte Carlo. The average increase in total clock tree capacitance is less than 0.02%. I
Analysis and mitigation of variability in subthreshold design
Subthreshold circuit design is a compelling method for ultra-low power applications. However, subthreshold designs show dramati-cally increased sensitivity to process variations due to the exponen-tial relationship of subthreshold drive current with Vth variation. In this paper, we present an analysis of subthreshold energy efficiency considering process variation, and propose methods to mitigate its impact. We show that, unlike superthreshold circuits, random dopant fluctuation is the dominant component of variation in subthreshold operation. We investigate how this variability can be ameliorated with proper circuit sizing and choice of circuit logic depth. We then present a statistical analysis of the energy efficiency of subthreshold circuits considering process variations. We show that the energy optimal supply voltage increases due to process variations and study its dependence on circuit parameters. We verify our analytical mod-els against Monte Carlo SPICE simulations and show that they accu-rately predict the minimum energy and energy optimal supply voltage. Finally, we use the developed statistical energy model to determine the optimal pipelining depth in subthreshold designs
ConSmax: Hardware-Friendly Alternative Softmax with Learnable Parameters
The self-attention mechanism sets transformer-based large language model
(LLM) apart from the convolutional and recurrent neural networks. Despite the
performance improvement, achieving real-time LLM inference on silicon is
challenging due to the extensively used Softmax in self-attention. Apart from
the non-linearity, the low arithmetic intensity greatly reduces the processing
parallelism, which becomes the bottleneck especially when dealing with a longer
context. To address this challenge, we propose Constant Softmax (ConSmax), a
software-hardware co-design as an efficient Softmax alternative. ConSmax
employs differentiable normalization parameters to remove the maximum searching
and denominator summation in Softmax. It allows for massive parallelization
while performing the critical tasks of Softmax. In addition, a scalable ConSmax
hardware utilizing a bitwidth-split look-up table (LUT) can produce lossless
non-linear operation and support mix-precision computing. It further
facilitates efficient LLM inference. Experimental results show that ConSmax
achieves a minuscule power consumption of 0.43 mW and area of 0.001 mm2 at
1-GHz working frequency and 22-nm CMOS technology. Compared to state-of-the-art
Softmax hardware, ConSmax results in 14.5x energy and 14.0x area savings with a
comparable accuracy on a GPT-2 model and the WikiText103 dataset
Identifying target areas for risk-based surveillance and control of Transboundary Animal Diseases: A seasonal analysis of slaughter and live-trade cattle movements in Uganda
Abstract Animal movements are a major driver for the spread of Transboundary Animal Diseases (TADs). These movements link populations that would otherwise be isolated and hence create opportunities for susceptible and infected individuals to meet. We used social network analysis to describe the seasonal network structure of cattle movements in Uganda and unravel critical network features that identify districts or sub-regions for targeted risk-based surveillance and intervention. We constructed weighted, directed networks based on 2019 between-district cattle movements using official livestock mobility data; the purpose of the movement (‘slaughter’ vs. ‘live trade’) was used to subset the network and capture the risks more reliably. Our results show that cattle trade can result in local and long-distance disease spread in Uganda. Seasonal variability appears to impact the structure of the network, with high heterogeneity of node and edge activity identified throughout the seasons. These observations mean that the structure of the live trade network can be exploited to target influential district hubs within the cattle corridor and peripheral areas in the south and west, which would result in rapid network fragmentation, reducing the contact structure-related trade risks. Similar exploitable features were observed for the slaughter network, where cattle traffic serves mainly slaughter hubs close to urban centres along the cattle corridor. Critically, analyses that target the complex livestock supply value chain offer a unique framework for understanding and quantifying risks for TADs such as Foot-and-Mouth disease in a land-locked country like Uganda. These findings can be used to inform the development of risk-based surveillance strategies and decision making on resource allocation. For instance, vaccine deployment, biosecurity enforcement and capacity building for stakeholders at the local community and across animal health services with the potential to limit the socio-economic impact of outbreaks, or indeed reduce their frequency
The Grizzly, April 23, 1990
Earth Day: Success! • Ready for Graduation • Letters: New Curriculum Comments; Studio Cottage Vandal Victims; In Support of Sanger • Condom Commission Report • New Meal Plans Next Year • Tracksters Finish Season at MACs • Lady Bears Run Ahead • Netters Roll • Bears Bounce • Hackers No More • Why Earth Day? • Earth Day: Serious? • RAs for 90-91 • Support Your Local RA • Art at UC • Tunnel Steams BWC • Lisa\u27s Heart\u27s in Art • Soda Can\u27s Debut • Video Reviews • Promotions for Faculty • Desktop Aids New Editor • Exam Schedulehttps://digitalcommons.ursinus.edu/grizzlynews/1256/thumbnail.jp
The Grizzly, February 19, 1991
Wismer Renovations • Movies Moved • Clark Wins NEH Grant: Travels to China • Teaching Catalan, An Alternative Language • Book Fair Comes to Ursinus • Italian Market: A Way of Life • Unchangeable South Street • Screaming Trees • Death of a Salesperson • Murder at Toranno\u27s • Swimmers Prepare for MAC Championship • Men\u27s Basketball Finishes Season with 11-14 Record • Black History Month • He\u27s Back! David Lee Roth, Stud or Stroker? • Modern Shakespeare • Wrestlers Romp • Schafer Tells of Track Troubles • Women Race at MAC Championships • Gymnasts Having Fun • Letter: Olin, for Students or Posterity? • Domestic Policy: A Matter of Perspective • Ah, Um, Uh... • An Elegantly Simple Plan • Radioactivity at Ursinus Collegehttps://digitalcommons.ursinus.edu/grizzlynews/1271/thumbnail.jp
Fully Autonomous Mixed Signal SoC Design & Layout Generation Platform
We present FASoC, the world’s first autonomous mixed-signal SoC framework driven entirely by user constraints, along with a suite of automated generators for analog blocks. The process agnostic framework takes high-level user intent as inputs to generate optimized and fully verified analog blocks using a cell-based design methodology.
Our approach is highly scalable and silicon-proven by an SoC prototype which includes 2 PLLs, 3 LDOs, 1 SRAM, and 2 temperature sensors fully integrated with a processor in a 65nm CMOS process. The physical design of all blocks, including analog, is achieved using optimized synthesis and APR flows in commercially available tools. The framework is portable across different processes and requires no human in the loop, dramatically accelerating design time.This material is based on research sponsored by Air Force Research Laboratory (AFRL)
and Defense Advanced Research Projects Agency (DARPA) under agreement number
FA8650 18 2 7844. The U.S. Government is authorized to reproduce and distribute
reprints for Governmental purposes notwithstanding any copyright notation thereon.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/165331/1/Fully Autonomous Mixed Signal SoC Design & Layout Generation Platform.pdfDescription of Fully Autonomous Mixed Signal SoC Design & Layout Generation Platform.pdf : Main articleSEL
- …