Search CORE

81 research outputs found

Recommended from our members

Father Knows Best: The Interactive Effects of Fathering Quantity and Quality on Child Self-Regulation

Author: Chary Mamatha Chetlur
Publication venue: ScholarWorks@UMass Amherst
Publication date: 16/07/2020
Field of study

In the past decade, developmental research has seen a surge of work regarding fathers and their influences of various aspects of child outcomes- cognitive and socioemotional. Studies show that father involvement, or “quantity” of time the father spends with the child, as well as fathering “quality”, or the characteristics marking the father-child relationship (warmth, supportiveness, sensitivity etc.), can both contribute to variance in the development of individual differences in child outcomes such as language skills, academic success and psychological well-being. One facet of adaptive development, self-regulation (SR), is a robust and consistent predictor of high academic success, fulfilling interpersonal relationships, and overall life satisfaction. SR has been studied extensively in its relation to mother parenting effects. Some work with fathers shows that positive fathering (autonomy-supportiveness, sensitivity, responsiveness, cognitive stimulation) is related to higher levels of SR- both cognitive and emotional. However, no fathering studies to our knowledge have looked at the potential additive or interactive effects of fathering quantity of involvement and quality of caretaking on self-regulatory capacity in children. In this study, I used a sample of fathers and 3-5-year-olds in two urban cities (Springfield, MA and Philadelphia PA, N = 88 dyads) to examine the relationship between father involvement (self-reported “quantity”) and father parenting behaviors (observed and self-reported “quality”) on child self-regulation (cognitive regulation, measured as observed executive function [EF], and emotion regulation, measured as father-reported effortful control [EC]). Results showed that quantity of father involvement and fathering positivity (warm affect, responsiveness, positive control) showed a crossover interaction effect to predict variance in child EF and EC (controlling for family socioeconomic status and child vocabulary skills). Father involvement was positively predictive of higher levels of EF and EC only when the quality of fathering was high in positivity (self-reported). When fathering was low in positivity (self-reported), the relationship between quantity of father involvement and child EF and EC became negative. This work points to the importance of taking a comprehensive view when assessing paternal parenting effects on development and also suggest potential targets for fathering intervention studies

ScholarWorks@UMass Amherst

Learning to infer: RL-based search for DNN primitive selection on Heterogeneous Embedded Systems

Author: abadi
anderson
baker
chetlur
cortes
dong
he
hsu
kim
li
real
sutton
tan
watkins
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/11/2018
Field of study

Deep Learning is increasingly being adopted by industry for computer vision applications running on embedded devices. While Convolutional Neural Networks' accuracy has achieved a mature and remarkable state, inference latency and throughput are a major concern especially when targeting low-cost and low-power embedded platforms. CNNs' inference latency may become a bottleneck for Deep Learning adoption by industry, as it is a crucial specification for many real-time processes. Furthermore, deployment of CNNs across heterogeneous platforms presents major compatibility issues due to vendor-specific technology and acceleration libraries. In this work, we present QS-DNN, a fully automatic search based on Reinforcement Learning which, combined with an inference engine optimizer, efficiently explores through the design space and empirically finds the optimal combinations of libraries and primitives to speed up the inference of CNNs on heterogeneous embedded devices. We show that, an optimized combination can achieve 45x speedup in inference latency on CPU compared to a dependency-free baseline and 2x on average on GPGPU compared to the best vendor library. Further, we demonstrate that, the quality of results and time "to-solution" is much better than with Random Search and achieves up to 15x better results for a short-time search

arXiv.org e-Print Archive

Crossref

Shortest Path Distance in Manhattan Poisson Line Cox Process

Author: Chetlur Vishnu Vardhan
Dettmann Carl P.
Dhillon Harpreet S.
Publication venue
Publication date: 05/06/2020
Field of study

While the Euclidean distance characteristics of the Poisson line Cox process (PLCP) have been investigated in the literature, the analytical characterization of the path distances is still an open problem. In this paper, we solve this problem for the stationary Manhattan Poisson line Cox process (MPLCP), which is a variant of the PLCP. Specifically, we derive the exact cumulative distribution function (CDF) for the length of the shortest path to the nearest point of the MPLCP in the sense of path distance measured from two reference points: (i) the typical intersection of the Manhattan Poisson line process (MPLP), and (ii) the typical point of the MPLCP. We also discuss the application of these results in infrastructure planning, wireless communication, and transportation networks

arXiv.org e-Print Archive

Explore Bristol Research

Wafer-Scale Fast Fourier Transforms

Author: Chetlur Sharan
Jacquelin Mathias
Orenes-Vera Marcelo
Schreiber Robert
Sharapov Ilya
Vandermersch Philippe
Publication venue
Publication date: 29/09/2022
Field of study

We have implemented fast Fourier transforms for one, two, and three-dimensional arrays on the Cerebras CS-2, a system whose memory and processing elements reside on a single silicon wafer. The wafer-scale engine (WSE) encompasses a two-dimensional mesh of roughly 850,000 processing elements (PEs) with fast local memory and equally fast nearest-neighbor interconnections. Our wafer-scale FFT (wsFFT) parallelizes a

n^3

problem with up to

n^2

PEs. At this point a PE processes only a single vector of the 3D domain (known as a pencil) per superstep, where each of the three supersteps performs FFT along one of the three axes of the input array. Between supersteps, wsFFT redistributes (transposes) the data to bring all elements of each one-dimensional pencil being transformed into the memory of a single PE. Each redistribution causes an all-to-all communication along one of the mesh dimensions. Given the level of parallelism, the size of the messages transmitted between pairs of PEs can be as small as a single word. In theory, a mesh is not ideal for all-to-all communication due to its limited bisection bandwidth. However, the mesh interconnecting PEs on the WSE lies entirely on-wafer and achieves nearly peak bandwidth even with tiny messages. This high efficiency on fine-grain communication allow wsFFT to achieve unprecedented levels of parallelism and performance. We analyse in detail computation and communication time, as well as the weak and strong scaling, using both FP16 and FP32 precision. With 32-bit arithmetic on the CS-2, we achieve 959 microseconds for 3D FFT of a

512^3

complex input array using a 512x512 subgrid of the on-wafer PEs. This is the largest ever parallelization for this problem size and the first implementation that breaks the millisecond barrier

arXiv.org e-Print Archive

System-Level Performance Analysis in 3D Drone Mobile Networks

Author: A Shojaeifard
E Turgut
P Kumaraswamy
S Srinivasa
S Zhang
SN Chiu
VV Chetlur
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 29/01/2020
Field of study

We present a system-level analysis for drone mobile networks on a finite three-dimensional (3D) space. A performance boundary derived by deterministic random (Brownian) motion model over Nakagami-m fading interfering channels is developed. This method allows us to circumvent the extremely complex reality model and obtain the upper and lower performance bounds of actual drone mobile networks. The validity and advantages of the proposed framework are confirmed via extensive Monte-Carlo (MC) simulations. The results reveal several important trends and design guidelines for the practical deployment of drone mobile networks

Crossref

UCL Discovery