16 research outputs found

    Understanding temporally weakly supervised training: A case study for keyword spotting

    Full text link
    The currently most prominent algorithm to train keyword spotting (KWS) models with deep neural networks (DNNs) requires strong supervision i.e., precise knowledge of the spoken keyword location in time. Thus, most KWS approaches treat the presence of redundant data, such as noise, within their training set as an obstacle. A common training paradigm to deal with data redundancies is to use temporally weakly supervised learning, which only requires providing labels on a coarse scale. This study explores the limits of DNN training using temporally weak labeling with applications in KWS. We train a simple end-to-end classifier on the common Google Speech Commands dataset with increased difficulty by randomly appending and adding noise to the training dataset. Our results indicate that temporally weak labeling can achieve comparable results to strongly supervised baselines while having a less stringent labeling requirement. In the presence of noise, weakly supervised models are capable to localize and extract target keywords without explicit supervision, leading to a performance increase compared to strongly supervised approaches

    C1-continuous space-time discretization based on Hamilton's law of varying action

    Full text link
    We develop a class of C1-continuous time integration methods that are applicable to conservative problems in elastodynamics. These methods are based on Hamilton's law of varying action. From the action of the continuous system we derive a spatially and temporally weak form of the governing equilibrium equations. This expression is first discretized in space, considering standard finite elements. The resulting system is then discretized in time, approximating the displacement by piecewise cubic Hermite shape functions. Within the time domain we thus achieve C1-continuity for the displacement field and C0-continuity for the velocity field. From the discrete virtual action we finally construct a class of one-step schemes. These methods are examined both analytically and numerically. Here, we study both linear and nonlinear systems as well as inherently continuous and discrete structures. In the numerical examples we focus on one-dimensional applications. The provided theory, however, is general and valid also for problems in 2D or 3D. We show that the most favorable candidate -- denoted as p2-scheme -- converges with order four. Thus, especially if high accuracy of the numerical solution is required, this scheme can be more efficient than methods of lower order. It further exhibits, for linear simple problems, properties similar to variational integrators, such as symplecticity. While it remains to be investigated whether symplecticity holds for arbitrary systems, all our numerical results show an excellent long-term energy behavior.Comment: slightly condensed the manuscript, added references, numerical results unchange

    Simultaneous Bright- and Dark-Field X-ray Microscopy at X-ray Free Electron Lasers

    Full text link
    The structures, strain fields, and defect distributions in solid materials underlie the mechanical and physical properties across numerous applications. Many modern microstructural microscopy tools characterize crystal grains, domains and defects required to map lattice distortions or deformation, but are limited to studies of the (near) surface. Generally speaking, such tools cannot probe the structural dynamics in a way that is representative of bulk behavior. Synchrotron X-ray diffraction based imaging has long mapped the deeply embedded structural elements, and with enhanced resolution, Dark Field X-ray Microscopy (DFXM) can now map those features with the requisite nm-resolution. However, these techniques still suffer from the required integration times due to limitations from the source and optics. This work extends DFXM to X-ray free electron lasers, showing how the 101210^{12} photons per pulse available at these sources offer structural characterization down to 100 fs resolution (orders of magnitude faster than current synchrotron images). We introduce the XFEL DFXM setup with simultaneous bright field microscopy to probe density changes within the same volume. This work presents a comprehensive guide to the multi-modal ultrafast high-resolution X-ray microscope that we constructed and tested at two XFELs, and shows initial data demonstrating two timing strategies to study associated reversible or irreversible lattice dynamics

    The Processing And Mental Representation Of Ing Variation

    Get PDF
    This dissertation examines the processing and mental representation of the sociolinguistic variable ING (thinking~thinkin\u27). Sociolinguists have asked questions about the locus of the ING variable using naturalistic speech data, which has resulted in a debate on whether the variable is phonological or morphological. These accounts of ING are not well-defined, and it is hard to isolate these representational properties in conversational data. I propose that locus of variation questions can be thought of as questions about the mental representation of variation, and that it would be fruitful to explore them using a highly-controllable tool from psycholinguistic research: primed lexical decision experiments. This tool is used to show that semantic, phonological, and morphological aspects of representation facilitate processing in different ways. I integrate sociolinguistic knowledge of variable ING with psycholinguistic knowledge on researching mental representations to ask: how are the socially meaningful variants -ing and -in\u27 mentally represented, and which aspects of shared representation contribute to how they are processed? Based on a framework of relevant aspects of representation, I establish a baseline understanding of the mental representation of -ing and -in\u27 across six experiments. Chapter 4 shows that -ing and -in\u27 prime both themselves and each other in words with unrelated stems (e.g. jumping-thinking), and uncovers an asymmetrical priming pattern between -ing and -in\u27 targets; -in\u27-in\u27 prime-target pairs enjoy a processing boost over -ing-ing, -in\u27-ing, and -ing-in\u27 pairs. Chapter 5 finds that this -in\u27 boost\u27 is temporally weak. Chapter 6 establishes that surface phonology does not contribute to the -in\u27 boost. Chapter 7 shows that the -in\u27 boost is insensitive to shared representation between prime-target stems. The results show robust and replicable affix priming for -ing and -in\u27. They also show a processing difference between the variants, and demonstrate properties of the -in\u27 boost. Taken together, the -in\u27 boost can be interpreted under a representation-based account, which suggests that -ing is the underlying phonological form, and that this can change to -in\u27 via application of a phonological rule. Finally, I propose future avenues of research that test this account and elaborate our understanding of the mental representation of variable ING

    Bare and indefinite NPs in predicative position in French

    Get PDF
    This paper proposes a new analysis of the use of bare nouns vs. indefinite NPs in predicative position in French. We distinguish between predicational sentences (with the bare noun version) and equative sentences (with the indefinite version). We argue that bare nouns ascribe permanent properties to aspects of entities. As for the indefinites, we claim that they exhibit their specific reading and introduce an individual in a new situation, which is identified with the referent of the subject

    Electrophysiological correlates of event segmentation: how does the human mind process ongoing activity?

    Get PDF
    The human mind decodes, processes, and makes sense of a continual flow of dynamic information, taken from an array of sensory inputs. Compelling behavioural and neuroimaging evidence reveals that humans segment activities into meaningful chunks for processing, and this phenomenon has profound implications for learning, memory and understanding the world around us (Newtson, 1973; Zacks and Tversky, 2001; Zacks et al., 2001). Whilst the existence of event segmentation is widely accepted, it remains unclear what cognitive mechanisms drive this ability.This thesis constitutes a series of behavioural and neuroimaging experiments that investigate top-down and bottom-up influences on event segmentation. The neuroimaging studies presented here are novel; they extend the field by investigating event segmentation using scalp-recorded electroencephalography (EEG). Event Related Potentials (ERPs, derived from EEG using signal-averaging procedures) showed that the perceptual processing of event boundaries is differentially sensitive to the segmentation of activities into small or large chunks, consistent with findings from previous neuroimaging research (Zacks et al., 2001). In contrast with previous findings, the electrophysiological investigations elicited responses that were clearly affected by manipulating top-down information (e.g., participant's knowledge about the activity being segmented). The results from the studies reported in the thesis support an account of the perceptual processing of event boundaries, which incorporates both top-down and bottom-up influences

    Spatial management of groundfish resources in the Gulf of Maine and Georges Bank

    Get PDF
    In the marine environment, studies suggest that overfishing suppresses sustainability and resilience in fish populations. In the 1990s, the National Marine Fisheries Service (NMFS) implemented five large year-round fishery closures in the Gulf of Maine and Georges Bank. These closures are partially protected marine protected areas (MPAs) restricting commercial fishing with bottom-tending gears. To date, evaluation of their performance has focused on the productivity of individual species. This research aims to understand if fish biodiversity changed after implementation of the five large fisheries closures. Here, a variety of datasets and statistical approaches are used. Modern patterns constructed from NMFS bottom-trawl surveys (1971--2005) demonstrated that fish species richness increased in some areas, as species evenness declined. Fish species composition changed from demersal species to more abundant pelagic species inside and outside of the fisheries closures. Likely drivers of these changes included destruction of habitat by over fishing, and climate and fishery driven disturbance. After constructing a historical baseline for the Western Gulf of Maine and Georges Bank, total genera detected by historical Fishhawk research trawl surveys in the late 19th century were 2 times greater than the number appearing in NMFS bottom-trawl surveys (1963-2007); however, in contrast, NMFS bottom-trawl surveys detected more fish genera. Guild composition was similar for both surveys with respect to demersal, but not pelagic species. These differences were explained by changes in survey sampling technology, fishing pressure, and ecosystem resilience. Recent seasonal patterns of fish species richness from multiple fisheries-independent and dependent datasets showed that inshore areas (such as Massachusetts and Ipswich Bays), the transition between the Gulf of Maine and Georges Bank, and eastern Georges Bank exhibited high fish species richness. When data was available from observed commercial otter-trawl tows adjacent to or within closures, high richness areas occurred, regardless of season. Likely processes influencing species richness over the region and near the fishery closures consisted of seasonal prey availability, oceanographic conditions, and fishing pressure. High effort around fishery closure boundaries had important implications for the effective design of MPAs and future management of groundfish
    corecore