1,856 research outputs found

    Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization

    Full text link
    This report describes our approach for the Audio-Visual Diarization (AVD) task of the Ego4D Challenge 2022. Specifically, we present multiple technical improvements over the official baselines. First, we improve the detection performance of the camera wearer's voice activity by modifying the training scheme of its model. Second, we discover that an off-the-shelf voice activity detection model can effectively remove false positives when it is applied solely to the camera wearer's voice activities. Lastly, we show that better active speaker detection leads to a better AVD outcome. Our final method obtains 65.9% DER on the test set of Ego4D, which significantly outperforms all the baselines. Our submission achieved 1st place in the Ego4D Challenge 2022.Comment: Validation report for the Ego4D challenge at ECCV 202

    Possible quantum phase-manipulation of a two-leg ladder in mixed-dimensional fermionic cold atoms

    Full text link
    The recent realization of mixed-dimensional systems of cold atoms has attracted much attention from both experimentalists and theorists. Different effective interactions and novel correlated quantum many-body phases may be engineered in these systems, with the different phases being tunable via external parameters. In this article we investigate a two-species Fermi atom mixture: one species of atom exists in two hyperfine states and is confined to move in a two-leg ladder, interacting with an on-site interaction, and the other moves freely in a two dimensional square lattice that contains the two-leg ladder. The two species of atoms interact via an on-site interaction on the ladder. In the limit of weak inter-species interactions, the two-dimensional gas can be integrated out, leading to an effective long-range mediated interaction in the ladder, generated by to the on-site inter-species interaction. We show that the form of the mediated interaction can be controlled by the density of the two-dimensional gas and that it enhances the charge density wave instability in the two-leg ladder after the renormalization group transformation. Parameterizing the phase diagram with various experimentally controllable quantities, we discuss the possible tuning of the macroscopic quantum many-body phases of the two-leg ladder in this mixed-dimensional fermionic cold atom system.Comment: 4 pages and 3 figure

    The impact of urban sprawl on journey to work times for mass transit and all other commuters in the United States: A research note

    Get PDF
    As government budgets get tighter, there has been considerable public outcry about the continued investment in public mass transit systems and their financial viability. Amid this outcry, a number of studies have been conducted to determine which factors influence the use and efficiency of publiclyfunded mass transit systems. These factors include population density and less sprawl (or greater urban compactness). However, their impact on mass transit usage is somewhat contradictory in that the heavy concentration of populations in the urban area and greater compactness is believed to increase mass transit usage due to a bigger number of potential passengers. In fact, greater compactness and greater transit ridership have played a role in lengthening the journey to work for most commuters and thus discouraged the use of mass transit systems. Thus, some questioned the wisdom of mass transit subsidies and “smart growth” policies. To attempt to answer this question and avoid any further confusion, this paper examines how urban sprawl affects the journey to work commute time of mass transit riders and other commuters throughout the United States after controlling for variables such as the volume of ridership, local per capita income, the presence of a local rail transit system, and local weather. The findings for this research note defy some conventional wisdom and point to several public policy recommendations on how to improve public mass transit at the local level. For instance, we find that greater urban compactness can be turned into a mass transit advantage if mass transit riders can use a commuter rail option

    Disruption of the nuclear p53-GAPDH complex protects against ischemia-induced neuronal damage

    Get PDF
    Glyceraldehyde 3-phosphate dehydrogenase (GAPDH) is conventionally considered a critical enzyme that involves in glycolysis for energy production. Recent previous studies have suggested that GAPDH is important in glutamate-induced neuronal excitotoxicity, while accumulated evidence also demonstrated that GAPDH nuclear translocation plays a critical role in cell death. However, the molecular mechanisms underlying this process remain largely unknown. In this study, we showed that GAPDH translocates to the nucleus in a Siah1-dependent manner upon glutamate stimulation. The nuclear GAPDH forms a protein complex with p53 and enhances p53 expression and phosphorylation. Disruption of the GAPDH-p53 interaction with an interfering peptide blocks glutamate-induced cell death and GAPDH-mediated up-regulation of p53 expression and phosphorylation. Furthermore, administration of the interfering peptide in vivo protects against ischemia-induced cell death in rats subjected to tMCAo. Our data suggest that the nuclear p53-GAPDH complex is important in regulating glutamate-mediated neuronal death and could serve as a potential therapeutic target for ischemic stroke treatment

    SViTT: Temporal Learning of Sparse Video-Text Transformers

    Full text link
    Do video-text transformers learn to model temporal relationships across frames? Despite their immense capacity and the abundance of multimodal training data, recent work has revealed the strong tendency of video-text models towards frame-based spatial representations, while temporal reasoning remains largely unsolved. In this work, we identify several key challenges in temporal learning of video-text transformers: the spatiotemporal trade-off from limited network size; the curse of dimensionality for multi-frame modeling; and the diminishing returns of semantic information by extending clip length. Guided by these findings, we propose SViTT, a sparse video-text architecture that performs multi-frame reasoning with significantly lower cost than naive transformers with dense attention. Analogous to graph-based networks, SViTT employs two forms of sparsity: edge sparsity that limits the query-key communications between tokens in self-attention, and node sparsity that discards uninformative visual tokens. Trained with a curriculum which increases model sparsity with the clip length, SViTT outperforms dense transformer baselines on multiple video-text retrieval and question answering benchmarks, with a fraction of computational cost. Project page: http://svcl.ucsd.edu/projects/svitt.Comment: CVPR 202

    Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection

    Full text link
    Active speaker detection (ASD) in videos with multiple speakers is a challenging task as it requires learning effective audiovisual features and spatial-temporal correlations over long temporal windows. In this paper, we present SPELL, a novel spatial-temporal graph learning framework that can solve complex tasks such as ASD. To this end, each person in a video frame is first encoded in a unique node for that frame. Nodes corresponding to a single person across frames are connected to encode their temporal dynamics. Nodes within a frame are also connected to encode inter-person relationships. Thus, SPELL reduces ASD to a node classification task. Importantly, SPELL is able to reason over long temporal contexts for all nodes without relying on computationally expensive fully connected graph neural networks. Through extensive experiments on the AVA-ActiveSpeaker dataset, we demonstrate that learning graph-based representations can significantly improve the active speaker detection performance owing to its explicit spatial and temporal structure. SPELL outperforms all previous state-of-the-art approaches while requiring significantly lower memory and computational resources. Our code is publicly available at https://github.com/SRA2/SPELLComment: ECCV 2022 camera ready (Supplementary videos: on ECVA soon). This paper supersedes arXiv:2112.0147

    Pay for Success: A Roadmap for Implementation in Minnesota

    Get PDF
    Professional paper for the fulfillment of the Master of Public Policy degree.Pay for Success (PFS) is a promising financing model that encourages investment in programs that produce improved social outcomes resulting in future cost-savings for the government. In a PFS project, investors provide initial capital to scale-up effective social programs and the government pays back the investors only if the desired outcomes are achieved. Minnesota emerged as a pioneer in this field, being the first U.S. state to enact legislation authorizing a Pay for Performance pilot in 2011, even before the first PFS program was launched in New York. However, despite having the legislation in effect for more than 6 years now, no PFS project has been implemented in Minnesota. Meanwhile, over 20 PFS programs have been launched in other states such as Illinois, Ohio, Colorado, and South Carolina among others, some of which have also seen their first success payments made out to the investors

    Pay for Success: A Roadmap for Implementation in Minnesota

    Get PDF
    Pay for Success (PFS) is a promising financing model that encourages investment in programs that produce improved social outcomes resulting in future cost-savings for the government. In a PFS project, investors provide initial capital to scale-up effective social programs and the government pays back the investors only if the desired outcomes are achieved. Minnesota emerged as a pioneer in this field, being the first U.S. state to enact legislation authorizing a Pay for Performance pilot in 2011, even before the first PFS program was launched in New York. However, despite having the legislation in effect for more than 6 years now, no PFS project has been implemented in Minnesota. Meanwhile, over 20 PFS programs have been launched in other states such as Illinois, Ohio, Colorado, and South Carolina among others, some of which have also seen their first success payments made out to the investors. This report describes the PFS financing mechanism and provides a set of recommendations for the state government and other stakeholders to advance the implementation of PFS projects in Minnesota recognizing the roadblocks that stalled implementation of the state Pay for Performance Act. We present a set of steps through which PFS funding can be approached in Minnesota and provide a list of program areas where PFS projects can be launched. The report also highlights the legislative action that could move PFS projects ahead in the state of Minnesota and discusses some ways to move forward in the absence of legislative involvement
    corecore