11,998 research outputs found

    Security and Privacy Problems in Voice Assistant Applications: A Survey

    Full text link
    Voice assistant applications have become omniscient nowadays. Two models that provide the two most important functions for real-life applications (i.e., Google Home, Amazon Alexa, Siri, etc.) are Automatic Speech Recognition (ASR) models and Speaker Identification (SI) models. According to recent studies, security and privacy threats have also emerged with the rapid development of the Internet of Things (IoT). The security issues researched include attack techniques toward machine learning models and other hardware components widely used in voice assistant applications. The privacy issues include technical-wise information stealing and policy-wise privacy breaches. The voice assistant application takes a steadily growing market share every year, but their privacy and security issues never stopped causing huge economic losses and endangering users' personal sensitive information. Thus, it is important to have a comprehensive survey to outline the categorization of the current research regarding the security and privacy problems of voice assistant applications. This paper concludes and assesses five kinds of security attacks and three types of privacy threats in the papers published in the top-tier conferences of cyber security and voice domain.Comment: 5 figure

    The Metaverse: Survey, Trends, Novel Pipeline Ecosystem & Future Directions

    Full text link
    The Metaverse offers a second world beyond reality, where boundaries are non-existent, and possibilities are endless through engagement and immersive experiences using the virtual reality (VR) technology. Many disciplines can benefit from the advancement of the Metaverse when accurately developed, including the fields of technology, gaming, education, art, and culture. Nevertheless, developing the Metaverse environment to its full potential is an ambiguous task that needs proper guidance and directions. Existing surveys on the Metaverse focus only on a specific aspect and discipline of the Metaverse and lack a holistic view of the entire process. To this end, a more holistic, multi-disciplinary, in-depth, and academic and industry-oriented review is required to provide a thorough study of the Metaverse development pipeline. To address these issues, we present in this survey a novel multi-layered pipeline ecosystem composed of (1) the Metaverse computing, networking, communications and hardware infrastructure, (2) environment digitization, and (3) user interactions. For every layer, we discuss the components that detail the steps of its development. Also, for each of these components, we examine the impact of a set of enabling technologies and empowering domains (e.g., Artificial Intelligence, Security & Privacy, Blockchain, Business, Ethics, and Social) on its advancement. In addition, we explain the importance of these technologies to support decentralization, interoperability, user experiences, interactions, and monetization. Our presented study highlights the existing challenges for each component, followed by research directions and potential solutions. To the best of our knowledge, this survey is the most comprehensive and allows users, scholars, and entrepreneurs to get an in-depth understanding of the Metaverse ecosystem to find their opportunities and potentials for contribution

    Examples of works to practice staccato technique in clarinet instrument

    Get PDF
    Klarnetin staccato tekniğini güçlendirme aşamaları eser çalışmalarıyla uygulanmıştır. Staccato geçişlerini hızlandıracak ritim ve nüans çalışmalarına yer verilmiştir. Çalışmanın en önemli amacı sadece staccato çalışması değil parmak-dilin eş zamanlı uyumunun hassasiyeti üzerinde de durulmasıdır. Staccato çalışmalarını daha verimli hale getirmek için eser çalışmasının içinde etüt çalışmasına da yer verilmiştir. Çalışmaların üzerinde titizlikle durulması staccato çalışmasının ilham verici etkisi ile müzikal kimliğe yeni bir boyut kazandırmıştır. Sekiz özgün eser çalışmasının her aşaması anlatılmıştır. Her aşamanın bir sonraki performans ve tekniği güçlendirmesi esas alınmıştır. Bu çalışmada staccato tekniğinin hangi alanlarda kullanıldığı, nasıl sonuçlar elde edildiği bilgisine yer verilmiştir. Notaların parmak ve dil uyumu ile nasıl şekilleneceği ve nasıl bir çalışma disiplini içinde gerçekleşeceği planlanmıştır. Kamış-nota-diyafram-parmak-dil-nüans ve disiplin kavramlarının staccato tekniğinde ayrılmaz bir bütün olduğu saptanmıştır. Araştırmada literatür taraması yapılarak staccato ile ilgili çalışmalar taranmıştır. Tarama sonucunda klarnet tekniğin de kullanılan staccato eser çalışmasının az olduğu tespit edilmiştir. Metot taramasında da etüt çalışmasının daha çok olduğu saptanmıştır. Böylelikle klarnetin staccato tekniğini hızlandırma ve güçlendirme çalışmaları sunulmuştur. Staccato etüt çalışmaları yapılırken, araya eser çalışmasının girmesi beyni rahatlattığı ve istekliliği daha arttırdığı gözlemlenmiştir. Staccato çalışmasını yaparken doğru bir kamış seçimi üzerinde de durulmuştur. Staccato tekniğini doğru çalışmak için doğru bir kamışın dil hızını arttırdığı saptanmıştır. Doğru bir kamış seçimi kamıştan rahat ses çıkmasına bağlıdır. Kamış, dil atma gücünü vermiyorsa daha doğru bir kamış seçiminin yapılması gerekliliği vurgulanmıştır. Staccato çalışmalarında baştan sona bir eseri yorumlamak zor olabilir. Bu açıdan çalışma, verilen müzikal nüanslara uymanın, dil atış performansını rahatlattığını ortaya koymuştur. Gelecek nesillere edinilen bilgi ve birikimlerin aktarılması ve geliştirici olması teşvik edilmiştir. Çıkacak eserlerin nasıl çözüleceği, staccato tekniğinin nasıl üstesinden gelinebileceği anlatılmıştır. Staccato tekniğinin daha kısa sürede çözüme kavuşturulması amaç edinilmiştir. Parmakların yerlerini öğrettiğimiz kadar belleğimize de çalışmaların kaydedilmesi önemlidir. Gösterilen azmin ve sabrın sonucu olarak ortaya çıkan yapıt başarıyı daha da yukarı seviyelere çıkaracaktır

    Anuário científico da Escola Superior de Tecnologia da Saúde de Lisboa - 2021

    Get PDF
    É com grande prazer que apresentamos a mais recente edição (a 11.ª) do Anuário Científico da Escola Superior de Tecnologia da Saúde de Lisboa. Como instituição de ensino superior, temos o compromisso de promover e incentivar a pesquisa científica em todas as áreas do conhecimento que contemplam a nossa missão. Esta publicação tem como objetivo divulgar toda a produção científica desenvolvida pelos Professores, Investigadores, Estudantes e Pessoal não Docente da ESTeSL durante 2021. Este Anuário é, assim, o reflexo do trabalho árduo e dedicado da nossa comunidade, que se empenhou na produção de conteúdo científico de elevada qualidade e partilhada com a Sociedade na forma de livros, capítulos de livros, artigos publicados em revistas nacionais e internacionais, resumos de comunicações orais e pósteres, bem como resultado dos trabalhos de 1º e 2º ciclo. Com isto, o conteúdo desta publicação abrange uma ampla variedade de tópicos, desde temas mais fundamentais até estudos de aplicação prática em contextos específicos de Saúde, refletindo desta forma a pluralidade e diversidade de áreas que definem, e tornam única, a ESTeSL. Acreditamos que a investigação e pesquisa científica é um eixo fundamental para o desenvolvimento da sociedade e é por isso que incentivamos os nossos estudantes a envolverem-se em atividades de pesquisa e prática baseada na evidência desde o início dos seus estudos na ESTeSL. Esta publicação é um exemplo do sucesso desses esforços, sendo a maior de sempre, o que faz com que estejamos muito orgulhosos em partilhar os resultados e descobertas dos nossos investigadores com a comunidade científica e o público em geral. Esperamos que este Anuário inspire e motive outros estudantes, profissionais de saúde, professores e outros colaboradores a continuarem a explorar novas ideias e contribuir para o avanço da ciência e da tecnologia no corpo de conhecimento próprio das áreas que compõe a ESTeSL. Agradecemos a todos os envolvidos na produção deste anuário e desejamos uma leitura inspiradora e agradável.info:eu-repo/semantics/publishedVersio

    Procedure-Aware Pretraining for Instructional Video Understanding

    Full text link
    Our goal is to learn a video representation that is useful for downstream procedure understanding tasks in instructional videos. Due to the small amount of available annotations, a key challenge in procedure understanding is to be able to extract from unlabeled videos the procedural knowledge such as the identity of the task (e.g., 'make latte'), its steps (e.g., 'pour milk'), or the potential next steps given partial progress in its execution. Our main insight is that instructional videos depict sequences of steps that repeat between instances of the same or different tasks, and that this structure can be well represented by a Procedural Knowledge Graph (PKG), where nodes are discrete steps and edges connect steps that occur sequentially in the instructional activities. This graph can then be used to generate pseudo labels to train a video representation that encodes the procedural knowledge in a more accessible form to generalize to multiple procedure understanding tasks. We build a PKG by combining information from a text-based procedural knowledge database and an unlabeled instructional video corpus and then use it to generate training pseudo labels with four novel pre-training objectives. We call this PKG-based pre-training procedure and the resulting model Paprika, Procedure-Aware PRe-training for Instructional Knowledge Acquisition. We evaluate Paprika on COIN and CrossTask for procedure understanding tasks such as task recognition, step recognition, and step forecasting. Paprika yields a video representation that improves over the state of the art: up to 11.23% gains in accuracy in 12 evaluation settings. Implementation is available at https://github.com/salesforce/paprika.Comment: CVPR 202

    OLIG2 neural progenitor cell development and fate in Down syndrome

    Full text link
    Down syndrome (DS) is caused by triplication of human chromosome 21 (HSA21) and is the most common genetic form of intellectual disability. It is unknown precisely how triplication of HSA21 results in the intellectual disability, but it is thought that the global transcriptional dysregulation caused by trisomy 21 perturbs multiple aspects of neurodevelopment that cumulatively contribute to its etiology. While the characteristics associated with DS can arise from any of the genes triplicated on HSA21, in this work we focus on oligodendrocyte transcription factor 2 (OLIG2). The progeny of neural progenitor cells (NPCs) expressing OLIG2 are likely to be involved in many of the cellular changes underlying the intellectual disability in DS. To explore the fate of OLIG2+ neural progenitors, we took advantage of two distinct models of DS, the Ts65Dn mouse model and induced pluripotent stem cells (iPSCs) derived from individuals with DS. Our results from these two systems identified multiple perturbations in development in the cellular progeny of OLIG2+ NPCs. In Ts65Dn, we identified alterations in neurons and glia derived from the OLIG2 expressing progenitor domain in the ventral spinal cord. There were significant differences in the number of motor neurons and interneurons present in the trisomic lumbar spinal cord depending on age of the animal pointing both to a neurodevelopment and a neurodegeneration phenotype in the Ts65Dn mice. Of particular note, we identified changes in oligodendrocyte (OL) maturation in the trisomic mice that are dependent on spatial location and developmental origin. In the dorsal corticospinal tract, there were significantly fewer mature OLs in the trisomic mice, and in the lateral funiculus we observed the opposite phenotype with more mature OLs being present in the trisomic animals. We then transitioned our studies into iPSCs where we were able to pattern OLIG2+ NPCs to either a spinal cord-like or a brain-like identity and study the OL lineage that differentiated from each progenitor pool. Similar to the region-specific dysregulation found in the Ts65Dn spinal cord, we identified perturbations in trisomic OLs that were dependent on whether the NPCs had been patterned to a brain-like or spinal cord-like fate. In the spinal cord-like NPCs, there was no difference in the proportion of cells expressing either OLIG2 or NKX2.2, the two transcription factors whose co-expression is essential for OL differentiation. Conversely, in the brain-like NPCs, there was a significant increase in OLIG2+ cells in the trisomic culture and a decrease in NKX2.2 mRNA expression. We identified a sonic hedgehog (SHH) signaling based mechanism underlying these changes in OLIG2 and NKX2.2 expression in the brain-like NPCs and normalized the proportion of trisomic cells expressing the transcription factors to euploid levels by modulating the activity of the SHH pathway. Finally, we continued the differentiation of the brain-like and spinal cord-like NPCs to committed OL precursor cells (OPCs) and allowed them to mature. We identified an increase in OPC production in the spinal cord-like trisomic culture which was not present in the brain-like OPCs. Conversely, we identified a maturation deficit in the brain-like trisomic OLs that was not present in the spinal cord-like OPCs. These results underscore the importance of regional patterning in characterizing changes in cell differentiation and fate in DS. Together, the findings presented in this work contribute to the understanding of the cellular and molecular etiology of the intellectual disability in DS and in particular the contribution of cells differentiated from OLIG2+ progenitors

    Animating potential for intensities and becoming in writing: challenging discursively constructed structures and writing conventions in academia through the use of storying and other post qualitative inquiries

    Get PDF
    Written for everyone ever denied the opportunity of fulfilling their academic potential, this is ‘Chloe’s story’. Using composite selves, a phrase chosen to indicate multiplicities and movement, to story both the initial event leading to ‘Chloe’s’ immediate withdrawal from a Further Education college and an imaginary second chance to support her whilst at university, this Deleuzo-Guattarian (2015a) ‘assemblage’ of post qualitative inquiries offers challenge to discursively constructed structures and writing conventions in academia. Adopting a posthuman approach to theorising to shift attention towards affects and intensities always relationally in action in multiple ‘assemblages’, these inquiries aim to decentre individual ‘lecturer’ and ‘student’ identities. Illuminating movements and moments quivering with potential for change, then, hoping thereby to generate second chances for all, different approaches to writing are exemplified which trouble those academic constraints by fostering inquiry and speculation: moving away from ‘what is’ towards ‘what if’. With the formatting of this thesis itself also always troubling the rigid Deleuzo-Guattarian (2015a) ‘segmentary lines’ structuring orthodox academic practice, imbricated in these inquiries are attempts to exemplify Manning’s (2015; 2016) ‘artfulness’ through shifts in thinking within and around an emerging PhD thesis. As writing resists organising, the verb thesisising comes into play to describe the processes involved in creating this always-moving thesis. Using ‘landing sites’ (Arakawa and Gins, 2009) as a landscaping device, freely creating emerging ‘lines of flight’ (Deleuze and Guattari, 2015a) so often denied to students forced to adhere to strict academic conventions, this ‘movement-moving’ (Manning, 2014) opens up opportunities for change as in Manning’s (2016) ‘research-creation’. Arguing for a moving away from writing-representing towards writing-inquiring, towards a writing ‘that does’ (Wyatt and Gale, 2018: 127), and toward writing as immanent doing, it is hoped to animate potential for intensities and becoming in writing, offering opportunities and glimmerings of the not-yet-known

    Learning disentangled speech representations

    Get PDF
    A variety of informational factors are contained within the speech signal and a single short recording of speech reveals much more than the spoken words. The best method to extract and represent informational factors from the speech signal ultimately depends on which informational factors are desired and how they will be used. In addition, sometimes methods will capture more than one informational factor at the same time such as speaker identity, spoken content, and speaker prosody. The goal of this dissertation is to explore different ways to deconstruct the speech signal into abstract representations that can be learned and later reused in various speech technology tasks. This task of deconstructing, also known as disentanglement, is a form of distributed representation learning. As a general approach to disentanglement, there are some guiding principles that elaborate what a learned representation should contain as well as how it should function. In particular, learned representations should contain all of the requisite information in a more compact manner, be interpretable, remove nuisance factors of irrelevant information, be useful in downstream tasks, and independent of the task at hand. The learned representations should also be able to answer counter-factual questions. In some cases, learned speech representations can be re-assembled in different ways according to the requirements of downstream applications. For example, in a voice conversion task, the speech content is retained while the speaker identity is changed. And in a content-privacy task, some targeted content may be concealed without affecting how surrounding words sound. While there is no single-best method to disentangle all types of factors, some end-to-end approaches demonstrate a promising degree of generalization to diverse speech tasks. This thesis explores a variety of use-cases for disentangled representations including phone recognition, speaker diarization, linguistic code-switching, voice conversion, and content-based privacy masking. Speech representations can also be utilised for automatically assessing the quality and authenticity of speech, such as automatic MOS ratings or detecting deep fakes. The meaning of the term "disentanglement" is not well defined in previous work, and it has acquired several meanings depending on the domain (e.g. image vs. speech). Sometimes the term "disentanglement" is used interchangeably with the term "factorization". This thesis proposes that disentanglement of speech is distinct, and offers a viewpoint of disentanglement that can be considered both theoretically and practically

    Embodying entrepreneurship: everyday practices, processes and routines in a technology incubator

    Get PDF
    The growing interest in the processes and practices of entrepreneurship has been dominated by a consideration of temporality. Through a thirty-six-month ethnography of a technology incubator, this thesis contributes to extant understanding by exploring the effect of space. The first paper explores how class structures from the surrounding city have appropriated entrepreneurship within the incubator. The second paper adopts a more explicitly spatial analysis to reveal how the use of space influences a common understanding of entrepreneurship. The final paper looks more closely at the entrepreneurs within the incubator and how they use visual symbols to develop their identity. Taken together, the three papers reject the notion of entrepreneurship as a primarily economic endeavour as articulated through commonly understood language and propose entrepreneuring as an enigmatic attractor that is accessed through the ambiguity of the non-verbal to develop the ‘new’. The thesis therefore contributes to the understanding of entrepreneurship and proposes a distinct role for the non-verbal in that understanding
    corecore