50 research outputs found

    Generalization on the Unseen, Logic Reasoning and Degree Curriculum

    Full text link
    This paper considers the learning of logical (Boolean) functions with focus on the generalization on the unseen (GOTU) setting, a strong case of out-of-distribution generalization. This is motivated by the fact that the rich combinatorial nature of data in certain reasoning tasks (e.g., arithmetic/logic) makes representative data sampling challenging, and learning successfully under GOTU gives a first vignette of an 'extrapolating' or 'reasoning' learner. We then study how different network architectures trained by (S)GD perform under GOTU and provide both theoretical and experimental evidence that for a class of network models including instances of Transformers, random features models, and diagonal linear networks, a min-degree-interpolator is learned on the unseen. We also provide evidence that other instances with larger learning rates or mean-field networks reach leaky min-degree solutions. These findings lead to two implications: (1) we provide an explanation to the length generalization problem (e.g., Anil et al. 2022); (2) we introduce a curriculum learning algorithm called Degree-Curriculum that learns monomials more efficiently by incrementing supports.Comment: To appear in ICML 202

    On the multiplicative independence between nn and ⌊αn⌋\lfloor \alpha n\rfloor

    Full text link
    In this article we investigate different forms of multiplicative independence between the sequences nn and ⌊nα⌋\lfloor n \alpha \rfloor for irrational α\alpha. Our main theorem shows that for a large class of arithmetic functions a,b ⁣:N→Ca, b \colon \mathbb{N} \to \mathbb{C} the sequences (a(n))n∈N(a(n))_{n \in \mathbb{N}} and (b(⌊αn⌋))n∈N(b ( \lfloor \alpha n \rfloor))_{n \in \mathbb{N}} are asymptotically uncorrelated. This new theorem is then applied to prove a 22-dimensional version of the Erd\H{o}s-Kac theorem, asserting that the sequences (ω(n))n∈N(\omega(n))_{n \in \mathbb{N}} and (ω(⌊αn⌋)n∈N(\omega( \lfloor \alpha n \rfloor)_{n\in \mathbb{N}} behave as independent normally distributed random variables with mean log⁥log⁥n\log\log n and standard deviation log⁥log⁥n\sqrt{ \log \log n}. Our main result also implies a variation on Chowla's Conjecture asserting that the logarithmic average of (λ(n)λ(⌊αn⌋))n∈N(\lambda(n) \lambda ( \lfloor \alpha n \rfloor))_{n \in \mathbb{N}} tends to 00.Comment: 34 pages; fixed misspelled author name; December 7 2023: updated authors affiliation, light edits, typos, added chart of main proof in introductio

    The Effectiveness and Cost-Effectiveness of Screening for HIV in Migrants in the EU/EEA: A Systematic Review

    Get PDF
    Migrants, defined as individuals who move from their country of origin to another, account for 40% of newly-diagnosed cases of human immunodeficiency virus (HIV) in the European Union/European Economic Area (EU/EEA). Populations at high risk for HIV include migrants, from countries or living in neighbourhoods where HIV is prevalent, and those participating in high risk behaviour. These migrants are at risk of low CD4 counts at diagnosis, increased morbidity, mortality, and onward transmission. The aim of this systematic review is to evaluate the effectiveness and cost-effectiveness of HIV testing strategies in migrant populations and to estimate their effect on testing uptake, mortality, and resource requirements. Following a systematic overview, we included four systematic reviews on the effectiveness of strategies in non-migrant populations and inferred their effect on migrant populations, as well as eight individual studies on cost-effectiveness/resource requirements. We assessed the certainty of our results using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach. The systematic reviews reported that HIV tests are highly accurate (rapid test >90% sensitivity, Western blot and ELISA >99% sensitivity). A meta-analysis showed that rapid testing approaches improve the access and uptake of testing (risk ratio = 2.95, 95% CI: 1.69 to 5.16), and were associated with a lower incidence of HIV in the middle-aged women subgroup among marginalised populations at a high risk of HIV exposure and HIV related stigma. Economic evidence on rapid counselling and testing identified strategic advantages with rapid tests. In conclusion, community-based rapid testing programmes may have the potential to improve uptake of HIV testing among migrant populations across a range of EU/EEA settings

    TOG–tubulin binding specificity promotes microtubule dynamics and mitotic spindle formation

    Get PDF
    XMAP215, CLASP, and Crescerin use arrayed tubulin-binding tumor overexpressed gene (TOG) domains to modulate microtubule dynamics. We hypothesized that TOGs have distinct architectures and tubulin-binding properties that underlie each family’s ability to promote microtubule polymerization or pause. As a model, we investigated the pentameric TOG array of a Drosophila melanogaster XMAP215 member, Msps. We found that Msps TOGs have distinct architectures that bind either free or polymerized tubulin, and that a polarized array drives microtubule polymerization. An engineered TOG1-2-5 array fully supported Msps-dependent microtubule polymerase activity. Requisite for this activity was a TOG5-specific N-terminal HEAT repeat that engaged microtubule lattice-incorporated tubulin. TOG5–microtubule binding maintained mitotic spindle formation as deleting or mutating TOG5 compromised spindle architecture and increased the mitotic index. Mad2 knockdown released the spindle assembly checkpoint triggered when TOG5–microtubule binding was compromised, indicating that TOG5 is essential for spindle function. Our results reveal a TOG5-specific role in mitotic fidelity and support our hypothesis that architecturally distinct TOGs arranged in a sequence-specific order underlie TOG array microtubule regulator activity

    OSIRIS-REx Encounters Bennu: Initial Assessment from the Approach Phase

    Get PDF
    The OSIRIS-REx spacecraft launched on September 8, 2016, on a seven-year journey to return samples from asteroid (101955) Bennu. This presentation summarizes the scientific results from the Approach and Preliminary Survey phases. Bennu observations are set to begin on August 17, 2018,when the asteroid is bright enough for detection by the PolyCam. PolyCam and MapCam collect data to survey the asteroid environment for any hazards and characterize the asteroid point-source photometric properties. Resolved images acquired during final approach, starting in late October 2018, allow the creation of a shape model using stereophotoclinometry (SPC), needed by both the navigation team and science planners. The OVIRS and OTES spectrometers characterize the point- source spectral properties over a full rotation period, providing a first look at any features and thermophysical properties. TAGSAM is released from the launch container and deployed into the sampling configuration then returned to the stow position.Preliminary Survey follows the Approach Phase in early December 2018. This phase consists of a series of hyperbolic trajectories that cross over the North and South poles and the equator of Bennu at a close-approach distance of 7 km. Images from these Preliminary Survey passes provide data to complete the 75-cm resolution SPC global shape model and solve for the rotation state. Once the shape model is complete, the asteroid coordinate system is defined for co-registration of all data products. These higher-resolution images also constrain the photometric properties and allow for an initial assessment of the geology. In Preliminary Survey the team also obtains the first OLA data, providing a measure of the surface topography. OVIRS and OTES collect data as "ride-along" instruments, with the spacecraft pointing driven by imaging constraints. These data provide a first look at the spectral variation across the surface of Bennu. Radio science measurements, combined with altimetry and imagery, determine Bennu's mass, a prerequisite to placing the spacecraft into orbit in late December 2018. Together, data from the Approach and Preliminary Survey phases set the stage for the extensive mapping planned for 2019. These dates are the baseline plan. Any contingency or unexpected discovery may change this mission profile

    Burnout among surgeons before and during the SARS-CoV-2 pandemic: an international survey

    Get PDF
    Background: SARS-CoV-2 pandemic has had many significant impacts within the surgical realm, and surgeons have been obligated to reconsider almost every aspect of daily clinical practice. Methods: This is a cross-sectional study reported in compliance with the CHERRIES guidelines and conducted through an online platform from June 14th to July 15th, 2020. The primary outcome was the burden of burnout during the pandemic indicated by the validated Shirom-Melamed Burnout Measure. Results: Nine hundred fifty-four surgeons completed the survey. The median length of practice was 10 years; 78.2% included were male with a median age of 37 years old, 39.5% were consultants, 68.9% were general surgeons, and 55.7% were affiliated with an academic institution. Overall, there was a significant increase in the mean burnout score during the pandemic; longer years of practice and older age were significantly associated with less burnout. There were significant reductions in the median number of outpatient visits, operated cases, on-call hours, emergency visits, and research work, so, 48.2% of respondents felt that the training resources were insufficient. The majority (81.3%) of respondents reported that their hospitals were included in the management of COVID-19, 66.5% felt their roles had been minimized; 41% were asked to assist in non-surgical medical practices, and 37.6% of respondents were included in COVID-19 management. Conclusions: There was a significant burnout among trainees. Almost all aspects of clinical and research activities were affected with a significant reduction in the volume of research, outpatient clinic visits, surgical procedures, on-call hours, and emergency cases hindering the training. Trial registration: The study was registered on clicaltrials.gov "NCT04433286" on 16/06/2020

    Large expert-curated database for benchmarking document similarity detection in biomedical literature search

    Get PDF
    Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe

    Run-time remapping algorithm of dataflow actors on NoC-based heterogeneous MPSoCs

    No full text
    International audienceMultiprocessor system-on-chip (MPSoC) platforms have been emerging as the main solution to cope with processor frequency ceiling and power density issues while still improving performances. Then, network-on-chip (NoC) has been adopted to provide the increasing number of processors with the required communication bandwidth as well as with the necessary flexibility. Video processing and streaming applications are adopting dynamic dataflow model of computation as the need for high performance parallel computing is growing. Dataflow applications executed on modern MPSoC-based architectures are becoming increasingly dynamic and more data-dependent. Different tasks execute concurrently with significant modifications in their workloads and resource demanding over time depending on the input data. Hence, adopting any static or offline dynamic scheduling for mapping tasks will not cope with the computation variations. This paper introduces an original run-time mapping algorithm based on the Move Based (MB) method targeting a dedicated heterogeneous NoC-based MPSoC architecture to achieve workload balancing and optimized communication traffic. The performance of the proposed algorithm is verified by conducting cycle-accurate SystemC simulations of the adopted NoC implementing a real MPEG4-SP decoder. The obtained results reveal the effectiveness of our proposed algorithm. For various real-life videos, the proposed algorithm systematically succeeded to enhance significantly the performance
    corecore