28 research outputs found

    Giraffe: Adventures in Expanding Context Lengths in LLMs

    Full text link
    Modern large language models (LLMs) that rely on attention mechanisms are typically trained with fixed context lengths which enforce upper limits on the length of input sequences that they can handle at evaluation time. To use these models on sequences longer than the train-time context length, one might employ techniques from the growing family of context length extrapolation methods -- most of which focus on modifying the system of positional encodings used in the attention mechanism to indicate where tokens or activations are located in the input sequence. We conduct a wide survey of existing methods of context length extrapolation on a base LLaMA or LLaMA 2 model, and introduce some of our own design as well -- in particular, a new truncation strategy for modifying the basis for the position encoding. We test these methods using three new evaluation tasks (FreeFormQA, AlteredNumericQA, and LongChat-Lines) as well as perplexity, which we find to be less fine-grained as a measure of long context performance of LLMs. We release the three tasks publicly as datasets on HuggingFace. We discover that linear scaling is the best method for extending context length, and show that further gains can be achieved by using longer scales at evaluation time. We also discover promising extrapolation capabilities in the truncated basis. To support further research in this area, we release three new 13B parameter long-context models which we call Giraffe: 4k and 16k context models trained from base LLaMA-13B, and a 32k context model trained from base LLaMA2-13B. We also release the code to replicate our results

    SCAN: Learning Hierarchical Compositional Visual Concepts

    Get PDF
    The seemingly infinite diversity of the natural world arises from a relatively small set of coherent rules, such as the laws of physics or chemistry. We conjecture that these rules give rise to regularities that can be discovered through primarily unsupervised experiences and represented as abstract concepts. If such representations are compositional and hierarchical, they can be recombined into an exponentially large set of new concepts. This paper describes SCAN (Symbol-Concept Association Network), a new framework for learning such abstractions in the visual domain. SCAN learns concepts through fast symbol association, grounding them in disentangled visual primitives that are discovered in an unsupervised manner. Unlike state of the art multimodal generative model baselines, our approach requires very few pairings between symbols and images and makes no assumptions about the form of symbol representations. Once trained, SCAN is capable of multimodal bi-directional inference, generating a diverse set of image samples from symbolic descriptions and vice versa. It also allows for traversal and manipulation of the implicit hierarchy of visual concepts through symbolic instructions and learnt logical recombination operations. Such manipulations enable SCAN to break away from its training data distribution and imagine novel visual concepts through symbolically instructed recombination of previously learnt concepts

    An Upper Lip Swelling: A Case Report, Differential Diagnosis and Review of Literature

    Get PDF
    Pleomorphic adenoma is the most common tumor of the salivary glands. About 90% of these tumors occur in the parotid gland and 10% in the minor salivary glands. The most common site for pleomorphic adenoma of the minor salivary glands is the palate followed by upper lip. Surgical removal with adequate margins is the principal treatment. Due to its microscopicprojections, this tumor requires a wide resection to avoid recurrence. We report a case of pleomorphic adenoma in the upper lip of an elderly female with emphasis on differentialdiagnosis and relevant review of literature

    Fusarium Disease of Maize and Its Management through Sustainable Approach

    Get PDF
    Fusarium causing disease in maize is probably the one of the most serious diseases among the crop plants all over the world. It not only damages the maize plant, reduces its potential yield and its nutritional values but imposes threatening to the human life through the induction of mycotoxin development. F. graminearum and F. moniliforme syn. Fusarium verticillioides are two important maize pathogens that cause substantial damage to its ear, stalk and foliage, causing contamination of grains with mycotoxins. Since conventional methods of controlling the diseases including the chemical methods proved not enough for total control of the disease with creating situation even worse for our surroundings, the application of PGPR and PGPF can play significant role to control the damage caused by Fusarium

    Search for post-merger gravitational waves from the remnant of the binary neutron star merger GW170817

    No full text
    In Advanced LIGO, detection and astrophysical source parameter estimation of the binary black hole merger GW150914 requires a calibrated estimate of the gravitational-wave strain sensed by the detectors. Producing an estimate from each detector's differential arm length control loop readout signals requires applying time domain filters, which are designed from a frequency domain model of the detector's gravitational-wave response. The gravitational-wave response model is determined by the detector's opto-mechanical response and the properties of its feedback control system. The measurements used to validate the model and characterize its uncertainty are derived primarily from a dedicated photon radiation pressure actuator, with cross-checks provided by optical and radio frequency references. We describe how the gravitational-wave readout signal is calibrated into equivalent gravitational-wave-induced strain and how the statistical uncertainties and systematic errors are assessed. Detector data collected over 38 calendar days, from September 12 to October 20, 2015, contain the event GW150914 and approximately 16 of coincident data used to estimate the event false alarm probability. The calibration uncertainty is less than 10% in magnitude and 10 degrees in phase across the relevant frequency band 20 Hz to 1 kHz

    First narrow-band search for continuous gravitational waves from known pulsars in advanced detector data

    Get PDF
    In Advanced LIGO, detection and astrophysical source parameter estimation of the binary black hole merger GW150914 requires a calibrated estimate of the gravitational-wave strain sensed by the detectors. Producing an estimate from each detector's differential arm length control loop readout signals requires applying time domain filters, which are designed from a frequency domain model of the detector's gravitational-wave response. The gravitational-wave response model is determined by the detector's opto-mechanical response and the properties of its feedback control system. The measurements used to validate the model and characterize its uncertainty are derived primarily from a dedicated photon radiation pressure actuator, with cross-checks provided by optical and radio frequency references. We describe how the gravitational-wave readout signal is calibrated into equivalent gravitational-wave-induced strain and how the statistical uncertainties and systematic errors are assessed. Detector data collected over 38 calendar days, from September 12 to October 20, 2015, contain the event GW150914 and approximately 16 of coincident data used to estimate the event false alarm probability. The calibration uncertainty is less than 10% in magnitude and 10 degrees in phase across the relevant frequency band 20 Hz to 1 kHz

    The X Chromosome of Hemipteran Insects: Conservation, Dosage Compensation and Sex-Biased Expression

    Get PDF
    Insects of the order Hemiptera (true bugs) use a wide range of mechanisms of sex determination, including genetic sex determination, paternal genome elimination, and haplodiploidy. Genetic sex determination, the prevalent mode, is generally controlled by a pair of XY sex chromosomes or by an XX/XO system, but different configurations that include additional sex chromosomes are also present. Although this diversity of sex determining systems has been extensively studied at the cytogenetic level, only the X chromosome of the model pea aphid Acyrthosiphon pisum has been analyzed at the genomic level, and little is known about X chromosome biology in the rest of the order. In this study, we take advantage of published DNA- and RNA-seq data from three additional Hemiptera species to perform a comparative analysis of the gene content and expression of the X chromosome throughout this clade. We find that, despite showing evidence of dosage compensation, the X chromosomes of these species show female-biased expression, and a deficit of male-biased genes, in direct contrast to the pea aphid X. We further detect an excess of shared gene content between these very distant species, suggesting that despite the diversity of sex determining systems, the same chromosomal element is used as the X throughout a large portion of the order
    corecore