1,194 research outputs found

    Whole genome sequence and analysis of the Marwari horse breed and its genetic origin

    Get PDF
    Background: The horse (Equus ferus caballus) is one of the earliest domesticated species and has played an important role in the development of human societies over the past 5,000 years. In this study, we characterized the genome of the Marwari horse, a rare breed with unique phenotypic characteristics, including inwardly turned ear tips. It is thought to have originated from the crossbreeding of local Indian ponies with Arabian horses beginning in the 12th century. Results: We generated 101 Gb (similar to 30 x coverage) of whole genome sequences from a Marwari horse using the Illumina HiSeq2000 sequencer. The sequences were mapped to the horse reference genome at a mapping rate of similar to 98% and with similar to 95% of the genome having at least 10 x coverage. A total of 5.9 million single nucleotide variations, 0.6 million small insertions or deletions, and 2,569 copy number variation blocks were identified. We confirmed a strong Arabian and Mongolian component in the Marwari genome. Novel variants from the Marwari sequences were annotated, and were found to be enriched in olfactory functions. Additionally, we suggest a potential functional genetic variant in the TSHZ1 gene (p.Ala344>Val) associated with the inward-turning ear tip shape of the Marwari horses. Conclusions: Here, we present an analysis of the Marwari horse genome. This is the first genomic data for an Asian breed, and is an invaluable resource for future studies of genetic variation associated with phenotypes and diseases in horses.open1

    Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles

    Full text link
    Previous research in multi-document news summarization has typically concentrated on collating information that all sources agree upon. However, to our knowledge, the summarization of diverse information dispersed across multiple articles about an event has not been previously investigated. The latter imposes a different set of challenges for a summarization model. In this paper, we propose a new task of summarizing diverse information encountered in multiple news articles encompassing the same event. To facilitate this task, we outlined a data collection schema for identifying diverse information and curated a dataset named DiverseSumm. The dataset includes 245 news stories, with each story comprising 10 news articles and paired with a human-validated reference. Moreover, we conducted a comprehensive analysis to pinpoint the position and verbosity biases when utilizing Large Language Model (LLM)-based metrics for evaluating the coverage and faithfulness of the summaries, as well as their correlation with human assessments. We applied our findings to study how LLMs summarize multiple news articles by analyzing which type of diverse information LLMs are capable of identifying. Our analyses suggest that despite the extraordinary capabilities of LLMs in single-document summarization, the proposed task remains a complex challenge for them mainly due to their limited coverage, with GPT-4 only able to cover less than 40% of the diverse information on average

    Structure-aware shape processing

    Full text link

    Structure-aware shape processing

    Full text link

    From Classical to Modern Computational Approaches to Identify Key Genetic Regulatory Components in Plant Biology

    Get PDF
    The selection of plant genotypes with improved productivity and tolerance to environmental constraints has always been a major concern in plant breeding. Classical approaches based on the generation of variability and selection of better phenotypes from large variant collections have improved their efficacy and processivity due to the implementation of molecular biology techniques, particularly genomics, Next Generation Sequencing and other omics such as proteomics and metabolomics. In this regard, the identification of interesting variants before they develop the phenotype trait of interest with molecular markers has advanced the breeding process of new varieties. Moreover, the correlation of phenotype or biochemical traits with gene expression or protein abundance has boosted the identification of potential new regulators of the traits of interest, using a relatively low number of variants. These important breakthrough technologies, built on top of classical approaches, will be improved in the future by including the spatial variable, allowing the identification of gene(s) involved in key processes at the tissue and cell levels

    Do rent-seeking and interregional transfers contribute to urban primacy in sub-Saharan Africa?

    Get PDF
    We develop an economic geography model where mobile skilled workers choose to either work in a production sector or to become part of an unproductive elite. The elite sets income tax rates to maximize its own welfare by extracting rents, thereby influencing the spatial structure of the economy and changing the available range of consumption goods. We show that either unskilled labor mobility, or rent-seeking behavior, or both, are likely to favor the occurence of agglomeration and of urban primacy. In equilibrium, the elite may tax the unskilled workers but does not tax the skilled workers, and there are rural-urban transfers towards the agglomeration. The size of the elite and the magnitude of the tax burden that falls on the unskilled decrease with product differentiation and with the expenditure share for manufacturing goods. All these results are broadly in line with observed patterns of urban primacy and economic development in sub-Saharan African countries.economic geography; rent-seeking; interregional transfers; urban primacy; Sub-Saharan Africa.
    corecore