93 research outputs found

    Probing the Statistical Properties of Unknown Texts: Application to the Voynich Manuscript

    Get PDF
    While the use of statistical physics methods to analyze large corpora has been useful to unveil many patterns in texts, no comprehensive investigation has been performed on the interdependence between syntactic and semantic factors. In this study we propose a framework for determining whether a text (e.g., written in an unknown alphabet) is compatible with a natural language and to which language it could belong. The approach is based on three types of statistical measurements, i.e. obtained from first-order statistics of word properties in a text, from the topology of complex networks representing texts, and from intermittency concepts where text is treated as a time series. Comparative experiments were performed with the New Testament in 15 different languages and with distinct books in English and Portuguese in order to quantify the dependency of the different measurements on the language and on the story being told in the book. The metrics found to be informative in distinguishing real texts from their shuffled versions include assortativity, degree and selectivity of words. As an illustration, we analyze an undeciphered medieval manuscript known as the Voynich Manuscript. We show that it is mostly compatible with natural languages and incompatible with random texts. We also obtain candidates for keywords of the Voynich Manuscript which could be helpful in the effort of deciphering it. Because we were able to identify statistical measurements that are more dependent on the syntax than on the semantics, the framework may also serve for text analysis in language-dependent applications

    Rapid plant DNA and RNA extraction protocol using a bench drill

    Get PDF
    Plant DNA and RNA extraction methods are well established, with a wide range of protocols, depending on the purposes of each laboratory/research. Nowadays, quick, inexpensive and easy plant DNA and RNA extraction methods are highly sought after. We developed an optimized protocol for plant DNA and RNA extraction that uses an inexpensive bench drill and plastic bags and does not require liquid nitrogen. DNA from leaves and RNA from leaves and roots of banana, pineapple, citrus, papaya, passion fruit and cassava, were extracted using a basic cetyltrimethylammonium bromide method. Both nucleic acids were quantified and evaluated for quality based on agarose gel electrophoresis. The DNA and RNA extractions were successful for all species, and RNA quality in pellets was maintained after storage at room temperature for three weeks. This protocol can reduce costs considerably in laboratories with ongoing routine activities of DNA and RNA extraction for genetic diversity and gene expression analyses, where other conventional methods have not been successful due to explant, condition of samples and quantity and quality of nucleic acids. This is especially relevant for many laboratories in developing countries where the cost and availability of liquid nitrogen may be a constraint

    Strong correlations between text quality and complex networks features

    Full text link
    Concepts of complex networks have been used to obtain metrics that were correlated to text quality established by scores assigned by human judges. Texts produced by high-school students in Portuguese were represented as scale-free networks (word adjacency model), from which typical network features such as the in/outdegree, clustering coefficient and shortest path were obtained. Another metric was derived from the dynamics of the network growth, based on the variation of the number of connected components. The scores assigned by the human judges according to three text quality criteria (coherence and cohesion, adherence to standard writing conventions and theme adequacy/development) were correlated with the network measurements. Text quality for all three criteria was found to decrease with increasing average values of outdegrees, clustering coefficient and deviation from the dynamics of network growth. Among the criteria employed, cohesion and coherence showed the strongest correlation, which probably indicates that the network measurements are able to capture how the text is developed in terms of the concepts represented by the nodes in the networks. Though based on a particular set of texts and specific language, the results presented here point to potential applications in other instances of text analysis.Comment: 8 pages, 8 figure

    Quantifying the interdisciplinarity of scientific journals and fields

    Full text link
    There is an overall perception of increased interdisciplinarity in science, but this is difficult to confirm quantitatively owing to the lack of adequate methods to evaluate subjective phenomena. This is no different from the difficulties in establishing quantitative relationships in human and social sciences. In this paper we quantified the interdisciplinarity of scientific journals and science fields by using an entropy measurement based on the diversity of the subject categories of journals citing a specific journal. The methodology consisted in building citation networks using the Journal Citation Reports database, in which the nodes were journals and edges were established based on citations among journals. The overall network for the 11-year period (1999-2009) studied was small-world and scale free with regard to the in-strength. Upon visualizing the network topology an overall structure of the various science fields could be inferred, especially their interconnections. We confirmed quantitatively that science fields are becoming increasingly interdisciplinary, with the degree of interdisplinarity (i.e. entropy) correlating strongly with the in-strength of journals and with the impact factor.Comment: 23 pages, 6 figure

    The microstructure and development of male genital organs of Spix's yellow-toothed cavy (Galea spixii) bred in captivity

    Get PDF
    O preá do semiárido nordestino (Galea spixii) é um roedor pertencente à família Caviidae. São encontrados nas regiões da Caatinga e do Cerrado Brasileiro e se reproduz ao longo do ano, apresentando um período de gestação de 48 dias e uma ninhada de 2 a 4 crias. O objetivo deste estudo foi caracterizar histologicamente os componentes estruturais dos órgãos genitais de preás machos relacionando com a evolução cronológica destes órgãos na espécie. Foram utilizados para análise animais ao nascimento e aos 15, 30, 45, 60, 75, 90, 105, 120 e aos 150 dias de idade. Fragmentos do epidídimo, ducto deferente, pênis e glândulas sexuais acessórias foram coletados, fixados e processados para descrição em microscopia de luz. O epidídimo apresentou epitélio colunar simples e em cada fase sexual notou-se diferença quanto ao tamanho do lúmen tubular e à presença de espermatozóides no lúmen aos 45 dias de idade. O epitélio do ducto deferente no preá mostrou-se pseudo-estratificado colunar com crescente presença de estereocilios com o avanço da idade. A glândula vesicular no preá apresentou uma mucosa com pregueamento variado, de acordo com a fase do desenvolvimento sexual. A próstata mostrou-se pouco desenvolvida, com lúmen pequeno nos preás ao nascimento e aos 15 dias de idade; aos 45 dias mostrou-se com um pregueamento do epitélio variável. Os órgãos genitais masculinos do preá passaram por transformações morfológicas no decorrer da idade e com o desenvolvimento sexual, isso colaborou para a determinação do início da fase da puberdade, que na espécie em estudo foi aos 45 dias de idade

    Estudo para avaliação e gerenciamento da disponibilidade hídrica da bacia da Lagoa Mirim

    Get PDF
    A Lagoa Mirim é um dos principais sistemas naturais existentes no Rio Grande do Sul. O desenvolvimento da região é baseado fundamentalmente na disponibilidade de recursos hídricos e a sua interação com as demandas, existentes e de futuros cenários previstos. O principal recurso hídrico desta região, a Lagoa Mirim, é compartilhada com o Uruguai, desta forma envolve uma Gestão dos Recursos Hídricos baseada na administração do Estado, Federação e na estratégia do Brasil neste contexto. Os condicionantes ambientais regionais envolvem a conservação da estação ecológica do Taim, administrada pelo IBAMA - Instituto Brasileiro de Meio Ambiente, que está sujeita a ações antrópicas, já que na sua bacia hidrográfica existe uma grande retirada de água para irrigação. Este estudo é voltado fundamentalmente para uma avaliação global dos condicionantes de disponibilidade e demanda de recursos hídricos nesta região. O estudo foi contratado pelo estado do Rio Grande do Sul. Especificamente os objetivos são os seguintes: Caracterização do comportamento da Lagoa Mirim e do sistema do Taim; Estabelecer o balanço hídrico da Lagoa Mirim através da avaliação da sua disponibilidade hídrica, face as demandas de água existentes na região
    corecore