8,995 research outputs found

    Isoenzyme electrophoresis of 30 isolates of Giardia from humans and felines

    Get PDF
    Thirty isolates of Giardia duodenalis from humans and felines were compared by isoenzyme electrophoresis. Using 10 enzyme systems, 13 different zymodemes were distinguished. The majority of zymodemes could be divided into two groups: one group comprising human and feline isolates with worldwide geographic distribution; the other group containing human isolates restricted to Western Australia. A number of isolates showed multiple-banded patterns and the genetic significance of these findings is discussed. The marked heterogeneity of G. duodenalis demonstrated in this study is considered in relation to the epidemiology of giardiasis. The findings are consistent with felines serving as a reservoir of infection to humans

    Microdata Deduplication with Spark

    Get PDF
    Üha rohkem avaldatakse veebis struktureeritud sisu, mis on loetav nii inimeste kui masinate poolt. Tänu otsimootorite loojatele, kes on defineerinud standardid struktureeritud sisu esitamiseks, teevad järjest rohkemad veebisaidid osa oma andmetest, nt toodete, isikute, organisatsioonide ja asukohtade kirjeldused, veebis avalikuks. Selleks kasutatakse RDFa, microdata jms vorminguid. Microdata on üks viimastest vormingutest ning saanud populaarseks suhteliselt lühikese aja jooksul. Sarnaselt on arenenud tehnoloogiad veebist struktureeritud sisu kättesaamiseks. Näiteks on Apache Any23, mis võimaldab veebilehtedest microdata andmeid eraldada ja linkandmetena kättesaadavaks teha. Samas pole struktureeritud andmete veebist kättesaamine enam suurim tehniline väljakutse. Nimelt on veebist saadud andmeid enne kasutamist vaja puhastada - eemaldada duplikaadid, lahendada ebakõlad ning hakkama tuleb saada ka ebamääraste andmetega.\n\rKäesoleva magistritöö peamiseks fookuseks on efektiivse lahenduse loomine veebis leiduvatest linkandmetest duplikaatide eemaldamine suurte andmekoguste jaoks. Kuigi deduplikeerimise algoritmid on saavutanud suhtelise küpsuse, tuleb neid konkreetsete andmekomplektide jaoks siiski peenhäälestada. Eelkõige tuleb tuvastada sobivaim võtme pikkus kirjete sortimiseks. Käesolevas töös tuvastatakse optimaalne võtme pikkus veebis leiduvate tooteandmete deduplikeerimise kontekstis. Suurte andmemahtude tõttu kasutatakse Apache Spark'i deduplikeerimist hajusalgoritmide realiseerimiseks.The web is transforming from traditional web to web of data, where information is presented in such a way that it is readable by machines as well as human. As a part of this transformation, every day more and more websites implant structured data, e.g. product, person, organization, place etc., into the HTML pages. To implant the structured data different encoding vocabularies, such as RDFa, microdata, and microformats, are used. Microdata is the most recent addition to these structure data embedding standards, but it has gained more popularity over other formats in less time. Similarly, progress has been made in the extraction of the structured data from web pages, which has resulted in open source tools such as Apache Any23 and non-profit Common Crawl project. Any23 allows extraction of microdata from the web pages with less effort, whereas Common Crawl extracts data from websites and provides it publically for download. In fact, the microdata extraction tools only take care of parsing and data transformation steps of data cleansing. Although with the help of these state-of-the-art extraction tools microdata can be easily extracted, before the extracted data used in potential applications, duplicates should be removed and data unified. Since microdata origins from arbitrary web resources, it has arbitrary quality as well and should be treated correspondingly. \n\rThe main purpose of this thesis is to develop the effective mechanism for deduplication of microdata on the web scale. Although the deduplication algorithms have reached relative maturity, however, these algorithm needs to be executed on specific datasets for fine-tuning. In particular, the need to identify the most suitable length of sorting key in sorted-based deduplication approach. The present work identifies the optimum length of the sorting key in the context of extracted product microdata deduplication. Due to large volumes of data to be processed continuously, Apache Spark will be used for implementing the necessary procedures

    The EU as Provider of Frames and Scripts: Evidence on Law and Courts from Europe, Africa, Asia, and the Americas

    Get PDF
    Regional trade agreements (RTAs) differ a great deal in both their legal and judicial dimensions. Accounting for RTAs means, in good part, to explain those differences. A rationalist approach focused on interests, calculations, and utility-maximizing outcomes can offer limited insight into those differences. RTA officials do not operate in a vacuum. First, at the intra-regional level, they work in environments with established, and often similar, national legal and judicial traditions. Those traditions, rather than the EU, provide the frames through which officials interpret and solve the regulatory challenges associated with integration: officials develop legal frameworks and judicial mechanisms that mirror, in their overall character, what is already in place in the member states. But, second, officials are aware of RTAs elsewhere in the world – above all, the EU. The EU provides ready-made detailed instructions, or scripts, for the formulation of specific laws and judicial processes in other RTAs. If consistent with the national traditions in a given RTA, officials often adopt or mimic those scripts. Thus, overall, choices about legal and judicial design have little to do with what is ‘best’ for trade liberalization and the fulfillment of national interests. They have a lot more to do with continuity, legitimacy, and expediency

    Clustering in an Object-Oriented Environment

    Get PDF
    This paper describes the incorporation of seven stand-alone clustering programs into S-PLUS, where they can now be used in a much more flexible way. The original Fortran programs carried out new cluster analysis algorithms introduced in the book of Kaufman and Rousseeuw (1990). These clustering methods were designed to be robust and to accept dissimilarity data as well as objects-by-variables data. Moreover, they each provide a graphical display and a quality index reflecting the strength of the clustering. The powerful graphics of S-PLUS made it possible to improve these graphical representations considerably. The integration of the clustering algorithms was performed according to the object-oriented principle supported by S-PLUS. The new functions have a uniform interface, and are compatible with existing S-PLUS functions. We will describe the basic idea and the use of each clustering method, together with its graphical features. Each function is briefly illustrated with an example.

    GPML: an XML-based standard for the interchange of genetic programming trees

    Get PDF
    We propose a Genetic Programming Markup Language (GPML), an XML based standard for the interchange of genetic programming trees, and outline the benefits such a format would bring in allowing the deployment of trained genetic programming (GP) models in applications as well as the subsidiary benefit of allowing GP researchers to directly share trained trees. We present a formal definition of this standard and describe details of an implementation. In addition, we present a case study where GPML is used to implement a model predictive controller for the control of a building heating plant

    Characterizing practices and the sanitary status of farms of Creole and Creole-like pigs in Martinique

    Get PDF
    Notice à reprendre pas de clé UT au 27 juin 2019The study aimed at characterizing the management of 27 pig farms in the frame-work of a conservation program of the Creole pig of Martinique. Blood samples were collected on 67 pigs for serological analysis. A multivariate analysis dis-criminated six groups of farms: group 1 (n = 9) consisted of family farms with fewer than three boars and a higher age of replacement of reproducers than the other farms (6.2 vs 5.0 years, p < 0.01); group 2 (n = 5) consisted of small farms with two matings per sow per yearand with more than two livestock species; group 3 (n = 5) consisted of farms with more than four boars and more than four sows, with batch management and a feeding diet mainly based on industrial concentrates; group 4 (n = 4) consisted of family farms with pigs reared outdoors and a feeding diet mainly based on local resources (50 to 100% of the total diet content); group 5 (n = 3) consisted of mixed farming systems with more than three boars and more than 10 sows; and group 6 consisted of a farm with a high number of females without batch management. The analysis of serological data showed that, regardless of the farming system (outdoor, semi outdoor, indoor), the sanitary status of pig farms in Martinique was good, with no identified con-tagious disease. The typology obtained and the results on the sanitary status of farms are encouraging signs to pursue the project aiming at the conservation of Creole pigs by exchange of healthy reproducers between farmers and the devel-opment of a niche market around heritage values

    Repeatability of traits for characterizing feed intake patterns in dairy goats: a basis for phenotyping in the precision farming context

    Get PDF
    International audienceIn ruminants, feeding behaviour variables are parameters involved in feed efficiency that show variation among individuals. This study aimed to evaluate during the first two production cycles in ruminants the repeatability of feed intake pattern, which is an important aspect of feeding behaviour. Thirty-five dairy goats from Alpine or Saanen breeds were housed in individual pens at four periods (end of first gestation, middle of first and second lactations and middle of second gestation which is also the end of first lactation) and fed a total mixed ration (TMR) ad libitum. Individual cumulative dry matter intake (DMI) was automatically measured every 2 min during the last 4 days of each period. Feed intake pattern was characterized by several measures related to the quantity of feed eaten or to the rate of intake during the 15 h following the afternoon feed delivery. Two main methods were used: modelling cumulative DMI evolution by an exponential model or by a segmentation-clustering method. The goat ability to sort against dietary fibre was also evaluated. There was a very good repeatability of the aggregate measures between days within a period for a given goat estimated by the day effect within breed and goat, tested on the residual variance (P > 0.95). The correlations between periods were the highest between the second and either the third or fourth periods. With increasing age, goats sorted more against the fibrous part of the TMR and increased their initial rate of intake. Alpine goats ate more slowly than Saanen goats but ate during a longer duration. Principal component analysis (PCA) was performed on all the aggregate measures of feed intake patterns. The factor score plots generated by the PCA highlighted the opposition between the different measures of feed intake patterns and the sorting behaviour. The projection of the animals on the scoring plots showed a breed effect and that there was a continuum for the feed intake pattern of goats. In conclusion, this study showed that the feed intake pattern was highly repeatable for an animal in a given period and between periods. This means that phenotyping goats in a younger age might be of interest, either to select them on feeding behaviour and choose preferentially the slow eaters or to adapt the quantity offered and restrict feed delivery to the fast eaters in order to increase feed efficiency and welfare by limiting the occurrence of acidosis, for example
    corecore