15,821 research outputs found

    Searching Data: A Review of Observational Data Retrieval Practices in Selected Disciplines

    Get PDF
    A cross-disciplinary examination of the user behaviours involved in seeking and evaluating data is surprisingly absent from the research data discussion. This review explores the data retrieval literature to identify commonalities in how users search for and evaluate observational research data. Two analytical frameworks rooted in information retrieval and science technology studies are used to identify key similarities in practices as a first step toward developing a model describing data retrieval

    Automatically assembling a full census of an academic field

    Get PDF
    The composition of the scientific workforce shapes the direction of scientific research, directly through the selection of questions to investigate, and indirectly through its influence on the training of future scientists. In most fields, however, complete census information is difficult to obtain, complicating efforts to study workforce dynamics and the effects of policy. This is particularly true in computer science, which lacks a single, all-encompassing directory or professional organization. A full census of computer science would serve many purposes, not the least of which is a better understanding of the trends and causes of unequal representation in computing. Previous academic census efforts have relied on narrow or biased samples, or on professional society membership rolls. A full census can be constructed directly from online departmental faculty directories, but doing so by hand is prohibitively expensive and time-consuming. Here, we introduce a topical web crawler for automating the collection of faculty information from web-based department rosters, and demonstrate the resulting system on the 205 PhD-granting computer science departments in the U.S. and Canada. This method constructs a complete census of the field within a few minutes, and achieves over 99% precision and recall. We conclude by comparing the resulting 2017 census to a hand-curated 2011 census to quantify turnover and retention in computer science, in general and for female faculty in particular, demonstrating the types of analysis made possible by automated census construction.Comment: 11 pages, 6 figures, 2 table

    Pervasive Parallel And Distributed Computing In A Liberal Arts College Curriculum

    Get PDF
    We present a model for incorporating parallel and distributed computing (PDC) throughout an undergraduate CS curriculum. Our curriculum is designed to introduce students early to parallel and distributed computing topics and to expose students to these topics repeatedly in the context of a wide variety of CS courses. The key to our approach is the development of a required intermediate-level course that serves as a introduction to computer systems and parallel computing. It serves as a requirement for every CS major and minor and is a prerequisite to upper-level courses that expand on parallel and distributed computing topics in different contexts. With the addition of this new course, we are able to easily make room in upper-level courses to add and expand parallel and distributed computing topics. The goal of our curricular design is to ensure that every graduating CS major has exposure to parallel and distributed computing, with both a breadth and depth of coverage. Our curriculum is particularly designed for the constraints of a small liberal arts college, however, much of its ideas and its design are applicable to any undergraduate CS curriculum

    Guilt By Genetic Association: The Fourth Amendment and the Search of Private Genetic Databases by Law Enforcement

    Get PDF
    Over the course of 2018, a number of suspects in unsolved crimes have been identified through the use of GEDMatch, a public online genetic database. Law enforcement’s use of GEDMatch to identify suspects in cold cases likely does not constitute a search under the Fourth Amendment because the genetic information hosted on the website is publicly available. Transparency reports from direct-to-consumer (DTC) genetic testing providers like 23andMe and Ancestry suggest that federal and state officials may now be requesting access to private genetic databases as well. Whether law enforcement’s use of private DTC genetic databases to search for familial relatives of a suspect’s genetic profile constitutes a search within the meaning of the Fourth Amendment is far less clear. A strict application of the third-party doctrine suggests that individuals have no expectation of privacy in genetic information that they voluntarily disclose to third parties, including DTC providers. This Note, however, contends that the U.S. Supreme Court’s recent decision in Carpenter v. United States overwhelmingly supports the proposition that genetic information disclosed to third-party DTC providers is subject to Fourth Amendment protection. Approximately fifteen million individuals in the United States have already submitted their genetic information to DTC providers. The genetic information held by these providers can reveal a host of highly intimate details about consumers’ medical conditions, behavioral traits, genetic health risks, ethnic background, and familial relationships. Allowing law enforcement warrantless access to investigate third-party DTC genetic databases circumvents their consumers’ reasonable expectations of privacy by exposing this sensitive genetic information to law enforcement without any meaningful oversight. Furthermore, individuals likely reasonably expect that they retain ownership over their uniquely personal genetic information despite their disclosure of that information to a thirdparty provider. This Note therefore asserts that the third-party doctrine does not permit law enforcement to conduct warrantless searches for suspects on private DTC genetics databases under the Fourth Amendment

    A review into the factors affecting declines in undergraduate Computer Science enrolments and approaches for solving this problem

    Get PDF
    There has been a noticeable drop in enrolments in Computer Science (CS) courses and interest in CS careers in recent years while demand for CS skills is increasing dramatically. Not only are such skills useful for CS jobs but for all forms of business and to some extent personal lives as Information Technology (IT) is becoming ubiquitous and essential for most aspects of modern life. Therefore it is essential to address this lack of interest and skills to not only fill the demand for CS employees but to provide students with the CS skills they need for modern life especially for improving their employability and skills for further study. This report looks at possible reasons for the lack of interest in CS and different approaches used to enhance CS education and improve the appeal of CS

    Static and Dynamic Aspects of Scientific Collaboration Networks

    Get PDF
    Collaboration networks arise when we map the connections between scientists which are formed through joint publications. These networks thus display the social structure of academia, and also allow conclusions about the structure of scientific knowledge. Using the computer science publication database DBLP, we compile relations between authors and publications as graphs and proceed with examining and quantifying collaborative relations with graph-based methods. We review standard properties of the network and rank authors and publications by centrality. Additionally, we detect communities with modularity-based clustering and compare the resulting clusters to a ground-truth based on conferences and thus topical similarity. In a second part, we are the first to combine DBLP network data with data from the Dagstuhl Seminars: We investigate whether seminars of this kind, as social and academic events designed to connect researchers, leave a visible track in the structure of the collaboration network. Our results suggest that such single events are not influential enough to change the network structure significantly. However, the network structure seems to influence a participant's decision to accept or decline an invitation.Comment: ASONAM 2012: IEEE/ACM International Conference on Advances in Social Networks Analysis and Minin

    A case study of the Hirsch index for 26 non-prominent physicists

    Full text link
    The h index was introduced by Hirsch to quantify an individual's scientific research output. It has been widely used in different fields to show the relevance of the research work of prominent scientists. I have worked out 26 practical cases of physicists which are not so prominent. Therefore this case study should be more relevant to discuss various features of the Hirsch index which are interesting or disturbing or both for the more average situation. In particular, I investigate quantitatively some pitfalls in the evaluation and the influence of self-citations.Comment: 13 pages, 3 figures, updated after extensive language editing, no other changes to first versio

    Experience and Challenges from Clinical Trials with Malaria Vaccines in Africa.

    Get PDF
    Malaria vaccines are considered amongst the most important modalities for potential elimination of malaria disease and transmission. Research and development in this field has been an area of intense effort by many groups over the last few decades. Despite this, there is currently no licensed malaria vaccine. Researchers, clinical trialists and vaccine developers have been working on many approached to make malaria vaccine available.African research institutions have developed and demonstrated a great capacity to undertake clinical trials in accordance to the International Conference on Harmonization-Good Clinical Practice (ICH-GCP) standards in the last decade; particularly in the field of malaria vaccines and anti-malarial drugs. This capacity is a result of networking among African scientists in collaboration with other partners; this has traversed both clinical trials and malaria control programmes as part of the Global Malaria Action Plan (GMAP). GMAP outlined and support global strategies toward the elimination and eradication of malaria in many areas, translating in reduction in public health burden, especially for African children. In the sub-Saharan region the capacity to undertake more clinical trials remains small in comparison to the actual need.However, sustainability of the already developed capacity is essential and crucial for the evaluation of different interventions and diagnostic tools/strategies for other diseases like TB, HIV, neglected tropical diseases and non-communicable diseases. There is urgent need for innovative mechanisms for the sustainability and expansion of the capacity in clinical trials in sub-Saharan Africa as the catalyst for health improvement and maintained
    corecore