4,510 research outputs found

    Source Retrieval for Plagiarism Detection

    Get PDF
    Plagiarism has become a serious problem mainly because of the electronically available documents. An online document retrieval is a weighty part of a modern anti-plagiarism tool. This paper describes an architecture and concepts of a real-world document retrieval system, which is a part of a general anti-plagiarism software. Up to date systems for plagiarism detection are discussed from the source retrieval perspective. The key approaches of source retrieval are compared. The system recommendations stem from design, implementation, and several years of operation experience of a nationwide plagiarism solution at Masaryk University in the Czech Republic. The design can be adapted to many situations. Proper usage of such systems contributes to the gradual improvement of the quality of student theses.Plagiarism has become a serious problem mainly because of the electronically available documents. An online document retrieval is a weighty part of a modern anti-plagiarism tool. This paper describes an architecture and concepts of a real-world document retrieval system, which is a part of a general anti-plagiarism software. Up to date systems for plagiarism detection are discussed from the source retrieval perspective. The key approaches of source retrieval are compared. The system recommendations stem from design, implementation, and several years of operation experience of a nationwide plagiarism solution at Masaryk University in the Czech Republic. The design can be adapted to many situations. Proper usage of such systems contributes to the gradual improvement of the quality of student theses

    Extending the institutional repository to include undergraduate research

    Get PDF
    While a primary strategy of scholarly communication initiatives has been to encourage faculty participation in institutional repositories (IRs), with some process and workflow customization, IR participation can be successfully extended to undergraduate students, with benefits to both the student and institution. Drawing observations from the University of New Hampshire Library\u27s work collecting undergraduate honors theses and other student research, this article discusses customization strategies for creating an effective workflow for student self-deposit using an iterative, feedback-based approach, and the benefits, challenges,and potential concerns of encouraging undergraduate participation in institutional repositories

    Evaluation of options for a UK electronic thesis service: study report

    Get PDF
    The British Library (BL), JISC, UK HE institutions and CURL have funded an 18-month project to develop a national framework for the provision, preservation and open access to electronic theses produced in UK HE institutions. The project, called EThOS (Electronic Theses Online Service) was developed in response to a competitive tender invitation released by the JISC and proposes a service set up and run by the British Library. The British Library’s current service, the British Thesis Service, offers access to around 180,000 doctoral theses, predominantly from 1970 onwards, though it is estimated that overall some half million theses dating from the 1600s are in existence in the UK. Around 80% of requests are for theses published within the last 13 years and almost all of these exist only in hardcopy. Through this service, theses are acquired ‘on demand’ and delivered on microfilm at a cost of just over £60 to the user (and at this price the service runs at a loss). Whilst this service, coupled with the Index to Theses (Expert Information), enables the location of and access to relatively recent British theses by the determined seeker, no one could argue that the process is optimised. As a result, usage of theses is much lower than it might be and much research is going unnoticed and unused as a result. Conversely, it has been shown that when theses are easy to locate and access, usage is high: at Virginia Tech, a pioneer site in the provision of a formal, systematised ETD (electronic theses and dissertations) service, downloads have been shown to increase over 30-fold when a thesis is available free online and easily located. A national service for the UK that provides discovery and access to theses in electronic form via the Web will increase the utility of doctoral scholarship. A single interface that directs users to theses wherever they are held, and which addresses the issues of intellectual property, permissions, royalties, preservation, discovery, and other matters associated with the public provision of theses in electronic form, will be of great benefit to the scholarly community in the UK and across the world. The EThOS project (Electronic Theses Online Service) was commissioned to develop a model for a workable, sustainable and acceptable national service for the provision of open access to electronic doctoral theses. The EThOS project team have completed the task and UCL Library Services in partnership with Key Perspectives Ltd have been asked to undertake a consultative study to assess the acceptability of the proposed model to the UK higher education community in the context of other potential models. This document reports the results of this consultative study, including a set of recommendations to JISC and other stakeholders for setting up a UK national e-theses service. The stakeholders other than JISC are: The British Library University administrators (registrars) Graduate students and recent PhDs Librarians Institutional repository managers Other e-theses services including: DART-Europe DiVA DissOnline Australasian Digital Theses Theses Canada Networked Digital Library for Theses and Dissertations The EThOS tea

    Approaches for Candidate Document Retrieval

    Get PDF
    Plagiarism has become a serious problem mainly because of the electronically available documents. An online document retrieval is weighty part of a modern anti-plagiarism tool. This paper describes an architecture and concepts of a real-world document retrieval system, which is a part of a general anti-plagiarism software. A similar system was developed as a part of nationwide plagiarism solution at Masaryk University. The design can be adapted into many situations. Provided recommendation stem from experience of the system operation for several years. The proper usage of such systems contributes to gradual improvement of the quality of student theses

    Neural Machine Translation Inspired Binary Code Similarity Comparison beyond Function Pairs

    Full text link
    Binary code analysis allows analyzing binary code without having access to the corresponding source code. A binary, after disassembly, is expressed in an assembly language. This inspires us to approach binary analysis by leveraging ideas and techniques from Natural Language Processing (NLP), a rich area focused on processing text of various natural languages. We notice that binary code analysis and NLP share a lot of analogical topics, such as semantics extraction, summarization, and classification. This work utilizes these ideas to address two important code similarity comparison problems. (I) Given a pair of basic blocks for different instruction set architectures (ISAs), determining whether their semantics is similar or not; and (II) given a piece of code of interest, determining if it is contained in another piece of assembly code for a different ISA. The solutions to these two problems have many applications, such as cross-architecture vulnerability discovery and code plagiarism detection. We implement a prototype system INNEREYE and perform a comprehensive evaluation. A comparison between our approach and existing approaches to Problem I shows that our system outperforms them in terms of accuracy, efficiency and scalability. And the case studies utilizing the system demonstrate that our solution to Problem II is effective. Moreover, this research showcases how to apply ideas and techniques from NLP to large-scale binary code analysis.Comment: Accepted by Network and Distributed Systems Security (NDSS) Symposium 201
    • …
    corecore