18,237 research outputs found

    The Contemporary Tax Journal Volume 6, No. 1 – Summer/Fall 2016

    Get PDF

    Programmable Insight: A Computational Methodology to Explore Online News Use of Frames

    Get PDF
    abstract: The Internet is a major source of online news content. Online news is a form of large-scale narrative text with rich, complex contents that embed deep meanings (facts, strategic communication frames, and biases) for shaping and transitioning standards, values, attitudes, and beliefs of the masses. Currently, this body of narrative text remains untapped due—in large part—to human limitations. The human ability to comprehend rich text and extract hidden meanings is far superior to known computational algorithms but remains unscalable. In this research, computational treatment is given to online news framing for exposing a deeper level of expressivity coined “double subjectivity” as characterized by its cumulative amplification effects. A visual language is offered for extracting spatial and temporal dynamics of double subjectivity that may give insight into social influence about critical issues, such as environmental, economic, or political discourse. This research offers benefits of 1) scalability for processing hidden meanings in big data and 2) visibility of the entire network dynamics over time and space to give users insight into the current status and future trends of mass communication.Dissertation/ThesisDoctoral Dissertation Computer Science 201

    Advanced document data extraction techniques to improve supply chain performance

    Get PDF
    In this thesis, a novel machine learning technique to extract text-based information from scanned images has been developed. This information extraction is performed in the context of scanned invoices and bills used in financial transactions. These financial transactions contain a considerable amount of data that must be extracted, refined, and stored digitally before it can be used for analysis. Converting this data into a digital format is often a time-consuming process. Automation and data optimisation show promise as methods for reducing the time required and the cost of Supply Chain Management (SCM) processes, especially Supplier Invoice Management (SIM), Financial Supply Chain Management (FSCM) and Supply Chain procurement processes. This thesis uses a cross-disciplinary approach involving Computer Science and Operational Management to explore the benefit of automated invoice data extraction in business and its impact on SCM. The study adopts a multimethod approach based on empirical research, surveys, and interviews performed on selected companies.The expert system developed in this thesis focuses on two distinct areas of research: Text/Object Detection and Text Extraction. For Text/Object Detection, the Faster R-CNN model was analysed. While this model yields outstanding results in terms of object detection, it is limited by poor performance when image quality is low. The Generative Adversarial Network (GAN) model is proposed in response to this limitation. The GAN model is a generator network that is implemented with the help of the Faster R-CNN model and a discriminator that relies on PatchGAN. The output of the GAN model is text data with bonding boxes. For text extraction from the bounding box, a novel data extraction framework consisting of various processes including XML processing in case of existing OCR engine, bounding box pre-processing, text clean up, OCR error correction, spell check, type check, pattern-based matching, and finally, a learning mechanism for automatizing future data extraction was designed. Whichever fields the system can extract successfully are provided in key-value format.The efficiency of the proposed system was validated using existing datasets such as SROIE and VATI. Real-time data was validated using invoices that were collected by two companies that provide invoice automation services in various countries. Currently, these scanned invoices are sent to an OCR system such as OmniPage, Tesseract, or ABBYY FRE to extract text blocks and later, a rule-based engine is used to extract relevant data. While the system’s methodology is robust, the companies surveyed were not satisfied with its accuracy. Thus, they sought out new, optimized solutions. To confirm the results, the engines were used to return XML-based files with text and metadata identified. The output XML data was then fed into this new system for information extraction. This system uses the existing OCR engine and a novel, self-adaptive, learning-based OCR engine. This new engine is based on the GAN model for better text identification. Experiments were conducted on various invoice formats to further test and refine its extraction capabilities. For cost optimisation and the analysis of spend classification, additional data were provided by another company in London that holds expertise in reducing their clients' procurement costs. This data was fed into our system to get a deeper level of spend classification and categorisation. This helped the company to reduce its reliance on human effort and allowed for greater efficiency in comparison with the process of performing similar tasks manually using excel sheets and Business Intelligence (BI) tools.The intention behind the development of this novel methodology was twofold. First, to test and develop a novel solution that does not depend on any specific OCR technology. Second, to increase the information extraction accuracy factor over that of existing methodologies. Finally, it evaluates the real-world need for the system and the impact it would have on SCM. This newly developed method is generic and can extract text from any given invoice, making it a valuable tool for optimizing SCM. In addition, the system uses a template-matching approach to ensure the quality of the extracted information

    Spartan Daily, May 12, 1981

    Get PDF
    Volume 76, Issue 69https://scholarworks.sjsu.edu/spartandaily/6773/thumbnail.jp

    Official Proceedings of the One Thousand Two Hundred and Eighty-Second Meeting of the Board of Trustees, Columbus, Ohio, May 1, 1992

    Get PDF
    This item is the meeting minutes of the Board of Trustees of The Ohio State University held on Friday, May 1, 1992 at the Ohio State University Fawcett Center for Tomorrow, Columbus, Ohio. Topics discussed include announcement of officers elected for the 1992-93 Board, the President's Report, the Research Foundation Report, Report of Research Contracts, Grants, and Gifts, Hospitals Board Report, Agricultural Affairs Committee Report, Academic Affairs Committee Report, Personnel Actions, Resolutions in Memoriam, Student Affairs Committee Report, Investments Committee Report, the Designation of the University Foundation as Primary Fundraising and Gift Receipt Organization, Fiscal Affairs Committee Report, and the Lease of Land and Building Space to the Alumni Association

    Disagreeable Privacy Policies: Mismatches between Meaning and Users’ Understanding

    Get PDF
    Privacy policies are verbose, difficult to understand, take too long to read, and may be the least-read items on most websites even as users express growing concerns about information collection practices. For all their faults, though, privacy policies remain the single most important source of information for users to attempt to learn how companies collect, use, and share data. Likewise, these policies form the basis for the self-regulatory notice and choice framework that is designed and promoted as a replacement for regulation. The underlying value and legitimacy of notice and choice depends, however, on the ability of users to understand privacy policies. This paper investigates the differences in interpretation among expert, knowledgeable, and typical users and explores whether those groups can understand the practices described in privacy policies at a level sufficient to support rational decision-making. The paper seeks to fill an important gap in the understanding of privacy policies through primary research on user interpretation and to inform the development of technologies combining natural language processing, machine learning and crowdsourcing for policy interpretation and summarization. For this research, we recruited a group of law and public policy graduate students at Fordham University, Carnegie Mellon University, and the University of Pittsburgh (“knowledgeable users”) and presented these law and policy researchers with a set of privacy policies from companies in the e-commerce and news & entertainment industries. We asked them nine basic questions about the policies’ statements regarding data collection, data use, and retention. We then presented the same set of policies to a group of privacy experts and to a group of non-expert users. The findings show areas of common understanding across all groups for certain data collection and deletion practices, but also demonstrate very important discrepancies in the interpretation of privacy policy language, particularly with respect to data sharing. The discordant interpretations arose both within groups and between the experts and the two other groups. The presence of these significant discrepancies has critical implications. First, the common understandings of some attributes of described data practices mean that semi-automated extraction of meaning from website privacy policies may be able to assist typical users and improve the effectiveness of notice by conveying the true meaning to users. However, the disagreements among experts and disagreement between experts and the other groups reflect that ambiguous wording in typical privacy policies undermines the ability of privacy policies to effectively convey notice of data practices to the general public. The results of this research will, consequently, have significant policy implications for the construction of the notice and choice framework and for the US reliance on this approach. The gap in interpretation indicates that privacy policies may be misleading the general public and that those policies could be considered legally unfair and deceptive. And, where websites are not effectively conveying privacy policies to consumers in a way that a “reasonable person” could, in fact, understand the policies, “notice and choice” fails as a framework. Such a failure has broad international implications since websites extend their reach beyond the United States

    Global Financial crisis and Islamic finance

    Get PDF
    The world economy is still suffering the crisis, considered the most severe since the Great Depression, where economic downturn at historic magnitude and many countries across the globe, irrespective of their development level, are still under strain dealing with this crisis. The severe global crisis that has spilled from the financial sector to the real economy, including international trade in manufactures, commodities and services. The onset of the present crisis can be traced back to July 2007 with the liquidity crisis due to the loss of confidence in the mortgage credit markets in the United States. At first, there was uncertainty about the possible spillovers to the rest of the economy, and there was also discussion about the risks of contagion and decoupling, that is to say, the capacity of other countries – especially developing countries – to isolate themselves from the problems originating in the United States (which is the largest market for many countries). The hope was that the crisis would be restricted to financial markets, with few repercussions on the real economy and the rest of the world. This hope was shattered in September 2008 as the crisis entered an acute phase, with strong downward fluctuations in the stock markets, substantially reduced rates of economic growth, volatile exchange rates, and squeezes in demand and consumption, leading to falls in industrial production and decreasing flows of international trade and FDI, and causing impacts on related areas such as transfer of technology. The crisis has also been accompanied by increases in unemployment, with concomitant declining incomes and demand. The severity of the current crisis has led to the evaluation of the foundations of the capitalist financial system and the search for ideas and solutions. While some have proposed that the Islamic finance serves as a vehicle for recovering from the international financial crisis and The Islamic banking industry may be able to strengthen its position in the international market as investors and companies seek alternate sources of financing. Other economists have argued that Islamic finance, is a different way of structuring financial dealings; but, it is not a totally different financial system. This paper tries to note the main causes and the impacts of the current financial and economic crisis. In addition to discuss the belief that the Islamic finance and its prospective is a viable alternative to the ailing global financial system.financial and economic crisis , Islamic finance
    • …
    corecore