4,855 research outputs found

    Text-to-picture tools, systems, and approaches: a survey

    Get PDF
    Text-to-picture systems attempt to facilitate high-level, user-friendly communication between humans and computers while promoting understanding of natural language. These systems interpret a natural language text and transform it into a visual format as pictures or images that are either static or dynamic. In this paper, we aim to identify current difficulties and the main problems faced by prior systems, and in particular, we seek to investigate the feasibility of automatic visualization of Arabic story text through multimedia. Hence, we analyzed a number of well-known text-to-picture systems, tools, and approaches. We showed their constituent steps, such as knowledge extraction, mapping, and image layout, as well as their performance and limitations. We also compared these systems based on a set of criteria, mainly natural language processing, natural language understanding, and input/output modalities. Our survey showed that currently emerging techniques in natural language processing tools and computer vision have made promising advances in analyzing general text and understanding images and videos. Furthermore, important remarks and findings have been deduced from these prior works, which would help in developing an effective text-to-picture system for learning and educational purposes. - 2019, The Author(s).This work was made possible by NPRP grant #10-0205-170346 from the Qatar National Research Fund (a member of Qatar Foundation). The statements made herein are solely the responsibility of the authors

    The Dialectics of the Archaic and the Post-Modern in Maghrebian Literature Written in French

    Get PDF
    Maghrebian literature written in French has been since its inception a literature of and about the abyss. For the Maghrebian the abyss is esentially the space of modernity, that forbidden citadel of art, science and technology from which s/he was excluded and marginalized. Recently, writing of/in French has become the site/scene of a polemos between the archaic (identity) and the post-modern (difference). Our study of the archaic focuses on cultural, literary and critical knowledge and centers around two main themes: that of a beginning, that is a search for events in the past that explain the abyss (or retardation vis-à-vis the West), and that of an excavation, mainly of the collective unconscious, through the revamping of traditional and oral materials. On the other hand, the post-modern is not only that moment of delegitimization of modernity, as expounded by J.-F. Lyotard and other social theorists of post-modern knowledge; it is also a project, an esthetics and a theory to be, an epistemology of the future. In short, Maghrebian literature written in French, because it makes use of the Other\u27s alphabet, is faced with a formidable challenge: can/will the alliance of the archaic and the post-modern bridge the abyss of modernity

    Artificial Intelligence for Multimedia Signal Processing

    Get PDF
    Artificial intelligence technologies are also actively applied to broadcasting and multimedia processing technologies. A lot of research has been conducted in a wide variety of fields, such as content creation, transmission, and security, and these attempts have been made in the past two to three years to improve image, video, speech, and other data compression efficiency in areas related to MPEG media processing technology. Additionally, technologies such as media creation, processing, editing, and creating scenarios are very important areas of research in multimedia processing and engineering. This book contains a collection of some topics broadly across advanced computational intelligence algorithms and technologies for emerging multimedia signal processing as: Computer vision field, speech/sound/text processing, and content analysis/information mining

    Advanced document data extraction techniques to improve supply chain performance

    Get PDF
    In this thesis, a novel machine learning technique to extract text-based information from scanned images has been developed. This information extraction is performed in the context of scanned invoices and bills used in financial transactions. These financial transactions contain a considerable amount of data that must be extracted, refined, and stored digitally before it can be used for analysis. Converting this data into a digital format is often a time-consuming process. Automation and data optimisation show promise as methods for reducing the time required and the cost of Supply Chain Management (SCM) processes, especially Supplier Invoice Management (SIM), Financial Supply Chain Management (FSCM) and Supply Chain procurement processes. This thesis uses a cross-disciplinary approach involving Computer Science and Operational Management to explore the benefit of automated invoice data extraction in business and its impact on SCM. The study adopts a multimethod approach based on empirical research, surveys, and interviews performed on selected companies.The expert system developed in this thesis focuses on two distinct areas of research: Text/Object Detection and Text Extraction. For Text/Object Detection, the Faster R-CNN model was analysed. While this model yields outstanding results in terms of object detection, it is limited by poor performance when image quality is low. The Generative Adversarial Network (GAN) model is proposed in response to this limitation. The GAN model is a generator network that is implemented with the help of the Faster R-CNN model and a discriminator that relies on PatchGAN. The output of the GAN model is text data with bonding boxes. For text extraction from the bounding box, a novel data extraction framework consisting of various processes including XML processing in case of existing OCR engine, bounding box pre-processing, text clean up, OCR error correction, spell check, type check, pattern-based matching, and finally, a learning mechanism for automatizing future data extraction was designed. Whichever fields the system can extract successfully are provided in key-value format.The efficiency of the proposed system was validated using existing datasets such as SROIE and VATI. Real-time data was validated using invoices that were collected by two companies that provide invoice automation services in various countries. Currently, these scanned invoices are sent to an OCR system such as OmniPage, Tesseract, or ABBYY FRE to extract text blocks and later, a rule-based engine is used to extract relevant data. While the system’s methodology is robust, the companies surveyed were not satisfied with its accuracy. Thus, they sought out new, optimized solutions. To confirm the results, the engines were used to return XML-based files with text and metadata identified. The output XML data was then fed into this new system for information extraction. This system uses the existing OCR engine and a novel, self-adaptive, learning-based OCR engine. This new engine is based on the GAN model for better text identification. Experiments were conducted on various invoice formats to further test and refine its extraction capabilities. For cost optimisation and the analysis of spend classification, additional data were provided by another company in London that holds expertise in reducing their clients' procurement costs. This data was fed into our system to get a deeper level of spend classification and categorisation. This helped the company to reduce its reliance on human effort and allowed for greater efficiency in comparison with the process of performing similar tasks manually using excel sheets and Business Intelligence (BI) tools.The intention behind the development of this novel methodology was twofold. First, to test and develop a novel solution that does not depend on any specific OCR technology. Second, to increase the information extraction accuracy factor over that of existing methodologies. Finally, it evaluates the real-world need for the system and the impact it would have on SCM. This newly developed method is generic and can extract text from any given invoice, making it a valuable tool for optimizing SCM. In addition, the system uses a template-matching approach to ensure the quality of the extracted information

    Curse and/or blessing? Turkish soap operas and their impact on contemporary Algerian women: audience reception and audio-visual analysis of the historical TV drama Harim al-Sultan (The Magnificent Century)

    Get PDF
    Turkey is now the second biggest TV market exporter after the US finding huge audiences among Arab women. This study examines how Algerian women perceive the Dizi genres, what makes them captivating and influential, and determines the dominant positive and negative themes tackled in these TV dramas. Based on literature review on globalized media, media effect theories, and gender studies, an online focus group discussion, online questionnaires, and interpretative phenomenological analysis techniques this study aims to understand the reception and perception of Turkish TV series by Algerian female viewers. The thesis implements the audio-visual content analysis method on the most-watched Dizi TV drama titled Harim al-Sultan as a case study to identify the prominent features of those soap operas. The findings show that the lack of Algerian local cinematic production, cultural proximity between Turkey and Algeria, Syrian dialect, creative scenarios, and modern lifestyles broadcasted in the Dizi TV dramas has rapidly increased its popularity among Algerian female viewers. The results also indicate that Turkish TV dramas depict women who appear to be modern and independent and who challenge imposed traditional patriarchal values. At the same time, they also promote gender stereotypes, romanticizing rape stories, and spreading Western ideologies under the umbrella of modernity that could threaten or weaken Algerian females’ traditional values. Audio-visual content analysis shows that Harim al-Sultan contains controversial topics, that generated many critiques in terms of history, power, religion, identity, and representation of women. Nonetheless, this study examines how participants incorporate some Dizi TV series’ features in a relevant way to Algerian female traditional values and lifestyles, thus, arguing for the creation of an ambiguous identity which sits at the cusp of traditional principles and modern hybrid identities
    corecore