15 research outputs found

    SentiALG: Automated Corpus Annotation for Algerian Sentiment Analysis

    Full text link
    Data annotation is an important but time-consuming and costly procedure. To sort a text into two classes, the very first thing we need is a good annotation guideline, establishing what is required to qualify for each class. In the literature, the difficulties associated with an appropriate data annotation has been underestimated. In this paper, we present a novel approach to automatically construct an annotated sentiment corpus for Algerian dialect (a Maghrebi Arabic dialect). The construction of this corpus is based on an Algerian sentiment lexicon that is also constructed automatically. The presented work deals with the two widely used scripts on Arabic social media: Arabic and Arabizi. The proposed approach automatically constructs a sentiment corpus containing 8000 messages (where 4000 are dedicated to Arabic and 4000 to Arabizi). The achieved F1-score is up to 72% and 78% for an Arabic and Arabizi test sets, respectively. Ongoing work is aimed at integrating transliteration process for Arabizi messages to further improve the obtained results.Comment: To appear in the 9th International Conference on Brain Inspired Cognitive Systems (BICS 2018

    Subjectivity and Sentiment Analysis of Arabic: A Survey

    Full text link
    Abstract. Subjectivity and sentiment analysis (SSA) has recently gained consid-erable attention, but most of the resources and systems built so far are tailored to English and other Indo-European languages. The need for designing systems for other languages is increasing, especially as blogging and micro-blogging web-sites become popular throughout the world. This paper surveys different tech-niques for SSA for Arabic. After a brief synopsis about Arabic, we describe the main existing techniques and test corpora for Arabic SSA that have been intro-duced in the literature.

    A Semi-supervised Corpus Annotation for Saudi Sentiment Analysis Using Twitter

    Get PDF
    In the literature, limited work has been conducted to develop sentiment resources for Saudi dialect. The lack of resources such as dialectical lexicons and corpora are some of the major bottlenecks to the successful development of Arabic sentiment analysis models. In this paper, a semi-supervised approach is presented to construct an annotated sentiment corpus for Saudi dialect using Twitter. The presented approach is primarily based on a list of lexicons built by using word embedding techniques such as word2vec. A huge corpus extracted from twitter is annotated and manually reviewed to exclude incorrect annotated tweets which is publicly available. For corpus validation, state-of-the-art classification algorithms (such as Logistic Regression, Support Vector Machine, and Naive Bayes) are applied and evaluated. Simulation results demonstrate that the Naive Bayes algorithm outperformed all other approaches and achieved accuracy up to 91%

    FLEXURAL BEHAVIOR OF TWO-LAYER BEAM MADE WITH LIGHT WEIGHT STEEL FIBRE CONCRETE AND RECYCLED AGGREGATE CONCRETE

    No full text
    <h2>Abstract</h2><p>In structural design, it is extremely desirable to use as low-material as possible while keeping integrity and usefulness. Reducing the structure's weight is one strategy for achieving this objective. Steel fibres have recently been added to reinforced concrete beams to increase flexural and shear strength. Fibre reinforcement in structural elements has drawn considerable interest from the building sector. Steel fibre has received the greatest attention and utilization among all fibre types. When compared to plain concrete, incorporating fibres into concrete may result in better crack management and greater strength. This study examines how two-layer beams made of lightweight steel fibre concrete and recycled aggregate concrete flex under bending loads. Twelve distinct beams with cross sections measuring 100 mm, 150 mm, and 1500 mm (width, depth, and length) are prepared and tested as part of the study. These beams are evaluated under four-point bending. In the tension zone of the lightweight concrete layer, different percentages of steel fibre ranging from 0% to 1.5% by volume were introduced. In the concrete compression layer, recycled block aggregate was substituted for natural coarse aggregate in varying percentages (0%, 25%, and 50%). According to the findings, the flexural strength of beams with a higher steel fibre percentage is higher than that of beams with a higher recycled aggregate component. The study also shows that two-layer beams with higher steel fibre content have superior crack management and deflection behavior than those with lower steel content. The results of the flexural reinforced concrete beam test were contrasted with the calculated design strength determined using British Standards.</p&gt
    corecore