Search CORE

15 research outputs found

SentiALG: Automated Corpus Annotation for Algerian Sentiment Analysis

Author: AZ Khan
JH AlKhateeb
JH AlKhateeb
JH AlKhateeb
M Al-Ayyoub
M Mataoui
M Rushdi-Saleh
M Taboada
N Al-Twairesh
SM Mohammad
Publication venue
Publication date: 15/08/2018
Field of study

Data annotation is an important but time-consuming and costly procedure. To sort a text into two classes, the very first thing we need is a good annotation guideline, establishing what is required to qualify for each class. In the literature, the difficulties associated with an appropriate data annotation has been underestimated. In this paper, we present a novel approach to automatically construct an annotated sentiment corpus for Algerian dialect (a Maghrebi Arabic dialect). The construction of this corpus is based on an Algerian sentiment lexicon that is also constructed automatically. The presented work deals with the two widely used scripts on Arabic social media: Arabic and Arabizi. The proposed approach automatically constructs a sentiment corpus containing 8000 messages (where 4000 are dedicated to Arabic and 4000 to Arabizi). The achieved F1-score is up to 72% and 78% for an Arabic and Arabizi test sets, respectively. Ongoing work is aimed at integrating transliteration process for Arabizi messages to further improve the obtained results.Comment: To appear in the 9th International Conference on Brain Inspired Cognitive Systems (BICS 2018

arXiv.org e-Print Archive

Crossref

Subjectivity and Sentiment Analysis of Arabic: A Survey

Author: A. Abbasi
B. Pang
J. Kim
M. Rushdi-Saleh
N. Habash
S. AbdelRahman
Z. Zhai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Abstract. Subjectivity and sentiment analysis (SSA) has recently gained consid-erable attention, but most of the resources and systems built so far are tailored to English and other Indo-European languages. The need for designing systems for other languages is increasing, especially as blogging and micro-blogging web-sites become popular throughout the world. This paper surveys different tech-niques for SSA for Arabic. After a brief synopsis about Arabic, we describe the main existing techniques and test corpora for Arabic SSA that have been intro-duced in the literature.

CiteSeerX

Crossref

A Semi-supervised Corpus Annotation for Saudi Sentiment Analysis Using Twitter

Author: A Assiri
AB Soliman
AS Alqarafi
HK Aldayel
K Dashtipour
K Khalifa
M Rushdi-Saleh
Publication venue: Springer
Publication date: 01/01/2018
Field of study

In the literature, limited work has been conducted to develop sentiment resources for Saudi dialect. The lack of resources such as dialectical lexicons and corpora are some of the major bottlenecks to the successful development of Arabic sentiment analysis models. In this paper, a semi-supervised approach is presented to construct an annotated sentiment corpus for Saudi dialect using Twitter. The presented approach is primarily based on a list of lexicons built by using word embedding techniques such as word2vec. A huge corpus extracted from twitter is annotated and manually reviewed to exclude incorrect annotated tweets which is publicly available. For corpus validation, state-of-the-art classification algorithms (such as Logistic Regression, Support Vector Machine, and Naive Bayes) are applied and evaluated. Simulation results demonstrate that the Naive Bayes algorithm outperformed all other approaches and achieved accuracy up to 91%

Crossref

Stirling Online Research Repository (RIOXX)

Stirling Online Research Repository

Repository@Napier

Recommended from our members

Occurrence and sources of natural and anthropogenic lipid tracers in surface soils from arid urban areas of Saudi Arabia

Author: Al-Mutlaq Khalid F.
Al-Saleh Mohammed A.
El-Mubarak Aarif H.
El-Otaibi Mubarak T.
Ibrahim Sami M. M.
Rushdi Ahmed I.
Simoneit Bernd R. T.
Publication venue: 'Elsevier BV'
Publication date
Field of study

Soil particles contain a variety of natural and anthropogenic organic components, and in urban areas can be considered as local collectors of pollutants. Surface soil samples were taken from ten urban areas in Riyadh during early winter of 2007. They were extracted with dichloromethane-methanol mixture and the extracts were analyzed by gas chromatography-mass spectrometry. The major compounds were unresolved complex mixture (UCM), plasticizers, n-alkanes, carbohydrates, n-alkanoic acids, hopanes, n-alkanols, and sterols. Vegetation detritus was the major natural source of organic compounds (24.0 ± 15.7%) in samples from areas with less human activities and included n-alkanes, n-alkanoic acids, n-alkanols, sterols and carbohydrates. Vehicular emission products and discarded plastics were the major anthropogenic sources in the soil particles (53.3 ± 21.3% and 22.7 ± 10.7%, respectively). The anthropogenic tracers were UCM, plasticizers, n-alkanes, hopanes and traces of steranes. Vegetation and human activities control the occurrence and distribution of natural and anthropogenic extractable organic matter in this arid urban area.Keywords: Petroleum residues, Biomarkers, Soils, Lipids, Plasticizer

ScholarsArchive@OSU

LIWC-Based Sentiment Analysis in Spanish Product Reviews

Author: B. Pang
I. Peñalver-Martínez
J. Nahar
J.W. Pennebaker
J.W. Pennebaker
L. Chen
M. Rushdi Saleh
M. Rushdi Saleh
M.D. Molina González
R. Kohavi
R. Moraes
R. Xia
R.R. Bouckaert
S.S. Keerthi
W.B. Stiles
Y. Chen
Y. He
Z. Zhai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

FLEXURAL BEHAVIOR OF TWO-LAYER BEAM MADE WITH LIGHT WEIGHT STEEL FIBRE CONCRETE AND RECYCLED AGGREGATE CONCRETE

Author: ABDUL AZIZ AL-BADI
AHMED W. AL ZAND
HAMDAN AL JABRI
MALEK AL-RUSHDI
SALEH MAZEN
WADHAH M. TAWFEEQ
Publication venue: Zenodo
Publication date: 20/10/2023
Field of study

<h2>Abstract</h2><p>In structural design, it is extremely desirable to use as low-material as possible while keeping integrity and usefulness. Reducing the structure's weight is one strategy for achieving this objective. Steel fibres have recently been added to reinforced concrete beams to increase flexural and shear strength. Fibre reinforcement in structural elements has drawn considerable interest from the building sector. Steel fibre has received the greatest attention and utilization among all fibre types. When compared to plain concrete, incorporating fibres into concrete may result in better crack management and greater strength. This study examines how two-layer beams made of lightweight steel fibre concrete and recycled aggregate concrete flex under bending loads. Twelve distinct beams with cross sections measuring 100 mm, 150 mm, and 1500 mm (width, depth, and length) are prepared and tested as part of the study. These beams are evaluated under four-point bending. In the tension zone of the lightweight concrete layer, different percentages of steel fibre ranging from 0% to 1.5% by volume were introduced. In the concrete compression layer, recycled block aggregate was substituted for natural coarse aggregate in varying percentages (0%, 25%, and 50%). According to the findings, the flexural strength of beams with a higher steel fibre percentage is higher than that of beams with a higher recycled aggregate component. The study also shows that two-layer beams with higher steel fibre content have superior crack management and deflection behavior than those with lower steel content. The results of the flexural reinforced concrete beam test were contrasted with the calculated design strength determined using British Standards.</p&gt

ZENODO

Multi-dimensional Sentiment Analysis for Large-Scale E-commerce Reviews

Author: A. Hogenboom
B. Pang
G. Qiu
M. Rushdi-Saleh
P. Jin
P. Turney
Q. Zhang
R. Feldman
X. Fu
Z. Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

The Nearest Centroid Based on Vector Norms: A New Classification Algorithm for a New Document Representation Model

Author: A. Khan
A. Mountassir
A.F. Smeaton
G. Salton
G. Salton
H. Bhavsar
I.H. Witten
J. Platt
M. Rushdi-Saleh
M.F. Porter
T.G. Dietterich
Y. Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref