21,304 research outputs found

    An Intelligent System For Arabic Text Categorization

    Get PDF
    Text Categorization (classification) is the process of classifying documents into a predefined set of categories based on their content. In this paper, an intelligent Arabic text categorization system is presented. Machine learning algorithms are used in this system. Many algorithms for stemming and feature selection are tried. Moreover, the document is represented using several term weighting schemes and finally the k-nearest neighbor and Rocchio classifiers are used for classification process. Experiments are performed over self collected data corpus and the results show that the suggested hybrid method of statistical and light stemmers is the most suitable stemming algorithm for Arabic language. The results also show that a hybrid approach of document frequency and information gain is the preferable feature selection criterion and normalized-tfidf is the best weighting scheme. Finally, Rocchio classifier has the advantage over k-nearest neighbor classifier in the classification process. The experimental results illustrate that the proposed model is an efficient method and gives generalization accuracy of about 98%

    Early texts on Hindu-Arabic calculation

    Get PDF
    This article describes how the decimal place value system was transmitted from India via the Arabs to the West up to the end of the fifteenth century. The arithmetical work of al-KhwÂŻarizm¯ı’s, ca. 825, is the oldest Arabic work on Indian arithmetic of which we have detailed knowledge. There is no known Arabic manuscript of this work; our knowledge of it is based on an early reworking of a Latin translation. Until some years ago, only one fragmentary manuscript of this twelfth-century reworking was known (Cambridge, UL, Ii.6.5). Another manuscript that transmits the complete text (New York, Hispanic Society of America, HC 397/726) has made possible a more exact study of al-KhwÂŻarizm¯ı’s work. This article gives an outline of this manuscript’s contents and discusses some characteristics of its presentation

    Component-based Segmentation of words from handwritten Arabic text

    Get PDF
    Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words segmentation. Meanwhile, an improved projection based method is also employed for baseline detection. The proposed method has been successfully tested on IFN/ENIT database consisting of 26459 Arabic words handwritten by 411 different writers, and the results were promising and very encouraging in more accurate detection of the baseline and segmentation of words for further recognition

    A Study for the Necessity of Risk Assessment for Heavy metal Pollution in the Barada Basin, Syria

    Get PDF
    Manufacturing industries are blooming rapidly in Barada Basin carrying high risk to environment and human health due to generating huge amounts of heavy metals to environmental media, particularly rivers. Few studies show that concentrations of chromium, cadmium, and lead exceed the standards in down streams of rivers. Risk assessment on human health urges for immediate measures to control the emission of these metals. In this paper, we discuss the implementation of comprehensive risk management policy as an element to introduce an optimal policy to reduce water pollution and improve water quality in the Barada Basin.
    • 

    corecore