4 research outputs found

    A Survey of OCR in Arabic Language: Applications, Techniques, and Challenges

    No full text
    Optical character recognition (OCR) is the process of extracting handwritten or printed text from a scanned or printed image and converting it to a machine-readable form for further data processing, such as searching or editing. Automatic text extraction using OCR helps to digitize documents for improved productivity and accessibility and for preservation of historical documents. This paper provides a survey of the current state-of-the-art applications, techniques, and challenges in Arabic OCR. We present the existing methods for each step of the complete OCR process to identify the best-performing approach for improved results. This paper follows the keyword-search method for reviewing the articles related to Arabic OCR, including the backward and forward citations of the article. In addition to state-of-art techniques, this paper identifies research gaps and presents future directions for Arabic OCR

    Promoting the suitability of rice husk ash concrete in the building sector via contemporary machine intelligence techniques

    No full text
    Eco-friendly concrete is in great demand, and as a consequence, the necessity to find sustainable alternatives to ordinary cement has become critical. In recent years, there has been considerable interest in the potential benefits of utilizing rice husk ash (RHA) as a cement substitute in concrete. Evaluating concrete properties in a laboratory setup like compressive strength (CS) is a time-consuming and expensive process. The development of trustworthy and precise models for CS estimation might be a better strategy. To determine the CS of RHA concrete, this research used boosting ensemble algorithms, namely gradient boosting (GB), AdaBoost regressor (ABR), and extreme gradient boosting (XGB), over a vast literature database. The estimation models were validated using statistical measures, error distribution by plotting violin graphs, and k-fold analysis. Furthermore, SHapley Additive exPlanations (SHAP) analysis was utilized to assess the importance of contributing elements. The results showed that XGB had the greatest performance in estimating the CS of RHA concrete out of the three methods tested. While the GB and ABR models both had R2 values of 0.90 and 0.94, respectively, the XGB model achieved a value of 0.96. The violin graphs indicated that the average absolute error values for the GB, ABR, and XGB were 5.82, 4.65, and 3.53 MPa, implying the higher precision of the XGB model in estimating the CS of RHA concrete. The SHAP study demonstrated that the three most influential factors in increasing the strength were cement, specimen age, and rice husk ash. The construction sector may benefit from the application of such technologies by facilitating the development of quick and low-cost methods for identifying material qualities and the influence of input parameters

    A data-driven approach to predict the compressive strength of alkali-activated materials and correlation of influencing parameters using SHapley Additive exPlanations (SHAP) analysis

    No full text
    This research used gene expression programming (GEP) and multi expression programming (MEP) to determine the compressive strength (CS) of alkali-activated material (AAM) to compare and develop more reliable genetic algorithm-based prediction models. To learn more about how raw ingredients affect and interact with the CS of AAM, a SHapley Additive exPlanations (SHAP) analysis was conducted. A comprehensive dataset containing 676 points with fifteen influential parameters was formulated from the previously published literature. According to this study, considering the impact of 15 input variables, both genetic algorithms produced results close to the experimental CS (retrieved from the literature). When the performance of the GEP and MEP models were compared, it was found that the MEP model, with an R2 of 0.86, performed better than the GEP model, with an R2 of 0.82. The assessment of the statistical parameters of generated models revealed that the MEP model was more effective. Additionally, SHAP analysis revealed that slag content, followed by the specimen's age, sodium silicate, and curing temperature, showed a positive correlation with CS of AAM, which were the most important parameters. The results also revealed the importance of chemical contents, i.e., CaO, SiO2, Al2O3, of FA and slag on the CS of AAM. The built models might be used to compute the CS of AAMs with varying input parameter values, minimizing the effort, time, and cost of unnecessary lab tests. Furthermore, the outcomes of the SHAP study might help researchers and the industry determine the quantity or composition of raw ingredients when producing AAMs
    corecore