107 research outputs found

    A Semi-automatic Approach to Identifying and Unifying Ambiguously Encoded Arabic-Based Characters

    Get PDF
    In this study, we outline a potential problem in normalising texts that are based on a modified version of the Arabic alphabet. One of the main resources available for processing resource-scarce languages is raw text collected from the Internet. Many less-resourced languages, such as Kurdish, Farsi, Urdu, Pashtu, etc., use a modified version of the Arabic writing system. Many characters in harvested data from the Internet may have exactly the same form but encoded with different Unicode values (ambiguous characters). The existence of ambiguous characters in words leads to word duplication, thus it is important to identify and unify ambiguous characters during the normalisation stage. Here, we demonstrate cases related to ambiguous Kurdish and Farsi characters and propose a semi-automatic approach to identifying and unifying them

    Improved Arabic Characters Recognition by Combining Multiple Machine Learning Classifiers

    Get PDF
    In this paper, we investigate a range of strategies for combining multiple machine learning techniques for recognizing Arabic characters, where we are faced with imperfect and dimensionally variable input characters. Experimental results show that combined confidence-based backoff strategies can produce more accurate results than each technique produces by itself and even the ones exhibited by the majority voting combination

    Identification of Code-Switched Sentences and Words Using Language Modeling Approaches

    Get PDF
    Globalization and multilingualism contribute to code-switching—the phenomenon in which speakers produce utterances containing words or expressions from a second language. Processing code-switched sentences is a significant challenge for multilingual intelligent systems. This study proposes a language modeling approach to the problem of code-switching language processing, dividing the problem into two subtasks: the detection of code-switched sentences and the identification of code-switched words in sentences. A code-switched sentence is detected on the basis of whether it contains words or phrases from another language. Once the code-switched sentences are identified, the positions of the code-switched words in the sentences are then identified. Experimental results show that the language modeling approach achieved an F-measure of 80.43% and an accuracy of 79.01% for detecting Mandarin-Taiwanese code-switched sentences. For the identification of code-switched words, the word-based and POS-based models, respectively, achieved F-measures of 41.09% and 53.08%

    A Combined DNA-Affinic Molecule and N-Mustard Alkylating Agent Has an Anti-Cancer Effect and Induces Autophagy in Oral Cancer Cells

    Get PDF
    Although surgery or the combination of chemotherapy and radiation are reported to improve the quality of life and reduce symptoms in patients with oral cancer, the prognosis of oral cancer remains generally poor. DNA alkylating agents, such as N-mustard, play an important role in cancer drug development. BO-1051 is a new 9-anilinoacridine N-mustard-derivative anti-cancer drug that can effectively target a variety of cancer cell lines and inhibit tumorigenesis in vivo. However, the underlying mechanism of BO-1051-mediated tumor suppression remains undetermined. In the present study, BO-1051 suppressed cell viability with a low IC50 in oral cancer cells, but not in normal gingival fibroblasts. Cell cycle analysis revealed that the tumor suppression by BO-1051 was accompanied by cell cycle arrest and downregulation of stemness genes. The enhanced conversion of LC3-I to LC3-II and the formation of acidic vesicular organelles indicated that BO-1501 induced autophagy. The expression of checkpoint kinases was upregulated as demonstrated with Western blot analysis, showing that BO-1051 could induce DNA damage and participate in DNA repair mechanisms. Furthermore, BO-1051 treatment alone exhibited a moderate tumor suppressive effect against xenograft tumor growth in immunocompromised mice. Importantly, the combination of BO-1051 and radiation led to a potent inhibition on xenograft tumorigenesis. Collectively, our findings demonstrated that BO-1051 exhibited a cytotoxic effect via cell cycle arrest and the induction of autophagy. Thus, the combination of BO-1051 and radiotherapy may be a feasible therapeutic strategy against oral cancer in the future

    Oncologic impact of delay between diagnosis and radical nephroureterectomy

    Get PDF
    PurposeThis study aimed to evaluate the oncological outcome of delayed surgical wait time from the diagnosis of upper tract urothelial carcinoma (UTUC) to radical nephroureterectomy (RNU).MethodsIn this multicenter retrospective study, medical records were collected between 1988 and 2021 from 18 participating Taiwanese hospitals under the Taiwan UTUC Collaboration Group. Patients were dichotomized into the early (≤90 days) and late (>90 days) surgical wait-time groups. Overall survival, disease-free survival, and bladder recurrence-free survival were calculated using the Kaplan–Meier method and multivariate Cox regression analysis. Multivariate analysis was performed using stepwise linear regression.ResultsOf the 1251 patients, 1181 (94.4%) were classifed into the early surgical wait-time group and 70 (5.6%) into the late surgical wait-time group. The median surgical wait time was 21 days, and the median follow-up was 59.5 months. Our study showed delay-time more than 90 days appeared to be associated with worse overall survival (hazard ratio [HR] 1.974, 95% confidence interval [CI] 1.166−3.343, p = 0.011), and disease-free survival (HR 1.997, 95% CI 1.137−3.507, p = 0.016). This remained as an independent prognostic factor after other confounding factors were adjusted. Age, ECOG performance status, Charlson Comorbidity Index (CCI), surgical margin, tumor location and adjuvant systemic therapy were independent prognostic factors for overall survival. Tumor location and adjuvant systemic therapy were also independent prognostic factors for disease-free survival.ConclusionsFor patients with UTUC undergoing RNU, the surgical wait time should be minimized to less than 90 days. Prolonged delay times may be associated with poor overall and disease-free survival

    Search for dark matter produced in association with bottom or top quarks in √s = 13 TeV pp collisions with the ATLAS detector

    Get PDF
    A search for weakly interacting massive particle dark matter produced in association with bottom or top quarks is presented. Final states containing third-generation quarks and miss- ing transverse momentum are considered. The analysis uses 36.1 fb−1 of proton–proton collision data recorded by the ATLAS experiment at √s = 13 TeV in 2015 and 2016. No significant excess of events above the estimated backgrounds is observed. The results are in- terpreted in the framework of simplified models of spin-0 dark-matter mediators. For colour- neutral spin-0 mediators produced in association with top quarks and decaying into a pair of dark-matter particles, mediator masses below 50 GeV are excluded assuming a dark-matter candidate mass of 1 GeV and unitary couplings. For scalar and pseudoscalar mediators produced in association with bottom quarks, the search sets limits on the production cross- section of 300 times the predicted rate for mediators with masses between 10 and 50 GeV and assuming a dark-matter mass of 1 GeV and unitary coupling. Constraints on colour- charged scalar simplified models are also presented. Assuming a dark-matter particle mass of 35 GeV, mediator particles with mass below 1.1 TeV are excluded for couplings yielding a dark-matter relic density consistent with measurements

    Search for single production of vector-like quarks decaying into Wb in pp collisions at s=8\sqrt{s} = 8 TeV with the ATLAS detector

    Get PDF
    corecore