9,134 research outputs found

    A Biased Topic Modeling Approach for Case Control Study from Health Related Social Media Postings

    Get PDF
    abstract: Online social networks are the hubs of social activity in cyberspace, and using them to exchange knowledge, experiences, and opinions is common. In this work, an advanced topic modeling framework is designed to analyse complex longitudinal health information from social media with minimal human annotation, and Adverse Drug Events and Reaction (ADR) information is extracted and automatically processed by using a biased topic modeling method. This framework improves and extends existing topic modelling algorithms that incorporate background knowledge. Using this approach, background knowledge such as ADR terms and other biomedical knowledge can be incorporated during the text mining process, with scores which indicate the presence of ADR being generated. A case control study has been performed on a data set of twitter timelines of women that announced their pregnancy, the goals of the study is to compare the ADR risk of medication usage from each medication category during the pregnancy. In addition, to evaluate the prediction power of this approach, another important aspect of personalized medicine was addressed: the prediction of medication usage through the identification of risk groups. During the prediction process, the health information from Twitter timeline, such as diseases, symptoms, treatments, effects, and etc., is summarized by the topic modelling processes and the summarization results is used for prediction. Dimension reduction and topic similarity measurement are integrated into this framework for timeline classification and prediction. This work could be applied to provide guidelines for FDA drug risk categories. Currently, this process is done based on laboratory results and reported cases. Finally, a multi-dimensional text data warehouse (MTD) to manage the output from the topic modelling is proposed. Some attempts have been also made to incorporate topic structure (ontology) and the MTD hierarchy. Results demonstrate that proposed methods show promise and this system represents a low-cost approach for drug safety early warning.Dissertation/ThesisDoctoral Dissertation Computer Science 201

    Credit risk measurement model for small and medium enterprises : the case of Zimbabwe

    Get PDF
    Abstracts in English, Zulu and Southern SothoThe advent of Basel II Capital Accord has revolutionised credit risk measurement (CRM) to the extent that the once “perceived riskier bank assets” are now accommodated for lending. The Small and Medium Enterprise (SME) sector has been traditionally perceived as a riskier and unprofitable asset for lending activity by Commercial Banks, in particular. But empirical studies on the implementation of the Basel II internal-ratings-based (IRB) framework have demonstrated that SME credit risk is measurable. Banks are still finding it difficult to forecast SME loan default and to provide credit to the sector that meet Basel’s capital requirements. The thesis proposes to construct an empirical credit risk measurement (CRM) model, specifically for SMEs, to ameliorate the adverse effects of SME credit inaccessibility due to high information asymmetry between financial institutions (FI) and SMEs in Zimbabwe. A well-performing and accurate CRM helps FIs to control their risk exposure through selective granting of credit based on a thorough statistical analysis of historical customer data. This thesis develops a CRM model, built on a statistically random sample, known-good-bad (KGB) sample, which is a better representation of the through-the-door (TTD) population of SME loan applicants. The KGB sample incorporates both accepted and rejected applications, through reject inference (RI). A model-based bound and collapse (BC) reject inference methodology was empirically used to correct selectivity bias inherent in CRM domain. The results have shown great improvement in the classification power and aggregate supply of credit supply to the SME portfolio of the case-studied bank, as evidenced by substantial decrease of bad rates across models developed; from the preliminary model to final model designed for the case-studied bank. The final model was validated using both bad rate, confusion matrix metrics and Area under Receiver Operating Characteristic (AUROC) curve to assess the classification power of the model within-sample and out-of-sample. The AUROC for the final model (weak model) was found to be 0.9782 whilst bad rate was found to be 14.69%. There was 28.76% improvement in the bad rate in the final model in comparison with the current CRM model being used by the case-studied bank.Isivumelwano seBasel II Capital Accord sesishintshe indlela yokulinganisa ubungozi bokunikezana ngesikweletu credit risk measurement (CRM) kwaze kwafika ezingeni lapho izimpahla ezazithathwa njengamagugu anobungozi “riskier bank assets” sezimukelwa njengesibambiso sokuboleka imali. Umkhakha wezamaBhizinisi Amancane naSafufusayo, phecelezi, Small and Medium Enterprise (SME) kudala uqondakala njengomkhakha onobungozi obukhulu futhi njengomkhakha ongangenisi inzuzo, ikakhulu njengesibambiso sokubolekwa imali ngamabhange ahwebayo. Kodwa izifundo zocwaningo ezimayelana nokusetshenziswa nokusetshenziswa kwesakhiwo iBasel II internal-ratings-based (IRB) sezikhombisile ukuthi ubungozi bokunikeza isikweletu kumabhizinisi amancane nasafufusayo (SME) sebuyalinganiseka. Yize kunjalo, amabhange asathola ukuthi kusenzima ukubona ngaphambili inkinga yokungabhadeleki kahle kwezikweletu kanye nokunikeza isikweletu imikhakha enemigomo edingekayo yezimali kaBasel. Lolu cwaningo beluphakamisa ukwakha uhlelo imodeli ephathekayo yokulinganisa izinga lobungozi bokubolekisa ngemali (CRM) kwihlelo lokuxhasa ngezimali ama-SME, okuyihlelo elilawulwa yiziko lezimali ezweni laseZimbabwe. Imodeli ye-CRM esebenza kahle futhi eshaya khona inceda amaziko ezimali ukugwema ubungozi bokunikezana ngezikweletu ngokusebenzisa uhlelo lokunikeza isikweletu ababoleki abakhethekile, lokhu kususelwa ohlelweni oluhlaziya amanani edatha engumlando wekhasimende. Imodeli ye-CRM ephakanyisiwe yaqala yakhiwa ngohlelo lwamanani, phecelezi istatistically random sample, okuluphawu olungcono olumele uhlelo lwe through-the-door (TTD) population lokukhetha abafakizicelo zokubolekwa imali bama SME, kanti lokhu kuxuba zona zombili izicelo eziphumelele kanye nezingaphumelelanga. Indlela yokukhetha abafakizicelo, phecelezi model-based bound-and-collapse (BC) reject-inference methodology isetshenzisiwe ukulungisa indlela yokukhetha ngokukhetha ngendlela yokucwasa kwisizinda seCRM. Imiphumela iye yakhombisa intuthuko enkulu mayelana namandla okwehlukanisa kanye nokunikezwa kwezikweletu kuma SME okungamamabhange enziwe ucwaningo lotho., njengoba lokhu kufakazelwa ukuncipha okukhulu kwe-bad rate kuwo wonke amamodeli athuthukisiwe. Imodeli yokuqala kanye neyokugcina zazidizayinelwe ibhange. Imodeli yokugcina yaqinisekiswa ngokusebenzisa zombili indlela isikweletu esingagculisi kanye negrafu ye-Area under Receiver Operating Characteristic (AUROC) ukulinganisa ukwehlukaniswa kwamandla emodeli engaphakathi kwesampuli nangaphandle kwesampuli. Uhlelo lwe-AUROC lwemodeli yokugcina (weak model) lwatholakala ukuthi luyi 0.9782, kanti ibad rate yatholakala ukuthi yenza i-14.69%. Kwaba khona ukuthuthuka nge-28.76% kwi-bad rate kwimodeli yokugcina uma iqhathaniswa nemodeli yamanje iCRM model ukuba isetshenziswe yibhange elithile.Basel II Capital Accord e fetotse tekanyo ya kotsi ya mokitlane (credit risk measurement (CRM)) hoo “thepa e kotsi ya dibanka” ka moo e neng e bonwa ka teng, e seng e fuwa sebaka dikadimong. Lekala la Dikgwebo tse Nyane le tse Mahareng (SME) le bonwa ka tlwaelo jwalo ka lekala le kotsi e hodimo le senang ditswala bakeng sa ditshebetso tsa dikadimo haholo ke dibanka tsa kgwebo. Empa dipatlisiso tse thehilweng hodima se bonweng kapa se etsahetseng tsa tshebetso ya moralo wa Basel II internal-ratings-based (IRB) di supile hore kotsi ya mokitlane ya SME e kgona ho lekanngwa. Leha ho le jwalo, dibanka di ntse di thatafallwa ke ho bonelapele palo ya ditlholeho tsa ho lefa tsa diSME le ho fana ka mokitla lekaleng leo le kgotsofatsang ditlhoko tsa Basel tsa ditjhelete. Phuputso ena e ne sisinya ho etsa tekanyo ya se bonwang ho mmotlolo wa kotsi ya mokitlane (CRM) tshebetsong ya phano ya tjhelete ya diSME e etswang ke setsi sa ditjhelete (FI) ho la Zimbabwe. Mmotlolo o sebetsang hantle hape o fanang ka dipalo tse nepahetseng o dusa diFI hore di laole pepeso ya tsona ho kotsi ka phano e kgethang ya mokitlane, e thehilweng hodima manollo ya dipalopalo ya dintlha tsa histori ya bareki. Mmotlolo o sisingwang wa CRM o hlahisitswe ho tswa ho sampole e sa hlophiswang, e leng pontsho e betere ya setjhaba se ikenelang le monyako (TTD) ya batho bao e kang bakadimi ba tjhelete ho diSME, hobane e kenyelletsa bakopi ba amohetsweng le ba hannweng. Mokgwatshebetso wa bound-and-collapse (BC) reject-inference o kentswe tshebetsong ho nepahatsa tshekamelo ya kgetho e leng teng ho lekala la CRM. Diphetho tsena di bontshitse ntlafalo e kgolo ho matla a tlhophiso le palohare ya phano ya mokitlane ho diSME tsa banka eo ho ithutilweng ka yona, jwalo ka ha ho pakilwe ke ho phokotseho ya direite tse mpe ho pharalla le dimmotlolo tse hlahisitsweng. Mmotlolo wa ho qala le wa ho qetela e ile ya ralwa bakeng sa banka. Mmotlolo wa ho qetela o ile wa netefatswa ka tshebediso ya bobedi reite e mpe le mothinya wa Area under Receiver Operating Characteristic (AUROC) ho lekanya matla a kenyo mekgahlelong a mmotlolo kahare ho sampole le kantle ho yona. AUROC bakeng sa mmotlo wa ho qetela (mmotlolo o fokotseng) e fumanwe e le 0.9782, ha reite e mpe e fumanwe e le 14.69%. Ho bile le ntlafalo ya 28.76% ho reite e mpe bakeng sa mmotlolo wa ho qetela ha ho bapiswa le mmotlolo wa CRM ha o sebediswa bankeng yona eo.Graduate School of Business LeadershipD.B.L

    Constrained Bayesian Estimation of Inverse Probability Weights for Nonmonotone Missing Data

    Get PDF
    Copyright c©2014 by the authors

    Bridging the Global Divide in AI Regulation: A Proposal for a Contextual, Coherent, and Commensurable Framework

    Full text link
    This paper examines the current landscape of AI regulations, highlighting the divergent approaches being taken, and proposes an alternative contextual, coherent, and commensurable (3C) framework. The EU, Canada, South Korea, and Brazil follow a horizontal or lateral approach that postulates the homogeneity of AI systems, seeks to identify common causes of harm, and demands uniform human interventions. In contrast, the U.K., Israel, Switzerland, Japan, and China have pursued a context-specific or modular approach, tailoring regulations to the specific use cases of AI systems. The U.S. is reevaluating its strategy, with growing support for controlling existential risks associated with AI. Addressing such fragmentation of AI regulations is crucial to ensure the interoperability of AI. The present degree of proportionality, granularity, and foreseeability of the EU AI Act is not sufficient to garner consensus. The context-specific approach holds greater promises but requires further development in terms of details, coherency, and commensurability. To strike a balance, this paper proposes a hybrid 3C framework. To ensure contextuality, the framework categorizes AI into distinct types based on their usage and interaction with humans: autonomous, allocative, punitive, cognitive, and generative AI. To ensure coherency, each category is assigned specific regulatory objectives: safety for autonomous AI; fairness and explainability for allocative AI; accuracy and explainability for punitive AI; accuracy, robustness, and privacy for cognitive AI; and the mitigation of infringement and misuse for generative AI. To ensure commensurability, the framework promotes the adoption of international industry standards that convert principles into quantifiable metrics. In doing so, the framework is expected to foster international collaboration and standardization without imposing excessive compliance costs
    • …
    corecore