16 research outputs found

    Exploiting biomedical web resources: a case study

    Get PDF
    An increasing number of web resources continue to be extensively used by healthcare operators to obtain more accurate diagnostic results. In particular, health care is reaping the benefits of technological advances in genomic for facing the demand of genetic tests that allow a better comprehension of diagnostic results. Within this context, Gene Ontology (GO) is a popular and effective mean for extracting knowledge from a list of genes and evaluating their semantic similarity. This paper investigates about the potential and any limits of GO ontology as support for capturing information about a set of genes which are supposed to play a significant role in a pathological condition. In particular, we present a case study that exploits some biomedical web resources for devising several groups of functionally coherent genes and experiments about the evaluation of their semantic similarity over GO. Due to the GO structure and content, results reveal limitations that not affect the evaluation of the semantic similarity when genes exhibit simple correlations but influence the estimation of the relatedness of genes belonging to complex organizations

    Knowledge-based extraction of adverse drug events from biomedical text

    Get PDF
    Background: Many biomedical relation extraction systems are machine-learning based and have to be trained on large annotated corpora that are expensive and cumbersome to construct. We developed a knowledge-based relation extraction system that requires minimal training data, and applied the system for the extraction of adverse drug events from biomedical text. The system consists of a concept recognition module that identifies drugs and adverse effects in sentences, and a knowledg

    Term Extraction and Disambiguation for Semantic Knowledge Enrichment: A Case Study on Initial Public Offering (IPO)

    Get PDF
    Domain knowledge bases are a basis for advanced knowledge-based systems, manually creating a formal knowledge base for a certain domain is both resource consuming and non-trivial. In this paper, we propose an approach that provides support to extract, select, and disambiguate terms embedded in domain specific documents. The extracted terms are later used to enrich existing ontologies/taxonomies, as well as to bridge domain specific knowledge base with a generic knowledge base such as WordNet. The proposed approach addresses two major issues in the term extraction domain, namely quality and efficiency. Also, the proposed approach adopts a feature-based method that assists in topic extraction and integration with existing ontologies in the given domain. The proposed approach is realized in a research prototype, and then a case study is conducted in order to illustrate the feasibility and the efficiency of the proposed method in the finance domain. A preliminary empirical validation by the domain experts is also conducted to determine the accuracy of the proposed approach. The results from the case study indicate the advantages and potential of the proposed approach

    The Study on Automatic Annotation using Structural/Linguistic Characteristics of biomedical documents

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ์น˜์˜๊ณผํ•™๊ณผ ์˜๋ฃŒ๊ฒฝ์˜์ •๋ณดํ•™์ „๊ณต, 2015. 8. ๊น€ํ™๊ธฐ.์ž๋™ ์–ด๋…ธํ…Œ์ด์…˜์— ๋Œ€ํ•œ ์—ฐ๊ตฌ๋Š” ๊ธ‰์†๋„๋กœ ์ฆ๊ฐ€ํ•˜๋Š” ์˜์ƒ๋ช… ๋ถ„์•ผ์˜ ๋…ผ๋ฌธ ๊ณผ ์ž„์ƒ ๋ฌธ์„œ๋“ค์„ ๋”์šฑ ์ •ํ™•ํ•˜๊ฒŒ ๊ฒ€์ƒ‰ํ•˜๊ฑฐ๋‚˜ ํ•„์š”ํ•œ ์ •๋ณด๋งŒ์„ ์ถ”์ถœํ•  ์ˆ˜ ์žˆ๊ฒŒ ํ•˜๋Š” ๊ธฐ๋ฐ˜์ด ๋œ๋‹ค๋Š” ์ ์—์„œ ์ค‘์š”ํ•˜๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š”, ๊ทธ ์ค‘ ์—ฐ๊ตฌ ํ™œ ๋™์—์„œ ํ•„์ˆ˜์ ์ธ ๋…ผ๋ฌธ ๊ฒ€์ƒ‰๊ณผ ํ™˜์ž์˜ ์งˆ๋ณ‘์— ๋Œ€ํ•œ ์ง„๋‹จ, ๊ฒ€์‚ฌ, ๊ทธ๋ฆฌ๊ณ  ์ฒ˜ ๋ฐฉ ๋“ฑ์„ ๊ธฐ๋กํ•˜๋Š”๋ฐ ํ•„์ˆ˜์ ์ธ ์ž„์ƒ์„œ์‹์˜ ์ž‘์„ฑ์— ์ดˆ์ ์„ ๋งž์ถ”์–ด, ์ด์— ํ•„ ์š”ํ•œ ์–ด๋…ธํ…Œ์ด์…˜ ๊ธฐ์ˆ ์„ ์—ฐ๊ตฌํ•˜์˜€๋‹ค. ์ด ๋‘ ๊ฐ€์ง€ ํ™œ๋™์€ ์˜์ƒ๋ช… ๋ถ„์•ผ์˜ ๋Œ€ ํ‘œ ๋ฌธ์„œ์ธ ๋…ผ๋ฌธ๊ณผ ์ž„์ƒ์„œ์‹์„ ๋Œ€์ƒ์œผ๋กœ ์ผ์ƒ์ ์œผ๋กœ ์ผ์–ด๋‚˜๋Š” ๊ฒƒ์ด๋ฉฐ, ์ด ๋Ÿฌํ•œ ํ™œ๋™์ด ํšจ์œจ์ ์œผ๋กœ ๊ฐœ์„ ๋˜๋Š” ๊ฒƒ์€ ์˜์ƒ๋ช… ๋ถ„์•ผ์—์„œ ์ค‘์š”ํ•œ ์˜๋ฏธ๋ฅผ ๊ฐ€์ง„๋‹ค. ๋จผ์ €, ํ…์ŠคํŠธ ํ˜•์‹์˜ ์—ฐ๊ตฌ ๋…ผ๋ฌธ์— ๋Œ€ํ•ด์„œ๋Š” ์—ฐ๊ตฌ ํ™œ๋™์˜ ๋ฐฉํ–ฅ ์„ค์ •์— ์ค‘ ์š”ํ•œ ์—ญํ• ์„ ํ•˜๋Š” ์ดˆ๋ก์„ ๋Œ€์ƒ์œผ๋กœ, ์˜์ƒ๋ช… ๋ถ„์•ผ์—์„œ ์ฃผ๋กœ ์‚ฌ์šฉํ•˜๋Š” IMRAD(Introduction, Methods, Results, and Discussion)๋กœ์˜ ์ž๋™ ํƒœ๊น…์„ ์—ฐ๊ตฌํ•˜์˜€๋‹ค. ์ด ์—ฐ๊ตฌ์—์„œ๋Š”, ๊ธฐ์กด ์–ธ์–ดํ•™ ๋ถ„์•ผ์—์„œ ์˜์ƒ๋ช… ๋ถ„์•ผ์˜ ๋…ผ๋ฌธ์„ ๋Œ€์ƒ์œผ๋กœ ์ด๋ฃฌ ๊ฒฐ๊ณผ์™€ ์ปดํ“จํ„ฐ ๊ณผํ•™ ๋ถ„์•ผ์—์„œ ์ง„ํ–‰๋ผ์˜จ ๊ฒฐ๊ณผ๋ฅผ ๊ธฐ ๋ฐ˜์œผ๋กœ, ๊ณ„์‚ฐ ๋น„์šฉ์ด ์ ์œผ๋ฉด์„œ๋„ ๋†’์€ ์„ฑ๋Šฅ์„ ๋‚ด๋Š” ์ƒˆ๋กœ์šด ์ž๋™ ํƒœ๊น… ์‹œ์Šค ํ…œ์„ ์ œ์•ˆํ•˜๊ณ  ๊ฐœ๋ฐœํ•˜์˜€๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ ์ œ์•ˆํ•œ ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•˜๋Š” ๊ฒฝ์šฐ, ๋ฌธ ์žฅ์—์„œ ๋ฝ‘์•„๋‚ธ 17๊ฐœ์˜ ํŠน์ง•๋งŒ์œผ๋กœ๋„ ๋น„๊ตฌ์กฐํ™”๋œ ์ดˆ๋ก์„ Accuracy 77.0 ~ 90.3%์˜ ์„ฑ๋Šฅ์œผ๋กœ ๋ถ„๋ฅ˜ํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค. ๋˜ํ•œ, ๊ธฐ์กด ์—ฐ๊ตฌ๋“ค์—์„œ ์‚ฌ์šฉํ•œ ํŠน ์ง•๋“ค๊ณผ ํ•จ๊ป˜ ์‚ฌ์šฉํ–ˆ์„ ๋•Œ๋Š” ์ตœ๋Œ€ Accuracy 91.7%์˜ ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค. ์ž„์ƒ ๋ฌธ์„œ์˜ ๊ฒฝ์šฐ, EMR(Electronic Medical Record)์„ ์‹œ์Šคํ…œ์„ ์‚ฌ์šฉํ•˜๋Š” ํ™˜๊ฒฝ์—์„œ๋Š” ์ž„์ƒ ์„œ์‹์„ ํ†ตํ•ด ์ƒ์„ฑ๋˜๋Š” ๊ฒฝ์šฐ๊ฐ€ ๋Œ€๋ถ€๋ถ„์ด๋ฏ€๋กœ, ์ž„ ์ƒ ์„œ์‹์„ ๋Œ€์ƒ์œผ๋กœ ์ž๋™ ํƒœ๊น…์„ ์‹œ๋„ํ•˜์˜€๋‹ค. ์ž„์ƒ ์„œ์‹์€ ์—ฐ๊ตฌ ์ดˆ๋ก๊ณผ๋Š” ๋‹ฌ๋ฆฌ ์ด๋ฏธ ๊ตฌ์กฐํ™”๋œ ํ˜•์‹์„ ๊ฐ€์ง€๊ณ  ์žˆ์œผ๋ฏ€๋กœ, ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ด ๊ตฌ์กฐ ์•ˆ์— ๋‚ด์žฌ๋œ ์ „๋ฌธ๊ฐ€์˜ ์ง€์‹์„ ํƒœ๊น…ํ•˜๊ณ ์ž ํ•˜์˜€๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ์ƒˆ๋กœ์šด ์ง€์‹๋ชจ๋ธ ๊ณผ ์ด๋ฅผ ์ด์šฉํ•œ ์ž„์ƒ ์„œ์‹ ์ž‘์„ฑ ์ง€์› ์‹œ์Šคํ…œ์ธ STEP(Smart Clinical Document Template Editing and Production System)์„ ๊ฐœ๋ฐœํ•˜์˜€๋‹ค. STEP์˜ ์‹œ์Šคํ…œ์˜ ํ™œ์šฉ์„ฑ์„ ๊ฒ€์ฆํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ์ž„์ƒ ์„œ์‹ ์ž‘์„ฑ ๋„๊ตฌ๋ฅผ ๊ฐœ ๋ฐœํ•˜์—ฌ, ์ง€์‹ ๋ชจ๋ธ์„ ํ†ตํ•ด ๊ตฌ์ถ•๋œ ์ง€์‹๋ฒ ์ด์Šค๊ฐ€ ์ž„์ƒ ์„œ์‹์˜ ์ž‘์„ฑ์„ ๊ฐœ์„  ์‹œํ‚ฌ ์ˆ˜ ์žˆ์Œ์„ ๋ณด์˜€๋‹ค. ์—ฐ๊ตฌ ๊ฒฐ๊ณผ๋Š” ์˜์ƒ๋ช… ๋ถ„์•ผ์˜ ์—ฐ๊ตฌ์ž๋“ค์—๊ฒŒ ๋Œ€๊ทœ๋ชจ์˜ ์˜์ƒ๋ช… ๊ด€๋ จ ๋…ผ๋ฌธ๊ณผ ์ž„์ƒ์—์„œ ์ง€์†์ ์œผ๋กœ ์ƒ์‚ฐ๋˜๋Š” ์ž„์ƒ ๋ฌธ์„œ๊ฐ€ ๋”์šฑ ์ •ํ™•ํ•˜๊ฒŒ ๊ฒ€์ƒ‰๋˜๊ณ  ์žฌ์‚ฌ ์šฉ๋  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์—ฌ์ฃผ๊ณ  ์žˆ๋‹ค. ์ด๋Ÿฌํ•œ ๊ฒฐ๊ณผ๋Š” ์˜์ƒ๋ช… ๋ถ„์•ผ ์ „๋ฐ˜์—์„œ ์—ฐ๊ตฌ ์ž๋“ค์˜ ํ™œ๋™์„ ๊ฐœ์„ ์‹œํ‚ฌ ์ˆ˜ ์žˆ๋‹ค๋Š” ์ ์—์„œ ์ค‘์š”ํ•˜๋‹ค. ๋งˆ์ง€๋ง‰์œผ๋กœ, ๋ณธ ์—ฐ๊ตฌ ์˜ ์„ฑ๊ณผ๊ฐ€ ๋‹ค๋ฅธ ์—ฐ๊ตฌ์ž๋“ค์—๊ฒŒ๋„ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ๋„๋ก, ์—ฐ๊ตฌ ๊ณผ์ •์—์„œ ์ถ”์ถœํ•œ ์–ธ์–ด ์ž์›๊ณผ ๊ฒฐ๊ณผ๋ฅผ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋Š” ์‹œ์Šคํ…œ์„ ์›น์œผ๋กœ ๊ณต๊ฐœํ•˜์˜€๋‹ค.์ดˆ ๋ก....................................................................................................i ๋ชฉ ์ฐจ..................................................................................................iii I. ์„œ๋ก ................................................................................................1 1. ์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ ......................................................................................1 2. ์—ฐ๊ตฌ ๋ชฉ์  ......................................................................................5 3. ๋…ผ๋ฌธ์˜ ๊ตฌ์„ฑ....................................................................................6 II. ๊ตฌ์กฐํ™”๋œ ์ดˆ๋ก์˜ ์–ธ์–ด์  ํŠน์ง• ์ถ”์ถœ..................................................7 1. ์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ .....................................................................................7 2. ์—ฐ๊ตฌ ๋ชฉ์  .....................................................................................9 3. ๊ด€๋ จ ์—ฐ๊ตฌ .....................................................................................9 4. ์—ฐ๊ตฌ ๋ฐฉ๋ฒ• ................................................................................... 12 4.1. ๋ฐ์ดํ„ฐ ์ฝ”ํผ์Šค ......................................................................... 13 4.2. ์„น์…˜ ์ •๊ทœํ™”............................................................................. 14 4.3. ์„น์…˜ ๋งตํ•‘ ............................................................................... 17 4.4. ์–ธ์–ด์  ํŠน์ง• ์ถ”์ถœ ..................................................................... 18 5. ๊ฒฐ๊ณผ ......................................................................................... 20 5.1. ์„น์…˜๋ณ„ ๋™์‚ฌ/๋™์‚ฌ๊ตฌ์˜ ์‚ฌ์šฉ ํŠน์ง• .................................................. 20 5.2. ์„น์…˜๋ณ„ N-gram์˜ ์‚ฌ์šฉ ํŠน์ง• ...................................................... 22 5.3. ์„น์…˜๋ณ„ ๋ช…์‚ฌ(๊ตฌ)์˜ ์‚ฌ์šฉ ํŠน์ง• ....................................................... 24 5.4. ์–ธ์–ด์  ํŠน์ง•๋“ค์˜ ์„น์…˜ ๊ตฌ๋ณ„๋ ฅ ...................................................... 27 6. ๊ฒฐ๋ก  .......................................................................................... 41 III. ์–ธ์–ด์  ํŠน์ง•์„ ์ด์šฉํ•œ ์ดˆ๋ก ๋ฌธ์žฅ ๋ถ„๋ฅ˜................................................. 44 1. ์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ ................................................................................... 44 2. ์—ฐ๊ตฌ ๋ชฉ์  ................................................................................... 45 3. ๊ด€๋ จ ์—ฐ๊ตฌ ................................................................................... 45 4. ์—ฐ๊ตฌ ๋ฐฉ๋ฒ• ................................................................................... 48 4.1. Feature Set ๊ตฌ์„ฑ ................................................................... 48 4.2. ํ…Œ์ŠคํŠธ ๋ฌธ์„œ ์ง‘ํ•ฉ ...................................................................... 52 4.3. SVM์„ ์ด์šฉํ•œ ํ•™์Šต ๋ฐ ํ‰๊ฐ€ ....................................................... 53 5. ์—ฐ๊ตฌ ๊ฒฐ๊ณผ ................................................................................... 54 5.1. ์–ธ์–ด์  ํŠน์ง•๋ณ„ ์„ฑ๋Šฅ.....................................................................54 5.2. ํŠน์ง• ๊ทธ๋ฃน ์กฐํ•ฉ๋ณ„ ์„ฑ๋Šฅ ............................................................... 56 6. ๋…ผ์˜ .......................................................................................... 65 IV. ์˜์ƒ๋ช… ์ดˆ๋ก ๋ฌธ์žฅ ์ž๋™ ํƒœ๊น… ์‹œ์Šคํ…œ.............................................. 67 1. ์‹œ์Šคํ…œ ์†Œ๊ฐœ ................................................................................ 67 2. ์„œ๋น„์Šค ๊ตฌ์„ฑ ................................................................................ 67 2.1. INTRODUCTION...................................................................67 2.2 LEXICAL FEATURES ............................................................. 69 2.3 RESULTS................................................................................71 2.4 ONLINE DEMO.......................................................................73 3. Use Cases ............................................................................... 76 V. ๊ตฌ์กฐ์  ํŠน์ง•์„ ์ด์šฉํ•œ ์ž„์ƒ ์„œ์‹์˜ ํƒœ๊น… ..................................... 78 1. ์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ.................................................................................... 78 2. ์—ฐ๊ตฌ ๋ชฉํ‘œ.................................................................................... 80 3. ์ž„์ƒ ์„œ์‹์˜ ํƒœ๊น…์„ ์œ„ํ•œ ์ง€์‹ ๋ชจ๋ธ ................................................... 80 3.1. ์˜จํ†จ๋กœ์ง€ ................................................................................ 80 3.2. ๊ฐœ๋… ๋ชจ๋ธ ............................................................................... 81 3.3. CDT ์˜จํ†จ๋กœ์ง€......................................................................... 85 4. CDT ์˜จํ†จ๋กœ์ง€๋ฅผ ์ด์šฉํ•œ ์ž„์ƒ์„œ์‹ ํƒœ๊น… ............................................. 90 5. ๊ฒฐ๋ก  .......................................................................................... 93 VI. ์ž„์ƒ ์„œ์‹ ์ง€์‹๋ฒ ์ด์Šค ๊ธฐ๋ฐ˜์˜ ์„œ์‹ ์ž‘์„ฑ ์ง€์› ์‹œ์Šคํ…œ ............... 94 1. ์‹œ์Šคํ…œ ์†Œ๊ฐœ ................................................................................ 94 2. ์‹œ์Šคํ…œ ๊ตฌ์„ฑ ................................................................................ 95 2.1. ์ง€์‹ ๋ฒ ์ด์Šค ๊ด€๋ฆฌ ๋ชจ๋“ˆ ............................................................... 96 2.2. ํ•ต์‹ฌ ๋ชจ๋“ˆ ............................................................................... 96 2.3. ์›น ์‚ฌ์šฉ์ž ์ธํ„ฐํŽ˜์ด์Šค .............................................................. 101 2.4. Web Services ์ธํ„ฐํŽ˜์ด์Šค ..................................................... 106 3. Use Case ...............................................................................108 4. ๊ฒฐ๋ก  ........................................................................................110 VII. ๊ฒฐ๋ก  .......................................................................................113 VIII. ์—ฐ๊ตฌ์˜ ์ œํ•œ์  ๋ฐ ์ œ์–ธ ...............................................................116 ์ฐธ๊ณ ๋ฌธํ—Œ .......................................................................................118 ๋ถ€๋ก ............................................................................................129 Abstract .....................................................................................133Docto

    Text Mining for Chemical Compounds

    Get PDF
    Exploring the chemical and biological space covered by patent and journal publications is crucial in early- stage medicinal chemistry activities. The analysis provides understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents and journals through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. In this book, we addressed the lack of quality measurements for assessing the correctness of structural representation within and across chemical databases; lack of resources to build text-mining systems; lack of high performance systems to extract chemical compounds from journals and patents; and lack of automated systems to identify relevant compounds in patents. The consistency and ambiguity of chemical identifiers was analyzed within and between small- molecule databases in Chapter 2 and Chapter 3. In Chapter 4 and Chapter 7 we developed resources to enable the construction of chemical text-mining systems. In Chapter 5 and Chapter 6, we used community challenges (BioCreative V and BioCreative VI) and their corresponding resources to identify mentions of chemical compounds in journal abstracts and patents. In Chapter 7 we used our findings in previous chapters to extract chemical named entities from patent full text and to classify the relevancy of chemical compounds

    Agile in-litero experiments:how can semi-automated information extraction from neuroscientific literature help neuroscience model building?

    Get PDF
    In neuroscience, as in many other scientific domains, the primary form of knowledge dissemination is through published articles in peer-reviewed journals. One challenge for modern neuroinformatics is to design methods to make the knowledge from the tremendous backlog of publications accessible for search, analysis and its integration into computational models. In this thesis, we introduce novel natural language processing (NLP) models and systems to mine the neuroscientific literature. In addition to in vivo, in vitro or in silico experiments, we coin the NLP methods developed in this thesis as in litero experiments, aiming at analyzing and making accessible the extended body of neuroscientific literature. In particular, we focus on two important neuroscientific entities: brain regions and neural cells. An integrated NLP model is designed to automatically extract brain region connectivity statements from very large corpora. This system is applied to a large corpus of 25M PubMed abstracts and 600K full-text articles. Central to this system is the creation of a searchable database of brain region connectivity statements, allowing neuroscientists to gain an overview of all brain regions connected to a given region of interest. More importantly, the database enables researcher to provide feedback on connectivity results and links back to the original article sentence to provide the relevant context. The database is evaluated by neuroanatomists on real connectomics tasks (targets of Nucleus Accumbens) and results in significant effort reduction in comparison to previous manual methods (from 1 week to 2h). Subsequently, we introduce neuroNER to identify, normalize and compare instances of identify neuronsneurons in the scientific literature. Our method relies on identifying and analyzing each of the domain features used to annotate a specific neuron mention, like the morphological term 'basket' or brain region 'hippocampus'. We apply our method to the same corpus of 25M PubMed abstracts and 600K full-text articles and find over 500K unique neuron type mentions. To demonstrate the utility of our approach, we also apply our method towards cross-comparing the NeuroLex and Human Brain Project (HBP) cell type ontologies. By decoupling a neuron mention's identity into its specific compositional features, our method can successfully identify specific neuron types even if they are not explicitly listed within a predefined neuron type lexicon, thus greatly facilitating cross-laboratory studies. In order to build such large databases, several tools and infrastructureslarge-scale NLP were developed: a robust pipeline to preprocess full-text PDF articles, as well as bluima, an NLP processing pipeline specialized on neuroscience to perform text-mining at PubMed scale. During the development of those two NLP systems, we acknowledged the need for novel NLP approaches to rapidly develop custom text mining solutions. This led to the formalization of the agile text miningagile text-mining methodology to improve the communication and collaboration between subject matter experts and text miners. Agile text mining is characterized by short development cycles, frequent tasks redefinition and continuous performance monitoring through integration tests. To support our approach, we developed Sherlok, an NLP framework designed for the development of agile text mining applications

    A framework for an adaptable and personalised e-learning system based on free web resources

    Get PDF
    An adaptable and personalised E-learning system (APELS) architecture is developed to provide a framework for the development of comprehensive learning environments for learners who cannot follow a conventional programme of study. The system extracts information from freely available resources on the Web taking into consideration the learners' background and requirements to design modules and a planner system to organise the extracted learning material to facilitate the learning process. The process is supported by the development of an ontology to optimise and support the information extraction process. Additionally, natural language processing techniques are utilised to evaluate a topic's content against a set of learning outcomes as defined by standard curricula. An application in the computer science field is used to illustrate the working mechanisms of the proposed framework and its evaluation based on the ACM/IEEE Computing Curriculum.A variety of models are developed and techniques used to support the adaptability and personalisation features of APELS. First, a learnerโ€™s model was designed by incorporating studentsโ€™ details, studentsโ€™ requirements and the domain they wish to study into the system. In addition, learning style theories were adopted as a way of identifying and categorising the individuals, to improve their on-line learning experience and applying it to the learnerโ€™s model. Secondly, the knowledge extraction model is responsible for the extraction of the learning resources from the Web that would satisfy the learnersโ€™ needs and learning outcomes. To support this process, an ontology was developed to retrieve the relevant information as per usersโ€™ needs. In addition, it transforms HTML documents to XHTML to provide the information in an accessible format and easier for extraction and comparison purposes. Moreover, a matching process was implemented to compute the similarity measure between the ontology concepts that are used in the ACM/IEEE Computer Science Curriculum and those extracted from the websites. The website with the highest similarity score is selected as the best matching website that satisfies the learnersโ€™ request. A further step is required to evaluate whether the content extracted by the system is the appropriate learning material of the subject. For this purpose, the learning outcome validation process is added to ensure that the content of the selected websites will enable the appropriate learning based to the learning outcomes set by standard curricula. Finally, the information extracted by the system will be passed to a Planner model that will structure the content into lectures, tutorials and workshops based on some predefined learning constraints. The APELS system provides a novel addition to the field of adaptive E-learning systems by providing more personalized learning material to each user in a time-efficient way saving his/her time looking for the right course from the hugely available resources on the Web or going through the large number of websites and links returned by traditional search engines. The APELS system will adapt better to the learnerโ€™s style based on feedback and assessment once the learning process is initiated by the learner. The APELS system is expected to develop over time with more users
    corecore