Search CORE

12 research outputs found

NERO: a biomedical named-entity (recognition) ontology with a large, annotated corpus reveals meaningful associations through text embedding.

Author: Alachram Halima
Ambite José Luis
Ananiadou Sophia
Beißbarth Tim
Chambers Brendan
Christopoulou Fenia
Evans James A
Galstyan Aram
Gao Xin
Garg Sahil
Hermjakob Ulf
Khomtchouk Bohdan B
King Ross
Li Maolin
Li Yu
Marcu Daniel
Matthew Joel
Pan Weidi
Rzhetsky Andrey
Schoene Annika M
Sheng Emily
Soldatova Larisa
Stevens Robert
Wang Kanix
Wingender Edgar
Publication venue: NPJ Syst Biol Appl
Publication date: 01/01/2021
Field of study

Machine reading (MR) is essential for unlocking valuable knowledge contained in millions of existing biomedical documents. Over the last two decades1,2, the most dramatic advances in MR have followed in the wake of critical corpus development3. Large, well-annotated corpora have been associated with punctuated advances in MR methodology and automated knowledge extraction systems in the same way that ImageNet4 was fundamental for developing machine vision techniques. This study contributes six components to an advanced, named entity analysis tool for biomedicine: (a) a new, Named Entity Recognition Ontology (NERO) developed specifically for describing textual entities in biomedical texts, which accounts for diverse levels of ambiguity, bridging the scientific sublanguages of molecular biology, genetics, biochemistry, and medicine; (b) detailed guidelines for human experts annotating hundreds of named entity classes; (c) pictographs for all named entities, to simplify the burden of annotation for curators; (d) an original, annotated corpus comprising 35,865 sentences, which encapsulate 190,679 named entities and 43,438 events connecting two or more entities; (e) validated, off-the-shelf, named entity recognition (NER) automated extraction, and; (f) embedding models that demonstrate the promise of biomedical associations embedded within this corpus

Goldsmiths Research Online

Directory of Open Access Journals

Chalmers Research

Apollo (Cambridge)

A virtual environment for ultrasound examination learning

Author: Dardenne G
Driscoll J
Gillies DF
Jensen JA
Mantke R
Ourahmoune A
Peters TM
Petrinec K
Sclaverano S
Stallkamp J
Sun B
Yipeng H Eli G, Li-Lin L, Weidi X, Barratt C, Vercauteren T, Alison Noble J.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Recommended from our members

NERO: A biomedical named-entity (recognition) ontology with a large, annotated corpus reveals meaningful associations through text embedding

Author: Alachram Halima
Ambite José Luis
Ananiadou Sophia
Beißbarth Tim
Chambers Brendan
Christopoulou Fenia
Evans James A.
Galstyan Aram
Gao Xin
Garg Sahil
Hermjakob Ulf
Khomtchouk Bohdan B.
King Ross
Li Maolin
Li Yu
Marcu Daniel
Matthew Joel
Pan Weidi
Rzhetsky Andrey
Schoene Annika M.
Sheng Emily
Soldatova Larisa
Stevens Robert
Wang Kanix
Wingender Edgar
Publication venue
Publication date: 24/08/2023
Field of study

Machine reading (MR) is essential for unlocking valuable knowledge contained in millions of existing biomedical documents. Over the last two decades, the most dramatic advances in MR have followed in the wake of critical corpus development. Large, well-annotated corpora have been associated with punctuated advances in MR methodology and automated knowledge extraction systems in the same way that ImageNet4 was fundamental for developing machine vision techniques. This study contributes six components to an advanced, named entity analysis tool for biomedicine: (a) a new, Named Entity Recognition Ontology (NERO) developed specifically for describing textual entities in biomedical texts, which accounts for diverse levels of ambiguity, bridging the scientific sublanguages of molecular biology, genetics, biochemistry, and medicine; (b) detailed guidelines for human experts annotating hundreds of named entity classes; (c) pictographs for all named entities, to simplify the burden of annotation for curators; (d) an original, annotated corpus comprising 35,865 sentences, which encapsulate 190,679 named entities and 43,438 events connecting two or more entities; (e) validated, off-the-shelf, named entity recognition (NER) automated extraction, and; (f) embedding models that demonstrate the promise of biomedical associations embedded within this corpus

Knowledge UChicago