Search CORE

74 research outputs found

MeSH indexing based on automatically generated summaries

Author: Alan R Aronson
Alberto Díaz
Antonio J Jimeno-Yepes
James G Mork
Laura Plaza
Publication venue: Springer Nature
Publication date: 01/01/2013
Field of study

BACKGROUND: MEDLINE citations are manually indexed at the U.S. National Library of Medicine (NLM) using as reference the Medical Subject Headings (MeSH) controlled vocabulary. For this task, the human indexers read the full text of the article. Due to the growth of MEDLINE, the NLM Indexing Initiative explores indexing methodologies that can support the task of the indexers. Medical Text Indexer (MTI) is a tool developed by the NLM Indexing Initiative to provide MeSH indexing recommendations to indexers. Currently, the input to MTI is MEDLINE citations, title and abstract only. Previous work has shown that using full text as input to MTI increases recall, but decreases precision sharply. We propose using summaries generated automatically from the full text for the input to MTI to use in the task of suggesting MeSH headings to indexers. Summaries distill the most salient information from the full text, which might increase the coverage of automatic indexing approaches based on MEDLINE. We hypothesize that if the results were good enough, manual indexers could possibly use automatic summaries instead of the full texts, along with the recommendations of MTI, to speed up the process while maintaining high quality of indexing results. RESULTS: We have generated summaries of different lengths using two different summarizers, and evaluated the MTI indexing on the summaries using different algorithms: MTI, individual MTI components, and machine learning. The results are compared to those of full text articles and MEDLINE citations. Our results show that automatically generated summaries achieve similar recall but higher precision compared to full text articles. Compared to MEDLINE citations, summaries achieve higher recall but lower precision. CONCLUSIONS: Our results show that automatic summaries produce better indexing than full text articles. Summaries produce similar recall to full text but much better precision, which seems to indicate that automatic summaries can efficiently capture the most important contents within the original articles. The combination of MEDLINE citations and automatically generated summaries could improve the recommendations suggested by MTI. On the other hand, indexing performance might be dependent on the MeSH heading being indexed. Summarization techniques could thus be considered as a feature selection algorithm that might have to be tuned individually for each MeSH heading

Springer - Publisher Connector

PubMed Central

The road from manual to automatic semantic indexing of biomedical literature: a 10 years journey

Author: Anastasia Krithara
Anastasios Nentidis
Anastasios Nentidis
Georgios Paliouras
James G. Mork
Publication venue: Frontiers Media S.A.
Publication date: 01/09/2023
Field of study

Biomedical experts are facing challenges in keeping up with the vast amount of biomedical knowledge published daily. With millions of citations added to databases like MEDLINE/PubMed each year, efficiently accessing relevant information becomes crucial. Traditional term-based searches may lead to irrelevant or missed documents due to homonyms, synonyms, abbreviations, or term mismatch. To address this, semantic search approaches employing predefined concepts with associated synonyms and relations have been used to expand query terms and improve information retrieval. The National Library of Medicine (NLM) plays a significant role in this area, indexing citations in the MEDLINE database with topic descriptors from the Medical Subject Headings (MeSH) thesaurus, enabling advanced semantic search strategies to retrieve relevant citations, despite synonymy, and polysemy of biomedical terms. Over time, advancements in semantic indexing have been made, with Machine Learning facilitating the transition from manual to automatic semantic indexing in the biomedical literature. The paper highlights the journey of this transition, starting with manual semantic indexing and the initial efforts toward automatic indexing. The BioASQ challenge has served as a catalyst in revolutionizing the domain of semantic indexing, further pushing the boundaries of efficient knowledge retrieval in the biomedical field

Directory of Open Access Journals

Feature engineering for MEDLINE citation categorization with MeSH

Author: A Jimeno-Yepes
Alan R Aronson
Antonio Jose Jimeno Yepes
AR Aronson
AR Aronson
C Apte
CM Tan
DD Lewis
F Sebastiani
James G Mork
Jorge Carrillo-de-Albornoz
JR Herskovica
L Plaza
L Smith
Laura Plaza
O Bodenreider
P Ruch
S Sohn
WW Cohen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Field Demonstration of Carbon Dioxide Miscible Flooding in the Lansing-Kansas City Formation, Central Kansas

Author: Byrnes Alan
Cantrell Paul
Daniels James
Doveton John
Flanders William
Green Don
Griend Dave Vander
Guy Willard
Martin Russell
Mork Eric
Murfin Dave
Pancake Richard
Reynolds Rodney
Tsau JyunSyung
Watney W. Lynn
Willhite G. Paul
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date: 07/03/2010
Field of study

A pilot carbon dioxide miscible flood was initiated in the Lansing Kansas City C formation in the Hall Gurney Field, Russell County, Kansas. The reservoir zone is an oomoldic carbonate located at a depth of about 2900 feet. The pilot consists of one carbon dioxide injection well and three production wells. Continuous carbon dioxide injection began on December 2, 2003. By the end of June 2005, 16.19 MM lb of carbon dioxide was injected into the pilot area. Injection was converted to water on June 21, 2005 to reduce operating costs to a breakeven level with the expectation that sufficient carbon dioxide was injected to displace the oil bank to the production wells by water injection. By March 7,2010, 8,736 bbl of oil were produced from the pilot. Production from wells to the northwest of the pilot region indicates that oil displaced from carbon dioxide injection was produced from Colliver A7, Colliver A3, Colliver A14 and Graham A4 located on adjacent leases. About 19,166 bbl of incremental oil were estimated to have been produced from these wells as of March 7, 2010. There is evidence of a directional permeability trend toward the NW through the pilot region. The majority of the injected carbon dioxide remains in the pilot region, which has been maintained at a pressure at or above the minimum miscibility pressure. Estimated oil recovery attributed to the CO2 flood is 27,902 bbl which is equivalent to a gross CO2 utilization of 4.8 MCF/bbl. The pilot project is not economic

Crossref

UNT Digital Library

Political Institutions and Government Spending Behavior: Theory and Evidence from Iran

Author: A Naka
Al Alfoneh
Alireza Naghavi
C A Sims
C A Sims
C A Sims
D A Dickey
D Acemoglu
D Brown
D Brown
D L Hoffman
D Rodrik
G Saint-Paul
J A Tijerina-Guajardo
J D Hamilton
J H Lebovic
J H Stock
J R Oneal
J Yildirim
K A Mork
K A Mork
K Hausken
M G Marshall
M H Berument
M H Pesaran
M Olson
M P Clements
M R Farzanegan
M R Farzanegan
M R Farzanegan
Mohammad Reza Farzanegan
P James
R F Engle
R R Kaufman
R Wintrobe
R Wintrobe
S F Dizaji
S Johansen
Sajjad Faraji Dizaji
T Besley
T Doan
T Pl�mper
T S Aidt
T S Aidt
T Vanhanen
W A Fuller
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Crossref

Recommended from our members

FIELD DEMONSTRATION OF CARBON DIOXIDE MISCIBLE FLOODING IN THE LANSING-KANSAS CITY FORMATION, CENTRAL KANSAS

Author: Byrnes Alan
Cantrell Paul
Carr Timothy
Daniels James
Doveton John
Dubois Martin
Flanders William
Green Don
Griend Dave Vander
Guy Willard
Martin Russell
Mork Eric
Murfn Dave
Pancake Richard
Reynolds Rodney
Watney W. Lynn
Willhite G. Paul
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date: 31/12/2004
Field of study

A pilot carbon dioxide miscible flood was initiated in the Lansing Kansas City C formation in the Hall Gurney Field, Russell County, Kansas. Continuous carbon dioxide injection began on December 2, 2003. By the end of December 2004, 11.39 MM lb of carbon dioxide were injected into the pilot area. Carbon dioxide injection rates averaged about 242 MCFD. Vent losses were excessive during June as ambient temperatures increased. Installation of smaller plungers in the carbon dioxide injection pump reduced the recycle and vent loss substantially. Carbon dioxide was detected in one production well near the end of May and in the second production well in August. No channeling of carbon dioxide was observed. The GOR has remained within the range of 3000-4000 for most the last six months. Wells in the pilot area produced 100% water at the beginning of the flood. Oil production began in February, increasing to an average of about 2.35 B/D for the six month period between July 1 and December 31. Cumulative oil production was 814 bbls. Neither well has experienced increased oil production rates expected from the arrival of the oil bank generated by carbon dioxide injection

UNT Digital Library

Recommended from our members

Field Demonstration of Carbon Dioxide Miscible Flooding in the Lansing-Kansas City Formation, Central Kansas

Author: Avison Niall
Byrnes Alan
Cantrell Paul
Carr Timothy
Daniels James
Doveton John
Dubois Martin
Flanders William
Green Don
Griend Dave Vander
Guy Willard
Kunjithaya Rajesh
Martin Russell
Mork Eric
Murfin Dave
Pancake Richard
Reynolds Rodney
Watney W. Lynn
Willhite G. Paul
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date: 31/12/2001
Field of study

Progress is reported for the period from January 1, 2002 to March 31, 2002. Technical design and budget for a larger (60-acre, 24.3 ha) CO2 demonstration project are being reviewed by the US DOE for approval. While this review process is being conducted, work is proceeding on well testing to obtain reservoir properties and on the VIP reservoir simulation model to improve model prediction and better understand the controls that certain parameters exert on predicted performance. In addition, evaluation of the economics of commercial application in the surrounding area was performed. In a meeting on January 14, 2002 the possibility of staging the demonstration, starting with a 10-acre sub-pattern flood was raised and the decision made to investigate this plan in detail. The influence of carbon dioxide on oil properties and the influence of binary interaction parameters (BIP) used in the VIP simulator were investigated. VIP calculated swelling factors are in good agreement with published values up to 65% mole-fraction CO2. Swelling factor and saturated liquid density are relatively independent of the BIP over the range of BIPs used (0.08-0.15) up to 65% mole-fraction CO2. Assuming a CO2 EOR recovery rate projected as being most likely by current modeling, commercial scale CO2 flooding at

20/BO is possible in the leases in Hall-Gurney field. Relatively small floods (240-320 acres, 4-6 patterns) are economically viable at

20/BO in areas of very high primary and secondary productivity (>14 MBO/net acre recovery). Leases with moderately high primary and secondary productivity (> 10 MBO/net acre recovery) can be economic when combined with high productivity leases to form larger floods (>640 acres, 9 or more patterns)

UNT Digital Library

HPV-Related Nonkeratinizing Squamous Cell Carcinoma of the Oropharynx: Utility of Microscopic Features in Predicting Patient Outcome

Author: AR Kreimer
CH Lenselink
Curtis A. Parvin
E Soriano
G D’Souza
H Maier
H Mellin
IB Paz
J Mork
J Piccirillo
James S. Lewis
JKC Chan
JM Boudewijn
JM Ritchie
JP Klussmann
K Lindel
K Strati
KM Applebaum
KR Dahlstrom
L Dahlgren
L Hammarstedt
L Licitra
M Hoffman
ML Gillison
PM Weinberger
Rebecca D. Chernock
S Begum
S Syrjanen
Samir K. El-Mofty
SH Kim
SK El-Mofty
SK El-Mofty
SK El-Mofty
SK El-Mofty
SL Wain
SP Wilczynski
T Sano
Wade L. Thorstad
WC Reeves
Publication venue: Humana Press Inc
Publication date: 01/01/2009
Field of study

Human papilloma virus (HPV) is an etiologic agent in a subset of oropharyngeal squamous cell carcinomas (SCCs). The aim of this study was to sub-classify SCC of the oropharynx based upon histologic features into nonkeratinizing (NK) SCC, keratinizing (K) SCC, and hybrid SCC, and determine the frequency of HPV and patient survival in each group. Patients with oropharyngeal SCC with a minimum of 2 years of clinical follow-up were identified from radiation oncology databases from 1997 to 2004. All patients received either up front surgery with postoperative radiation or definitive radiation based therapy. In situ hybridization (ISH) for high-risk HPV subtypes and immunohistochemistry for p16, a protein frequently up-regulated in HPV-associated carcinomas, were performed. Overall and disease-specific survival were assessed. Of 118 cases, 46.6% were NK SCC, 24.6% K SCC and 28.8% hybrid SCC. NK SCC occurred in slightly younger patients that were more often male. It more frequently presented with lymph node metastases and was surgically resected compared to K SCC. NK SCC was significantly more likely to be HPV and p16 positive than KSCC (P < 0.001) and to have better overall and disease-specific survival (P = 0.0002; P = 0.0142, respectively). Hybrid SCC was also more likely than K SCC to be HPV and p16 positive (P = 0.003; P = 0.002, respectively) and to have better overall survival (P = 0.0105). Sub-classification of oropharyngeal SCC by histologic type provides useful clinical information. NK SCC histology strongly predicts HPV-association and better patient survival compared to K SCC. Hybrid SCC appears to have an intermediate frequency of HPV-association and patient survival

Crossref

Springer - Publisher Connector

PubMed Central

Political institutions and government spending behavior: theory and evidence from Iran

Author: A Alfoneh
A Naka
Alireza Naghavi
BE Goldsmith
BO Fordham
CA Sims
CA Sims
CA Sims
D Acemoglu
D Acemoglu
D Acemoglu
D Brown
D Hewitt
D Rodrik
DA Dickey
DL Hoffman
G Palmer
G Saint-Paul
J Falkinger
J Yildirim
JA Tijerina-Guajardo
JD Hamilton
JH Lebovic
JH Stock
JR Oneal
K Hausken
KA Mork
KA Mork
M Olson
M Russett
MH Berument
MH Pesaran
Mohammad Reza Farzanegan
MP Clements
MR Farzanegan
MR Farzanegan
MR Farzanegan
MS Kimenyi
P James
R Wintrobe
R Wintrobe
RF Engle
S Deger
S Johansen
Sajjad F. Dizaji
SF Dizaji
T Besley
T Besley
T Besley
T Doan
T Plümper
TS Aidt
TS Aidt
WA Fuller
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Automatic Indexing of Specialized Documents: Using Generic vs. Domain-Specific Document Representations

Author: Aurélie Névéol
James G. Mork
Publication venue
Publication date
Field of study

The shift from paper to electronic documents has caused the curation of information sources in large electronic databases to become more generalized. In the biomedical domain, continuing efforts aim at refining indexing tools to assist with the update and maintenance of databases such as MEDLINE ®. In this paper, we evaluate two statistical methods of producing MeSH ® indexing recommendations for the genetics literature, including recommendations involving subheadings, which is a novel application for the methods. We show that a generic representation of the documents yields both better precision and recall. We also find that a domainspecific representation of the documents can contribute to enhancing recall.

CiteSeerX