835 research outputs found
Automation of a problem list using natural language processing
BACKGROUND: The medical problem list is an important part of the electronic medical record in development in our institution. To serve the functions it is designed for, the problem list has to be as accurate and timely as possible. However, the current problem list is usually incomplete and inaccurate, and is often totally unused. To alleviate this issue, we are building an environment where the problem list can be easily and effectively maintained. METHODS: For this project, 80 medical problems were selected for their frequency of use in our future clinical field of evaluation (cardiovascular). We have developed an Automated Problem List system composed of two main components: a background and a foreground application. The background application uses Natural Language Processing (NLP) to harvest potential problem list entries from the list of 80 targeted problems detected in the multiple free-text electronic documents available in our electronic medical record. These proposed medical problems drive the foreground application designed for management of the problem list. Within this application, the extracted problems are proposed to the physicians for addition to the official problem list. RESULTS: The set of 80 targeted medical problems selected for this project covered about 5% of all possible diagnoses coded in ICD-9-CM in our study population (cardiovascular adult inpatients), but about 64% of all instances of these coded diagnoses. The system contains algorithms to detect first document sections, then sentences within these sections, and finally potential problems within the sentences. The initial evaluation of the section and sentence detection algorithms demonstrated a sensitivity and positive predictive value of 100% when detecting sections, and a sensitivity of 89% and a positive predictive value of 94% when detecting sentences. CONCLUSION: The global aim of our project is to automate the process of creating and maintaining a problem list for hospitalized patients and thereby help to guarantee the timeliness, accuracy and completeness of this information
A UMLS-based spell checker for natural language processing in vaccine safety
BACKGROUND: The Institute of Medicine has identified patient safety as a key goal for health care in the United States. Detecting vaccine adverse events is an important public health activity that contributes to patient safety. Reports about adverse events following immunization (AEFI) from surveillance systems contain free-text components that can be analyzed using natural language processing. To extract Unified Medical Language System (UMLS) concepts from free text and classify AEFI reports based on concepts they contain, we first needed to clean the text by expanding abbreviations and shortcuts and correcting spelling errors. Our objective in this paper was to create a UMLS-based spelling error correction tool as a first step in the natural language processing (NLP) pipeline for AEFI reports. METHODS: We developed spell checking algorithms using open source tools. We used de-identified AEFI surveillance reports to create free-text data sets for analysis. After expansion of abbreviated clinical terms and shortcuts, we performed spelling correction in four steps: (1) error detection, (2) word list generation, (3) word list disambiguation and (4) error correction. We then measured the performance of the resulting spell checker by comparing it to manual correction. RESULTS: We used 12,056 words to train the spell checker and tested its performance on 8,131 words. During testing, sensitivity, specificity, and positive predictive value (PPV) for the spell checker were 74% (95% CI: 74–75), 100% (95% CI: 100–100), and 47% (95% CI: 46%–48%), respectively. CONCLUSION: We created a prototype spell checker that can be used to process AEFI reports. We used the UMLS Specialist Lexicon as the primary source of dictionary terms and the WordNet lexicon as a secondary source. We used the UMLS as a domain-specific source of dictionary terms to compare potentially misspelled words in the corpus. The prototype sensitivity was comparable to currently available tools, but the specificity was much superior. The slow processing speed may be improved by trimming it down to the most useful component algorithms. Other investigators may find the methods we developed useful for cleaning text using lexicons specific to their area of interest
Double Diffraction Dissociation at the Fermilab Tevatron Collider
We present results from a measurement of double diffraction dissociation in
collisions at the Fermilab Tevatron collider. The production cross
section for events with a central pseudorapidity gap of width
(overlapping ) is found to be [] at [630]
GeV. Our results are compared with previous measurements and with predictions
based on Regge theory and factorization.Comment: 10 pages, 4 figures, using RevTeX. Submitted to Physical Review
Letter
Search for a Technicolor omega_T Particle in Events with a Photon and a b-quark Jet at CDF
If the Technicolor omega_T particle exists, a likely decay mode is omega_T ->
gamma pi_T, followed by pi_T -> bb-bar, yielding the signature gamma bb-bar. We
have searched 85 pb^-1 of data collected by the CDF experiment at the Fermilab
Tevatron for events with a photon and two jets, where one of the jets must
contain a secondary vertex implying the presence of a b quark. We find no
excess of events above standard model expectations. We express the result of an
exclusion region in the M_omega_T - M_pi_T mass plane.Comment: 14 pages, 2 figures. Available from the CDF server (PS with figs):
http://www-cdf.fnal.gov/physics/pub98/cdf4674_omega_t_prl_4.ps
FERMILAB-PUB-98/321-
Measurement of the B0 anti-B0 oscillation frequency using l- D*+ pairs and lepton flavor tags
The oscillation frequency Delta-md of B0 anti-B0 mixing is measured using the
partially reconstructed semileptonic decay anti-B0 -> l- nubar D*+ X. The data
sample was collected with the CDF detector at the Fermilab Tevatron collider
during 1992 - 1995 by triggering on the existence of two lepton candidates in
an event, and corresponds to about 110 pb-1 of pbar p collisions at sqrt(s) =
1.8 TeV. We estimate the proper decay time of the anti-B0 meson from the
measured decay length and reconstructed momentum of the l- D*+ system. The
charge of the lepton in the final state identifies the flavor of the anti-B0
meson at its decay. The second lepton in the event is used to infer the flavor
of the anti-B0 meson at production. We measure the oscillation frequency to be
Delta-md = 0.516 +/- 0.099 +0.029 -0.035 ps-1, where the first uncertainty is
statistical and the second is systematic.Comment: 30 pages, 7 figures. Submitted to Physical Review
Stringency of the 2-His–1-Asp Active-Site Motif in Prolyl 4-Hydroxylase
The non-heme iron(II) dioxygenase family of enzymes contain a common 2-His–1-carboxylate iron-binding motif. These enzymes catalyze a wide variety of oxidative reactions, such as the hydroxylation of aliphatic C–H bonds. Prolyl 4-hydroxylase (P4H) is an α-ketoglutarate-dependent iron(II) dioxygenase that catalyzes the post-translational hydroxylation of proline residues in protocollagen strands, stabilizing the ensuing triple helix. Human P4H residues His412, Asp414, and His483 have been identified as an iron-coordinating 2-His–1-carboxylate motif. Enzymes that catalyze oxidative halogenation do so by a mechanism similar to that of P4H. These halogenases retain the active-site histidine residues, but the carboxylate ligand is replaced with a halide ion. We replaced Asp414 of P4H with alanine (to mimic the active site of a halogenase) and with glycine. These substitutions do not, however, convert P4H into a halogenase. Moreover, the hydroxylase activity of D414A P4H cannot be rescued with small molecules. In addition, rearranging the two His and one Asp residues in the active site eliminates hydroxylase activity. Our results demonstrate a high stringency for the iron-binding residues in the P4H active site. We conclude that P4H, which catalyzes an especially demanding chemical transformation, is recalcitrant to change
Search for Gluinos and Scalar Quarks in Collisions at TeV using the Missing Energy plus Multijets Signature
We have performed a search for gluinos (\gls) and squarks (\sq) in a data
sample of 84 pb of \ppb collisions at = 1.8 TeV, recorded by
the Collider Detector at Fermilab, by investigating the final state of large
missing transverse energy and 3 or more jets, a characteristic signature in
R-parity-conserving supersymmetric models. The analysis has been performed
`blind', in that the inspection of the signal region is made only after the
predictions from Standard Model backgrounds have been calculated. Comparing the
data with predictions of constrained supersymmetric models, we exclude gluino
masses below 195 \gev (95% C.L.), independent of the squark mass. For the case
\msq \approx \mgls, gluino masses below 300 \gev are excluded.Comment: 7 pages, 3 figure
A Measurement of the Differential Dijet Mass Cross Section in p-pbar Collisions at sqrt{s}=1.8 TeV
We present a measurement of the cross section for production of two or more
jets as a function of dijet mass, based on an integrated luminosity of 86 pb^-1
collected with the Collider Detector at Fermilab. Our dijet mass spectrum is
described within errors by next-to-leading order QCD predictions using CTEQ4HJ
parton distributions, and is in good agreement with a similar measurement from
the D0 experiment.Comment: 18 pages including 2 figures and 3 tables. Submitted to Phys. Rev. D
Rapid Communication
Search for New Particles Decaying to top-antitop in proton-antiproton collisions at squareroot(s)=1.8 TeV
We use 106 \ipb of data collected with the Collider Detector at Fermilab to
search for narrow-width, vector particles decaying to a top and an anti-top
quark. Model independent upper limits on the cross section for narrow, vector
resonances decaying to \ttbar are presented. At the 95% confidence level, we
exclude the existence of a leptophobic \zpr boson in a model of
topcolor-assisted technicolor with mass M_{\zpr} 480 \gev for natural
width = 0.012 M_{\zpr}, and M_{\zpr} 780 \gev for =
0.04 M_{\zpr}.Comment: The CDF Collaboration, submitted to PRL 25-Feb-200
Diffractive Dijet Production at sqrt(s)=630 and 1800 GeV at the Fermilab Tevatron
We report a measurement of the diffractive structure function of
the antiproton obtained from a study of dijet events produced in association
with a leading antiproton in collisions at GeV at the
Fermilab Tevatron. The ratio of at GeV to
obtained from a similar measurement at GeV is compared with
expectations from QCD factorization and with theoretical predictions. We also
report a measurement of the (-Pomeron) and ( of parton in
Pomeron) dependence of at GeV. In the region
, GeV and , is
found to be of the form , which obeys
- factorization.Comment: LaTeX, 9 pages, Submitted to Phys. Rev. Letter
- …