413 research outputs found
Learning and Long-Term Retention of Large-Scale Artificial Languages
Recovering discrete words from continuous speech is one of the first challenges facing language learners. Infants and adults can make use of the statistical structure of utterances to learn the forms of words from unsegmented input, suggesting that this ability may be useful for bootstrapping language-specific cues to segmentation. It is unknown, however, whether performance shown in small-scale laboratory demonstrations of “statistical learning” can scale up to allow learning of the lexicons of natural languages, which are orders of magnitude larger. Artificial language experiments with adults can be used to test whether the mechanisms of statistical learning are in principle scalable to larger lexicons. We report data from a large-scale learning experiment that demonstrates that adults can learn words from unsegmented input in much larger languages than previously documented and that they retain the words they learn for years. These results suggest that statistical word segmentation could be scalable to the challenges of lexical acquisition in natural language learning.National Science Foundation (U.S.) (NSF DDRIG #0746251
Reynolds Number Effects at High Angles of Attack
Lessons learned from comparisons between ground-based tests and flight measurements for the high-angle-of-attack programs on the F-18 High Alpha Research Vehicle (HARV), the X-29 forward-swept wing aircraft, and the X-31 enhanced fighter maneuverability aircraft are presented. On all three vehicles, Reynolds number effects were evident on the forebodies at high angles of attack. The correlation between flight and wind tunnel forebody pressure distributions for the F-18 HARV were improved by using twin longitudinal grit strips on the forebody of the wind-tunnel model. Pressure distributions obtained on the X-29 wind-tunnel model at flight Reynolds numbers showed excellent correlation with the flight data up to alpha = 50 deg. Above (alpha = 50 deg. the pressure distributions for both flight and wind tunnel became asymmetric and showed poorer agreement, possibly because of the different surface finish of the model and aircraft. The detrimental effect of a very sharp nose apex was demonstrated on the X-31 aircraft. Grit strips on the forebody of the X-31 reduced the randomness but increased the magnitude of the asymmetry. Nose strakes were required to reduce the forebody yawing moment asymmetries and the grit strips on the flight test noseboom improved the aircraft handling qualities
Using ESTs to improve the accuracy of de novo gene prediction
BACKGROUND: ESTs are a tremendous resource for determining the exon-intron structures of genes, but even extensive EST sequencing tends to leave many exons and genes untouched. Gene prediction systems based exclusively on EST alignments miss these exons and genes, leading to poor sensitivity. De novo gene prediction systems, which ignore ESTs in favor of genomic sequence, can predict such "untouched" exons, but they are less accurate when predicting exons to which ESTs align. TWINSCAN is the most accurate de novo gene finder available for nematodes and N-SCAN is the most accurate for mammals, as measured by exact CDS gene prediction and exact exon prediction. RESULTS: TWINSCAN_EST is a new system that successfully combines EST alignments with TWINSCAN. On the whole C. elegans genome TWINSCAN_EST shows 14% improvement in sensitivity and 13% in specificity in predicting exact gene structures compared to TWINSCAN without EST alignments. Not only are the structures revealed by EST alignments predicted correctly, but these also constrain the predictions without alignments, improving their accuracy. For the human genome, we used the same approach with N-SCAN, creating N-SCAN_EST. On the whole genome, N-SCAN_EST produced a 6% improvement in sensitivity and 1% in specificity of exact gene structure predictions compared to N-SCAN. CONCLUSION: TWINSCAN_EST and N-SCAN_EST are more accurate than TWINSCAN and N-SCAN, while retaining their ability to discover novel genes to which no ESTs align. Thus, we recommend using the EST versions of these programs to annotate any genome for which EST information is available. TWINSCAN_EST and N-SCAN_EST are part of the TWINSCAN open source software package
GeneWaltz--A new method for reducing the false positives of gene finding
<p>Abstract</p> <p>Background</p> <p>Identifying protein-coding regions in genomic sequences is an essential step in genome analysis. It is well known that the proportion of false positives among genes predicted by current methods is high, especially when the exons are short. These false positives are problematic because they waste time and resources of experimental studies.</p> <p>Methods</p> <p>We developed GeneWaltz, a new filtering method that reduces the risk of false positives in gene finding. GeneWaltz utilizes a codon-to-codon substitution matrix that was constructed by comparing protein-coding regions from orthologous gene pairs between mouse and human genomes. Using this matrix, a scoring scheme was developed; it assigned higher scores to coding regions and lower scores to non-coding regions. The regions with high scores were considered candidate coding regions. One-dimensional Karlin-Altschul statistics was used to test the significance of the coding regions identified by GeneWaltz.</p> <p>Results</p> <p>The proportion of false positives among genes predicted by GENSCAN and Twinscan were high, especially when the exons were short. GeneWaltz significantly reduced the ratio of false positives to all positives predicted by GENSCAN and Twinscan, especially when the exons were short.</p> <p>Conclusions</p> <p>GeneWaltz will be helpful in experimental genomic studies. GeneWaltz binaries and the matrix are available online at <url>http://en.sourceforge.jp/projects/genewaltz/</url>.</p
Suicide among adults aged 30–49: A psychological autopsy study in Hong Kong
<p>Abstract</p> <p>Background</p> <p>A surge in suicide rates in middle age people in Hong Kong and many Asian countries was recently observed. However, there is a paucity of suicide research on this subgroup of people in Asia.</p> <p>Methods</p> <p>The next-of-kin of 85 suicide cases and 85 community subjects aged 30–49 years were interviewed by a psychological autopsy approach. Information was triangulated by interview notes, coroner's court files, and police investigation reports.</p> <p>Results</p> <p>A multiple logistic regression analysis identified the following risk factors for suicide among the middle age people in Hong Kong: the presence of at least one psychiatric disorder (OR = 37.5, 95% CI 11.5–121.9, p < 0.001), indebtedness (OR = 9.4, 95% CI 2.2–40.8, p < 0.01), unemployment (OR = 4.8, 95% CI 1.3–17.5, p < 0.05), never married (OR = 4.2, 95% CI 1.1–16.3, p < 0.05), and lived alone (OR = 3.9, 95% CI 1.2–13.4, p < 0.05).</p> <p>Conclusion</p> <p>The data show that socio-economical factors had a strong impact on suicide in the target group. Further research is needed to explore any positive qualities that protect the middle-aged from suicide. The prevention of suicide in the middle-aged requires multiple strategies.</p
A PATO-compliant zebrafish screening database (MODB): management of morpholino knockdown screen information
<p>Abstract</p> <p>Background</p> <p>The zebrafish is a powerful model vertebrate amenable to high throughput <it>in vivo </it>genetic analyses. Examples include reverse genetic screens using morpholino knockdown, expression-based screening using enhancer trapping and forward genetic screening using transposon insertional mutagenesis. We have created a database to facilitate web-based distribution of data from such genetic studies.</p> <p>Description</p> <p>The MOrpholino DataBase is a MySQL relational database with an online, PHP interface. Multiple quality control levels allow differential access to data in raw and finished formats. MODBv1 includes sequence information relating to almost 800 morpholinos and their targets and phenotypic data regarding the dose effect of each morpholino (mortality, toxicity and defects). To improve the searchability of this database, we have incorporated a fixed-vocabulary defect ontology that allows for the organization of morpholino affects based on anatomical structure affected and defect produced. This also allows comparison between species utilizing Phenotypic Attribute Trait Ontology (PATO) designated terminology. MODB is also cross-linked with ZFIN, allowing full searches between the two databases. MODB offers users the ability to retrieve morpholino data by sequence of morpholino or target, name of target, anatomical structure affected and defect produced.</p> <p>Conclusion</p> <p>MODB data can be used for functional genomic analysis of morpholino design to maximize efficacy and minimize toxicity. MODB also serves as a template for future sequence-based functional genetic screen databases, and it is currently being used as a model for the creation of a mutagenic insertional transposon database.</p
Genome Majority Vote Improves Gene Predictions
Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome majority vote (GMV) scheme to increase the consistency of gene starts among orthologs. We used a set of validated Escherichia coli genes as a standard to quantify accuracy. Results showed that the GMV algorithm can correct hundreds of gene prediction errors in sets of five or ten genomes while introducing few errors. Using a conservative calculation, we project that GMV would resolve many inconsistencies and errors in publicly available microbial gene maps. Our simple and logical solution provides a notable advance toward accurate gene maps
Combining regenerative medicine strategies to provide durable reconstructive options: auricular cartilage tissue engineering
Recent advances in regenerative medicine place us in a unique position to improve the quality of engineered tissue. We use auricular cartilage as an exemplar to illustrate how the use of tissue-specific adult stem cells, assembly through additive manufacturing and improved understanding of postnatal tissue maturation will allow us to more accurately replicate native tissue anisotropy. This review highlights the limitations of autologous auricular reconstruction, including donor site morbidity, technical considerations and long-term complications. Current tissue-engineered auricular constructs implanted into immune-competent animal models have been observed to undergo inflammation, fibrosis, foreign body reaction, calcification and degradation. Combining biomimetic regenerative medicine strategies will allow us to improve tissue-engineered auricular cartilage with respect to biochemical composition and functionality, as well as microstructural organization and overall shape. Creating functional and durable tissue has the potential to shift the paradigm in reconstructive surgery by obviating the need for donor sites
“Genes”
In order to describe a cell at molecular level, a notion of a “gene” is neither necessary nor helpful. It is sufficient to consider the molecules (i.e., chromosomes, transcripts, proteins) and their interactions to describe cellular processes. The downside of the resulting high resolution is that it becomes very tedious to address features on the organismal and phenotypic levels with a language based on molecular terms. Looking for the missing link between biological disciplines dealing with different levels of biological organization, we suggest to return to the original intent behind the term “gene”. To this end, we propose to investigate whether a useful notion of “gene” can be constructed based on an underlying notion of function, and whether this can serve as the necessary link and embed the various distinct gene concepts of biological (sub)disciplines in a coherent theoretical framework. In reply to the Genon Theory recently put forward by Klaus Scherrer and Jürgen Jost in this journal, we shall discuss a general approach to assess a gene definition that should then be tested for its expressiveness and potential cross-disciplinary relevance
- …