357 research outputs found
Are Household Surveys Like Tax Forms: Evidence from Income Underreporting of the Self Employed
There is a large literature showing that the self employed underreport their income to tax authorities. In this paper, we quantify the extent to which the self employed systematically underreport their income to U.S. household surveys. To do so, we use the Engel curve describing the relationship between income and expenditures of wage and salary workers to infer the actual income, and thus the reporting gap, of the self employed based on their reported expenditures. We find that the self employed underreport their income by about 30 percent. This result is remarkably robust across data sources and alternative model specifications. Aside from transportation expenditures, we find little evidence that the self employed misreport their expenditures to household surveys. We show that failing to account for such income underreporting leads to biased conclusions when comparing the earnings and saving behavior between the self employed and other workers as well as biased estimates of the importance of precautionary savings, the shape of lifecycle earnings profiles, and the magnitude of earnings differences across MSAs. Finally, our results show that it is naive for researchers to take it for granted that individuals will provide unbiased information to household surveys when they are simultaneously providing distorted information to other administrative sources.
Evaluation of signal peptide prediction algorithms for identification of mycobacterial signal peptides using sequence data from proteomic methods
Secreted proteins play an important part in the pathogenicity of Mycobacterium tuberculosis, and are the primary source of vaccine and diagnostic candidates. A majority of these proteins are exported via the signal peptidase I-dependent pathway, and have a signal peptide that is cleaved off during the secretion process. Sequence similarities within signal peptides have spurred the development of several algorithms for predicting their presence as well as the respective cleavage sites. For proteins exported via this pathway, algorithms exist for eukaryotes, and for Gram-negative and Gram-positive bacteria. However, the unique structure of the mycobacterial membrane raises the question of whether the existing algorithms are suitable for predicting signal peptides within mycobacterial proteins. In this work, we have evaluated the performance of nine signal peptide prediction algorithms on a positive validation set, consisting of 57 proteins with a verified signal peptide and cleavage site, and a negative set, consisting of 61 proteins that have an N-terminal sequence that confirms the annotated translational start site. We found the hidden Markov model of SignalP v3.0 to be the best-performing algorithm for predicting the presence of a signal peptide in mycobacterial proteins. It predicted no false positives or false negatives, and predicted a correct cleavage site for 45 of the 57 proteins in the positive set. Based on these results, we used the hidden Markov model of SignalP v3.0 to analyse the 10 available annotated proteomes of mycobacterial species, including annotations of M. tuberculosis H37Rv from the Wellcome Trust Sanger Institute and the J. Craig Venter Institute (JCVI). When excluding proteins with transmembrane regions among the proteins predicted to harbour a signal peptide, we found between 7.8 and 10.5 % of the proteins in the proteomes to be putative secreted proteins. Interestingly, we observed a consistent difference in the percentage of predicted proteins between the Sanger Institute and JCVI. We have determined the most valuable algorithm for predicting signal peptidase I-processed proteins of M. tuberculosis, and used this algorithm to estimate the number of mycobacterial proteins with the potential to be exported via this pathway
Genetic basis of the very short life cycle of ‘Apogee’ wheat
Background: ‘Apogee’ has a very short life cycle among wheat cultivars (flowering 25 days after planting under a long day and without vernalization), and it is a unique genetic material that can be used to accelerate cycling breeding lines. However, little is known about the genetic basis of the super-short life of Apogee wheat.
Results: In this study, Apogee was crossed with a strong winter wheat cultivar ‘Overland’, and 858 F2 plants were generated and tested in a greenhouse under constant warm temperature and long days. Apogee wheat was found to have the early alleles for four flowering time genes, which were ranked in the order of vrn-A1 \u3e VRN-B1 \u3e vrn- D3 \u3e PPD-D1 according to their effect intensity. All these Apogee alleles for early flowering showed complete or partial dominance effects in the F2 population. Surprisingly, Apogee was found to have the same alleles at vrn-A1a and vrn-D3a for early flowering as observed in winter wheat cultivar ‘Jagger.’ It was also found that the vrn-A1a gene was epistatic to VRN-B1 and vrn-D3. The dominant vrn-D3a alone was not sufficient to cause the transition from vegetative to reproductive development in winter plants without vernalization but was able to accelerate flowering in those plants that carry the vrn-A1a or Vrn-B1 alleles. The genetic effects of the vernalization and photoperiod genes were validated in Apogee x Overland F3 populations.
Conclusion: VRN-A1, VRN-B1, VRN-D3, and PPD-D1 are the major genes that enabled Apogee to produce the very short life cycle. This study greatly advanced the molecular understanding of the multiple flowering genes under different genetic backgrounds and provided useful molecular tools that can be used to accelerate winter wheat breeding schemes
Cure and Curse: E. coli Heat-Stable Enterotoxin and Its Receptor Guanylyl Cyclase C
Enterotoxigenic Escherichia coli (ETEC) associated diarrhea is responsible for roughly half a million deaths per year, the majority taking place in developing countries. The main agent responsible for these diseases is the bacterial heat-stable enterotoxin STa. STa is secreted by ETEC and after secretion binds to the intestinal receptor guanylyl cyclase C (GC-C), thus triggering a signaling cascade that eventually leads to the release of electrolytes and water in the intestine. Additionally, GC-C is a specific marker for colorectal carcinoma and STa is suggested to have an inhibitory effect on intestinal carcinogenesis. To understand the conformational events involved in ligand binding to GC-C and to devise therapeutic strategies to treat both diarrheal diseases and colorectal cancer, it is paramount to obtain structural information on the receptor ligand system. Here we summarize the currently available structural data and report on physiological consequences of STa binding to GC-C in intestinal epithelia and colorectal carcinoma cells
Flanking signal and mature peptide residues influence signal peptide cleavage
<p>Abstract</p> <p>Background</p> <p>Signal peptides (SPs) mediate the targeting of secretory precursor proteins to the correct subcellular compartments in prokaryotes and eukaryotes. Identifying these transient peptides is crucial to the medical, food and beverage and biotechnology industries yet our understanding of these peptides remains limited. This paper examines the most common type of signal peptides cleavable by the endoprotease signal peptidase I (SPase I), and the residues flanking the cleavage sites of three groups of signal peptide sequences, namely (i) eukaryotes (Euk) (ii) Gram-positive (Gram+) bacteria, and (iii) Gram-negative (Gram-) bacteria.</p> <p>Results</p> <p>In this study, 2352 secretory peptide sequences from a variety of organisms with amino-terminal SPs are extracted from the manually curated SPdb database for analysis based on physicochemical properties such as p<it>I</it>, aliphatic index, GRAVY score, hydrophobicity, net charge and position-specific residue preferences. Our findings show that the three groups share several similarities in general, but they display distinctive features upon examination in terms of their amino acid compositions and frequencies, and various physico-chemical properties. Thus, analysis or prediction of their sequences should be separated and treated as distinct groups.</p> <p>Conclusion</p> <p>We conclude that the peptide segment recognized by SPase I extends to the start of the mature protein to a limited extent, upon our survey of the amino acid residues surrounding the cleavage processing site. These flanking residues possibly influence the cleavage processing and contribute to non-canonical cleavage sites. Our findings are applicable in defining more accurate prediction tools for recognition and identification of cleavage site of SPs.</p
- …