40 research outputs found

    Search for Human-Specific Proteins Based on Availability Scores of Short Constituent Sequences: Identification of a WRWSH Protein in Human Testis

    Get PDF
    Little is known about protein sequences unique in humans. Here, we performed alignment-free sequence comparisons based on the availability (frequency bias) of short constituent amino acid (aa) sequences (SCSs) in proteins to search for human-specific proteins. Focusing on 5-aa SCSs (pentats), exhaustive comparisons of availability scores among the human proteome and other nine mammalian proteomes in the nonredundant (nr) database identified a candidate protein containing WRWSH, here called FAM75, as human-specific. Examination of various human genome sequences revealed that FAM75 had genomic DNA sequences for either WRWSH or WRWSR due to a single nucleotide polymorphism (SNP). FAM75 and its related protein FAM205A were found to be produced through alternative splicing. The FAM75 transcript was found only in humans, but the FAM205A transcript was also present in other mammals. In humans, both FAM75 and FAM205A were expressed specifically in testis at the mRNA level, and they were immunohistochemically located in cells in seminiferous ducts and in acrosomes in spermatids at the protein level, suggesting their possible function in sperm development and fertilization. This study highlights a practical application of SCS-based methods for protein searches and suggests possible contributions of SNP variants and alternative splicing of FAM75 to human evolution

    Ramucirumab in elderly patients with hepatocellular carcinoma and elevated alpha-fetoprotein after sorafenib in REACH and REACH-2

    Get PDF
    Background & Aims: Limited data on treatment of elderly patients with hepatocellular carcinoma (HCC) increase the unmet need. REACH and REACH‐2 were global phase III studies of ramucirumab in patients with HCC after prior sorafenib, where patients with alpha‐fetoprotein (AFP) ≥400 ng/mL showed an overall ssurvival (OS) benefit for ramucirumab. These post‐hoc analyses examined efficacy and safety of ramucirumab in patients with HCC and baseline AFP ≥ 400 ng/mL by three prespecified age subgroups (<65, ≥65 to <75 and ≥75 years). Methods: Individual patient data were pooled from REACH (baseline AFP ≥400 ng/mL) and REACH‐2. Kaplan‐Meier and Cox proportional hazards regression methods (stratified by study) assessed OS, progression‐free survival (PFS), time to progression (TTP) and patient‐reported outcomes (Functional Hepatobiliary System Index‐8 [FHSI‐8] score). Results: A total of 542 patients (<65 years: n = 302; ≥65 to <75 years: n = 160; ≥75 years: n = 80) showed similar baseline characteristics between ramucirumab and placebo. Older subgroups had higher hepatitis C and steatohepatitis incidences, and lower AFP levels, than the <65 years subgroup. Ramucirumab prolonged OS in patients <65 years (hazard ratio [HR], 0.753; 95% CI 0.581‐0.975), ≥65 to <75 years (0.602; 0.419‐0.866) and ≥75 years (0.709; 0.420‐1.199), PFS and TTP irrespective of age. Ramucirumab showed similar overall safety profiles across subgroups, with a consistent median relative dose intensity ≥97.8%. A trend towards a delay in symptom deterioration in FHSI‐8 with ramucirumab was observed in all subgroups. Conclusions: In this post‐hoc analysis, ramucirumab showed a survival benefit across age subgroups with a tolerable safety profile, supporting its use in advanced HCC with elevated AFP, irrespective of age, including ≥75 years

    A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package

    No full text
    Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs

    A FREQUENCY-BASED LINGUISTIC APPROACH TO PROTEIN DECODING AND DESIGN: SIMPLE CONCEPTS, DIVERSE APPLICATIONS, AND THE SCS PACKAGE

    No full text
    Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs

    レチノイン酸はラットにおけるPropionibacterium acnesとリポポリサッカライドによる肝障害を抑制する

    No full text
    Made available in DSpace on 2012-09-04T05:10:13Z (GMT). No. of bitstreams: 1 motomura.pdf: 2635407 bytes, checksum: 5c966bdd56880f7ab53dbec1d17f97cd (MD5) Previous issue date: 1999-03-2

    Immediate virological response predicts the success of short-term peg-interferon monotherapy for chronic hepatitis C

    No full text
    AIM: To investigate the efficacy of short-term peg-interferon (PEG-IFN) monotherapy for chronic hepatitis C patients who achieved an immediate virological response

    Word decoding of protein amino Acid sequences with availability analysis: a linguistic approach.

    Get PDF
    The amino acid sequences of proteins determine their three-dimensional structures and functions. However, how sequence information is related to structures and functions is still enigmatic. In this study, we show that at least a part of the sequence information can be extracted by treating amino acid sequences of proteins as a collection of English words, based on a working hypothesis that amino acid sequences of proteins are composed of short constituent amino acid sequences (SCSs) or "words". We first confirmed that the English language highly likely follows Zipf's law, a special case of power law. We found that the rank-frequency plot of SCSs in proteins exhibits a similar distribution when low-rank tails are excluded. In comparison with natural English and "compressed" English without spaces between words, amino acid sequences of proteins show larger linear ranges and smaller exponents with heavier low-rank tails, demonstrating that the SCS distribution in proteins is largely scale-free. A distribution pattern of SCSs in proteins is similar among species, but species-specific features are also present. Based on the availability scores of SCSs, we found that sequence motifs are enriched in high-availability sites (i.e., "key words") and vice versa. In fact, the highest availability peak within a given protein sequence often directly corresponds to a sequence motif. The amino acid composition of high-availability sites within motifs is different from that of entire motifs and all protein sequences, suggesting the possible functional importance of specific SCSs and their compositional amino acids within motifs. We anticipate that our availability-based word decoding approach is complementary to sequence alignment approaches in predicting functionally important sites of unknown proteins from their amino acid sequences
    corecore