Search CORE

23 research outputs found

Comparison of exon 5 sequences from 35 class I genes of the BALB/c mouse

Author: Brorson Kurt A.
Cheroutre Hilde
Hood Leroy
Hunkapiller Tim
Hunt Stephen W., III
Nickerson Deborah A.
Sun Y. Henry
Publication venue: 'Rockefeller University Press'
Publication date: 01/12/1989
Field of study

DNA sequences of the fifth exon, which encodes the transmembrane domain, were determined for the BALB/c mouse class I MHC genes and used to study the relationships between them. Based on nucleotide sequence similarity, the exon 5 sequences can be divided into seven groups. Although most members within each group are at least 80% similar to each other, comparison between groups reveals that the groups share little similarity. However, in spite of the extensive variation of the fifth exon sequences, analysis of their predicted amino acid translations reveals that only four class I gene fifth exons have frameshifts or stop codons that terminate their translation and prevent them from encoding a domain that is both hydrophobic and long enough to span a lipid bilayer. Exactly 27 of the remaining fifth exons could encode a domain that is similar to those of the transplantation antigens in that it consists of a proline-rich connecting peptide, a transmembrane segment, and a cytoplasmic portion with membrane-anchoring basic residues. The conservation of this motif in the majority of the fifth exon translations in spite of extensive variation suggests that selective pressure exists for these exons to maintain their ability to encode a functional transmembrane domain, raising the possibility that many of the nonclassical class I genes encode functionally important products

Caltech Authors

Diversity and evolution of the immunoglobulin gene superfamily

Author: Hunkapiller Tim
Publication venue
Publication date: 01/01/1993
Field of study

The Immunoglobulin Gene Superfamily is characterized by a common protein homology unit that is present in arguably the largest and most diverse set of genes and gene families of any protein motif. This distribution indicates that the homology unit is a remarkably versatile functional unit. Its central role in defining the complex phenotypes of the immune and nervous systems, likewise, is testament to the ability of the motif to support an amazing and unique degree of diversification. Understanding more about the function, structure and evolution of the Immunoglobulin Gene Superfamily can provide insights into both the general issues of complex system evolution as well as the specific nature of the various systems the superfamily plays a central role in. This thesis is a collection of work aimed at a more thorough understanding of these elements. Particularly, these works summarize much of our current understanding of the members of the Immunoglobulin Gene Superfamily along with speculations on their evolutionary history as well as both the evolutionary and somatic mechanisms responsible for their diversity. This work includes initial descriptions of several features relevant to somatic diversification of rearranging immune receptors, including: l) the role of joining imprecision in the generation of junctional diversity in immunoglobulin kappa chain; 2) the initial description of the T-cell beta chain J/C locus; 3) the translation of T-cell beta chain D gene segments in all three reading frames; 4) the occurrence of a cryptic rearrangement signal in most rearranging V families; 5) the first description of the mechanisms of class switching between heavy chain mu and delta genes; 6) the limited diversity of germline T-cell beta chains; 7) the shared complementary determining region structure of T-cell beta chains and immunoglobulin heavy chains. Also, from these efforts, new members of the superfamily have been identified including MHC class I molecules, L3T4 and Myelin Associated Glycoprotein. Various observations concerning the evolutionary relationships of these molecules and motifs have been made. Particularly, a variation on the basic homology unit motif has been proposed that probably more nearly represents the primordial sequence and function. As a result of these discoveries, a new, comprehensive picture of the immunoglobulin superfamily is emerging that has implications for interpreting current functional relationships in the context of the evolutionary history of the members. Particularly, it is suggested from this work that the ability of the homology unit to accommodate diversity has made possible the evolution of the superfamily. Given the tremendous diversity within the superfamily, it might be assumed that selective pressures favoring diversity have driven its evolution. However, much of the analysis within this collection suggests that, on the contrary, diversity is an inherent feature of the conserved protein and gene structure of the homology unit and that it was the a priori diversity itself that drove and shaped the evolution of the complex systems that employ the homology unit today. This basic diversity is the consequence of three characteristics of the homology unit. First, the tertiary structure of the protein motif is such that homology units tend to interact preferentially to form homo- or heterodimers, forming the basis of many of the receptors and the receptor/ligand interactions common within the superfamily. These combinatorial associations increase both the somatic and evolutionary potential for diversification. This can lead to the rather sudden appearance of new functional associations between existing members of the superfamily preadapted for otherwise unrelated functions. Second, except for a minimal number of amino acid residues involved in critical intra- and interchain interactions, the primary structure of these units can vary dramatically and still provide for essentially the same tertiary structure. This has been borne out by various crystallographic studies. The variability is particularly true of the loop structures normally identified with antigen specificity, but seen in other extended families as well. Reduced constraints on structural sequences inherently promote the establishment of variation within populations. Third, with very few exceptions the genes of the superfamily, the homology units are not only encoded by discrete exons, but these exons have a shared 1/2 splicing rule. That is, each is begun with the second 2 bases of a codon and ended with the first base. This allows the in-frame splicing of any number of tandem homology units, while maintaining functional protein domains. This rule generally applies to the non-homology unit exons of member genes as well. This allows, through relatively simple genetic events, the development of new contexts for homology unit expression, both by simple expansion and contraction of homology unit number and exon shuffling. This is probably at work, as well, in the frequent occurrence and utilization of alternative transcripts seen throughout the superfamily. Many of the recognized occurrences of alternative splicing, such as that between membrane-bound and secreted forms, indicate that this gene structure provides for a further level of functional diversity and the expansion of the virtual genetic information. Beyond the explicit discussion of the superfamily members, this work also speaks to various issues of evolution in general. In particular, the history of the superfamily suggests the importance of canalization and non-gradual episodes of evolutionary change. It can contribute, as well, to the discussion of adaptive versus neutral change.</p

Caltech Theses and Dissertations

Nucleotide Sequence of a Light Chain Gene of the Mouse I-A Subregion: Aβ^d

Author: Hood Leroy
Hunkapiller Tim
Malissen Marie
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 19/08/1983
Field of study

Iɑ (I region-associated) antigens are cell-surface glycoproteins involved in the regulation of immune responsiveness. They are composed of one heavy (ɑ) and one light (β) polypeptide chain. We have sequenced the gene encoding the Aβ^d chain of the BALB/c mouse. The presence of six exons is predicted by comparison with the complementary DNA sequences of human β chains and with partial protein sequence data for the Aβ^d polypeptide. Sequence comparisons have been made to other proteins involved in immune responses and the consequent implications for the evolutionary relationships of these genes are discussed

Caltech Authors

DNA Sequence of the Gene Encoding the E_ɑ Ia Polypeptide of the BALB/c Mouse

Author: Hood Leroy
Hunkapiller Tim
Jones Patricia
McNicholas Janet
Steinmetz Michael
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 17/12/1982
Field of study

A 3.4-kilobase DNA fragment containing the gene coding for the Eɑ chain of an Ia (I region-associated) antigen from the BALB/c mouse has been sequenced. It contains at least three exons, which correlate with the major structural domains of the Eɑ chain-the two external domains al and ɑ2, and the transmembrane-cytoplasmic domain. The coding sequence of the mouse Eɑ gene shows striking homology to its human counterpart at the DNA and protein levels. The translated ɑ2 exon demonstrates significant similarity to β_2-microglohulin, to immunoglobulin constant region domains, and to certain domains of transplantation antigens. These observations and those of others suggest that the Ia antigen, transplantation antigen, and immunoglohulin gene families share a common ancestor

Caltech Authors

Hidden Markov Models in Molecular Biology: New Algorithms and Applications

Author: Cathy Place
Marcella A. Mcclure
Pierre Baldi
Tim Hunkapiller
Yves Chauvin
Publication venue: Morgan Kaufmann
Publication date: 01/01/1993
Field of study

Hidden Markov Models (HMMs) can be applied to several important problems in molecular biology. We introduce a new convergent learning algorithm for HMMs that, unlike the classical Baum-Welch algorithm is smooth and can be applied on-line or in batch mode, with or without the usual Viterbi most likely path approximation. Left-right HMMs with insertion and deletion states are then trained to represent several protein families including immunoglobulins and kinases. In all cases, the models derived capture all the important statistical properties of the families and can be used efficiently in a number of important tasks such as multiple alignment, motif detection, and classification. 3 and Division of Biology, California Institute of Technology. y and Department of Psychology, Stanford University. 1 INTRODUCTION Hidden Markov Models (e.g., Rabiner, 1989) and the more general EM algorithm in statistics can be applied to the modeling and analysis of biological primary sequenc..

CiteSeerX

Caltech Authors

Light on molecular recognition

Author: C Tuerk
KM Shokat
KR Khrapko
Leroy Hood
MM Davis
Robert Kaiser
SPA Fodor
T Hunkapiller
Tim Hunkapiller
WD Huse
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Isolation and sequence of L3T4 complementary DNA clones: expression in T cells and brain

Author: Field Elizabeth H.
Gorman Scott D.
Hunkapiller Tim
Parnes Jane R.
Tourvielle Béatrice
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 31/10/1986
Field of study

T lymphocytes express on their surface not only a specific receptor for antigen and major histocompatibility complex proteins, but also a number of additional glycoproteins that are thought to play accessory roles in the processes of recognition and signal transduction. L3T4 is one such T-cell surface protein that is expressed on most mouse thymocytes and on mature mouse T cells that recognize class II (Ia) major histocompatibility complex proteins. Such cells are predominantly of the helper/inducer phenotype. In this study, complementary DNA clones encoding L3T4 were isolated and sequenced. The predicted protein sequence shows that L3T4 is a member of the immunoglobulin gene superfamily. It is encoded by a single gene that does not require rearrangement prior to expression. Although the protein has not previously been demonstrated on nonhematopoietic cells, two messenger RNA species specific for L3T4 are found in brain. The minor species comigrates with the L3T4 transcript in T cells, whereas the major species is 1 kilobase smaller

Caltech Authors

A systolic array processor for biological information signal processing

Author: Chow E. T.
Hunkapiller Tim
Peterson J. C.
Waterman M. S.
Zimmerman B. A.
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/06/1991
Field of study

The Biological Information Signal Processing (BISP) is a system for high speed sequence comparisons designed to support the computation requirements for mapping and sequencing the human and other genomes. The heart of a BISP system is a versatile processor chip that can conduct the most time consuming sequence comparison functions, establishing both global and local relationships between two DNA or protein sequences. Because of the application’s strong computation and communication requirements, a programmable systolic array architecture was developed. A BISP system can include a large number of processing elements; the initial BISP demonstration system consists of 768 BISP elements, capable of delivering more than 6.25 x 10^9 integer operations per second. The system can be expanded to include over 4,000 elements, This paper describes the comparison algorithm and outlines the BISP chip and system designs. Estimated performance of the BISP system is compared with several different computer architectures

Caltech Authors

A systolic array processor for biological information signal processing

Author: Chow E. T.
Hunkapiller Tim
Peterson J. C.
Waterman M. S.
Zimmerman B. A.
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/06/1991
Field of study