2 research outputs found
Methods for identifying regulatory grammars
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2013.Cataloged from PDF version of thesis.Includes bibliographical references (p. [37]-40).Recent advancements in sequencing technology have made it possible to study the mechanisms of gene regulation, such as protein-DNA binding, at greater resolution and on a greater scale than was previously possible. We present an expectation-maximization learning algorithm that identifies enriched spatial relationships between motifs in sets of DNA sequences. For example, the method will identify spatially constrained motifs colocated in the same regulatory region. We apply our method to biological sequence data and recover previously known prokaryotic promoter spacing constraints demonstrating that joint learning of motifs and spacing constraints is superior to other methods for this task.by Tahin Fahmid Syed.S.M
Predicting genomic interactions using deep learning
Thesis: E.C.S., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2019Cataloged from PDF version of thesis.Includes bibliographical references (pages 39-43).Physical promoter-enhancer and CTCF-CTCF interactions organize the human genome in 3-dimensions, and contribute to the regulation of gene expression. Hi-C and related approaches have enabled profiling of these interactions, though how the instructions for these interactions are encoded in the genome is still largely not understood. We develop a deep learning model, Deep3DGenome, to predict genomic interactions using both genomic sequence data and chromatin features. We find that a machine learning model that has anchor specific modules and uses rich chromatin features outperforms previous approaches at predicting 3D interactions.by Tahin Fahmid Syed.E.C.S.E.C.S. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Scienc