Sangam: A Confluence of Knowledge Streams

Design of Nuclear-Targeting Peptides for Macromolecule Delivery via Machine Learning

Show simple item record

dc.contributor Pentelute, Bradley L
dc.contributor Massachusetts Institute of Technology. Department of Chemistry
dc.creator Schissel, Carly Katherine
dc.date 2022-06-15T13:06:28Z
dc.date 2022-06-15T13:06:28Z
dc.date 2022-02
dc.date 2022-03-03T18:35:32.041Z
dc.date.accessioned 2023-02-17T20:05:10Z
dc.date.available 2023-02-17T20:05:10Z
dc.identifier https://hdl.handle.net/1721.1/143244
dc.identifier https://orcid.org/ 0000-0003-0773-5168
dc.identifier.uri http://localhost:8080/xmlui/handle/CUHPOERS/242099
dc.description The effective design of functional peptide sequences remains a fundamental challenge in biomedicine. For example, cell-penetrating peptides (CPPs) are capable of delivering macromolecular cargo to intracellular targets that are otherwise inaccessible. However, design of novel CPPs with high activity and unique structure remains challenging. In this thesis, methods to design and characterize highly active CPPs for antisense oligonucleotide delivery were explored. Machine learning is a promising method for de novo design of functional peptide sequences. A deep learning model inspired by directed evolution was used to optimize abiotic sequences that traffic antisense oligomers to the nucleus of cells. The model was able to predict activities beyond those in the training dataset, and simultaneously decipher and visualize sequence-activity predictions. The validated miniproteins (40-80 residues) were more effective than any previously known variant in cells. By augmenting the machine learning model to over-represent shorter sequence space, the model also predicted a short peptide (18-residues) with comparable activity to a positive control peptide. Empirical sequence-activity studies demonstrated reliance on the cationic residues as well as the C-terminal cysteine residue. These sequences were nontoxic, able to deliver other biomacromolecules to the cytosol, and efficiently delivered antisense cargo in mice. A different approach to discover and characterize CPP sequences was also taken, by extracting peptides taken up into cells and analyzing their relative quantities or identifying their sequences by mass spectrometry. First, several mirror-image D-peptides had similar delivery activity to their native forms, while demonstrating complete proteolytic stability. Mixtures of fully intact antisense-peptide conjugates could be recovered from whole cell and cytosolic lysates, and relative concentrations were quantified by MALDI-TOF. This method was then extended to the discovery of de novo sequences from a combinatorial library of antisense-peptide conjugates containing unnatural residues. Following cell treatment with the biotinylated antisense-peptide library, the cytosol of cells was extracted and internalized peptides recovered via affinity capture. De novo sequencing was achieved by Orbitrap tandem mass spectrometry, and several unique, unnatural sequences were identified that could effectively deliver the antisense oligomer to the nucleus. In summary, machine learning and mass spectrometry-based strategies to discover and characterize novel CPP sequences for antisense delivery were explored. In the future, we envision combining these methods in order to use lists of library hits to train a machine learning model to design sequences composed of fully unnatural amino acids.
dc.description Ph.D.
dc.format application/pdf
dc.publisher Massachusetts Institute of Technology
dc.rights In Copyright - Educational Use Permitted
dc.rights Copyright MIT
dc.rights http://rightsstatements.org/page/InC-EDU/1.0/
dc.title Design of Nuclear-Targeting Peptides for Macromolecule Delivery via Machine Learning
dc.type Thesis


Files in this item

Files Size Format View
Schissel-schissel-phd-chemistry-2022-thesis.pdf 83.43Mb application/pdf View/Open

This item appears in the following Collection(s)

  • DSpace@MIT [2699]
    DSpace@MIT is a digital repository for MIT's research, including peer-reviewed articles, technical reports, working papers, theses, and more.

Show simple item record

Search DSpace


Advanced Search

Browse