stat940W25-presentation: Difference between revisions

From statwiki
Jump to navigation Jump to search
(Created page with "a")
 
No edit summary
Line 1: Line 1:
a
<div style="border: 2px solid #0073e6; background-color: #f0f8ff; padding: 10px; margin: 10px 0; border-radius: 5px;">
 
== Group 24 Presentation: Mitigating the Missing Fragmentation Problem in De Novo Peptide Sequencing With A Two-Stage Graph-Based Deep Learning Model ==
 
=== Background ===
 
- Proteins are crucial for biological functions
 
- Proteins are formed from peptides which are sequences of amino acids
 
- Mass spectrometry is used to analyze peptide sequences
 
- De Novo sequencing is used to piece together peptide sequences when the sequences are missing from existing established protein databases
 
- Deep learning has become commonly implimented to solve the problem of de-novo peptide sequencing
 
- When a peptide fails to fragment in the expected manner, it can make protein reconstruction difficult due to missing data
 
- One error in the protein can propogate to errors throughout the entire sequence
 
=== Paper Contributions===
 
- Graph Novo was developed to handle incomplete segments
 
- '''GraphNovo-PathSearcher''' instead of directly predicting, does a path search method to predict the next peptide in a sequence
 
- '''GraphNovo-SeqFiller''' instead of directly predicting, does a path search method to predict the next peptide in a sequence.
 
- Input is mass spectrum from mass spectrometry
 
- Graph construction is done where nodes represent possible fragments, and edges represent possible peptides (PathSearcher module)
 
- PathSearcher uses machine learning to find the optimal path on the generated graph
 
- SeqFiller fills in missing amino acids that may have not been included in the PathSearcher module due to lacking data from the mass spectrometry inputs
 
 
</div>

Revision as of 11:57, 24 March 2025

Group 24 Presentation: Mitigating the Missing Fragmentation Problem in De Novo Peptide Sequencing With A Two-Stage Graph-Based Deep Learning Model

Background

- Proteins are crucial for biological functions

- Proteins are formed from peptides which are sequences of amino acids

- Mass spectrometry is used to analyze peptide sequences

- De Novo sequencing is used to piece together peptide sequences when the sequences are missing from existing established protein databases

- Deep learning has become commonly implimented to solve the problem of de-novo peptide sequencing

- When a peptide fails to fragment in the expected manner, it can make protein reconstruction difficult due to missing data

- One error in the protein can propogate to errors throughout the entire sequence

Paper Contributions

- Graph Novo was developed to handle incomplete segments

- GraphNovo-PathSearcher instead of directly predicting, does a path search method to predict the next peptide in a sequence

- GraphNovo-SeqFiller instead of directly predicting, does a path search method to predict the next peptide in a sequence.

- Input is mass spectrum from mass spectrometry

- Graph construction is done where nodes represent possible fragments, and edges represent possible peptides (PathSearcher module)

- PathSearcher uses machine learning to find the optimal path on the generated graph

- SeqFiller fills in missing amino acids that may have not been included in the PathSearcher module due to lacking data from the mass spectrometry inputs