Neki statistički aspekti prepoznavanja motiva

Relja, Ajka (2014) Neki statistički aspekti prepoznavanja motiva. Diploma thesis, Faculty of Science > Department of Mathematics.

[img]
Preview
PDF
Language: Croatian

Download (2MB) | Preview
[img] Archive (dodatni materijali)
Language: Croatian

Download (59MB)

Abstract

In this thesis we have analysed the distribution of the maximum scores obtained for an actual enzyme from GDSL family of hydrolases on the plant A. thaliana. This is an intriguing topic, since such distribution does not match Gumbel distribution, as it should, according to relevant theoretical results. For the purpose of analysing such distribution, we have calculated scores for query using the sliding window protocol and PSSM matrix. What we have noticed is that a certain correction of scores with respect to the length of the protein, gives Gumbel distribution. Finally we check how the correction affected scores of positive matches, i.e. those enzymes that are in functional relation with our query whose variants we wanted to find in the proteome. That is rather important, primarily since we want enzymes which are related to the query to keep higher scores than those that are not. We show that our correction keeps such a relationship.

Item Type: Thesis (Diploma thesis)
Supervisor: Goldstein, Pavle
Date: 2014
Number of Pages: 45
Subjects: NATURAL SCIENCES > Mathematics
Divisions: Faculty of Science > Department of Mathematics
Depositing User: Iva Prah
Date Deposited: 07 Jul 2015 12:39
Last Modified: 07 Jul 2015 12:39
URI: http://digre.pmf.unizg.hr/id/eprint/4100

Actions (login required)

View Item View Item