Abstract
Remote homology detction among proteins in an unsupervised approach from sequences is an important problem in computational biology. The existing neighborhood cluster kernel methods and Markov clustering algorithms are most efficient for homolog detection. Yet they deviate from random walks with inflation or similarity depending on hard thresholds. Our spectral clustering approach with new combined local alignment kernels more effectively exploits state-ofthe- art neighborhood vectors globally. This appoarch combined with Markov clustering similarity after modified symmetry based corrections outperforms other six cluster kernels for unsupervised remote homolog detection even in multi-domain and promiscuous proteins from Genolevures database with better biological relevance. Source code available upon request.
Original language | English |
---|---|
Title of host publication | Proceedings - 2nd International Conference on Emerging Applications of Information Technology, EAIT 2011 |
Pages | 269-272 |
Number of pages | 4 |
DOIs | |
Publication status | Published - 2011 Apr 18 |
Externally published | Yes |
Event | 2nd International Conference on Emerging Applications of Information Technology, EAIT 2011 - Kolkata, India Duration: 2011 Feb 19 → 2011 Feb 20 |
Conference
Conference | 2nd International Conference on Emerging Applications of Information Technology, EAIT 2011 |
---|---|
Country/Territory | India |
City | Kolkata |
Period | 2011/02/19 → 2011/02/20 |
Free keywords
- Kernel matrix
- Modified symmetry distance measure
- Remote homology detection
- Spectral clustering