Date : June 18, 2015, 2 p.m. - Room :Salle du conseil

A multi-objective optimisation approach accurately resolves protein domain architectures

Juliana Silva Bernardes - Université paris 6

Possibilité de visio-conférence @IP : Nom du correspondant technique : Nicolas CHAMPEIL Tél correspondant technique : 04 73 40 50 15 / 06 78 34 55 26 Tél salle de visio : 04 73 40 50 47 Given a protein sequence and a number of potential domains matching it, what are the domain content and the most likely domain architecture for the sequence ? This problem is of fundamental importance in protein annotation, constituting one of the main steps of all predictive annotation strategies. On the other hand, when potential domains are several and in conflict because of overlapping domain boundaries, finding a solution for the problem might become difficult. An accurate prediction of the domain architecture of a multi-domain protein provides important information for function prediction, comparative genomics and molecular evolution. We developed DAMA (Domain Annotation by a Multi- objective Approach), a novel approach that identifies architectures through a multi-objective optimisation algorithm combining scores of domain matches, previously observed multi-domain co-occurrence, and domain overlapping. DAMA has been validated on a known benchmark data set based on CATH structural domain assignments and on the Plasmodium falciparum proteome. When compared to existing tools on both data sets, it outperforms all of them.