Biological sequence analysis
The Gibson team investigates protein sequence interactions, undertakes computational analyses of macromolecules, and develops tools to enhance sequence analysis research.
Previous and current research
Regulatory decisions during eukaryotic cell signalling are made within large dynamic protein complexes (see Gibson, 2009). Cell regulation is networked, redundant and - above all - cooperative. Decisions are made by in-complex molecular switching. e deeply misleading ‘kinase cascade’ metaphor needs to be retired and the sooner, the better. Regulatory proteins make remarkable numbers of interactions, with the corollary that they also have highly modular architectures.
We and collaborators develop and deploy the Eukaryotic Linear Motif (ELM) resource for investigating functional sites in modular protein sequences. Linear motifs (LMs) are short functional sites used for the dynamic assembly and regulation of large cellular protein complexes and their characterisation is essential if we are to understand cell signalling. So-called ‘hub’ proteins that make many contacts in interaction networks are being found to have abundant LMs in large segments of IUP (intrinsically unstructured protein segments). Viral proteomes are rich in LMs that are used for hijacking cell systems required for viral production (see figure). ELM data are now being used by many bioinformatics groups to develop and benchmark LM predictors. We are now actively hunting for new LM candidates. Recently, we proposed new candidate KEN anaphase destruction motifs, as well as KEPE, a motif of unknown function that is superposed on many sumoylation sites. We look to collaborate with experimental groups undertaking validation experiments.
We also undertake more general computational analyses of biological macromolecules. Where possible, we contribute to multidisciplinary projects involving structural and experimental groups at EMBL and elsewhere. Our collaborators Des Higgins (Dublin) and Julie Thompson (Strasbourg) have released Clustal Omega, a major update to the widely used multiple sequence alignment software. Additional public web servers include Phospho.ELM, a collection of some 42,000 reported phosphorylation sites and EpiC, a tool to aid in targeting epitopes for antibody selection.
Future projects and goals
We will continue to hunt for regulatory motifs and may survey individual gene families in depth, and we will undertake proteome surveys when we have specific questions to answer. Protein interaction networks are anticipated to become increasingly important to our work. Molecular evolution is also one of the group’s interests, especially when it has practical applications.
With our collaborators, we will look to build up the protein architecture tools, especially the unique ELM resource, taking them to a new level of power and applicability. We will apply the tools in the investigation of modular protein function and may deploy them in proteome and protein network analysis pipelines. We are now working to improve the way that bioinformatics standards represent cooperative molecular interactions. As part of the consortia DiGtoP, SyBoSS and SYSCILIA we are looking at interaction networks and systems in stem cells and primary cilia.