Toby Gibson

Team Leader at
European Molecular Biology Laboratory
Meyerhofstraße 1, 69012
Heidelberg,
Germany
Tel: +49 6221 387398
Fax: +49 6221 387517
Email: Toby.Gibson@EMBL.DE

As from 1996, I have been responsible for establishing an in-house Sequence Analysis Service to support the EMBL research groups. Aidan Budd, Sophie Chabanis-Davidson, Chenna Ramu , Christine Gemünd, José Castresana, Rune Linding work with me. We maintain several web servers, including SRS (Sequence Retrieval Service), Gene2EST and a fast and sensitive Bioccelerator database search facility. We also run in-house courses on using EMBL's computational facilities. If you have some sequence analysis needs, come and talk to us.

My research interests involve the application of computers in molecular biology. In recent years, this has centred primarily on amino acid sequence analysis. I have worked on a number of domains and proteins where divergent sequence relationships have been informative, often in collaboration with structural groups who have solved several of the domain structures. Annalisa Pastore's group at Mill Hill have solved the structure of Frataxin, which is involved in the Inherited disease Friedreich's ataxia. With Rein Aasland, I have been interested in looking at chromatin-associated domains, such as the PHD finger and the SAND domain. Michael Sattler's group at EMBL have solved the DNA-binding SAND domain structure. With Jürg Spring, I have been looking at redundant genes in vertebrates, which seem to be differentially retained, depending on whether the phenotypes are dominant or recessive.

I am coordinator of an EU grant QLRI-CT2000-00127 which sets out our plans to develop a new protein motif database, the Eukaryotic Linear Motif resource, ELM (elm.eu.org). As well as EMBL, the other partners are Rein Aasland (Bergen), Bill Hunter (Dundee), Manuela Helmer-Citterich (Rome), Giulio Superti-Furga (CellZome) and Leszek Rychlewski (BioInfobank). There are many short functional motifs in modular proteins but it is difficult to provide good tools for their detection in query sequences due to a severe overprediction problem. The aim of the ELM project is to provide software filtering tools as well as the motif database, in order to reduce this problem. Filters currently include globular domain, cell compartment and taxonomy. The project is ongoing but you can use ELM now to search for functional sites in your protein or simply to browse the entries. Other servers from our group that are useful in conjunction with ELM are GlobPlot (find non-globular regions that may be rich in short motifs) and SIRW (select sets of proteins by kyword search to query with motif patterns).

Recently, I have been thinking about a nucleic acid agent for TSEs (spongiform encephalopathies). I have put forward a model, termed RuNAway Disease, for SINE retroposons as the transmissible agent with a mechanism to overproduce PrP, leading to amyloid plaque formation. Evidence for or against the model is scanty, but I hope to persuade TSE labs to do some testing. the RuNAway model has been posted on the GenomeBiology web set.

I have also been involved in some methods development projects. The multiple alignment programs Clustal W and Clustal X were developed together with Des Higgins (now at Cork) and Julie Thompson and colleagues (now in Strasbourg): we continue to maintain and develop Clustal W and X. Julie Thompson and I have also worked closely with Ewan Birney (now at the Sanger Centre) in the development of the WiseTools package of sequence comparison programs. WiseTools is a generalised sequence analysis system in which the user can compare combinations of proteins, profiles (from multiple alignments) and DNA translation frames. A frameshifting capability allows the user to match homologous proteins despite frameshifts caused by introns or sequencing errors.

More details may be found at our Official EMBL Team Page.


Recent Publications

 2003

Bimpikis, K., Budd, A., Linding, R. and Gibson, T.J. (2003) BLAST2SRS, a web server for flexible retrieval of related protein sequences in the SWISS-PROT and SPTrEMBL databases. Nucleic Acids Res., 31, 3792-3794. Full Article at NAR

Chenna, R., Sugawara, H., Koike, T., Lopez, R., Gibson, T.J., Higgins, D.G. and Thompson, J.D. (2003) Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res., 31, 3497-3500.

Linding, R., Russell, R.B., Neduva, V. and Gibson, T.J. (2003) GlobPlot: exploring protein sequences for globularity and disorder. Nucleic Acids Res., 31, 3701-3708.

Puntervoll, P., Linding, R., Gemund, C., Chabanis-Davidson, S., Mattingsda,l M., Cameron, S., Martin, D.M., Ausiello, G., Brannetti, B., Costantini, A., Ferre, F., Maselli, V., Via, A., Cesareni, G., Diella, F., Superti-Furga, G., Wyrwicz, L., Ramu, C., McGuigan, C., Gudavalli, R., Letunic, I., Bork, P., Rychlewski, L., Kuster, B., Helmer-Citterich, M., Hunter, W.N., Aasland, R., Gibson, T.J. (2003) ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins. Nucleic Acids Res., 31, 3625-3630. Full Article at NAR

2001

Bottomley, M. J., Collard, M. W., Huggenvik, J. I., Liu, Z., Gibson, T. J. and Sattler, M. (2001) The SAND domain structure defines a novel DNA binding fold in transcriptional regulation. Nature Struct., 8, 626-633.

Gemünd, C., Ramu, C., Altenberg-Greulich, B. and Gibson, T. J. (2001) Gene2EST - a BLAST2 server for searching Expressed Sequence Tag (EST) databases with eukaryotic gene-sized queries. Nucleic Acids Res., 29, 1272-1277.

Gibson, T. J. (2001) RuNAway Disease: A two cycle model for transmissible spongiform encephalopathies (TSEs) wherein SINE proliferation drives PrP overproduction. Genome Biology, 2(7), preprint0006.1-0006.17.

Huynen, M. A. Snel, B., Bork, P. and Gibson, T. J. (2001) The phylogenetic distribution of frataxin indicates a role in iron-sulfur cluster assembly. Human Mol. Genet., 10, 2463-2468.

Moreira, M.-C., Barbot, C., Tachi, N., Kozuka, N., Uchida, E., Gibson, T., Mendonça, P., Costa, M., Barros, J., Yanagisawa, T., Watanabe, M., Ikeda, Y., Aoki, M., Nagata, T. Coutinho, P., Sequeiros, J. and Koenig, M. (2001) The gene mutated in ataxia-ocular apraxia 1 encodes the new HIT/Zn-finger protein aprataxin. Nature Genet., 29, 189-193.

 2000

Andrade, M. A., Ponting, C. P., Gibson, T. J. and Bork, P. (2000) Homology-based method for identification of protein repeats usingstatistical significance estimates. J. Mol. Biol., 298, 521-537.

Gibson, T. J. and Spring J. (2000) Evidence in favour of ancient octaploidy in the vertebrate genome. Biochem. Soc. Trans.,28, 259-264.

Hill, F., Gemuend, C., Benes, V., Ansorge, W. and Gibson, T. J. (2000) An estimate of large-scale sequencing accuracy. EMBO Reports, 1, 29-31.

Musco, G., Stier, G., Kolmerer, B., Adinolfi, S., Martin, S., Frenkiel, T.,Gibson, T. and Pastore, A. (2000) Towards a structural understanding ofFriedreich's ataxia: the solution structure of frataxin. Structure, 8,695-707.

Ramu, C., Gemünd, C. and Gibson, T. J. (2000) Object-oriented parsing of biological databases with Python. Bioinformatics, 16, 628-638.

1999

Adinolfi, S., Bagni, C., Musco, G., Gibson, T., Mazzarella, L. and Pastore, A. (1999) Dissecting FMR1, the protein responsible for Fragile X syndrome, in its structural and functional domains. RNA, 5, 1248-1258.

Gibson, T. J. and Bork, P. (1999) "Encyclopedia of Molecular Biology" (Ed. T. Creighton) Wiley & Son Inc., New York, pp 86-90; 765-766; 961-962; 1252, 2320-2324.

Musco, G., de Tommasi, T., Stier, G., Kolmerer, B., Bottomley, M., Adinolfi, S., Muskett, F. W., Gibson, T. J., Frenkiel, T. A. and Pastore, A. (1999) Assignment of the 1H, 15N, and 13C resonances of the C-terminal domain of frataxin, the protein responsible for Friedreich ataxia. J. Biomol. NMR, 15, 87-88.

Zeller, R., Haramis, A. G., Zuniga, A., McGuigan, C., Dono, R., Davidson, G., Chabanis, S. and Gibson, T. (1999) Formin defines a large family of morphoregulatory genes and functions in establishment of the polarising region. Cell Tissue Res., 296, 85-93.

1998

Gibson, T. J. and Spring. J. (1998) Genetic redundancy in vertebrates: polyploidy and persistence of genes encoding multidomain proteins. Trends Genet., 14, 46-49.

Gibson, T. J., Chenna, R., Gemuend, C. and Aasland, R. (1998) The APECED polyglandular autoimmune syndrome protein, AIRE-1, contains the SAND domain and is probably a transcription factor. (1998) Trends Biochem. Sci., 23, 242-244.

Jeanmougin, F., Thompson, J. D., Gibson, T. J., Gouy, M. and Higgins, D. G. (1998) Multiple sequence alignment with Clustal X. Trends Biochem. Sci., 23, 403-405.

Lock, P., Abram, C. L., Gibson, T. and Courtneidge, S. A. (1998) A new method for isolating tyrosine kinase substrates used to identify Fish, an SH3 and PX domain-containing protein, and Src substrate. EMBO J., 17, 4346-4357.

1997

Musco, G., Kharrat, A., Stier, G., Fraternali, F., Gibson, T. J., Nilges, M. and Pastore, A. (1997) The solution structure of the first KH domain of FMR1, the protein responsible for the fragile X syndrome. Nature Structure, 4, 712-716.

Thompson, J. D., Gibson, T. J., Plewniak, F. Jeanmougin, F. and Higgins, D. G. (1997) The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res., 25, 4876-4882.