Data Scientist (text miner)
Location:EMBL-EBI Hinxton near Cambridge, UK
Staff Category:Staff Member
Contract Duration:3 years
Grading:5 or 6 (monthly salary starting at £2,552 or £2,856 after tax). For more information about pay and benefits click here
Closing Date:24th September 2017
Reference Number:EBI_01009

Job Description

We are seeking to recruit a senior data scientist with a background in text mining to join the Literature Services Team at the European Bioinformatics Institute (EMBL-EBI) located on the Wellcome Trust Genome Campus near Cambridge in the UK.

Our goal is to integrate the open research literature with public data resources, supporting better search technologies, browse experiences, and database curation workflows. The successful candidate will be responsible for delivering text and data mining projects that impact on these goals. Our team is involved in a variety of projects in the area of Literature Services, and at the heart of these is our commitment to run Europe PMC, the database of life sciences abstracts and full text articles that incorporates both PubMed and PMC content. Based on Europe PMC content, we enable text mining groups to extract named entities, relationships, or events extracted from the content and highlight these on articles. In addition, we run daily text-mining pipelines to extract entities such as genes/proteins, organism names, chemicals, Gene Ontology terms, diseases, and data citations. Finally, we participate in projects that address wider EBI text mining requirements such as support for curation activities. We are therefore looking for a versatile data scientist who is ambitious to use their skills to build text and data mining solutions both directly and collaboratively, embedded in a service-driven team.

Specific job responsibilities include:

  • Develop and improve core text mining services on full text articles and abstracts
  • Data analysis and evaluation of extraction results
  • Iterative improvement of solutions, with key stakeholders

At EMBL-EBI, we help scientists realise the potential of ‘big data’ in biology by enabling them to exploit complex information to make discoveries that benefit mankind. Working for EMBL-EBI gives you an opportunity to apply your skills and energy for the greater good. As part of the European Molecular Biology Laboratory (EMBL), we are a non-profit, intergovernmental organisation funded by 23 member states and two associate member states and proud to be an equal-opportunity employer. Our 600 staff are engineers, technicians, scientists and other professionals from all over the world.

Qualifications and Experience

The successful candidate must be able to demonstrate the following:

  • Proven experience of a range of methodologies such as NLP, pattern mining, dictionary-based techniques, deep learning.
  • Experience of text-mining as applied to biological data resources in an academic, industrial or publishing setting;
  • Technical ability e.g. Perl, Java, R, XML parsing;
  • Flexible approach and ability to take on new skills;
  • Self starter and able to manage multiple projects;
  • Team player and good communicator


EMBL is an inclusive, equal opportunity employer offering attractive conditions and benefits appropriate to an international research organisation. The remuneration package comprises a competitive salary, a comprehensive pension scheme and health insurance, educational and other family related benefits where applicable, as well as financial support for relocation and installation.

We have an informal culture, international working environment and excellent professional development opportunities but one of the really amazing things about us is the concentration of technical and scientific expertise – something you probably won’t find anywhere else.

If you’ve ever visited the campus you’ll have experienced first-hand our friendly, collegial and supportive atmosphere, set in the beautiful Cambridgeshire countryside. Our staff also enjoy excellent sports facilities including a gym, a free shuttle bus, an on-site nursery, cafés and restaurant and a library.

Application Instructions

To apply please submit a covering letter and CV, with two referees, through our online system.

Additional Information

Applications are welcome from all nationalities - visa information will be discussed in more depth with applicants selected for interview.

EMBL-EBI is committed to achieving gender balance and strongly encourages applications from women, who are currently under-represented at all levels. Appointment will be based on merit alone.

The initial contract is for a period of three years with the possibility of a fixed-term extension.

Applications will close at 23:00 GMT on the date listed above.