Figure 1: Strand-seq preserves the identity and structure of each homologue in a cell (Porubsky et al. Nat Commun in press).

Figure 1: Strand-seq preserves the identity and structure of each homologue in a cell (Porubsky et al., Nat Commun 2017).

Figure 2: Weischenfeldt et al. Nat Genet 2017; Northcott et al. Nature 2017.

Figure 2: (a) Identification of enhancer hijacking mediating oncogenic overexpression in cancer genomes; (b) IGF2 overexpression via de novo 3D contact domain (‘neo-TAD’) formation in colon cancer mediated by recurrent somatic tandem duplications (Weischenfeldt et al., Nat Genet 2017; Northcott et al., Nature 2017).

The Korbel group combines computational and experimental approaches, including in single cells, to unravel determinants and consequences of germline and somatic genetic variation with a special focus on disease mechanisms.

Previous and current research

Our group is using bulk as well as single cell-based omics approaches for investigating mechanisms behind complex phenotypes in humans, ranging from common diseases, including cancer, to ageing. An overarching theme centres on the formation and selection of germline and somatic genetic variation in health and disease states, in particular genomic structural variation (SV). Bioinformatics approaches used encompass deep learning and statistical methodology for processing high-dimensional big datasets. Omics techniques employed in our group range from whole genome sequencing and epigenomic techniques to single-cell/single-strand DNA sequencing (strand-seq; see Figure 1), the latter of which enables haplotype-resolved studies of genetic variation and genome instability. Scientists in our group combine methods development, data generation and analysis with hypothesis generation and experimental testing to obtain insights into biological mechanisms of disease.

Our laboratory has further been among the pioneers in the utilisation of cloud computing to enable sharing and processing of large-scale omics data. As an example, the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) project, co-led by our group, is leveraging cloud solutions to globally standardise and analyse cancer genomics data, with the aim of uncovering commonalities and differences between molecular disease mechanisms in disparate cancer entities. By studying recurrent somatic SVs affecting intergenic regions, we recently demonstrated that 'enhancer hijacking' – the juxtaposition of active enhancers near proto-oncogenes, and often across topologically associating domain (TAD) boundaries – is a frequent oncogene activation mechanism in solid tumors (Figure 2).

Future projects and goals

  • Identifying determinants for the formation and selection of genetic variation in cancer and during ageing (which includes the development of deep learning and statistical methodologies).
  • Development of methodology to facilitate single-cell studies of structural variation, and integrating state-of-the-art microscopy methods with single-cell sequencing.
  • Completion of human genome variation maps using strand-specific and single-molecule DNA sequencing techniques.
  • Deciphering the basis of genomic instability using cell-based models.