Specification of DAF format

Purpose:

In the following you'll find a commented version of the DAF format including all compulsory, recommended, and optional elements. For simplicity you can find examples for DAF files containing the compulsory and recommended information, only:

  1. one pair ,
  2. a list of pairs (search with one sequence U against a database),
  3. a list of lists of pairs (search with a number of test set sequences {U} against a database).

Overview over document:

  1. compulsory elements,
  2. recommended elements,
  3. optional elements.
  4. full file with all elements being clickable.

Features of DAF format:


Possible DAF file with explanations

All format elements (keywords, and entire lines) in boldface.
# DAF           	(dirty alignment format)
#
#	information:	
#			1) in all lines not beginning with '#'
#			2) in lines beginning with '#' and a keyword 
#			   (e.g.: ALISYM, NPAIRS, NSEARCH, ALIGNMENTS)
#	comments:	lines beginning with '#'
#	rows:		one line per pair alignment
#	columns:	separated by tabs or blanks (succession not important)
#	
#	purpose:	communicating pair alignments to Aqua (Alignment QUality Assessment)
#       
#	note 1:		If you give only id's (no alignmentss) AQUA will perform
#			an analysis of rank correlations, only.
#	note 2:		Please use the keywords given in this example
# 
# 
# SOURCE:	/data/hssp/1atf.hssp
# ALISYM:	ACDEFGHIKLMNPQRSTVWY.
# NPAIRS:	5 (or (for test sets: 2,3)
# NSEARCH:	1
# ADDKEYS:	addMy1,addMy2 
# 
#	note 3:		Additional column names that can be interpreted
#			by AQUA are: 'conf', 'zDali', 'rmsDali' for the presentation
#			of the true alignments.
# 
# 
# RELKEYS:	zMy1,zMy2
# RELHISTO:	3,2,1
# 
# ALIGNMENTS

idSeq idStr rank conf lenSeq lenStr lenAli pide seq str weightSeq
1acf2btfP191255131917 SWQTYVDTNLVGTGAVTQGWNAYID.NLMADGTCQD1111111111111111111111
1acf2btf291255132117 SWQTYVD....GTGAVTQGG..NKKCYEMASHLRRSQY111111222211111111111222
1acf1asu391255131917 SWQTYVDTNLVGTGAVTQPLQIWQTDFTLAVTVDTA
9rnt1gmp191041932017 GSNaYSSSDVSTAQAAGYKVSGTVCLSALPPEATDTLN
9rnt1srp291044682015 GSNaYSSSDVSTAQAAGYKGGH.YAAAPLLDDIAAIQH


Compulsory elements

# DAF
# ALIGNMENTS
idSeqidStr
1acf2btfP


Recommended elements

# ALISYM	ACDEFGHIKLMNPQRSTVWY.acdefghiklmnpqrstvwy
# NPAIRS	1
lenSeqlenStrlenAlipideseqstr
1255131917SWQTYVDTNL...GAVTQGWNAYID.NLMADGTCQD


Optional elements

# SOURCE:	/data/hssp/1atf.hssp
# NSEARCH:	1
# ADDKEYS:	addMy1,addMy2 
# RELKEYS:	zMy1,zMy2
# RELHISTO:	3,2,1
rank conf zscore zDali rmsDali weightSeq
193.216.82.1111111112222111511111
322.23.86.1111199111111


EMBL Home Sander Home Rost Home PredictProtein AQUA Mail to Rost