avdist 1.0 review
Downloadavdist is a simple tool for bootstrap analysis of haplotype differences. It computes hamming distance between pairs of sequences s
|
|
avdist is a simple tool for bootstrap analysis of haplotype differences.
It computes hamming distance between pairs of sequences sampled from the input sequences and presents average difference and standard devitation of the results after some number of iterations.
A previous version of this tool was used for Savolainen et al in Science 2003.
Usage:
avdist [< options >] < iter > < samplesize > < ancient > < seqfile >
Arguments
iter
The number of times the sampling is performed.
samplesize
The number of sequences in each sample. Must be less than the number of sequences in the file.
seqfile
Name of file with sequences on simplified GDE or Fasta format. First a sequence identifier prefixed by '#' or '>', then a new line with the whole sequence on it. Note! The sequence may not wrap over several lines!
ancient
A number between 1 and number of sequences in the file pointing out which sequence the distance is measured to. If 0, we just report the average difference between any pair of sequences in the sample.
Options:
-v
Verbose mode. Presents more information on STDERR.
-m
Don't use multiplicity! This will also make your sequence to be sampled regardless of a zero multiplicity.
-i
Verbose mode. Presents more information on STDERR.
The multiplicity, that is the number of times it is counted, of a sequence is taken from the id line. For example, in the following sequence file,
>name1 3
ACGTACGT...
>name2 50
AAGTAAGT...
>ancient 0
AAGTACGT...
the first sequence is registered three times and the second is registered 50 times.
If you don't want your ancient sequence to be sampled, that is, if it is actually not in the observed sequences, be sure to set the multiplicity of it to zero! See example above.
avdist 1.0 keywords