Description of output civet directory
An output prefix can be specified with -o /--output-prefix
. If no output directory is specified, by default the directory will be a timstamped directory beginning with the output_prefix
and the report files will be called output_prefix
.md (Default: civet) The output directory can be specified with --outdir
, which overwrites the --output_prefix
directory name. See example in the figure below:
### Output files
catchment_trees
A directory of the catchment trees, with corresponding text files of the content of each collapsed node (will not have the additional fasta input added in at this stage, these are the trees clipped directly out of the global tree)
local_trees
A directory of the collapsed local trees, with corresponding text files of the content of each collapsed node.
report
A directory containing a report in markdown and html formats. The report has summaries of lineages and genetic diversity present in the sequences of interest. Trees are visualised in the report and compared to the diversity of lineages present in the community. Optional figures include summary barcharts of collapsed-node composition, genome ‘snipit’ graphs and maps of the local community diversity. Full report options described here.
This directory also contains a figures
directory that contains the figures linked to the report.
civet_metadata.csv
A csv containing all the metadata for the queries provided that were found in the COG database, drawn from any query metadata provided and supplemented with any found in the background metadata. It also includes any sequences provided in a fasta file which were not found in the COG database, with accompanying metadata. Finally, it also includes sequences which are not in the query, but are present in non-collapsed nodes in the trees.
A filtered version of this table is provided in the report, and can be customised using --table-fields
(see here)
Config yaml file
The final config file that gets passed to the report (can be used as a template to run additional analyses).
query.failed_qc.csv
If a fasta file is input, this file will show details of any sequences that didn’t pass the quality control cut offs and why. For example:
name | reason_for_failure |
---|---|
seq1 | fail=seq_len:5052 |
seq2 | fail=N_content:0.98 |
seq3 | fail=N_content:1.0 |
seq4 | fail=not_in_query_csv |
query.post_qc.fasta
A file of input fasta sequences that passed the quality control cut offs.
With the --no-temp
flag, many more intermediate files are output, including the local tree files prior to collapse.