Annotating Patterns of Inheritance

Basic command


$ genmod models --help
Usage: genmod models [OPTIONS] <vcf_file> or -

  Annotate genetic models for vcf variants.

  Checks what patterns of inheritance that are followed in a VCF file. The
  analysis is family based so each family that are specified in the family
  file and exists in the variant file will get it's own annotation.

Options:
 Options:
   -f, --family_file <ped_file>
   -t, --family_type [ped|alt|cmms|mip]
                                   If the analysis use one of the known setups,
                                   please specify which one.
   -r, --reduced_penetrance <tsv_file>
                                   File with gene ids that have reduced
                                   penetrance.
   --vep                           If variants are annotated with the Variant
                                   Effect Predictor.
   --phased                        If data is phased use this flag.
   -s, --strict                    If strict model annotations should be
                                   used(see documentation).
   -p, --processes INTEGER         Define how many processes that should be use
                                   for annotation.
   --silent                        Do not print the variants.
   -w, --whole_gene                If compounds should be checked over the
                                   whole gene.
   -k, --keyword TEXT              What annotation keyword that should be used
                                   when
                                   searching for features.
   -o, --outfile FILENAME          Specify the path to a file where results
                                   should be stored.
   --help                          Show this message and exit.

Overview

The genmod models command is used to annotate patterns of inheritance in vcf files. genmod can handle multiple families of arbitrary sizes in the same vcf.
The individuals and families to be included in the analysis is specified in the family file that is given with the -f/--family_file option.

Options and Arguments

vcf_file

This is the only mandatory argument. A proper vcf file or a stream with a vcf file is allways input

family_file

This is a file in ped like format that describes which individuals that should be considered in the analysis and how they are related. The default file format is ped but there are some alternatives. Please see -t/--family_type.

family_type

Family files are usually in the ped format but there are possibilities to use ped like files. If a ped file includes more columns than the first six mandatory (described here) one may use the alt option for this parameter.

reduced_penetrance

Some phenotypes are known to have reduced penetrance. This means that healthy individuals can carry a heterozygous variant without being affected, while others that have the same variant are affected. When annotating inheritance patterns with genmod the user can provide a tab separated file where the first column describes a gene id per row. Use the flag -r/--reduced_penetrance to provide this file to genmod. Genmod will then allow healthy carriers for variants that resides in these genes when annotating the Autosomal Dominant inheritance pattern.

vep

This flag tell genmod that the variants are annotated with the Variant Effect Predictor. In this case the vep annotations will be used to determine if a compound pair is in the same gene.

strict

Only annotate that a genetic model is followed if there is "proof", which means that the variants are called in all individuals.

phased

This flag tell genmod that the variant calls are phased. This will make it possible for more accurate compound checks. For all autosomal chromosomes humans have two copies. If two variants resides on the same copy there will still be one copy that should produce "healthy" transcripts.

With phased variants genmod can take this information into account.

processes

How many processes should be used during the analysis. If -p/--processes 1 the streaming functionality will work better for genmod since no intermediate file needs to be used, the variants will be processes line by line and printed directly.

If there are several processes running at the same time the variants will be printed unordered to a intermediate file that will be sorted when all variants are analyzed.

silent

If no variants or headers should be printed to screen. This is mainly for testing.

whole_gene

If variants in introns should be considered when checking compound pairs. This will lead to a huge number of compound candidates/pairs when dealing with whole genome data.

keyword

This is the keyword in the vcf info field that genmod will look for when trying to find what genes the variant is annotated with. Default is Annotation since this is the keyword used by genmod annotate.