Creating a coverage report for one or more samples based on a list of genes

An example of this report is shown by the demo report endpoint: http://0.0.0.0:8000/report/demo (requires all genes and gene transcripts in build GRCh37 loaded into the database).

image

The statistics from the report above are computed using the transcripts intervals present in the provided genes.

Given an application running in production settings, with genes, transcripts and exons loaded into the database, a coverage report like the one above can be created taking into account one of these types of intervals from any gene of choice.

The gene list might be provided as a list of Ensembl gene IDs, HGNC gene IDs or HGNC gene symbols.

Here is what the request data of a POST request to the /report endpoint looks like:

{
  "build": "GRCh37",
  "completeness_thresholds": [
    10,
    15,
    20,
    50,
    100
  ],
  "ensembl_gene_ids": [],
  "hgnc_gene_ids": [],
  "hgnc_gene_symbols": [],
  "interval_type": "genes",
  "default_level": 10,
  "panel_name": "Custom panel",
  "case_display_name": "internal_id",
  "samples": [
    {
      "name": "string",
      "coverage_file_path": "string",
      "case_name": "string",
      "analysis_date": "2023-10-04T08:22:06.980106"
    }
  ]
}

Mandatory parameters

  • build either "GRCh37" or "GRCh38"
  • ensembl_gene_ids OR hgnc_gene_ids OR hgnc_gene_symbols either a list of strings (for Ensembl IDs or HGNC symbols or a list of integers for HGNC IDs
  • interval_type either "genes", "transcripts" or "exons", depending on the analysis type (WGS, WES, WTS)
  • default_level default coverage threshold level to take into account for displaying the number of fully covered or partially covered intervals
  • samples mandatory parameters are name and either coverage_file_path or case_name

Optional parameters

  • completeness_thresholds the list of different coverage thresholds that will be used to calculate gene coverage completeness. If not provided, the default threshold values will be: 10, 15, 20, 50, 100
  • panel_name Add a name only if the provided genes are part of a gene panel
  • case_display_name provide this string parameter if the case belong to the same case / group
  • samples analysis_date if not provided with be set to current date

Creating a genes coverage overview report to show incompletely covered genomic intervals

This type of report contains stats from incompletely covered genomic intervals at different coverage thresholds and is basically the same as the genes overview report provided by chanjo-report.

A demo genes overview report based on genes transcripts from the demo PanelApp genes is available at the demo overview endpoint: http://0.0.0.0:8000/overview/demo (requires all genes and gene transcripts in build GRCh37 loaded into the database).

image

To create a custom genes coverage overview report, send a POST request to the /overview endpoint containing the same request data described above for the /report endpoint.

Creating coverage overview over MANE transcripts for a list of genes

image

This report contains statistics over all MANE Select and Mane Plus Clinical transcripts for a list of genes provided by the user.

The /mane_overview endpoint accepts POST request with the same data described for the 2 reports above.

Note that MANE overview reports are available only for analyses run with genome build GRCh38.