• Date: 2017-04-10
  • Scanned QC directory (qc.path): /Users/kassambara/Documents/R/MyPackages/fastqcr/inst/fastqc_results
  • R packages: Report generated with the R package fastqcr version 0.1.0
  • Experiment description: Sequencing data

Required R packages

library(fastqcr)
library(dplyr)

Aggregating Reports

qc <- qc_aggregate(qc.path, progressbar = FALSE)
qc

Column names:

  • sample: sample names
  • module: fastqc modules
  • status: fastqc module status for each sample
  • tot.seq: total sequences (i.e.: the number of reads)
  • seq.length: sequence length
  • pct.gc: percentage of GC content
  • pct.dup: percentage of duplicate reads

The table shows, for each sample, the names of tested FastQC modules, the status of the test, as well as, some general statistics including the number of reads, the length of reads, the percentage of GC content and the percentage of duplicate reads.

Summary

summary(qc)

Column names:

  • module: fastqc modules
  • nb_samples: the number of samples tested
  • nb_pass, nb_fail, nb_warn: the number of samples that passed, failed and warned, respectively.
  • failed, warned: the name of samples that failed and warned, respectively.

The table shows, for each FastQC module, the number and the name of samples that failed or warned.

General statistics

qc_stats(qc)

Column names:

  • pct.dup: the percentage of duplicate reads,
  • pct.gc: the percentage of GC content,
  • tot.seq: total sequences or the number of reads and
  • seq.length: sequence length or the length of reads.

The table shows, for each sample, some genral statistics such as the total number of reads, the length of reads, the percentage of GC content and the percentage of duplicate reads

Inspecting Problems

Failed modules in the most samples

qc_fails(qc, "module")

For each module, the number of problems (failures) and the name of samples, that failed, are shown.

Warned module in the most samples

qc_warns(qc, "module")

Samples with Problems

  • Compact format
qc_problems(qc, "sample")
  • Stretched format with details
qc_problems(qc, "sample", compact = FALSE)