library(fastqcr)
library(dplyr)
# Read all modules
qc <- qc_read(qc.path)
# Elements contained in the qc object
names(qc)
## [1] "summary" "basic_statistics" "per_base_sequence_quality" "per_tile_sequence_quality"
## [5] "per_sequence_quality_scores" "per_base_sequence_content" "per_sequence_gc_content" "per_base_n_content"
## [9] "sequence_length_distribution" "sequence_duplication_levels" "overrepresented_sequences" "adapter_content"
## [13] "kmer_content" "total_deduplicated_percentage"
Summary shows a summary of the modules which were tested, and the status of the test resuls:
Some experiments may be expected to produce libraries which are biased in particular ways. You should treat the summary evaluations therefore as pointers to where you should concentrate your attention and understand why your library may not look normal.
qc_plot(qc, "summary")
status <chr> | module <chr> | sample <chr> | ||
---|---|---|---|---|
PASS | Basic Statistics | S1.fastq | ||
PASS | Per base sequence quality | S1.fastq | ||
PASS | Per tile sequence quality | S1.fastq | ||
PASS | Per sequence quality scores | S1.fastq | ||
FAIL | Per base sequence content | S1.fastq | ||
WARN | Per sequence GC content | S1.fastq | ||
PASS | Per base N content | S1.fastq | ||
WARN | Sequence Length Distribution | S1.fastq | ||
PASS | Sequence Duplication Levels | S1.fastq | ||
PASS | Overrepresented sequences | S1.fastq |
Basic statistics shows basic data metrics such as:
qc_plot(qc, "Basic statistics")
Measure <chr> | Value <chr> | ||
---|---|---|---|
Filename | S1.fastq | ||
File type | Conventional base calls | ||
Encoding | Sanger / Illumina 1.9 | ||
Total Sequences | 50299587 | ||
Sequences flagged as poor quality | 0 | ||
Sequence length | 35-76 | ||
%GC | 48 |
qc_plot(qc, "Per base sequence quality")
qc_plot(qc, "Per sequence quality scores")
qc_plot(qc, "Per base sequence content")
qc_plot(qc, "Per sequence GC content")
qc_plot(qc, "Per base N content")
qc_plot(qc, "Sequence length distribution")
qc_plot(qc, "Sequence duplication levels")
qc_plot(qc, "Overrepresented sequences")
qc_plot(qc, "Adapter content")
qc_plot(qc, "Kmer content")