question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Could not read base quality scores

See original GitHub issue

Describe the issue: The prints that read base quality scores cannot be read, as result, no variants are reported. However, I can visualize these values in the reads in IGV. How is that these values cannot be read? This is the line with the error, which repeats one after.

2021-03-26 19:12:43.550815: W third_party/nucleus/io/sam_reader.cc:534] Could not read base quality scores m64036_210113_122249/147655225/ccs: Not found: Could not read base quality scores

Setup

  • Operative system: Ubuntu 20.04
  • DeepVariant version: 1.1.0 (latest)
  • Installation method: docker
  • Type of data: PacBio HiFi. BAM files aligned to the reference with minimap2 -ax map-pb.

Steps to reproduce:

  • Command:
  docker run \
    -v /home/user/working_directory:/input \
    -v /home/user/working_directory:/output \
    google/deepvariant:1.1.0 \
    /opt/deepvariant/bin/run_deepvariant \
    --model_type=PACBIO \
    --ref=/input/reference.fa \
    --reads=/input/file.bam \
    --output_vcf=/output/file.vcf \
    --call_variants_extra_args="use_openvino=true" \
    --num_shards=4 \
    --logging_dir=/output/logs

Does the quick start test work on your system? Yes. The test works without problem.

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:14

github_iconTop GitHub Comments

2reactions
elcorteganocommented, Apr 8, 2021

I see! I am very grateful for the support. Not only the problem is solved, it is everything much more clear now, and I have learned a lot from your feedback. Thank you.

2reactions
AndrewCarrollcommented, Apr 9, 2021

Hi @elcortegano

Billy has provided some excellent, detailed answers. I’ll just add a little bit of context to your question

“Our understanding deepvariant is designed for read alignments (and not assembly-to-reference alignments as achieved with minimap2 -ax asm

This is correct, DeepVariant is for read alignments not assembly-to-reference alignment. Because minimap2 has original written when most PacBio data was CLR, the choice of the word map-pb was chosen for this present. When CCS became more common, a new recommendation was made by minimap2 authors to use the parameters for assembly with an sequence divergence tuned to HiFi data (this is the 20 part of --asm20). If you search for the string ``ccs``` in the minimap2 github, this should pull up the line.

So in short, DeepVariant is for read alignments, and the parameter for read mapping HiFi reads happens to manifest in minimap2 as minimap2 -ax asm20.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Sequencing Quality Scores - Illumina
Sequencing quality scores are a measure of the uncertainty of base calls, or the probability of a base call being wrong.
Read more >
Quality Scores And Read Accuracy | EPI2ME Labs Blog
In this brief post we will examine the concepts of read accuracy and read quality scores, and how they are calculated by Oxford...
Read more >
Re-alignment of the unmapped reads with base quality score
When the trimmed reads with K low quality bases cannot be aligned or confidently mapped, their original reads are the input of the...
Read more >
Assessing Read Quality - Data Carpentry
Explain how a FASTQ file encodes per-base quality scores. ... In real life, you will not be assessing the quality of your reads...
Read more >
How do adapter sequences not contribute to alignment if the ...
In the tutorial (How to) Map and clean up short read sequence data efficiently, the documentation states that changing the base quality scores...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found