Unmapped reads from cattle RNAseq data: A source for missing and misassembled sequences in the reference assemblies and for detection of pathogens in the host
Unmapped reads from transcriptome sequencing data can provide information regarding the presence of microorganisms in mammalian samples. We found that several parasite and virus genome reference assemblies in NCBI were contaminated with bovine DNA. We confirmed recombination of bovine genomic DNA into BVD virus strains. De novo assembled contigs of unknown unmapped reads demonstrated incomplete bovine reference genome assemblies.