Info • Scenedesmus obliquus UTEX 3031 haplotype 1

Status

The Scenedesmus obliquus UTEX 3031 (DOE0152z) genome sequence and gene models were not determined by the Joint Genome Institute (JGI), but were provided by collaborators at Los Alamos National Laboratory (LANL). The genome assembly of the previously published S. obliquus UTEX 3031 assembly (Starkenburg et al., 2017) was improved and scaffolded using a combination of PacBio HiFi, Oxford Nanopore, and Illumina Hi-C reads.

This portal contains the genomic data from haplotype 1 of Scenedesmus obliquus UTEX 3031.

All published models from this haplotype are available as ExternalModels. In order to ensure this genome is comparable to those sequenced and annotated by the JGI, we applied standard filters to ExternalModels to produce the initial GeneCatalog. A total of 3,744 external models were excluded based on one of the following classifications: 1) association with repetitive elements, 2) pseudogenes with internal stop codons, 3) alternative isoforms or overlapping transcript models, 4) alleles on secondary scaffolds, and 5) short models lacking functional annotation. Please note that this copy of the genome is not maintained by Starkenburg et al. and is therefore not automatically updated. In order to allow comparative analyses with other algal genomes sequenced by the JGI, a copy of this genome is incorporated into PhycoCosm. The JGI Annotation Pipeline was used to add functional annotation to this genome.

Genome Assembly
Genome Assembly size (Mbp) 101.59
Sequencing read coverage depth
# of contigs 17
# of scaffolds 17
# of scaffolds >= 2Kbp 17
Scaffold N50 7
Scaffold L50 (Mbp) 6.12
# of gaps 0
% of scaffold length in gaps 0.0%
Three largest Scaffolds (Mbp) 11.14, 8.15, 7.69


Gene Models FilteredModels1
length (bp) of: average median
gene 4564 3493
transcript 1496 1188
exon 173 135
intron 404 317
description:
protein length (aa) 499 396
exons per gene 8.64 7
# of gene models 11826


Collaborators

Genome Reference(s)

Links

Funding

This project was not sequenced at the JGI.