AGC archives of human and SARS-CoV-2 genomes (Q12938)

From MaRDI portal
Dataset published at Zenodo repository.
Language Label Description Also known as
English
AGC archives of human and SARS-CoV-2 genomes
Dataset published at Zenodo repository.

    Statements

    0 references
    AGC is a tool to compress a collection of similar genomes. This Zenodo record provides pre-built AGC-3.0 archives of several datasets: File HPRC-yr1.agc contains CHM13 and 94 haploid human assemblies released by HPRCin 2021. The telomere-to-telomereCHM13 v2pluschrY from GRCh38 is used as the reference genome. File sars-cov-2_ncbi-620k.agc contains 619,750 complete SARS-CoV-2 genomes withNC_045512.2 as the reference. It wascreated with AGC command line agc create -cb10000 -s3000.SARS-CoV-2 genomes were downloaded from NCBI at the end of year 2021. The original FASTA is provided as sars-cov-2_ncbi-620k.fa.xz.
    0 references
    2 March 2023
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references