Imputation panel for low-pass whole genome sequencing (GLIMPSE2 format) (Q12289)

From MaRDI portal
Dataset published at Zenodo repository.
Language Label Description Also known as
English
Imputation panel for low-pass whole genome sequencing (GLIMPSE2 format)
Dataset published at Zenodo repository.

    Statements

    0 references
    This dataset includes autosomal genotypes from the 1000 Genomes +HGDP project (10.1101/2023.01.23.525248 ) as well as X chromosome genotypes from the NY Genome Center (as of yet, a comparable dataset that includes HGDP is not available for the X; see 10.1016/j.cell.2022.08.004). The genotypes were down-sampled so as to be appropriate for low-pass imputation; uncertain phase calls were removed (any PP tags), and individuals deemed to be outliers or relatives (based on autosomal data, as per the first citation) were also removed. Similarly, singleton polymorphisms were also excluded. Hemizygous genotypes on the X were converted into (quasi) diploid genotypes. These data were then converted into a binary imputation panel format using glimpse v2 (https://odelaneau.github.io/GLIMPSE/; using the static binaries provided). The "chunk" size was doubled from the defaults (which considers a minimum number of snps, genetic length and physical length) so as to be more performant.
    0 references
    15 November 2024
    0 references
    0 references
    v0.1
    0 references

    Identifiers

    0 references