Imputation panel for low-pass whole genome sequencing (GLIMPSE2 format) (Q12289)
From MaRDI portal
Dataset published at Zenodo repository.
Language | Label | Description | Also known as |
---|---|---|---|
English | Imputation panel for low-pass whole genome sequencing (GLIMPSE2 format) |
Dataset published at Zenodo repository. |
Statements
This dataset includes autosomal genotypes from the 1000 Genomes +HGDP project (10.1101/2023.01.23.525248 ) as well as X chromosome genotypes from the NY Genome Center (as of yet, a comparable dataset that includes HGDP is not available for the X; see 10.1016/j.cell.2022.08.004). The genotypes were down-sampled so as to be appropriate for low-pass imputation; uncertain phase calls were removed (any PP tags), and individuals deemed to be outliers or relatives (based on autosomal data, as per the first citation) were also removed. Similarly, singleton polymorphisms were also excluded. Hemizygous genotypes on the X were converted into (quasi) diploid genotypes. These data were then converted into a binary imputation panel format using glimpse v2 (https://odelaneau.github.io/GLIMPSE/; using the static binaries provided). The "chunk" size was doubled from the defaults (which considers a minimum number of snps, genetic length and physical length) so as to be more performant.
0 references
15 November 2024
0 references
v0.1
0 references