Overview of Admixture Mapping - Daniel Shriner

djoser-xyyman
Vizier

Without data you are just another person with an opinion - Deming

Posts: 3,268

Overview of Admixture Mapping - Daniel Shriner Apr 11, 2018 9:40:45 GMT -5

Quote

Post by djoser-xyyman on Apr 11, 2018 9:40:45 GMT -5

Overview of Admixture Mapping - Daniel Shriner

Center for Research on Genomics and Global Health, National Human Genome Research,
Institute, Bethesda, Maryland

admixture began, or the rate of gene flow.
Replication and Follow-Up
Admixture mapping is fundamentally a statistical procedure, like linkage analysis or
association testing.

Admixture mapping is a powerful method of gene mapping for diseases or traits that show
differential risk by ancestry. Admixture mapping has been applied most often to African
Americans who trace ancestry to Europeans and West Africans. Recent developments in
admixture mapping include improvements in methods to take advantage

There are currently two genetic maps that are useful for
admixture mapping. One map is specific for African Americans (Hinch et al., 2011). The
other map is combined over multiple continental populations and is therefore generic (http://
mathgen.stats.ox.ac.uk/impute/ALL_1000G_phase1integrated_v3_impute.tgz).

Last Edit: Apr 11, 2018 10:05:44 GMT -5 by djoser-xyyman

Without data you are just another person with an opinion - Deming

djoser-xyyman Vizier Without data you are just another person with an opinion - Deming Posts: 3,268	Overview of Admixture Mapping - Daniel Shriner Apr 11, 2018 10:13:15 GMT -5 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by djoser-xyyman on Apr 11, 2018 10:13:15 GMT -5 www.soph.uab.edu/ssg/linkage/stratification
	Without data you are just another person with an opinion - Deming

djoser-xyyman
Vizier

Without data you are just another person with an opinion - Deming

Posts: 3,268

Overview of Admixture Mapping - Daniel Shriner Apr 11, 2018 10:27:18 GMT -5

Quote

Post by djoser-xyyman on Apr 11, 2018 10:27:18 GMT -5

A survey of tools for variant analysis
of next-generation genome
sequencing data
Stephan Pabinger, Andreas Dander, Maria Fischer, Rene Snajder, Michael Sperk, Mirjana Efremova,
Birgit Krabichler, Michael R. Speicher, Johannes Zschocke and Zlatko Trajanoski

Without data you are just another person with an opinion - Deming

djoser-xyyman Vizier Without data you are just another person with an opinion - Deming Posts: 3,268	Overview of Admixture Mapping - Daniel Shriner Apr 11, 2018 10:31:42 GMT -5 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by djoser-xyyman on Apr 11, 2018 10:31:42 GMT -5
	Without data you are just another person with an opinion - Deming

djoser-xyyman
Vizier

Without data you are just another person with an opinion - Deming

Posts: 3,268

Overview of Admixture Mapping - Daniel Shriner Apr 11, 2018 10:37:54 GMT -5

Quote

Post by djoser-xyyman on Apr 11, 2018 10:37:54 GMT -5

NGS VARIANTANALYSIS
WORKFLOW
NGS platforms
NGS instruments provide higher throughput at an
unprecedented speed by sequencing millions of short
DNA fragments in parallel [50, 51]. Currently, the
three most commonly used platforms are Roche 454
(introduced in 2005), Illumina (launched in 2006)
and ABI SOLiD (followed in 2008). All three platforms
sequence DNA by measuring and analyzing
signals, which are emitted during the creation of

the second DNA strand, but differ in how the
second strand is generated.
In order to produce detectable signals, template
DNA is fragmented into small pieces, amplified and
immobilized on a glass slide before sequencing.
Roche 454 implements pyrosequencing, which
measures released pyrophosphates allowing the analysis
of read fragments up to a few hundred base
pairs. Since this technique infers the number of
incorporated nucleotides from the signal’s intensity,
the system experiences problems when homopolymer
stretches longer than 8 bp are sequenced [52].
This complicates identification of small insertions and
deletions. Illumina applies a sequencing-by-synthesis
approach where only 1 nt per sequencing cycle is
incorporated using reversible dye terminators.
Thereby, it avoids homopolymer calling problems
at the cost of being capable of sequencing only
shorter fragments. ABI SOLiD analyzes DNA by
ligating fluorescently labeled di-base probes to the
first strand, requiring reading each base twice. Due
to the nature of this approach, identified calls are not
stored in nucleotide but in color space—a property
that needs to be considered in downstream analyses.
Depending on library preparation and sequencing
technology, it is possible to sequence reads that are of
a known chromosomal distance [26]. These so-called
paired-end or mate-pair reads provide additional information
which can be used for enhancing mapping
accuracy and identifying structural rearrangements
[53].
After completing laboratory work and the actual
sequencing, the researcher is confronted with a huge
amount of raw data. The analysis of the data can be
decomposed into five distinct steps (Figure 1): (i)
quality assessment of the raw data, (ii) read alignment
to a reference genome, (iii) variant identification, (iv)
annotation of the variants and (v) data visualization.
In the following paragraphs, we briefly explain each
of the steps and review available software tools. The
initial list of analysis tools was acquired by performing
multiple PubMed searches. Furthermore, we
conducted additional Internet searches to identify
tools not indexed by PubMed. An overview of the
surveyed tools is given in Supplementary Tables (see
also icbi.at/ngs_survey).

Without data you are just another person with an opinion - Deming

djoser-xyyman
Vizier

Without data you are just another person with an opinion - Deming

Posts: 3,268

Overview of Admixture Mapping - Daniel Shriner Apr 11, 2018 13:09:49 GMT -5

Quote

Post by djoser-xyyman on Apr 11, 2018 13:09:49 GMT -5

2.3 Do I need to thin the marker set for linkage disequilibrium?
We tend to believe this is a good idea, since our model does not explicitly take LD into
consideration, and since enormous data sets take more time to analyze. It is impossible to
\remove" all LD, especially in recently-admixed populations, which have a high degree of
\admixture LD". Two approaches to mitigating the e_ects of LD are to include markers
that are separated from each other by a certain genetic distance, or to thin the markers

2.4 How many markers do I need to supply to ADMIXTURE?
This depends on how genetically di
erentiated your populations are, and on what you
plan to do with the estimates. It has been noted elsewhere [4] that the number of markers
needed to resolve populations in this kind of analysis is inversely proportional to the genetic
distance (FST ) betweeen the populations.

Viewing
these reference individuals as training samples, the problem is transformed into a supervised
learning problem.
8
Supervised learning mode is enabled with the ag --supervised and requires an additional
_le with a .pop su_x, specifying the ancestries of the reference individuals. It is assumed
that all reference samples have 100% ancestry from some ancestral population. Each line
of the .pop _le corresponds to individual listed on the same line number in the .fam or
.ped _le. I

Without data you are just another person with an opinion - Deming

zarahan Nomarch Global Moderator Posts: 2,098	Overview of Admixture Mapping - Daniel Shriner Apr 12, 2018 22:06:10 GMT -5 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by zarahan on Apr 12, 2018 22:06:10 GMT -5 What are the implications of the above for the study of African diversity?
	Note: I am not an "Egyptologist" as claimed by some still bitter, defeated, trolls creating fake profiles and posts elsewhere. You still fail..

Overview of Admixture Mapping - Daniel Shriner

Post by djoser-xyyman on Apr 11, 2018 9:40:45 GMT -5

Post by djoser-xyyman on Apr 11, 2018 10:13:15 GMT -5

Post by djoser-xyyman on Apr 11, 2018 10:27:18 GMT -5

Post by djoser-xyyman on Apr 11, 2018 10:31:42 GMT -5

Post by djoser-xyyman on Apr 11, 2018 10:37:54 GMT -5

Post by djoser-xyyman on Apr 11, 2018 13:09:49 GMT -5

Post by zarahan on Apr 12, 2018 22:06:10 GMT -5