Arlequin(version 3.0) An integrated software package for population genetics data analysis
更新时间:2023-11-15 04:34:01 阅读量: 教育文库 文档下载
Evol Bioinform Online. 2005; 1: 47–50. Published online 2007 February 23. Copyright ? 2005 The authors.
PMCID: PMC2658868
Arlequin (version 3.0): An integrated software package for population genetics data analysis
Laurent Excoffier, Guillaume Laval, and Stefan Schneider
Computational and Molecular Population Genetics Lab, Zoological Institute, University of Berne, Baltzerstrasse 6, 3012 Berne, Switzerland
Correspondence: Laurent Excoffier, Tel: +41 31 631 30 31, Fax: +41 31 631 48 88, Email: laurent.excoffier@zoo.unibe.ch
This article is published under the Creative Commons Attribution By licence. For further information go to: http://creativecommons.org/licenses/by/3.0.
This article has been cited by other articles in PMC.
?
Other Sections▼
o AbstractIntroductionMethods implemented in ArlequinNew features in
Arlequin 3AvailabilityReferencesAbstract
Arlequin ver 3.0 is a software package integrating several basic and advanced methods for population genetics data analysis, like the computation of standard genetic diversity indices, the estimation of allele and haplotype frequencies, tests of departure from linkage equilibrium, departure from selective neutrality and demographic equilibrium, estimation or parameters from past population expansions, and thorough analyses of population subdivision under the AMOVA framework. Arlequin 3 introduces a completely new graphical interface written in C++, a more robust semantic analysis of input files, and two new methods: a Bayesian estimation of gametic phase from multi-locus genotypes, and an estimation of the parameters of an instantaneous spatial expansion from DNA sequence polymorphism. Arlequin can handle several data types like DNA sequences, microsatellite data, or standard multi-locus genotypes. A Windows version of the software is freely available on http://cmpg.unibe.ch/software/arlequin3.
Keywords: Computer package, population genetics, genetic data analysis, AMOVA, EM
algorithm, gametic phase estimation, spatial expansion ?
Other Sections▼
o AbstractIntroductionMethods implemented in ArlequinNew features in
Arlequin 3AvailabilityReferencesIntroduction
Most genetic studies on non-model organisms require a description of the pattern of diversity within and between populations, based on a variety of markers often including mitochondrial DNA (mtDNA) sequences and microsatellites. The genetic data are processed to extract information on the mating system, the extent of population subdivision, the past demography of the population, or on departure from selective neutrality at some loci. A series of computer packages have been developed in the last 10 years to assist researchers in performing basic population genetics analyses like Arlequin2 (Schneider et al. 2000), DNASP (Rozas et al. 2003), FSTAT (Goudet 1995), GENEPOP (Raymond and Rousset 1995b), or GENETIX (Belkhir et al. 2004). These programs have been widely used in the molecular ecology and conservation genetics community (Labate 2000; Luikart and England 1999; Schnabel et al. 1998). Among these, Arlequin is a very versatile (though not universal) program, and complements the other programs listed above. It can handle several data types like RFLPs, DNA sequences, microsatellite data, allele frequencies, or standard multi-locus genotypes, while allowing the user to carry out the same types of analyses irrespective of the data types.
We present here the version 3 of Arlequin with additional methods extending its capacities for the handling of unphased multi-locus genotypes and for the estimation of parameters of a spatial expansion. Note that these new developments are mainly implementations of new methodologies developed in our lab. We believe these methods will be useful to the research community, but we do not claim that alternative
methods implemented by other groups in other programs are inadequate. A new graphical interface has been developed to provide a better integration of the different analyses into a common framework, and an easier exploration of the data by performing a wide variety of analyses with different settings. The tight coupling of Arlequin with the simulation programs SIMCOAL2 (Laval and Excoffier 2004) and SPLATCHE (Currat et al. 2004) should also make it useful to describe patterns of genetic diversity under complex evolutionary scenarios.
?
Other Sections▼
o AbstractIntroductionMethods implemented in ArlequinNew features in
Arlequin 3AvailabilityReferencesMethods implemented in
Arlequin
Arlequin provides methods to analyse patterns of genetic diversity within and between population samples. Intra-population methods
?
Computation of different standard genetic indices, like the number of segregating sites, the number of different alleles, the heterozygosity, the base composition of DNA sequences, gene diversity, or the population effective size Ne scaled by the mutation rate μ as θ = 4Neu.
?
Maximum-likelihood estimation of allele and haplotype frequencies via the EM algorithm (Excoffier and Slatkin 1995).
?
Estimation of the gametic phase from multi-locus genotypes via the Excoffier-Laval-Balding (ELB) algorithm (Excoffier et al. 2003).
?
Estimation of the parameters of a demographic (Rogers and Harpending 1992; Schneider and Excoffier 1999) or a spatial (Excoffier 2004; Ray et al. 2003) expansion, from the mismatch distribution computed on DNA sequences.
?
Calculation of several measures of linkage disequilibrium (LD) like
D, D’, or r2 (Hedrick 1987), and test of non-random association of alleles at different loci when the gametic phase is known (Weir 1996) or unknown (Slatkin and Excoffier 1996).
?
Exact test of departure from Hardy-Weinberg equilibrium (Guo and Thompson 1992).
?
Computation of Tajima’s D (Tajima 1989) and Fu’s FS (Fu 1997) statistics, and test of their significance by coalescent simulations (Hudson 1990; Nordborg 2003) under the infinite-site model.
?
Tests of selective neutrality under the infinite-alleles model, like the Ewens-Watterson test (Slatkin 1996; Watterson 1978), and Chakraborty’s amalgamation test (Chakraborty 1990).
Inter-population methods
? ?
Search for shared haplotypes between populations
Analysis of population subdivision under the AMOVA framework (Excoffier 2003; Excoffier et al. 1992), with three hierarchical levels: genes within individuals, individuals within demes, demes within groups of demes. Computation of F-statistics like the local inbreeding coefficient FIS or the index of population differentiation FST.
?
Computation of genetic distances between populations related to the pairwise FST index (Gaggiotti and Excoffier 2000; Reynolds et al. 1983; Slatkin 1995).
?
Exact test of population differentiation (Goudet et al. 1996; Raymond and Rousset 1995a).
?
A simple assignment test of individual genotypes to populations according to their likelihood (Paetkau et al. 1997).
?
Computation of correlations or partial correlations between a set of 2 or 3 distance matrices (Mantel test: Smouse et al. 1986)
?
Other Sections▼
o AbstractIntroductionMethods implemented in ArlequinNew features in
Arlequin 3AvailabilityReferencesNew features in Arlequin 3 ?
Version 3 of Arlequin integrates the core computational routines and the interface in a single program written in C++ for the Windows environment. The interface has been entirely redesigned to provide better usability.
?
Incorporation of two new methods to estimate gametic phase and haplotype frequencies:
?
The ELB algorithm (Excoffier et al. 2003) is a pseudo-Bayesian approach aiming at reconstructing the gametic phase of multi-locus genotypes, and the estimation of the haplotype frequencies are a by-product of this process. Phase updates are made on the basis of a window of neighbouring loci, and the window size varies according to the local level of linkage disequilibrium.
?
The EM zipper algorithm, which is an extension of the EM algorithm for estimating haplotype frequencies (Excoffier and Slatkin 1995), aims at estimating the haplotype frequencies in unphased multi-locus genotypes. The estimation of the gametic phases are a by-product of this process. It proceeds by adding loci one at a time and progressively extending the length of the reconstructed haplo-types. With this method, Arlequin does not need to build all possible genotypes for each individual like in the conventional EM algorithm, but it only considers the genotypes whose sub-haplotypes have non-null estimated
正在阅读:
Arlequin(version 3.0) An integrated software package for population genetics data analysis11-15
美丽的校园一角作文500字07-08
执业医师基本技能操作试题及答案05-26
Excel表格中如何依据据身份证号码自动填出生日期01-31
食堂粗加工管理制度12-03
液化天然气的储存与运输技术现状探究06-02
英语三年级上册unit 4 we love animals08-13
第4课时 旋转 台儿庄 孙中玲03-06
浅谈小学生孝道教育 10-11
大学生创业与就业调查报告05-23
- 1Arlequin操作说明
- 2rehh——An R Package
- 3Population用法
- 4Module 9 Population 新
- 51.Population Aging
- 6Software Testing in the Cloud
- 7Error Analysis and Contrastive Analysis
- 8A package of Linux scripts for the parallelization of Monte
- 9version2.0
- 10COMP5318 Knowledge Discovery and Data Mining_2011 Semester 1_week3chap6_basic_association_analysis
- exercise2
- 铅锌矿详查地质设计 - 图文
- 厨余垃圾、餐厨垃圾堆肥系统设计方案
- 陈明珠开题报告
- 化工原理精选例题
- 政府形象宣传册营销案例
- 小学一至三年级语文阅读专项练习题
- 2014.民诉 期末考试 复习题
- 巅峰智业 - 做好顶层设计对建设城市的重要意义
- (三起)冀教版三年级英语上册Unit4 Lesson24练习题及答案
- 2017年实心轮胎现状及发展趋势分析(目录)
- 基于GIS的农用地定级技术研究定稿
- 2017-2022年中国医疗保健市场调查与市场前景预测报告(目录) - 图文
- 作业
- OFDM技术仿真(MATLAB代码) - 图文
- Android工程师笔试题及答案
- 生命密码联合密码
- 空间地上权若干法律问题探究
- 江苏学业水平测试《机械基础》模拟试题
- 选课走班实施方案
- population
- integrated
- Arlequin
- software
- genetics
- analysis
- version
- package
- data
- 3.0
- 公共场所卫生题库
- 多进程同步方法解决生产者-消费者问题
- cost accounting test bank chapter 15
- 简约室内毕业设计(论文)任务书
- 碧桂园人力资源信息化项目需求调研报告(e-HR)
- 学校“三自”校本课程实施方案
- 大学生心理健康教育复习提纲及答案,考题
- 新世纪大学英语综合教程2课后翻译题答案(含词组)
- 巢NEST 武汉别墅折页文案(博思堂)
- 安全文明施工方案整改 - 图文
- 秘书应用写作 - 图文
- 2017人大汉语国际教育硕士考研参考书是哪些
- 健康扶贫一点通试卷
- 隧道施工主要危险源的造成和预防措施
- 发展心理学 填空题 名词解释 问答题 总汇(有答案)
- 明实录穆宗实录
- 液态金属 - 图文
- 毕业环节社会实践(樊建坤)DOC - 图文
- 2016社会调查研究方法第2阶段测试题
- 应用写作教案