rehh——An R Package
更新时间:2023-07-26 11:37:01 阅读量: 实用文档 文档下载
- 惹婚推荐度:
- 相关推荐
正向选择,人类遗传学,群体遗传学
rehh : An R Package吴珂皓 2012年3月29日星期四
正向选择,人类遗传学,群体遗传学
rehh An R package to detect footprints of selection in genome-wide SNPdata from haplotype structure
institut national de la recherche agronomique ,France
正向选择,人类遗传学,群体遗传学
the Brief Introduction of rehh An R packageTo detect the footprint of selection Based on SNP data
Using EHH(Extended Homozygosity Haplotype)Including computation: EHH(Extended Homozygosity Haplotype) iHS(within population) Rsb(across pairs of populations) ….
正向选择,人类遗传学,群体遗传学
About positive selection
positive selection purifying selection
balancing selection
正向选择,人类遗传学,群体遗传学
About EHH EHH: Extended Homozygosity HaplotypePresented firstly by Pardis C. Sabeti the probability that two randomly chosen haplotypes carrying the
candidate core haplotype are homozygous for the entire intervalspanning the core region to a given locus (Sabeti P.C et al. 2002).
Sabeti PC, et al. Detecting recent positive selection in the human genome from haplotype structure.
正向选择,人类遗传学,群体遗传学
About EHH12 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9 0
A
G
C
T
12 3 4 5 6 7 8 9
正向选择,人类遗传学,群体遗传学
正向选择,人类遗传学,群体遗传学
About REHHRelative EHH is the ratio of the EHH on the tested core haplotype compared with the EHH of the grouped set of core haplotypes at the region not including the core haplotype tested.
正向选择,人类遗传学,群体遗传学
About iHH integrated EHH (iHH) : summed over both directions away from the coreSNP
The expectation and standard deviation of ln(iHHA/iHHD) are estimated from the empirical distribution at SNPs whose derived allele frequency p matches the frequency at the core SNP.
正向选择,人类遗传学,群体遗传学
About EHHS EHHS: decay of EHH of an individual SNP site
正向选择,人类遗传学,群体遗传学
About iES
正向选择,人类遗传学,群体遗传学
About Rsb
正向选择,人类遗传学,群体遗传学
About Input File
SNP information file
SNP rs6718902 ……..
chromosome 2
base position 191838204
ancestral allele 1
derived allele 2
genotype data file
name 1 2 ……….
SNP1 1 1
SNP2 2 2
SNP3 2 1
SNP4 1 1
正向选择,人类遗传学,群体遗传学
About Input File
data2haplohh()CHI<-data2haplohh("CHI.hap","CHI.inp",min_maf=0,min_perc_geno.hap=100,min_perc_geno.snp=100,=NA,popsel=NA,recode.allele=FALSE)
parametero
min_maf
SNPs displaying a MAF<min_maf will be discard
o
min_perc_geno.hapmin_perc_geno.snp popsel recode.allele
Haplotypes with less than min_perc_geno.hapSNPs genotyped on less than min_perc_geno.snp
percent SNPs genotyped will be discardo
percent haplotypes will be discardo o o
name of chromosome code of population considered if true ,alleles will be recoded according to the map file
正向选择,人类遗传学,群体遗传学
About calculation calc_ehh()calc_ehhs()
EHH and iHH computationEHHS and iES computation
正向选择,人类遗传学,群体遗传学
About calculation scan_hh()encode in C 140 individuals , 1424 SNPs 3.2GHz 3.3second
scan_hh(CHI,limhaplo=2,limehh=0.05,limehhs=0.05)o o o
limhaplo limehh
minimal number of haplotypes limit below which EHH stops to be evaluated
limehhs
limit below which EHHS stops to be evaluated
正向选择,人类遗传学,群体遗传学
About calculation ies2rsb(hh_pop1,hh_pop2,popname1=NA,popname2=NA,method="b
ilateral")Compute Rsb ( standardized ratio of iES from two populations ) hh_pop1o
a matrix with nsnps rows and six columns – – – – – – chromosome name position frequency of ancestral alleles iHH of ancestral alleles iHH for the derived allele iES
popname1 method
name of population bilateral or unilateral
正向选择,人类遗传学,群体遗传学
About calculation ihh2ihs(res_ihh,freqbin=0.025,minmaf=0.05)Compute his ( standardized iHH)o
res_ihh
a matrix with nsnps rows and six columns – chromosome name – position – frequency of ancestral alleles – iHH of ancestral alleles – iHH for the derived allele – iES
o
freqbin Size of the bin to standardize log(iHH1/iHH2) according to the underlying Derived
Allele frequency. Allele frequency bins vary from minmaf to 1-minmaf per step of size freqbin. Iffreqbin is set to 0 (e.g. in the case of a large number of SNPs and few haplotypes), standardization is performed considering each observed frequency as a frequency class.o
minmaf SNPs with a MAF lower than minmaf will be discard
正向选择,人类遗传学,群体遗传学
About calculation distribplot(data,col=c("blue","red"),main="his distribution",xlab="iHS")
Plot the observed distribution of standarized iHS or Rsb values together with the expected standard Gaussian distribution
正在阅读:
rehh——An R Package07-26
购销合同(样本)(适用单次或多次采购化肥、食品、等各种消耗品)-修订15012610-11
食品安全与日常饮食网课答案10-25
党员自我批评4篇02-13
中文CorelDRAW 基础教程第4章06-11
中考语文复习模拟试卷 doc(13)03-27
江南三月的雨作文600字06-22
接触网常见故障分析及对策06-03
职业生涯人物访谈报告06-25
- 教学能力大赛决赛获奖-教学实施报告-(完整图文版)
- 互联网+数据中心行业分析报告
- 2017上海杨浦区高三一模数学试题及答案
- 招商部差旅接待管理制度(4-25)
- 学生游玩安全注意事项
- 学生信息管理系统(文档模板供参考)
- 叉车门架有限元分析及系统设计
- 2014帮助残疾人志愿者服务情况记录
- 叶绿体中色素的提取和分离实验
- 中国食物成分表2020年最新权威完整改进版
- 推动国土资源领域生态文明建设
- 给水管道冲洗和消毒记录
- 计算机软件专业自我评价
- 高中数学必修1-5知识点归纳
- 2018-2022年中国第五代移动通信技术(5G)产业深度分析及发展前景研究报告发展趋势(目录)
- 生产车间巡查制度
- 2018版中国光热发电行业深度研究报告目录
- (通用)2019年中考数学总复习 第一章 第四节 数的开方与二次根式课件
- 2017_2018学年高中语文第二单元第4课说数课件粤教版
- 上市新药Lumateperone(卢美哌隆)合成检索总结报告
- Package
- rehh