用户名: 密码: 验证码:
Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data
详细信息    查看全文
  • 作者:Lun Li (1) (2)
    Wei Zheng (3)
    Joon Sang Lee (1)
    Xianghua Zhang (1) (4)
    John Ferguson (1)
    Xiting Yan (1)
    Hongyu Zhao (1)
  • 刊名:BMC Proceedings
  • 出版年:2011
  • 出版时间:December 2011
  • 年:2011
  • 卷:5
  • 期:9-supp
  • 全文大小:263KB
  • 参考文献:1. Wellcome Trust Case Control Consortium: Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. / Nature 2007, 447:661-78. CrossRef
    2. Duerr RH, Taylor KD, Brant SR, Rioux JD, Silverberg MS, Daly MJ, Steinhart AH, Abraham C, Regueiro M, Griffiths A, / et al.: A genome-wide association study identifies IL23R as an inflammatory bowel disease gene. / Science 2006, 314:1461-463. CrossRef
    3. McPherson R, Pertsemlidis A, Kavaslar N, Stewart A, Roberts R, Cox DR, Hinds DA, Pennacchio LA, Tybjaerg-Hansen A, Folsom AR, / et al.: A common allele on chromosome 9 associated with coronary heart disease. / Science 2007, 316:1488-491. CrossRef
    4. Choi M, Scholl UI, Ji W, Liu T, Tikhonova IR, Zumbo P, Nayir A, Bakkaloglu A, Ozen S, Sanjad S, / et al.: Genetic diagnosis by whole exome capture and massively parallel DNA sequencing. / Proc Natl Acad Sci USA 2009, 106:19,096-9,101.
    5. 1000 Genomes Project Consortium: A map of human genome variation from population-scale sequencing. / Nature 2010, 467:1061-073. CrossRef
    6. Almasy LA, Dyer TD, Peralta JM, Kent JW Jr., Charlesworth JC, Curran JE, Blangero J: Genetic Analysis Workshop 17 mini-exome simulation. / BMC Proc 2011,5(suppl 9):S2. CrossRef
    7. Li B, Leal SM: Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. / Am J Hum Genet 2008, 83:311-21. CrossRef
    8. Morris AP, Zeggini E: An evaluation of statistical approaches to rare variant analysis in genetic association studies. / Genet Epidemiol 2010, 34:188-93. CrossRef
    9. Madsen BE, Browning SR: A groupwise association test for rare mutations using a weighted sum statistic. / PLoS Genet 2009, 5:e1000384. CrossRef
    10. Kwee LC, Liu D, Lin X, Ghosh D, Epstein MP: A powerful and flexible multilocus association test for quantitative traits. / Am J Hum Genet 2008, 82:386-97. CrossRef
    11. Wu MC, Kraft P, Epstein MP, Taylor DM, Chanock SJ, Hunter DJ, Lin X: Powerful SNP-set analysis for case-control genome-wide association studies. / Am J Hum Genet 2010, 86:929-42. CrossRef
    12. Liu D, Lin X, Ghosh D: Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models. / Biometrics 2007, 63:1079-088. CrossRef
    13. Liu D, Ghosh D, Lin X: Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models. / BMC Bioinform 2008, 9:292. CrossRef
    14. Kimeldorf G, Wahba G: Some results on Tchebycheffian spline functions. / J Math Anal Appl 1971, 33:82-5. CrossRef
    15. Zhang D, Lin X: Hypothesis testing in semiparametric additive mixed models. / Biostatistics 2003, 4:57-4. CrossRef
  • 作者单位:Lun Li (1) (2)
    Wei Zheng (3)
    Joon Sang Lee (1)
    Xianghua Zhang (1) (4)
    John Ferguson (1)
    Xiting Yan (1)
    Hongyu Zhao (1)

    1. Division of Biostatistics, Yale School of Public Health, Yale University, 60 College St., PO Box 208034, New Haven, CT, 06520-8034, USA
    2. Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
    3. Keck Biotechnology Resource Laboratory, Yale University, 300 George St., New Haven, CT, 06511, USA
    4. Department of Electronic Science and Technology, University of Science and Technology of China, Hefei, Anhui, 230027, China
文摘
Recently there has been great interest in identifying rare variants associated with common diseases. We apply several collapsing-based and kernel-based single-gene association tests to Genetic Analysis Workshop 17 (GAW17) rare variant association data with unrelated individuals without knowledge of the simulation model. We also implement modified versions of these methods using additional information, such as minor allele frequency (MAF) and functional annotation. For each of four given traits provided in GAW17, we use the Bayesian mixed-effects model to estimate the phenotypic variance explained by the given environmental and genotypic data and to infer an individual-specific genetic effect to use directly in single-gene association tests. After obtaining information on the GAW17 simulation model, we compare the performance of all methods and examine the top genes identified by those methods. We find that collapsing-based methods with weights based on MAFs are sensitive to the “lower MAF, larger effect size-assumption, whereas kernel-based methods are more robust when this assumption is violated. In addition, many false-positive genes identified by multiple methods often contain variants with exactly the same genotype distribution as the causal variants used in the simulation model. When the sample size is much smaller than the number of rare variants, it is more likely that causal and noncausal variants will share the same or similar genotype distribution. This likely contributes to the low power and large number of false-positive results of all methods in detecting causal variants associated with disease in the GAW17 data set.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700