用户名: 密码: 验证码:
Diverse convergent evidence in the genetic analysis of complex disease: coordinating omic, informatic, and experimental evidence to better identify and validate risk factors
详细信息    查看全文
  • 作者:Timothy H Ciesielski (1) (2)
    Sarah A Pendergrass (3) (4)
    Marquitta J White (1) (2) (5)
    Nuri Kodaman (1) (2) (5)
    Rafal S Sobota (1) (2) (5)
    Minjun Huang (1)
    Jacquelaine Bartlett (2)
    Jing Li (1)
    Qinxin Pan (1)
    Jiang Gui (2) (6)
    Scott B Selleck (4)
    Christopher I Amos (2) (6)
    Marylyn D Ritchie (3) (4)
    Jason H Moore (1) (2) (6)
    Scott M Williams (1) (2)

    1. Department of Genetics
    ; Geisel School of Medicine at Dartmouth ; Hanover ; NH ; 03755 ; USA
    2. Institute for Quantitative Biomedical Sciences
    ; Dartmouth College ; Hanover ; NH ; 03755 ; USA
    3. Center for Systems Genomics
    ; Pennsylvania State University ; University Park ; PA ; 16802 ; USA
    4. Department of Biochemistry & Molecular Biology
    ; Pennsylvania State University ; University Park ; PA ; 16802 ; USA
    5. Center for Human Genetics Research
    ; Vanderbilt University ; Nashville ; TN ; 37232-0700 ; USA
    6. Community and Family Medicine
    ; Section of Biostatistics & Epidemiology ; Geisel School of Medicine ; Hanover ; NH ; 03766 ; USA
  • 关键词:Replication ; Validation ; Complex disease ; Heterogeneity ; GWAS ; Omics ; Type 2 error ; Type 1 error ; False negatives ; False positives
  • 刊名:BioData Mining
  • 出版年:2014
  • 出版时间:December 2014
  • 年:2014
  • 卷:7
  • 期:1
  • 全文大小:629 KB
  • 参考文献:1. Chanock, SJ, Manolio, T, Boehnke, M, Boerwinkle, E, Hunter, DJ, Thomas, G, Hirschhorn, JN, Abecasis, G, Altshuler, D, Bailey-Wilson, JE, Brooks, LD, Cardon, LR, Daly, M, Donnelly, P, Fraumeni, JF, Freimer, NB, Gerhard, DS, Gunter, C, Guttmacher, AE, Guyer, MS, Harris, EL, Hoh, J, Hoover, R, Kong, CA, Merikangas, KR, Morton, CC, Palmer, LJ, Phimister, EG, Rice, JP, Roberts, J (2007) Replicating genotype-phenotype associations. Nature 447: pp. 655-660 CrossRef
    2. Igl, BW, Konig, IR, Ziegler, A (2009) What do we mean by 'replication' and 'validation' in genome-wide association studies?. Hum Hered 67: pp. 66-68 CrossRef
    3. Ioannidis, JP (2005) Why most published research findings are false. PLoS Med 2: pp. e124 journal.pmed.0020124" target="_blank" title="It opens in new window">CrossRef
    4. Ioannidis, JP (2005) Microarrays and molecular research: noise discovery?. Lancet 365: pp. 454-455 CrossRef
    5. Tyler, AL, Asselbergs, FW, Williams, SM, Moore, JH (2009) Shadows of complexity: what biological networks reveal about epistasis and pleiotropy. Bioessays 31: pp. 220-227 CrossRef
    6. Greene, CS, Penrod, NM, Williams, SM, Moore, JH (2009) Failure to replicate a genetic association may provide important clues about genetic architecture. PLoS One 4: pp. e5639 journal.pone.0005639" target="_blank" title="It opens in new window">CrossRef
    7. Liu, YJ, Papasian, CJ, Liu, JF, Hamilton, J, Deng, HW (2008) Is replication the gold standard for validating genome-wide association findings?. PLoS One 3: pp. e4037 journal.pone.0004037" target="_blank" title="It opens in new window">CrossRef
    8. Williams, SM, Haines, JL (2011) Correcting away the hidden heritability. Ann Hum Genet 75: pp. 348-350 j.1469-1809.2011.00640.x" target="_blank" title="It opens in new window">CrossRef
    9. Zaykin, DV, Zhivotovsky, LA (2005) Ranks of genuine associations in whole-genome scans. Genetics 171: pp. 813-823 CrossRef
    10. Daumer, M, Held, U, Ickstadt, K, Heinz, M, Schach, S, Ebers, G (2008) Reducing the probability of false positive research findings by pre-publication validation - experience with a large multiple sclerosis database. BMC Med Res Methodol 8: pp. 18 CrossRef
    11. Malley, JD, Dasgupta, A, Moore, JH (2013) The limits of p-values for biological data mining. BioData Min 6: pp. 10 CrossRef
    12. Nuzzo, R (2014) Scientific method: statistical errors. Nature 506: pp. 150-152 CrossRef
    13. Hill, AB (1965) The environment and disease: association or causation?. Proc R Soc Med 58: pp. 295-300
    14. Phillips, CV, Goodman, KJ (2004) The missed lessons of Sir Austin Bradford Hill. Epidemiol Perspect Innov 1: pp. 3 CrossRef
    15. Dudbridge, F, Gusnanto, A (2008) Estimation of significance thresholds for genomewide association scans. Genet Epidemiol 32: pp. 227-234 CrossRef
    16. / PubMed. [http://www.ncbi.nlm.nih.gov/pubmed/]
    17. / GEO: Gene Expression Omnibus. [http://www.ncbi.nlm.nih.gov/geo/]
    18. / NCBI: National Center for Biotechnology Information. [http://www.ncbi.nlm.nih.gov/]
    19. / KEGG: Kyoto Encyclopedia of Genes and Genomes. [jp/kegg/" class="a-plus-plus">http://www.genome.jp/kegg/]
    20. / GO: The Gene Ontology. [http://www.geneontology.org/]
    21. Jallow, M, Teo, YY, Small, KS, Rockett, KA, Deloukas, P, Clark, TG, Kivinen, K, Bojang, KA, Conway, DJ, Pinder, M, Sirugo, G, Sisay-Joof, F, Usen, S, Auburn, S, Bumpstead, SJ, Campino, S, Coffey, A, Dunham, A, Fry, AE, Green, A, Gwilliam, R, Hunt, SE, Inouye, M, Jeffreys, AE, Mendy, A, Palotie, A, Potter, S, Ragoussis, J, Rogers, J, Rowlands, K (2009) Genome-wide and fine-resolution association analysis of malaria in West Africa. Nat Genet 41: pp. 657-665 CrossRef
    22. Timmann, C, Thye, T, Vens, M, Evans, J, May, J, Ehmen, C, Sievertsen, J, Muntau, B, Ruge, G, Loag, W, Ansong, D, Antwi, S, Asafo-Adjei, E, Nguah, SB, Kwakye, KO, Akoto, AO, Sylverken, J, Brendel, M, Schuldt, K, Loley, C, Franke, A, Meyer, CG, Agbenyega, T, Ziegler, A, Horstmann, RD (2012) Genome-wide association study indicates two novel resistance loci for severe malaria. Nature 489: pp. 443-446 CrossRef
    23. Gauci, R, Bennett, D, Clark, IA, Bryant, C (1982) The induction of tyrosine aminotransferase activity and its use as an indirect assay for endotoxin in mice infected with Plasmodium vinckei petteri. Int J Parasitol 12: pp. 279-284 CrossRef
    24. Williams, SM, Canter, JA, Crawford, DC, Moore, JH, Ritchie, MD, Haines, JL (2007) Problems with genome-wide association studies. Science 316: pp. 1840-1842
    25. Lehmann, JM, Moore, LB, Smith-Oliver, TA, Wilkison, WO, Willson, TM, Kliewer, SA (1995) An antidiabetic thiazolidinedione is a high affinity ligand for peroxisome proliferator-activated receptor gamma (PPAR gamma). J Biol Chem 270: pp. 12953-12956 jbc.270.22.12953" target="_blank" title="It opens in new window">CrossRef
    26. Saxena, R, Voight, BF, Lyssenko, V, Burtt, NP, De Bakker, PI, Chen, H, Roix, JJ, Kathiresan, S, Hirschhorn, JN, Daly, MJ, Hughes, TE, Groop, L, Altshuler, D, Almgren, P, Florez, JC, Meyer, J, Ardlie, K, Bengtsson Bostrom, K, Isomaa, B, Lettre, G, Lindblad, U, Lyon, HN, Melander, O, Newton-Cheh, C, Nilsson, P, Orho-Melander, M, Rastam, L, Speliotes, EK, Taskinen, MR, Tuomi, T (2007) Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science 316: pp. 1331-1336 CrossRef
    27. Scott, LJ, Mohlke, KL, Bonnycastle, LL, Willer, CJ, Li, Y, Duren, WL, Erdos, MR, Stringham, HM, Chines, PS, Jackson, AU, Prokunina-Olsson, L, Ding, CJ, Swift, AJ, Narisu, N, Hu, T, Pruim, R, Xiao, R, Li, XY, Conneely, KN, Riebow, NL, Sprau, AG, Tong, M, White, PP, Hetrick, KN, Barnhart, MW, Bark, CW, Goldstein, JL, Watkins, L, Xiang, F, Saramies, J (2007) A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science 316: pp. 1341-1345 CrossRef
    28. Zeggini, E, Weedon, MN, Lindgren, CM, Frayling, TM, Elliott, KS, Lango, H, Timpson, NJ, Perry, JR, Rayner, NW, Freathy, RM, Barrett, JC, Shields, B, Morris, AP, Ellard, S, Groves, CJ, Harries, LW, Marchini, JL, Owen, KR, Knight, B, Cardon, LR, Walker, M, Hitman, GA, Morris, AD, Doney, AS, McCarthy, MI, Hattersley, AT (2007) Replication of genome-wide association signals in UK samples reveals risk loci for type 2 diabetes. Science 316: pp. 1336-1341 CrossRef
    29. Consortium, IMSG (2010) Comprehensive follow-up of the first genome-wide association study of multiple sclerosis identifies KIF21B and TMEM39A as susceptibility loci. Hum Mol Genet 19: pp. 953-962 CrossRef
    30. Sterne, JA, Davey Smith, G (2001) Sifting the evidence-what's wrong with significance tests?. BMJ 322: pp. 226-231 j.322.7280.226" target="_blank" title="It opens in new window">CrossRef
    31. Fisher, RA (1926) The arrangement of field experiments. J Min Agric Great Britain 33: pp. 503-513
    32. Fisher, RA (1950) Statistical Methods for Research Workers, Volume 80. Oliver and Boyd, London
    33. Kraft, P (2008) Curses--winner's and otherwise--in genetic epidemiology. Epidemiology 19: pp. 649-651 CrossRef
    34. Ioannidis, JP, Ntzani, EE, Trikalinos, TA, Contopoulos-Ioannidis, DG (2001) Replication validity of genetic association studies. Nat Genet 29: pp. 306-309 CrossRef
    35. Rothman, KJ (1990) No adjustments are needed for multiple comparisons. Epidemiology 1: pp. 43-46 CrossRef
    36. Bender, R, Lange, S (1999) Multiple test procedures other than Bonferroni's deserve wider use. BMJ 318: pp. 600-601 j.318.7183.600a" target="_blank" title="It opens in new window">CrossRef
    37. Panagiotou, OA, Willer, CJ, Hirschhorn, JN, Ioannidis, JP (2013) The power of meta-analysis in genome-wide association studies. Annu Rev Genomics Hum Genet 14: pp. 441-465 CrossRef
    38. Gisev, N, Bell, JS, Chen, TF (2013) Interrater agreement and interrater reliability: key concepts, approaches, and applications. Res Social Adm Pharm 9: pp. 330-338 j.sapharm.2012.04.004" target="_blank" title="It opens in new window">CrossRef
    39. Reif, DM, Sypa, M, Lock, EF, Wright, FA, Wilson, A, Cathey, T, Judson, RR, Rusyn, I (2013) ToxPi GUI: an interactive visualization tool for transparent integration of data from diverse sources of evidence. Bioinformatics 29: pp. 402-403 CrossRef
    40. Reif, DM, Martin, MT, Tan, SW, Houck, KA, Judson, RS, Richard, AM, Knudsen, TB, Dix, DJ, Kavlock, RJ (2010) Endocrine profiling and prioritization of environmental chemicals using ToxCast data. Environ Health Perspect 118: pp. 1714-1720 CrossRef
    41. Hauser, MA, Li, YJ, Takeuchi, S, Walters, R, Noureddine, M, Maready, M, Darden, T, Hulette, C, Martin, E, Hauser, E, Xu, H, Schmechel, D, Stenger, JE, Dietrich, F, Vance, J (2003) Genomic convergence: identifying candidate genes for Parkinson's disease by combining serial analysis of gene expression and genetic linkage. Hum Mol Genet 12: pp. 671-677 CrossRef
    42. Liang, X, Slifer, M, Martin, ER, Schnetz-Boutaud, N, Bartlett, J, Anderson, B, Zuchner, S, Gwirtsman, H, Gilbert, JR, Pericak-Vance, MA, Haines, JL (2009) Genomic convergence to identify candidate genes for Alzheimer disease on chromosome 10. Hum Mutat 30: pp. 463-471 CrossRef
    43. Jia, P, Ewers, JM, Zhao, Z (2011) Prioritization of epilepsy associated candidate genes by convergent analysis. PLoS One 6: pp. e17162 journal.pone.0017162" target="_blank" title="It opens in new window">CrossRef
    44. Okada, Y, Wu, D, Trynka, G, Raj, T, Terao, C, Ikari, K, Kochi, Y, Ohmura, K, Suzuki, A, Yoshida, S, Graham, RR, Manoharan, A, Ortmann, W, Bhangale, T, Denny, JC, Carroll, RJ, Eyler, AE, Greenberg, JD, Kremer, JM, Pappas, DA, Jiang, L, Yin, J, Ye, L, Su, DF, Yang, J, Xie, G, Keystone, E, Westra, HJ, Esko, T, Metspalu, A (2014) Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506: pp. 376-381 CrossRef
    45. Glazier, AM, Nadeau, JH, Aitman, TJ (2002) Finding genes that underlie complex traits. Science 298: pp. 2345-2349 CrossRef
    46. Rothman, KJ (1976) Causes. Am J Epidemiol 104: pp. 587-592
    47. Pendergrass, SA, Hayes, E, Farina, G, Lemaire, R, Farber, HW, Whitfield, ML, Lafyatis, R (2010) Limited systemic sclerosis patients with pulmonary arterial hypertension show biomarkers of inflammation and vascular injury. PLoS One 5: pp. e12106 journal.pone.0012106" target="_blank" title="It opens in new window">CrossRef
    48. Kim, JH, Cheong, HS, Park, JS, Jang, AS, Uh, ST, Kim, YH, Kim, MK, Choi, IS, Cho, SH, Choi, BW, Bae, JS, Park, CS, Shin, HD (2013) A genome-wide association study of total serum and mite-specific IgEs in asthma patients. PLoS One 8: pp. e71958 journal.pone.0071958" target="_blank" title="It opens in new window">CrossRef
    49. Higareda-Almaraz, JC, Valtierra-Gutierrez, IA, Hernandez-Ortiz, M, Contreras, S, Hernandez, E, Encarnacion, S (2013) Analysis and prediction of pathways in HeLa cells by integrating biological levels of organization with systems-biology approaches. PLoS One 8: pp. e65433 journal.pone.0065433" target="_blank" title="It opens in new window">CrossRef
    50. Lian, X, Selekman, J, Bao, X, Hsiao, C, Zhu, K, Palecek, SP (2013) A small molecule inhibitor of SRC family kinases promotes simple epithelial differentiation of human pluripotent stem cells. PLoS One 8: pp. e60016 journal.pone.0060016" target="_blank" title="It opens in new window">CrossRef
    51. Kingsley, EP, Manceau, M, Wiley, CD, Hoekstra, HE (2009) Melanism in peromyscus is caused by independent mutations in agouti. PLoS One 4: pp. e6435 journal.pone.0006435" target="_blank" title="It opens in new window">CrossRef
  • 刊物主题:Computer Appl. in Life Sciences; Computational Biology/Bioinformatics; Data Mining and Knowledge Discovery; Bioinformatics; Algorithms;
  • 出版者:BioMed Central
  • ISSN:1756-0381
文摘
In omic research, such as genome wide association studies, researchers seek to repeat their results in other datasets to reduce false positive findings and thus provide evidence for the existence of true associations. Unfortunately this standard validation approach cannot completely eliminate false positive conclusions, and it can also mask many true associations that might otherwise advance our understanding of pathology. These issues beg the question: How can we increase the amount of knowledge gained from high throughput genetic data? To address this challenge, we present an approach that complements standard statistical validation methods by drawing attention to both potential false negative and false positive conclusions, as well as providing broad information for directing future research. The Diverse Convergent Evidence approach (DiCE) we propose integrates information from multiple sources (omics, informatics, and laboratory experiments) to estimate the strength of the available corroborating evidence supporting a given association. This process is designed to yield an evidence metric that has utility when etiologic heterogeneity, variable risk factor frequencies, and a variety of observational data imperfections might lead to false conclusions. We provide proof of principle examples in which DiCE identified strong evidence for associations that have established biological importance, when standard validation methods alone did not provide support. If used as an adjunct to standard validation methods this approach can leverage multiple distinct data types to improve genetic risk factor discovery/validation, promote effective science communication, and guide future research directions.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700