用户名: 密码: 验证码:
Genomic data assimilation using a higher moment filtering technique for restoration of gene regulatory networks
详细信息    查看全文
  • 作者:Takanori Hasegawa (1)
    Tomoya Mori (1)
    Rui Yamaguchi (2)
    Teppei Shimamura (3)
    Satoru Miyano (2)
    Seiya Imoto (2)
    Tatsuya Akutsu (1)

    1. Bioinformatics Center
    ; Institute for Chemical Research ; Kyoto University ; Gokasho ; Kyoto ; 611-0011 Uji ; Japan
    2. Human Genome Center
    ; The Institute of Medical Science ; The University of Tokyo ; 4-6-1 Shirokanedai ; Tokyo ; 108-8639 Minato-ku ; Japan
    3. Division of Systems Biology
    ; Nagoya University Graduate School of Medicine ; 65 Tsurumai-cho ; Nagoya ; 466-8550 Showa-ku ; Japan
  • 关键词:Gene regulatory networks ; Time series analysis ; Systems biology ; Data assimilation ; Monte Carlo
  • 刊名:BMC Systems Biology
  • 出版年:2015
  • 出版时间:December 2015
  • 年:2015
  • 卷:9
  • 期:1
  • 全文大小:934 KB
  • 参考文献:1. Savageau, MA (1969) Biochemical systems analysis: II. The steady-state solutions for an n-pool system using a power-law approximation. J Theor Biol. 25: pp. 370-9 CrossRef
    2. Savageau, MA, Voit, EO (1987) Recasting nonlinear differential equations as s-systems: a canonical nonlinear form. Math Biosci. 87: pp. 83-115 CrossRef
    3. Elowitz, MB, Leibler, S (2000) A synthetic oscillatory network of transcriptional regulators. Nature 403: pp. 335-8 CrossRef
    4. de Jong, H (2002) Modeling and simulation of genetic regulatory systems: A literature review. J Comput Biol. 9: pp. 67-103 CrossRef
    5. Opper, M, Sanguinetti, G (2010) Learning combinatorial transcriptional dynamics from gene expression data. Bioinformatics 26: pp. 1623-9 CrossRef
    6. Henderson, J, Michailidis, G (2014) Network reconstruction using nonparametric additive ode models. PLoS ONE 9: pp. 94003 CrossRef
    7. Koh, CHH, Nagasaki, M, Saito, A, Wong, L, Miyano, S (2010) DA 1.0: parameter estimation of biological pathways using data assimilation approach. Bioinformatics 26: pp. 1794-6 CrossRef
    8. Matsuno, H, Nagasaki, M, Miyano, S (2011) Hybrid petri net based modeling for biological pathway simulation. Nat Comput. 10: pp. 1099-120 CrossRef
    9. Ramsay, JO, Hooker, G, Campbell, D, Cao, J (2007) Parameter estimation for differential equations: a generalized smoothing approach. J R Stat Soc: Ser B (Stat Methodology) 69: pp. 741-96 CrossRef
    10. Quach, M, Brunel, N, d鈥橝lche-Buc, F (2007) Estimating parameters and hidden variables in non-linear state-space models based on odes for biological networks inference. Bioinformatics 23: pp. 3209-16 CrossRef
    11. Hasegawa, T, Yamaguchi, R, Nagasaki, M, Imoto, S, Miyano, S (2011) Comprehensive pharmacogenomic pathway screening by data assimilation. Proceedings of the 7th International Conference on Bioinformatics Research and Applications. ISBRA鈥?1. Springer, Berlin, Heidelberg
    12. Hasegawa, T, Nagasaki, M, Yamaguchi, R, Imoto, S, Miyano, S (2014) An efficient method of exploring simulation models by assimilating literature and biological observational data. Biosystems 121: pp. 54-66 CrossRef
    13. Friedman, J, Hastie, T, Tibshirani, R (2007) Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9: pp. 432-41 CrossRef
    14. Kim, S, Imoto, S, Miyano, S (2004) Dynamic bayesian network and nonparametric regression for nonlinear modeling of gene networks from time series gene expression data. Biosystems 75: pp. 57-65 CrossRef
    15. Young, W, Raftery, A, Yeung, K (2014) Fast Bayesian inference for gene regulatory networks using ScanBMA. BMC Syst Biol. 8: pp. 47 CrossRef
    16. Zacher, B, Abnaof, K, Gade, S, Younesi, E, Tresch, A, Fr枚hlich, H (2012) Joint Bayesian inference of condition-specific miRNA and transcription factor activities from combined gene and microRNA expression data. Bioinformatics 28: pp. 1714-20 CrossRef
    17. Barenco, M, Tomescu, D, Brewer, D, Callard, R, Stark, J, Hubank, M (2006) Ranked prediction of p53 targets using hidden variable dynamic modeling. Genome Biol. 7: pp. 25 CrossRef
    18. Beal, MJ, Falciani, F, Ghahramani, Z, Rangel, C, Wild, DL (2005) A bayesian approach to reconstructing genetic regulatory networks with hidden factors. Bioinformatics 21: pp. 349-56 CrossRef
    19. Hasegawa, T, Yamaguchi, R, Nagasaki, M, Miyano, S, Imoto, S (2014) Inference of gene regulatory networks incorporating multi-source biological knowledge via a state space model with l1 regularization. PLoS ONE 9: pp. 105942 CrossRef
    20. Hirose, O, Yoshida, R, Imoto, S, Yamaguchi, R, Higuchi, T, Charnock-Jones, DS (2008) Statistical inference of transcriptional module-based gene networks from time course gene expression profiles by using state space models. Bioinformatics 24: pp. 932-42 CrossRef
    21. Rangel, C, Angus, J, Ghahramani, Z, Lioumi, M, Sotheran, E, Gaiba, A (2004) Modeling t-cell activation using gene expression profiling and state-space models. Bioinformatics 20: pp. 1361-72 CrossRef
    22. Sabatti, C, James, GM (2006) Bayesian sparse hidden components analysis for transcription regulation networks. Bioinformatics 22: pp. 739-46 CrossRef
    23. Asif, HMS, Sanguinetti, G (2011) Large-scale learning of combinatorial transcriptional dynamics from gene expression. Bioinformatics 27: pp. 1277-83 CrossRef
    24. Eduati, F, De Las Rivas, J, Di Camillo, B, Toffolo, G, Saez-Rodriguez, J (2012) Integrating literature-constrained and data-driven inference of signalling networks. Bioinformatics 28: pp. 2311-7 CrossRef
    25. do Rego, TG, Roider, HG, de Carvalho, FAT, Costa, IG (2012) Inferring epigenetic and transcriptional regulation during blood cell development with a mixture of sparse linear models. Bioinformatics 28: pp. 2297-303 CrossRef
    26. Tian, Y, Zhang, B, Hoffman, E, Clarke, R, Zhang, Z (2014) Knowledge-fused differential dependency network models for detecting significant rewiring in biological networks. BMC Syst. Biol 8: pp. 87 CrossRef
    27. Barzel, B, Barab谩si, A-LL (2013) Network link prediction by global silencing of indirect correlations. Nat Biotechnol. 31: pp. 720-5 CrossRef
    28. Feizi, S, Marbach, D, Medard, M, Kellis, M (2013) Network deconvolution as a general method to distinguish direct dependencies in networks. Nat Biotechnol. 31: pp. 726-33 CrossRef
    29. Nakajima, N, Tamura, T, Yamanishi, Y, Horimoto, K, Akutsu, T (2012) Network completion using dynamic programming and least-squares fitting. Sci World J 2012: pp. 1-8 CrossRef
    30. Wang, W, Cherry, JM, Nochomovitz, Y, Jolly, E, Botstein, D, Li, H (2005) Inference of combinatorial regulation in yeast transcriptional networks: A case study of sporulation. Proc Nat Acad Sci USA. 102: pp. 1998-2003 CrossRef
    31. Kalman, RE (1960) A New Approach to Linear Filtering and Prediction Problems. Trans ASME - J Basic Eng. 82(Series D): pp. 35-45 CrossRef
    32. Shumway, RH, Stoffer, DS (1982) An approach to time series smoothing and forecasting using the em algorithm. J Time Ser Anal. 3: pp. 253-64 CrossRef
    33. Julier SJ, Uhlmann JK. A new extension of the kalman filter to nonlinear systems. In: Proc. of AeroSense: The 11th Int. Symp. on Aerospace/Defense Sensing, Simulations and Controls: 1997. p. 182鈥?93.
    34. Julier, SJ, Uhlmann, JK (2004) Unscented filtering and nonlinear estimation. Proc IEEE 92: pp. 401-22 CrossRef
    35. Chow, S-M, Ferrer, E, Nesselroade, JR (2007) An unscented kalman filter approach to the estimation of nonlinear dynamical systems models. Multivariate Behavioral Res. 42: pp. 283-321 CrossRef
    36. Hasegawa, T, Mori, T, Yamaguchi, R, Imoto, S, Miyano, S, Akutsu, T (2014) An efficient data assimilation schema for restoration and extension of gene regulatory networks using time-course observation data. J Comput Biol. 21: pp. 785-98 CrossRef
    37. Sch盲fer, J, Strimmer, K (2005) An empirical bayes approach to inferring large-scale gene association networks. Bioinformatics 21: pp. 754-64 CrossRef
    38. Opgen-Rhein, R, Strimmer, K (2007) From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data. BMC Syst Biol. 1: pp. 37 CrossRef
    39. L茅bre, S (2009) Inferring dynamic genetic networks with low order independencies. Stat App Genet Mol Biol. 8: pp. 1-38
    40. Kim, S, Li, H, Dougherty, ER, Cao, N, Chen, Y, Bittner, M (2002) Can markov chain models mimic biological regulation. J Biol Syst. 10: pp. 337-28093357 CrossRef
    41. Kanehisa, M, Goto, S, Sato, Y, Furumichi, M, Tanabe, M (2012) Kegg for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40: pp. 109-14 CrossRef
    42. Almon, RR, DuBois, DC, Jin, JY, Jusko, WJ (2005) Temporal profiling of the transcriptional basis for the development of corticosteroid-induced insulin resistance in rat muscle. J Endocrinol. 184: pp. 219-32 CrossRef
    43. Yao, Z, Hoffman, EP, Ghimbovschi, S, DuBois, DC, Almon, RR, Jusko, WJ (2008) Mathematical modeling of corticosteroid pharmacogenomics in rat muscle following acute and chronic methylprednisolone dosing. Mol Pharm. 5: pp. 328-39 CrossRef
    44. Shimizu, N, Yoshikawa, N, Ito, N, Maruyama, T, Suzuki, Y, Takeda, S-I (2011) Crosstalk between Glucocorticoid Receptor and Nutritional Sensor mTOR in Skeletal Muscle. Cell Metab. 13: pp. 170-82 CrossRef
    45. Zheng, G, Tu, K, Yang, Q, Xiong, Y, Wei, C, Xie, L (2008) Itfp: an integrated platform of mammalian transcription factors. Bioinformatics 24: pp. 2416-7 CrossRef
    46. Greenfield, A, Hafemeister, C, Bonneau, R (2013) Robust data-driven incorporation of prior knowledge into the inference of dynamic regulatory networks. Bioinformatics 29: pp. 1060-7 CrossRef
    47. Evensen, G (1994) Sequential data assimilation with a nonlinear quasi-geostrophic model using monte carlo methods to forecast error statistics. J Geophys Res. 99: pp. 10143-62 CrossRef
    48. Gordon, NJ, Salmond, DJ, Smith, AFM (1993) Novel approach to nonlinear/non-gaussian bayesian state estimation. IEEE Proc F, Radar Signal Process. 140: pp. 107-13 CrossRef
    49. Kitagawa, G (1996) Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models. J Comput Graphical Stat. 5: pp. 1-25
    50. Anderson, LJ, Anderson, LS (1999) A monte carlo implementation of the nonlinear filtering problem to produce ensemble assimilations and forecasts. Monthly Weather Rev. 127: pp. 2741-58 CrossRef
    51. Pham, DT (2001) Stochastic methods for sequential data assimilation in strongly nonlinear systems. Monthly Weather Rev. 129: pp. 1194-207 CrossRef
    52. Zhao, Y, Lu, Z (2007) Fourth-moment standardization for structural reliability assessment. J Struct Eng. 133: pp. 916-24 CrossRef
    53. Foti, D, Iuliano, R, Chiefari, E, Brunetti, A (2003) A nucleoprotein complex containing sp1, c/ebpb, and hmgi-y controls human insulin receptor gene transcription. Mol Cell Biol. 23: pp. 2720-32 CrossRef
  • 刊物主题:Bioinformatics; Systems Biology; Simulation and Modeling; Computational Biology/Bioinformatics; Physiological, Cellular and Medical Topics; Algorithms;
  • 出版者:BioMed Central
  • ISSN:1752-0509
文摘
Background As a result of recent advances in biotechnology, many findings related to intracellular systems have been published, e.g., transcription factor (TF) information. Although we can reproduce biological systems by incorporating such findings and describing their dynamics as mathematical equations, simulation results can be inconsistent with data from biological observations if there are inaccurate or unknown parts in the constructed system. For the completion of such systems, relationships among genes have been inferred through several computational approaches, which typically apply several abstractions, e.g., linearization, to handle the heavy computational cost in evaluating biological systems. However, since these approximations can generate false regulations, computational methods that can infer regulatory relationships based on less abstract models incorporating existing knowledge have been strongly required. Results We propose a new data assimilation algorithm that utilizes a simple nonlinear regulatory model and a state space representation to infer gene regulatory networks (GRNs) using time-course observation data. For the estimation of the hidden state variables and the parameter values, we developed a novel method termed a higher moment ensemble particle filter (HMEnPF) that can retain first four moments of the conditional distributions through filtering steps. Starting from the original model, e.g., derived from the literature, the proposed algorithm can sequentially evaluate candidate models, which are generated by partially changing the current best model, to find the model that can best predict the data. For the performance evaluation, we generated six synthetic data based on two real biological networks and evaluated effectiveness of the proposed algorithm by improving the networks inferred by previous methods. We then applied time-course observation data of rat skeletal muscle stimulated with corticosteroid. Since a corticosteroid pharmacogenomic pathway, its kinetic/dynamics and TF candidate genes have been partially elucidated, we incorporated these findings and inferred an extended pathway of rat pharmacogenomics. Conclusions Through the simulation study, the proposed algorithm outperformed previous methods and successfully improved the regulatory structure inferred by the previous methods. Furthermore, the proposed algorithm could extend a corticosteroid related pathway, which has been partially elucidated, with incorporating several information sources.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700