用户名: 密码: 验证码:
Optimization Problems Concerning Tag SNP Selection,Haplotype Inference,and Detection of Horizontal Gene Transfers.
详细信息   
  • 作者:Wang ; Wei Bung.
  • 学历:Ph.D.
  • 年:2011
  • 导师:Jiang, Tao,eadvisorLonardi, Stefanoecommittee memberXu, Shizhongecommittee member
  • 毕业院校:University of California
  • Department:Computer Science
  • ISBN:9781124771908
  • CBH:3465383
  • Country:USA
  • 语种:English
  • FileSize:1237487
  • Pages:134
文摘
In this dissertation, we study several topics in genetics, including tag SNP selection, haplotype inference, error detection, and horizontal gene transfer detection. We formulate these problems as computational optimization problems, discuss the complexity, present our novel algorithms, and demonstrate the experimental results. We first study the genome-wide tag SNP selection problem, propose a new model of multi-marker correlation for the problem, and present a greedy algorithm to select a smallest possible set of tag SNPs according to the model. Our experimental results on several real datasets from the HapMap project demonstrate that the new model yields more succinct tag SNP sets than the previous methods. We then study how to infer haplotypes from genotype data which may contain genotyping errors, de novo mutations and missing alleles. We assume that there are no recombinants in the genotype data, which is usually true for tightly linked markers. We prove the problem is NP-hard, and propose a heuristic algorithm, the core of which is an integer linear program ILP) using the system of linear equations over Galois field GF2). Our experimental results show that the algorithm can infer haplotypes with a very high accuracy, and recover 65%--94% of genotyping errors depending on the pedigree topology. We also study the detection of mutations, sequencing errors, and horizontal gene transfers in a set of closely related microbial genomes which do not align well because of rearrangements. We use a new SNP definition to handle the rearrangement problem, divide the problem into several optimization subproblems, and propose a series of algorithms to tackle each subproblem. Results from simulation experiments show that we can detect 31%--61% of horizontal gene transfer events depending on the mutation and missing rates, and the precision of our detection is about 48%--90%.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700