Spatial Statistics and Ancestral Recombination Graphs with Applications in Gene Mapping and Geostatistics

Research output: ThesisDoctoral Thesis (compilation)

Abstract

This thesis explores models and algorithms in geostatistics and gene mapping. The first part deals with the use of computationally effective lattice models for inference of data with a continuous spatial index. The fundamental idea is to approximate a Gaussian field with a Gaussian Markov random field (GMRF) on a lattice, and then to conduct a bilinear interpolation of this at non-lattice locations. The resulting model is used for spatial interpolation, both in a Bayesian approach using Markov chain Monte Carlo (MCMC), and in kriging.

The second part of the thesis concerns genetic association analysis, particularly multi-locus gene mapping using case-control samples. The algorithms utilize the fact that a population based sample of haplotypes (a collection of alleles at closely linked loci on the same chromosome) mirrors the population history of shared ancestry, mutation, recombination etc. Around the disease locus chromosomes carrying the disease mutation will be more similar than chromosomes that do not carry the disease mutation (on account of increased levels of shared ancestry).

Two models and corresponding algorithms for gene mapping are presented. The first explicitly models the genealogy taking the over-sampling of cases into account. Under certain model approximations, a permutation-based test for genetic association is developed that is computationally feasible, even when haplotype phase is unknown. It contends with arbitrary phenotypes and genetic models, allows for neutral mutations, and adapts to marker allele frequencies.

The second model utilizes concepts and algorithms from both spatial statistics and statistical genetics. A spatial smoothing model is used for haplotypes, such that structurally similar haplotypes have risk parameters with high correlation. The disease locus is then searched as the place where a local similarity measure produces risk parameters that can discriminate between cases and controls. Different covariance structures and similarity metrics are suggested and compared.
Original languageEnglish
QualificationDoctor
Awarding Institution
  • Mathematical Statistics
Supervisors/Advisors
  • Hössjer, Ola, Supervisor
Award date2007 Oct 25
Publisher
ISBN (Print)978-91-628-7266-3
Publication statusPublished - 2007

Bibliographical note

Defence details

Date: 2007-10-25
Time: 09:15
Place: Lecture hall MH:A, Centre for Mathematical Sciences, Sölvegatan 18, Lund University Faculty of Engineering.

External reviewer(s)

Name: De Iorio, Maria
Title: PhD
Affiliation: Department of Epidemiology and Public Health, Imperial College, London, United Kingdom

---

Subject classification (UKÄ)

  • Probability Theory and Statistics

Free keywords

  • Genetik
  • Genetics
  • cytogenetics
  • cytogenetik
  • genetic association analysis
  • ancestral recombination graph Generalized linear mixed models
  • kriging
  • bilinear interpolation
  • Gaussian Markov random fields
  • Statistics

Fingerprint

Dive into the research topics of 'Spatial Statistics and Ancestral Recombination Graphs with Applications in Gene Mapping and Geostatistics'. Together they form a unique fingerprint.

Cite this