• DocumentCode
    589191
  • Title

    Prediction of HLA Genes from SNP Data and HLA Haplotype Frequencies

  • Author

    Paunic, V. ; Steinbach, Michael ; Kumar, Vipin ; Maiers, M.

  • Author_Institution
    Comput. Sci. & Eng., Univ. of Minnesota, Minneapolis, MN, USA
  • fYear
    2012
  • fDate
    10-10 Dec. 2012
  • Firstpage
    964
  • Lastpage
    971
  • Abstract
    Variation in the Human Leukocyte Antigen (HLA) gene system is very important. It is one of the most polymorphic regions of the human genome and one of the most extensively studied regions due to its association with autoimmune, infectious, and inflammatory diseases, such as rheumatoid arthritis, celiac disease, multiple sclerosis and Type I diabetes. The HLA gene system also plays a crucial role in hematopoietic stem cell transplantation, where patients and donors are matched with respect to their HLA genes in order to maximize the chances of a successful transplant. Having complete HLA data is therefore of great use to clinicians and researchers. However, due to its polymorphism, obtaining it is highly time- and cost-prohibitive. Genome-wide association studies finding strong associations within HLA region would ideally like to identify the exact HLA alleles responsible for association in order to determine the causal genes/variants. Here we propose a method to infer HLA alleles from widely available and affordable SNP genotype data. Our method takes into account the high linkage disequilibrium that exists in the region. We demonstrate that this additional information is an important asset in HLA prediction problem.
  • Keywords
    cellular biophysics; diseases; genetics; genomics; medical computing; HLA allele region inference; HLA gene system prediction; HLA haplotype frequencies; SNP genotype data; Type I diabetes; autoimmune diseases; causal gene determination; causal variant determination; celiac disease; hematopoietic stem cell transplantation; human genome; human leukocyte antigen gene system; infectious diseases; inflammatory diseases; linkage disequilibrium; multiple sclerosis; polymorphic regions; rheumatoid arthritis; Bioinformatics; Biological cells; Frequency estimation; Genomics; Humans; Sociology; HLA imputation; Human Leukocyte Antigen; SNP data; multi-label prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • Print_ISBN
    978-1-4673-5164-5
  • Type

    conf

  • DOI
    10.1109/ICDMW.2012.74
  • Filename
    6406554