Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)

AGUILAR, I. - LEGARRA, A. - CARDOSO, F. - MASUDA, Y. - LOURENCO, D. - MISZTAL, I.

Resumen:

ABSTRACT.Background: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to implement p-values for single-marker GWAS studies within the single-step GWAS (SSGWAS) framework by deriving computational algorithms and procedures, and by applying these to a large beef cattle population. Methods: P-values were obtained based on the prediction error (co)variances for single nucleotide polymorphisms (SNPs), which were obtained from the prediction error (co)variances of genomic predictions based on the inverse of the coefficient matrix and formulas to estimate SNP effects. Results: Computation of p-values took a negligible time for a dataset with almost 2 million animals in the pedigree and 1424 genotyped sires, and no inflation of statistics was observed. The SNPs that passed the Bonferroni threshold of 10-5.9 were the same as those that explained the highest proportion of additive genetic variance, but even at the same significance levels and effects, some of them explained less genetic variance due to lower allele frequency. Conclusions: The use of a p-value for SSGWAS is a very general and efficient strategy to identify quantitative trait loci (QTL). It can be used for complex datasets such as those used in animal breeding, where only a proportion of the pedigreed animals are genotyped. © 2019 The Author(s).


Detalles Bibliográficos
2019
ANIMALIA
Inglés
Instituto Nacional de Investigación Agropecuaria
AINFO
http://www.ainfo.inia.uy/consulta/busca?b=pc&id=59927&biblioteca=vazio&busca=59927&qFacets=59927
Acceso abierto
_version_ 1805580530939330560
author AGUILAR, I.
author2 LEGARRA, A.
CARDOSO, F.
MASUDA, Y.
LOURENCO, D.
MISZTAL, I.
author2_role author
author
author
author
author
author_facet AGUILAR, I.
LEGARRA, A.
CARDOSO, F.
MASUDA, Y.
LOURENCO, D.
MISZTAL, I.
author_role author
bitstream.checksum.fl_str_mv da2794f336ec1598ef9c6b8f4c33cf68
bitstream.checksumAlgorithm.fl_str_mv MD5
bitstream.url.fl_str_mv https://redi.anii.org.uy/jspui/bitstream/20.500.12381/874/1/sword-2022-10-20T22%3a18%3a34.original.xml
collection AINFO
dc.creator.none.fl_str_mv AGUILAR, I.
LEGARRA, A.
CARDOSO, F.
MASUDA, Y.
LOURENCO, D.
MISZTAL, I.
dc.date.accessioned.none.fl_str_mv 2022-10-21T01:18:34Z
dc.date.available.none.fl_str_mv 2022-10-21T01:18:34Z
dc.date.issued.none.fl_str_mv 2019
dc.date.updated.none.fl_str_mv 2022-10-21T01:18:34Z
dc.description.abstract.none.fl_txt_mv ABSTRACT.Background: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to implement p-values for single-marker GWAS studies within the single-step GWAS (SSGWAS) framework by deriving computational algorithms and procedures, and by applying these to a large beef cattle population. Methods: P-values were obtained based on the prediction error (co)variances for single nucleotide polymorphisms (SNPs), which were obtained from the prediction error (co)variances of genomic predictions based on the inverse of the coefficient matrix and formulas to estimate SNP effects. Results: Computation of p-values took a negligible time for a dataset with almost 2 million animals in the pedigree and 1424 genotyped sires, and no inflation of statistics was observed. The SNPs that passed the Bonferroni threshold of 10-5.9 were the same as those that explained the highest proportion of additive genetic variance, but even at the same significance levels and effects, some of them explained less genetic variance due to lower allele frequency. Conclusions: The use of a p-value for SSGWAS is a very general and efficient strategy to identify quantitative trait loci (QTL). It can be used for complex datasets such as those used in animal breeding, where only a proportion of the pedigreed animals are genotyped. © 2019 The Author(s).
dc.identifier.none.fl_str_mv http://www.ainfo.inia.uy/consulta/busca?b=pc&id=59927&biblioteca=vazio&busca=59927&qFacets=59927
dc.language.iso.none.fl_str_mv en
eng
dc.rights.es.fl_str_mv Acceso abierto
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
dc.source.none.fl_str_mv reponame:AINFO
instname:Instituto Nacional de Investigación Agropecuaria
instacron:Instituto Nacional de Investigación Agropecuaria
dc.subject.none.fl_str_mv ANIMALIA
dc.title.none.fl_str_mv Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)
dc.type.none.fl_str_mv Article
PublishedVersion
info:eu-repo/semantics/article
dc.type.version.none.fl_str_mv info:eu-repo/semantics/publishedVersion
description ABSTRACT.Background: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to implement p-values for single-marker GWAS studies within the single-step GWAS (SSGWAS) framework by deriving computational algorithms and procedures, and by applying these to a large beef cattle population. Methods: P-values were obtained based on the prediction error (co)variances for single nucleotide polymorphisms (SNPs), which were obtained from the prediction error (co)variances of genomic predictions based on the inverse of the coefficient matrix and formulas to estimate SNP effects. Results: Computation of p-values took a negligible time for a dataset with almost 2 million animals in the pedigree and 1424 genotyped sires, and no inflation of statistics was observed. The SNPs that passed the Bonferroni threshold of 10-5.9 were the same as those that explained the highest proportion of additive genetic variance, but even at the same significance levels and effects, some of them explained less genetic variance due to lower allele frequency. Conclusions: The use of a p-value for SSGWAS is a very general and efficient strategy to identify quantitative trait loci (QTL). It can be used for complex datasets such as those used in animal breeding, where only a proportion of the pedigreed animals are genotyped. © 2019 The Author(s).
eu_rights_str_mv openAccess
format article
id INIAOAI_b31a70566ad7f41ea85853bff3126fa7
instacron_str Instituto Nacional de Investigación Agropecuaria
institution Instituto Nacional de Investigación Agropecuaria
instname_str Instituto Nacional de Investigación Agropecuaria
language eng
language_invalid_str_mv en
network_acronym_str INIAOAI
network_name_str AINFO
oai_identifier_str oai:redi.anii.org.uy:20.500.12381/874
publishDate 2019
reponame_str AINFO
repository.mail.fl_str_mv lorrego@inia.org.uy
repository.name.fl_str_mv AINFO - Instituto Nacional de Investigación Agropecuaria
repository_id_str
rights_invalid_str_mv Acceso abierto
spelling 2022-10-21T01:18:34Z2022-10-21T01:18:34Z20192022-10-21T01:18:34Zhttp://www.ainfo.inia.uy/consulta/busca?b=pc&id=59927&biblioteca=vazio&busca=59927&qFacets=59927ABSTRACT.Background: Single-step genomic best linear unbiased prediction (SSGBLUP) is a comprehensive method for genomic prediction. Point estimates of marker effects from SSGBLUP are often used for genome-wide association studies (GWAS) without a formal framework of hypothesis testing. Our objective was to implement p-values for single-marker GWAS studies within the single-step GWAS (SSGWAS) framework by deriving computational algorithms and procedures, and by applying these to a large beef cattle population. Methods: P-values were obtained based on the prediction error (co)variances for single nucleotide polymorphisms (SNPs), which were obtained from the prediction error (co)variances of genomic predictions based on the inverse of the coefficient matrix and formulas to estimate SNP effects. Results: Computation of p-values took a negligible time for a dataset with almost 2 million animals in the pedigree and 1424 genotyped sires, and no inflation of statistics was observed. The SNPs that passed the Bonferroni threshold of 10-5.9 were the same as those that explained the highest proportion of additive genetic variance, but even at the same significance levels and effects, some of them explained less genetic variance due to lower allele frequency. Conclusions: The use of a p-value for SSGWAS is a very general and efficient strategy to identify quantitative trait loci (QTL). It can be used for complex datasets such as those used in animal breeding, where only a proportion of the pedigreed animals are genotyped. © 2019 The Author(s).https://hdl.handle.net/20.500.12381/874enenginfo:eu-repo/semantics/openAccessAcceso abiertoANIMALIAFrequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)ArticlePublishedVersioninfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionreponame:AINFOinstname:Instituto Nacional de Investigación Agropecuariainstacron:Instituto Nacional de Investigación AgropecuariaAGUILAR, I.LEGARRA, A.CARDOSO, F.MASUDA, Y.LOURENCO, D.MISZTAL, I.SWORDsword-2022-10-20T22:18:34.original.xmlOriginal SWORD entry documentapplication/octet-stream2616https://redi.anii.org.uy/jspui/bitstream/20.500.12381/874/1/sword-2022-10-20T22%3a18%3a34.original.xmlda2794f336ec1598ef9c6b8f4c33cf68MD5120.500.12381/8742022-10-20 22:18:34.993oai:redi.anii.org.uy:20.500.12381/874Gobiernohttp://inia.uyhttps://redi.anii.org.uy/oai/requestlorrego@inia.org.uyUruguayopendoar:2022-10-21T01:18:34AINFO - Instituto Nacional de Investigación Agropecuariafalse
spellingShingle Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)
AGUILAR, I.
ANIMALIA
status_str publishedVersion
title Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)
title_full Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)
title_fullStr Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)
title_full_unstemmed Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)
title_short Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)
title_sort Frequentist p-values for large-scale-single step genome-wide association, with an application to birth weight in American Angus cattle. (Short Communication)
topic ANIMALIA
url http://www.ainfo.inia.uy/consulta/busca?b=pc&id=59927&biblioteca=vazio&busca=59927&qFacets=59927