Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.

ZHANG, X. - LOURENCO, D. - AGUILAR, I. - LEGARRA, A. - MISZTAL, I.

Resumen:

ABSTRACT.Genomic Best Linear Unbiased Predictor (GBLUP) assumes equal variance for all single nucleotide polymorphisms (SNP). When traits are influenced by major SNP, Bayesian methods have the advantage of SNP selection. To overcome the limitation of GBLUP, unequal variance or weights for all SNP are applied in a method called weighted GBLUP (WGBLUP). If only a fraction of animals is genotyped, single-step WGBLUP (WssGBLUP) can be used. Default weights in WGBLUP or WssGBLUP are obtained iteratively based on single SNP effect squared (u2) and/or heterozygosity. When the weights are optimal, prediction accuracy, and ability to detect major SNP are maximized. The objective was to develop optimal weights for WGBLUP-based methods. We evaluated 5 new procedures that accounted for locus-specific or windows-specific variance to maximize accuracy of predicting genomic estimated breeding value (GEBV) and SNP effect. Simulated datasets consisted of phenotypes for 13,000 animals, including 1540 animals genotyped for 45,000 SNP. Scenarios with 5, 100, and 500 simulated quantitative trait loci (QTL) were considered. The 5 new procedures for SNP weighting were: (1) u2 plus a constant equal to the weight of the top SNP; (2) from a heavy-tailed distribution (similar to BayesA); (3) for every 20 SNP in a window along the whole genome, the largest effect (u2) among them; (4) the mean effect of every 20 SNP; and (5) the summation of every 20 SNP. Those methods were compared to the default WssGBLUP, GBLUP, BayesB, and BayesC. WssGBLUP methods were evaluated over 10 iterations. The accuracy of predicting GEBV was the correlation between true and estimated genomic breeding values for 300 genotyped animals from the last generation. The ability to detect the simulated QTL was also investigated. For most of the QTL scenarios, the accuracies obtained with all WssGBLUP procedures were higher compared to those from BayesB and BayesC, partly due to automatic inclusion of parent average in the former. Manhattan plots had higher resolution with 5 and 100 QTL. Using a common weight for a window of 20 SNP that sums or averages the SNP variance enhances accuracy of predicting GEBV and provides accurate estimation of marker effects. © 2016 Zhang, Lourenco, Aguilar, Legarra and Misztal.


Detalles Bibliográficos
2016
BayesB
BayesC
GENOME-WIDE ASSOCIATION
SNP WINDOW
WssGBLUP
Inglés
Instituto Nacional de Investigación Agropecuaria
AINFO
http://www.ainfo.inia.uy/consulta/busca?b=pc&id=59369&biblioteca=vazio&busca=59369&qFacets=59369
Acceso abierto
_version_ 1805580530474811392
author ZHANG, X.
author2 LOURENCO, D.
AGUILAR, I.
LEGARRA, A.
MISZTAL, I.
author2_role author
author
author
author
author_facet ZHANG, X.
LOURENCO, D.
AGUILAR, I.
LEGARRA, A.
MISZTAL, I.
author_role author
bitstream.checksum.fl_str_mv 3554274a5c82506fc7dff412ae1bef50
bitstream.checksumAlgorithm.fl_str_mv MD5
bitstream.url.fl_str_mv https://redi.anii.org.uy/jspui/bitstream/20.500.12381/3110/1/sword-2022-12-16T18%3a06%3a35.original.xml
collection AINFO
dc.creator.none.fl_str_mv ZHANG, X.
LOURENCO, D.
AGUILAR, I.
LEGARRA, A.
MISZTAL, I.
dc.date.accessioned.none.fl_str_mv 2022-12-16T21:06:35Z
dc.date.available.none.fl_str_mv 2022-12-16T21:06:35Z
dc.date.issued.none.fl_str_mv 2016
dc.date.updated.none.fl_str_mv 2022-12-16T21:06:35Z
dc.description.abstract.none.fl_txt_mv ABSTRACT.Genomic Best Linear Unbiased Predictor (GBLUP) assumes equal variance for all single nucleotide polymorphisms (SNP). When traits are influenced by major SNP, Bayesian methods have the advantage of SNP selection. To overcome the limitation of GBLUP, unequal variance or weights for all SNP are applied in a method called weighted GBLUP (WGBLUP). If only a fraction of animals is genotyped, single-step WGBLUP (WssGBLUP) can be used. Default weights in WGBLUP or WssGBLUP are obtained iteratively based on single SNP effect squared (u2) and/or heterozygosity. When the weights are optimal, prediction accuracy, and ability to detect major SNP are maximized. The objective was to develop optimal weights for WGBLUP-based methods. We evaluated 5 new procedures that accounted for locus-specific or windows-specific variance to maximize accuracy of predicting genomic estimated breeding value (GEBV) and SNP effect. Simulated datasets consisted of phenotypes for 13,000 animals, including 1540 animals genotyped for 45,000 SNP. Scenarios with 5, 100, and 500 simulated quantitative trait loci (QTL) were considered. The 5 new procedures for SNP weighting were: (1) u2 plus a constant equal to the weight of the top SNP; (2) from a heavy-tailed distribution (similar to BayesA); (3) for every 20 SNP in a window along the whole genome, the largest effect (u2) among them; (4) the mean effect of every 20 SNP; and (5) the summation of every 20 SNP. Those methods were compared to the default WssGBLUP, GBLUP, BayesB, and BayesC. WssGBLUP methods were evaluated over 10 iterations. The accuracy of predicting GEBV was the correlation between true and estimated genomic breeding values for 300 genotyped animals from the last generation. The ability to detect the simulated QTL was also investigated. For most of the QTL scenarios, the accuracies obtained with all WssGBLUP procedures were higher compared to those from BayesB and BayesC, partly due to automatic inclusion of parent average in the former. Manhattan plots had higher resolution with 5 and 100 QTL. Using a common weight for a window of 20 SNP that sums or averages the SNP variance enhances accuracy of predicting GEBV and provides accurate estimation of marker effects. © 2016 Zhang, Lourenco, Aguilar, Legarra and Misztal.
dc.identifier.none.fl_str_mv http://www.ainfo.inia.uy/consulta/busca?b=pc&id=59369&biblioteca=vazio&busca=59369&qFacets=59369
dc.language.iso.none.fl_str_mv en
eng
dc.rights.es.fl_str_mv Acceso abierto
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
dc.source.none.fl_str_mv reponame:AINFO
instname:Instituto Nacional de Investigación Agropecuaria
instacron:Instituto Nacional de Investigación Agropecuaria
dc.subject.none.fl_str_mv BayesB
BayesC
GENOME-WIDE ASSOCIATION
SNP WINDOW
WssGBLUP
dc.title.none.fl_str_mv Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.
dc.type.none.fl_str_mv Article
PublishedVersion
info:eu-repo/semantics/article
dc.type.version.none.fl_str_mv info:eu-repo/semantics/publishedVersion
description ABSTRACT.Genomic Best Linear Unbiased Predictor (GBLUP) assumes equal variance for all single nucleotide polymorphisms (SNP). When traits are influenced by major SNP, Bayesian methods have the advantage of SNP selection. To overcome the limitation of GBLUP, unequal variance or weights for all SNP are applied in a method called weighted GBLUP (WGBLUP). If only a fraction of animals is genotyped, single-step WGBLUP (WssGBLUP) can be used. Default weights in WGBLUP or WssGBLUP are obtained iteratively based on single SNP effect squared (u2) and/or heterozygosity. When the weights are optimal, prediction accuracy, and ability to detect major SNP are maximized. The objective was to develop optimal weights for WGBLUP-based methods. We evaluated 5 new procedures that accounted for locus-specific or windows-specific variance to maximize accuracy of predicting genomic estimated breeding value (GEBV) and SNP effect. Simulated datasets consisted of phenotypes for 13,000 animals, including 1540 animals genotyped for 45,000 SNP. Scenarios with 5, 100, and 500 simulated quantitative trait loci (QTL) were considered. The 5 new procedures for SNP weighting were: (1) u2 plus a constant equal to the weight of the top SNP; (2) from a heavy-tailed distribution (similar to BayesA); (3) for every 20 SNP in a window along the whole genome, the largest effect (u2) among them; (4) the mean effect of every 20 SNP; and (5) the summation of every 20 SNP. Those methods were compared to the default WssGBLUP, GBLUP, BayesB, and BayesC. WssGBLUP methods were evaluated over 10 iterations. The accuracy of predicting GEBV was the correlation between true and estimated genomic breeding values for 300 genotyped animals from the last generation. The ability to detect the simulated QTL was also investigated. For most of the QTL scenarios, the accuracies obtained with all WssGBLUP procedures were higher compared to those from BayesB and BayesC, partly due to automatic inclusion of parent average in the former. Manhattan plots had higher resolution with 5 and 100 QTL. Using a common weight for a window of 20 SNP that sums or averages the SNP variance enhances accuracy of predicting GEBV and provides accurate estimation of marker effects. © 2016 Zhang, Lourenco, Aguilar, Legarra and Misztal.
eu_rights_str_mv openAccess
format article
id INIAOAI_bff35f5a84991581f7aa5da6e2cc3485
instacron_str Instituto Nacional de Investigación Agropecuaria
institution Instituto Nacional de Investigación Agropecuaria
instname_str Instituto Nacional de Investigación Agropecuaria
language eng
language_invalid_str_mv en
network_acronym_str INIAOAI
network_name_str AINFO
oai_identifier_str oai:redi.anii.org.uy:20.500.12381/3110
publishDate 2016
reponame_str AINFO
repository.mail.fl_str_mv lorrego@inia.org.uy
repository.name.fl_str_mv AINFO - Instituto Nacional de Investigación Agropecuaria
repository_id_str
rights_invalid_str_mv Acceso abierto
spelling 2022-12-16T21:06:35Z2022-12-16T21:06:35Z20162022-12-16T21:06:35Zhttp://www.ainfo.inia.uy/consulta/busca?b=pc&id=59369&biblioteca=vazio&busca=59369&qFacets=59369ABSTRACT.Genomic Best Linear Unbiased Predictor (GBLUP) assumes equal variance for all single nucleotide polymorphisms (SNP). When traits are influenced by major SNP, Bayesian methods have the advantage of SNP selection. To overcome the limitation of GBLUP, unequal variance or weights for all SNP are applied in a method called weighted GBLUP (WGBLUP). If only a fraction of animals is genotyped, single-step WGBLUP (WssGBLUP) can be used. Default weights in WGBLUP or WssGBLUP are obtained iteratively based on single SNP effect squared (u2) and/or heterozygosity. When the weights are optimal, prediction accuracy, and ability to detect major SNP are maximized. The objective was to develop optimal weights for WGBLUP-based methods. We evaluated 5 new procedures that accounted for locus-specific or windows-specific variance to maximize accuracy of predicting genomic estimated breeding value (GEBV) and SNP effect. Simulated datasets consisted of phenotypes for 13,000 animals, including 1540 animals genotyped for 45,000 SNP. Scenarios with 5, 100, and 500 simulated quantitative trait loci (QTL) were considered. The 5 new procedures for SNP weighting were: (1) u2 plus a constant equal to the weight of the top SNP; (2) from a heavy-tailed distribution (similar to BayesA); (3) for every 20 SNP in a window along the whole genome, the largest effect (u2) among them; (4) the mean effect of every 20 SNP; and (5) the summation of every 20 SNP. Those methods were compared to the default WssGBLUP, GBLUP, BayesB, and BayesC. WssGBLUP methods were evaluated over 10 iterations. The accuracy of predicting GEBV was the correlation between true and estimated genomic breeding values for 300 genotyped animals from the last generation. The ability to detect the simulated QTL was also investigated. For most of the QTL scenarios, the accuracies obtained with all WssGBLUP procedures were higher compared to those from BayesB and BayesC, partly due to automatic inclusion of parent average in the former. Manhattan plots had higher resolution with 5 and 100 QTL. Using a common weight for a window of 20 SNP that sums or averages the SNP variance enhances accuracy of predicting GEBV and provides accurate estimation of marker effects. © 2016 Zhang, Lourenco, Aguilar, Legarra and Misztal.https://hdl.handle.net/20.500.12381/3110enenginfo:eu-repo/semantics/openAccessAcceso abiertoBayesBBayesCGENOME-WIDE ASSOCIATIONSNP WINDOWWssGBLUPWeighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.ArticlePublishedVersioninfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionreponame:AINFOinstname:Instituto Nacional de Investigación Agropecuariainstacron:Instituto Nacional de Investigación AgropecuariaZHANG, X.LOURENCO, D.AGUILAR, I.LEGARRA, A.MISZTAL, I.SWORDsword-2022-12-16T18:06:35.original.xmlOriginal SWORD entry documentapplication/octet-stream3454https://redi.anii.org.uy/jspui/bitstream/20.500.12381/3110/1/sword-2022-12-16T18%3a06%3a35.original.xml3554274a5c82506fc7dff412ae1bef50MD5120.500.12381/31102022-12-16 18:06:35.515oai:redi.anii.org.uy:20.500.12381/3110Gobiernohttp://inia.uyhttps://redi.anii.org.uy/oai/requestlorrego@inia.org.uyUruguayopendoar:2022-12-16T21:06:35AINFO - Instituto Nacional de Investigación Agropecuariafalse
spellingShingle Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.
ZHANG, X.
BayesB
BayesC
GENOME-WIDE ASSOCIATION
SNP WINDOW
WssGBLUP
status_str publishedVersion
title Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.
title_full Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.
title_fullStr Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.
title_full_unstemmed Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.
title_short Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.
title_sort Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.
topic BayesB
BayesC
GENOME-WIDE ASSOCIATION
SNP WINDOW
WssGBLUP
url http://www.ainfo.inia.uy/consulta/busca?b=pc&id=59369&biblioteca=vazio&busca=59369&qFacets=59369