Avoiding synchronization to accelerate a CFD solver in GPU
Resumen:
The caffa3d.MBRi is an open source, GPU-aware, general purpose incompressible flow solver, aimed at providing a useful tool for numerical simulation of real world fluid flow problems that require both geometrical flexibility and parallel computation capabilities to afford tens and hundreds million cells simulations. At the core of this tool there are a number of linear solvers that can be selected according to the characteristics of the problem to solve. For band matrices, the most efficient linear solver included in caffa3d.MBRi is the Strongly Implicit Procedure (SIP) solver. The parallelization of this solver follows the hyper-planes strategy, where the computations in one hyper-plane bare no dependencies and can be executed in parallel, while the hyper-planes have to be processed sequentially. In this work, we analyze this strategy to reach an efficient GPU implementation of the SIP solver for the caffa3d.MBRi. In particular, we design and implement a self-scheduling procedure to avoid the overhead of CPU-GPU synchronization implied by the hyper-planes strategy, outperforming the standard GPU implementation of the SIP by approximately 2x.
2019 | |
Agencia Nacional de Investigación e Innovación | |
GPU CFD Solver SIP Ingeniería y Tecnología Ingeniería Mecánica |
|
Inglés | |
Agencia Nacional de Investigación e Innovación | |
REDI | |
https://hdl.handle.net/20.500.12381/3190 | |
Acceso abierto | |
Reconocimiento 4.0 Internacional. (CC BY) |
_version_ | 1814959262232739840 |
---|---|
author | Dufrechou, Ernesto |
author2 | Ezzatti, Pablo Usera, Gabriel |
author2_role | author author |
author_facet | Dufrechou, Ernesto Ezzatti, Pablo Usera, Gabriel |
author_role | author |
bitstream.checksum.fl_str_mv | 3c9d86d36485746409b4281a0893d729 34ae9b0f4d89423912d80291b124f327 210f6659a1fde4c9023c95b94fc5e318 |
bitstream.checksumAlgorithm.fl_str_mv | MD5 MD5 MD5 |
bitstream.url.fl_str_mv | https://redi.anii.org.uy/jspui/bitstream/20.500.12381/3190/3/license.txt https://redi.anii.org.uy/jspui/bitstream/20.500.12381/3190/1/Anexo_I_Dufrechou_et_al_2019_SBAC_PAD.pdf https://redi.anii.org.uy/jspui/bitstream/20.500.12381/3190/2/Anexo_II_Dufrechou_2019_Award_SBAC_PAD.pdf |
collection | REDI |
dc.creator.none.fl_str_mv | Dufrechou, Ernesto Ezzatti, Pablo Usera, Gabriel |
dc.date.accessioned.none.fl_str_mv | 2023-04-13T13:47:22Z |
dc.date.available.none.fl_str_mv | 2023-04-13T13:47:22Z |
dc.date.issued.none.fl_str_mv | 2019-10-20 |
dc.description.abstract.none.fl_txt_mv | The caffa3d.MBRi is an open source, GPU-aware, general purpose incompressible flow solver, aimed at providing a useful tool for numerical simulation of real world fluid flow problems that require both geometrical flexibility and parallel computation capabilities to afford tens and hundreds million cells simulations. At the core of this tool there are a number of linear solvers that can be selected according to the characteristics of the problem to solve. For band matrices, the most efficient linear solver included in caffa3d.MBRi is the Strongly Implicit Procedure (SIP) solver. The parallelization of this solver follows the hyper-planes strategy, where the computations in one hyper-plane bare no dependencies and can be executed in parallel, while the hyper-planes have to be processed sequentially. In this work, we analyze this strategy to reach an efficient GPU implementation of the SIP solver for the caffa3d.MBRi. In particular, we design and implement a self-scheduling procedure to avoid the overhead of CPU-GPU synchronization implied by the hyper-planes strategy, outperforming the standard GPU implementation of the SIP by approximately 2x. |
dc.description.sponsorship.none.fl_txt_mv | Agencia Nacional de Investigación e Innovación |
dc.identifier.anii.es.fl_str_mv | FSE_1_2016_1_131837 |
dc.identifier.isbn.none.fl_str_mv | 978-1-7281-4194-7 |
dc.identifier.uri.none.fl_str_mv | https://hdl.handle.net/20.500.12381/3190 |
dc.language.iso.none.fl_str_mv | eng |
dc.relation.uri.none.fl_str_mv | https://hdl.handle.net/20.500.12381/3189 https://hdl.handle.net/20.500.12381/3191 https://hdl.handle.net/20.500.12381/3192 https://hdl.handle.net/20.500.12381/3193 https://hdl.handle.net/20.500.12381/3194 |
dc.rights.es.fl_str_mv | Acceso abierto |
dc.rights.license.none.fl_str_mv | Reconocimiento 4.0 Internacional. (CC BY) |
dc.rights.none.fl_str_mv | info:eu-repo/semantics/openAccess |
dc.source.es.fl_str_mv | 31st International Symposium on Computer Architecture and High Performance Computing. Campo Grande – MS, Brasil, 15-18 de octubre de 2019 |
dc.source.none.fl_str_mv | reponame:REDI instname:Agencia Nacional de Investigación e Innovación instacron:Agencia Nacional de Investigación e Innovación |
dc.subject.anii.none.fl_str_mv | Ingeniería y Tecnología Ingeniería Mecánica |
dc.subject.es.fl_str_mv | GPU CFD Solver SIP |
dc.title.none.fl_str_mv | Avoiding synchronization to accelerate a CFD solver in GPU |
dc.type.es.fl_str_mv | Documento de conferencia |
dc.type.none.fl_str_mv | info:eu-repo/semantics/conferenceObject |
dc.type.version.es.fl_str_mv | Publicado |
dc.type.version.none.fl_str_mv | info:eu-repo/semantics/publishedVersion |
description | The caffa3d.MBRi is an open source, GPU-aware, general purpose incompressible flow solver, aimed at providing a useful tool for numerical simulation of real world fluid flow problems that require both geometrical flexibility and parallel computation capabilities to afford tens and hundreds million cells simulations. At the core of this tool there are a number of linear solvers that can be selected according to the characteristics of the problem to solve. For band matrices, the most efficient linear solver included in caffa3d.MBRi is the Strongly Implicit Procedure (SIP) solver. The parallelization of this solver follows the hyper-planes strategy, where the computations in one hyper-plane bare no dependencies and can be executed in parallel, while the hyper-planes have to be processed sequentially. In this work, we analyze this strategy to reach an efficient GPU implementation of the SIP solver for the caffa3d.MBRi. In particular, we design and implement a self-scheduling procedure to avoid the overhead of CPU-GPU synchronization implied by the hyper-planes strategy, outperforming the standard GPU implementation of the SIP by approximately 2x. |
eu_rights_str_mv | openAccess |
format | conferenceObject |
id | REDI_3496679e7d2163ac6a67ff93c501a008 |
identifier_str_mv | 978-1-7281-4194-7 FSE_1_2016_1_131837 |
instacron_str | Agencia Nacional de Investigación e Innovación |
institution | Agencia Nacional de Investigación e Innovación |
instname_str | Agencia Nacional de Investigación e Innovación |
language | eng |
network_acronym_str | REDI |
network_name_str | REDI |
oai_identifier_str | oai:redi.anii.org.uy:20.500.12381/3190 |
publishDate | 2019 |
reponame_str | REDI |
repository.mail.fl_str_mv | jmaldini@anii.org.uy |
repository.name.fl_str_mv | REDI - Agencia Nacional de Investigación e Innovación |
repository_id_str | 9421 |
rights_invalid_str_mv | Reconocimiento 4.0 Internacional. (CC BY) Acceso abierto |
spelling | Reconocimiento 4.0 Internacional. (CC BY)Acceso abiertoinfo:eu-repo/semantics/openAccess2023-04-13T13:47:22Z2023-04-13T13:47:22Z2019-10-20978-1-7281-4194-7https://hdl.handle.net/20.500.12381/3190FSE_1_2016_1_131837The caffa3d.MBRi is an open source, GPU-aware, general purpose incompressible flow solver, aimed at providing a useful tool for numerical simulation of real world fluid flow problems that require both geometrical flexibility and parallel computation capabilities to afford tens and hundreds million cells simulations. At the core of this tool there are a number of linear solvers that can be selected according to the characteristics of the problem to solve. For band matrices, the most efficient linear solver included in caffa3d.MBRi is the Strongly Implicit Procedure (SIP) solver. The parallelization of this solver follows the hyper-planes strategy, where the computations in one hyper-plane bare no dependencies and can be executed in parallel, while the hyper-planes have to be processed sequentially. In this work, we analyze this strategy to reach an efficient GPU implementation of the SIP solver for the caffa3d.MBRi. In particular, we design and implement a self-scheduling procedure to avoid the overhead of CPU-GPU synchronization implied by the hyper-planes strategy, outperforming the standard GPU implementation of the SIP by approximately 2x.Agencia Nacional de Investigación e Innovaciónenghttps://hdl.handle.net/20.500.12381/3189https://hdl.handle.net/20.500.12381/3191https://hdl.handle.net/20.500.12381/3192https://hdl.handle.net/20.500.12381/3193https://hdl.handle.net/20.500.12381/319431st International Symposium on Computer Architecture and High Performance Computing. Campo Grande – MS, Brasil, 15-18 de octubre de 2019reponame:REDIinstname:Agencia Nacional de Investigación e Innovacióninstacron:Agencia Nacional de Investigación e InnovaciónGPUCFDSolver SIPIngeniería y TecnologíaIngeniería MecánicaAvoiding synchronization to accelerate a CFD solver in GPUDocumento de conferenciaPublicadoinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObjectUniversidad de la República//Ingeniería y Tecnología/Ingeniería Mecánica/Ingeniería MecánicaDufrechou, ErnestoEzzatti, PabloUsera, GabrielLICENSElicense.txtlicense.txttext/plain; charset=utf-84944https://redi.anii.org.uy/jspui/bitstream/20.500.12381/3190/3/license.txt3c9d86d36485746409b4281a0893d729MD53ORIGINALAnexo_I_Dufrechou_et_al_2019_SBAC_PAD.pdfAnexo_I_Dufrechou_et_al_2019_SBAC_PAD.pdfpaperapplication/pdf361626https://redi.anii.org.uy/jspui/bitstream/20.500.12381/3190/1/Anexo_I_Dufrechou_et_al_2019_SBAC_PAD.pdf34ae9b0f4d89423912d80291b124f327MD51Anexo_II_Dufrechou_2019_Award_SBAC_PAD.pdfAnexo_II_Dufrechou_2019_Award_SBAC_PAD.pdfawardapplication/pdf93132https://redi.anii.org.uy/jspui/bitstream/20.500.12381/3190/2/Anexo_II_Dufrechou_2019_Award_SBAC_PAD.pdf210f6659a1fde4c9023c95b94fc5e318MD5220.500.12381/31902023-04-13 11:19:32.424oai:redi.anii.org.uy:20.500.12381/3190PHA+PGI+QUNVRVJETyBERSBDRVNJT04gTk8gRVhDTFVTSVZBIERFIERFUkVDSE9TPC9iPjwvcD4KCjxwPkFjZXB0YW5kbyBsYSBjZXNpw7NuIGRlIGRlcmVjaG9zIGVsIHVzdWFyaW8gREVDTEFSQSBxdWUgb3N0ZW50YSBsYSBjb25kaWNpw7NuIGRlIGF1dG9yIGVuIGVsIHNlbnRpZG8gcXVlIG90b3JnYSBsYSBsZWdpc2xhY2nDs24gdmlnZW50ZSBzb2JyZSBwcm9waWVkYWQgaW50ZWxlY3R1YWwgZGUgbGEgb2JyYSBvcmlnaW5hbCBxdWUgZXN0w6EgZW52aWFuZG8gKOKAnGxhIG9icmHigJ0pLiBFbiBjYXNvIGRlIHNlciBjb3RpdHVsYXIsIGVsIGF1dG9yIGRlY2xhcmEgcXVlIGN1ZW50YSBjb24gZWwgIGNvbnNlbnRpbWllbnRvIGRlIGxvcyByZXN0YW50ZXMgdGl0dWxhcmVzIHBhcmEgaGFjZXIgbGEgcHJlc2VudGUgY2VzacOzbi4gRW4gY2FzbyBkZSBwcmV2aWEgY2VzacOzbiBkZSBsb3MgZGVyZWNob3MgZGUgZXhwbG90YWNpw7NuIHNvYnJlIGxhIG9icmEgYSB0ZXJjZXJvcywgZWwgYXV0b3IgZGVjbGFyYSBxdWUgdGllbmUgbGEgYXV0b3JpemFjacOzbiBleHByZXNhIGRlIGRpY2hvcyB0aXR1bGFyZXMgZGUgZGVyZWNob3MgYSBsb3MgZmluZXMgZGUgZXN0YSBjZXNpw7NuLCBvIGJpZW4gcXVlIGhhIGNvbnNlcnZhZG8gbGEgZmFjdWx0YWQgZGUgY2VkZXIgZXN0b3MgZGVyZWNob3MgZW4gbGEgZm9ybWEgcHJldmlzdGEgZW4gbGEgcHJlc2VudGUgY2VzacOzbi48L3A+Cgo8cD5Db24gZWwgZmluIGRlIGRhciBsYSBtw6F4aW1hIGRpZnVzacOzbiBhIGxhIG9icmEgYSB0cmF2w6lzIGRlbCByZXBvc2l0b3JpbyBkZSBhY2Nlc28gYWJpZXJ0byBSRURJIChodHRwczovL3JlZGkuYW5paS5vcmcudXkpLCBlbCBBVVRPUiBDRURFIGEgPGI+QWdlbmNpYSBOYWNpb25hbCBkZSBJbnZlc3RpZ2FjacOzbiBlIElubm92YWNpw7NuPC9iPiAoPGI+QU5JSTwvYj4pLCBkZSBmb3JtYSBncmF0dWl0YSB5IE5PIEVYQ0xVU0lWQSwgY29uIGNhcsOhY3RlciBpcnJldm9jYWJsZSBlIGlsaW1pdGFkbyBlbiBlbCB0aWVtcG8geSBjb24gw6FtYml0byBtdW5kaWFsLCBsb3MgZGVyZWNob3MgZGUgcmVwcm9kdWNjacOzbiwgZGUgZGlzdHJpYnVjacOzbiwgZGUgY29tdW5pY2FjacOzbiBww7pibGljYSwgaW5jbHVpZG8gZWwgZGVyZWNobyBkZSBwdWVzdGEgYSBkaXNwb3NpY2nDs24gZWxlY3Ryw7NuaWNhLCBwYXJhIHF1ZSBwdWVkYSBzZXIgdXRpbGl6YWRhIGRlIGZvcm1hIGxpYnJlIHkgZ3JhdHVpdGEgcG9yIHRvZG9zIGxvcyBxdWUgbG8gZGVzZWVuLjwvcD4KCjxwPkxhIGNlc2nDs24gc2UgcmVhbGl6YSBiYWpvIGxhcyBzaWd1aWVudGVzIGNvbmRpY2lvbmVzOjwvcD4KCjxwPkxhIHRpdHVsYXJpZGFkIGRlIGxhIG9icmEgc2VndWlyw6EgY29ycmVzcG9uZGllbmRvIGFsIEF1dG9yIHkgbGEgcHJlc2VudGUgY2VzacOzbiBkZSBkZXJlY2hvcyBwZXJtaXRpcsOhIGEgPGI+QU5JSTwvYj46PC9wPgoKPHVsPgo8bGkgdmFsdWU9KGEpPlRyYW5zZm9ybWFyIGxhIG9icmEgZW4gbGEgbWVkaWRhIGVuIHF1ZSBzZWEgbmVjZXNhcmlvIHBhcmEgYWRhcHRhcmxhIGEgY3VhbHF1aWVyIHRlY25vbG9nw61hIHN1c2NlcHRpYmxlIGRlIGluY29ycG9yYWNpw7NuIGEgSW50ZXJuZXQ7IHJlYWxpemFyIGxhcyBhZGFwdGFjaW9uZXMgbmVjZXNhcmlhcyBwYXJhIGhhY2VyIHBvc2libGUgc3UgYWNjZXNvIHkgdmlzdWFsaXphY2nDs24gcGVybWFuZW50ZSwgYcO6biBwb3IgcGFydGUgZGUgcGVyc29uYXMgY29uIGRpc2NhcGFjaWRhZCwgcmVhbGl6YXIgbGFzIG1pZ3JhY2lvbmVzIGRlIGZvcm1hdG9zIHBhcmEgYXNlZ3VyYXIgbGEgcHJlc2VydmFjacOzbiBhIGxhcmdvIHBsYXpvLCBpbmNvcnBvcmFyIGxvcyBtZXRhZGF0b3MgbmVjZXNhcmlvcyBwYXJhIHJlYWxpemFyIGVsIHJlZ2lzdHJvIGRlIGxhIG9icmEsIGUgaW5jb3Jwb3JhciB0YW1iacOpbiDigJxtYXJjYXMgZGUgYWd1YeKAnSBvIGN1YWxxdWllciBvdHJvIHNpc3RlbWEgZGUgc2VndXJpZGFkIG8gZGUgcHJvdGVjY2nDs24gbyBkZSBpZGVudGlmaWNhY2nDs24gZGUgcHJvY2VkZW5jaWEuIEVuIG5pbmfDum4gY2FzbyBkaWNoYXMgbW9kaWZpY2FjaW9uZXMgaW1wbGljYXLDoW4gYWR1bHRlcmFjaW9uZXMgZW4gZWwgY29udGVuaWRvIGRlIGxhIG9icmEuPC9saT4gCjxsaSB2YWx1ZT0oYik+UmVwcm9kdWNpciBsYSBvYnJhIGVuIHVuIG1lZGlvIGRpZ2l0YWwgcGFyYSBzdSBpbmNvcnBvcmFjacOzbiBhIHNpc3RlbWFzIGRlIGLDunNxdWVkYSB5IHJlY3VwZXJhY2nDs24sIGluY2x1eWVuZG8gZWwgZGVyZWNobyBhIHJlcHJvZHVjaXIgeSBhbG1hY2VuYXJsYSBlbiBzZXJ2aWRvcmVzIHUgb3Ryb3MgbWVkaW9zIGRpZ2l0YWxlcyBhIGxvcyBlZmVjdG9zIGRlIHNlZ3VyaWRhZCB5IHByZXNlcnZhY2nDs24uPC9saT4gCjxsaSB2YWx1ZT0oYyk+UGVybWl0aXIgYSBsb3MgdXN1YXJpb3MgbGEgZGVzY2FyZ2EgZGUgY29waWFzIGVsZWN0csOzbmljYXMgZGUgbGEgb2JyYSBlbiB1biBzb3BvcnRlIGRpZ2l0YWwuPC9saT4gCjxsaSB2YWx1ZT0oZCk+UmVhbGl6YXIgbGEgY29tdW5pY2FjacOzbiBww7pibGljYSB5IHB1ZXN0YSBhIGRpc3Bvc2ljacOzbiBkZSBsYSBvYnJhIGFjY2VzaWJsZSBkZSBtb2RvIGxpYnJlIHkgZ3JhdHVpdG8gYSB0cmF2w6lzIGRlIEludGVybmV0Lgo8L3VsPgoKPHA+RW4gdmlydHVkIGRlbCBjYXLDoWN0ZXIgbm8gZXhjbHVzaXZvIGRlIGxhIGNlc2nDs24sIGVsIEF1dG9yIGNvbnNlcnZhIHRvZG9zIGxvcyBkZXJlY2hvcyBkZSBhdXRvciBzb2JyZSBsYSBvYnJhLCB5IHBvZHLDoSBwb25lcmxhIGEgZGlzcG9zaWNpw7NuIGRlbCBww7pibGljbyBlbiBlc3RhIHkgZW4gcG9zdGVyaW9yZXMgdmVyc2lvbmVzLCBhIHRyYXbDqXMgZGUgbG9zIG1lZGlvcyBxdWUgZXN0aW1lIG9wb3J0dW5vcy48L3A+Cgo8cD5FbCBBdXRvciBkZWNsYXJhIGJham8ganVyYW1lbnRvIHF1ZSBsYSBwcmVzZW50ZSBjZXNpw7NuIG5vIGluZnJpbmdlIG5pbmfDum4gZGVyZWNobyBkZSB0ZXJjZXJvcywgeWEgc2VhbiBkZSBwcm9waWVkYWQgaW5kdXN0cmlhbCwgaW50ZWxlY3R1YWwgbyBjdWFscXVpZXIgb3RybyB5IGdhcmFudGl6YSBxdWUgZWwgY29udGVuaWRvIGRlIGxhIG9icmEgbm8gYXRlbnRhIGNvbnRyYSBsb3MgZGVyZWNob3MgYWwgaG9ub3IsIGEgbGEgaW50aW1pZGFkIHkgYSBsYSBpbWFnZW4gZGUgdGVyY2Vyb3MsIG5pIGVzIGRpc2NyaW1pbmF0b3Jpby4gPGI+QU5JSTwvYj4gZXN0YXLDoSBleGVudGEgZGUgbGEgcmV2aXNpw7NuIGRlbCBjb250ZW5pZG8gZGUgbGEgb2JyYSwgcXVlIGVuIHRvZG8gY2FzbyBwZXJtYW5lY2Vyw6EgYmFqbyBsYSByZXNwb25zYWJpbGlkYWQgZXhjbHVzaXZhIGRlbCBBdXRvci48L3A+Cgo8cD5MYSBvYnJhIHNlIHBvbmRyw6EgYSBkaXNwb3NpY2nDs24gZGUgbG9zIHVzdWFyaW9zIHBhcmEgcXVlIGhhZ2FuIGRlIGVsbGEgdW4gdXNvIGp1c3RvIHkgcmVzcGV0dW9zbyBkZSBsb3MgZGVyZWNob3MgZGVsIGF1dG9yIHkgY29uIGZpbmVzIGRlIGVzdHVkaW8sIGludmVzdGlnYWNpw7NuLCBvIGN1YWxxdWllciBvdHJvIGZpbiBsw61jaXRvLiBFbCBtZW5jaW9uYWRvIHVzbywgbcOhcyBhbGzDoSBkZSBsYSBjb3BpYSBwcml2YWRhLCByZXF1ZXJpcsOhIHF1ZSBzZSBjaXRlIGxhIGZ1ZW50ZSB5IHNlIHJlY29ub3pjYSBsYSBhdXRvcsOtYS4gQSB0YWxlcyBmaW5lcyBlbCBBdXRvciBhY2VwdGEgZWwgdXNvIGRlIGxpY2VuY2lhcyBDcmVhdGl2ZSBDb21tb25zIHkgRUxJR0UgdW5hIGRlIGVzdGFzIGxpY2VuY2lhcyBlc3RhbmRhcml6YWRhcyBhIGxvcyBmaW5lcyBkZSBjb211bmljYXIgc3Ugb2JyYS48L3A+Cgo8cD5FbCBBdXRvciwgY29tbyBnYXJhbnRlIGRlIGxhIGF1dG9yw61hIGRlIGxhIG9icmEgeSBlbiByZWxhY2nDs24gYSBsYSBtaXNtYSwgZGVjbGFyYSBxdWUgPGI+QU5JSTwvYj4gc2UgZW5jdWVudHJhIGxpYnJlIGRlIHRvZG8gdGlwbyBkZSByZXNwb25zYWJpbGlkYWQsIHNlYSDDqXN0YSBjaXZpbCwgYWRtaW5pc3RyYXRpdmEgbyBwZW5hbCwgeSBxdWUgw6lsIG1pc21vIGFzdW1lIGxhIHJlc3BvbnNhYmlsaWRhZCBmcmVudGUgYSBjdWFscXVpZXIgcmVjbGFtbyBvIGRlbWFuZGEgcG9yIHBhcnRlIGRlIHRlcmNlcm9zLiBMYSA8Yj5BTklJPC9iPiBlc3RhcsOhIGV4ZW50YSBkZSBlamVyY2l0YXIgYWNjaW9uZXMgbGVnYWxlcyBlbiBub21icmUgZGVsIEF1dG9yIGVuIGVsIHN1cHVlc3RvIGRlIGluZnJhY2Npb25lcyBhIGRlcmVjaG9zIGRlIHByb3BpZWRhZCBpbnRlbGVjdHVhbCBkZXJpdmFkb3MgZGVsIGRlcMOzc2l0byB5IGFyY2hpdm8gZGUgbGEgb2JyYS48L3A+Cgo8cD48Yj5BTklJPC9iPiBub3RpZmljYXLDoSBhbCBBdXRvciBkZSBjdWFscXVpZXIgcmVjbGFtYWNpw7NuIHF1ZSByZWNpYmEgZGUgdGVyY2Vyb3MgZW4gcmVsYWNpw7NuIGNvbiBsYSBvYnJhIHksIGVuIHBhcnRpY3VsYXIsIGRlIHJlY2xhbWFjaW9uZXMgcmVsYXRpdmFzIGEgbG9zIGRlcmVjaG9zIGRlIHByb3BpZWRhZCBpbnRlbGVjdHVhbCBzb2JyZSBlbGxhLjwvcD4KCjxwPkVsIEF1dG9yIHBvZHLDoSBzb2xpY2l0YXIgZWwgcmV0aXJvIG8gbGEgaW52aXNpYmlsaXphY2nDs24gZGUgbGEgb2JyYSBkZSBSRURJIHPDs2xvIHBvciBjYXVzYSBqdXN0aWZpY2FkYS4gQSB0YWwgZmluIGRlYmVyw6EgbWFuaWZlc3RhciBzdSB2b2x1bnRhZCBlbiBmb3JtYSBmZWhhY2llbnRlIHkgYWNyZWRpdGFyIGRlYmlkYW1lbnRlIGxhIGNhdXNhIGp1c3RpZmljYWRhLiBBc2ltaXNtbyA8Yj5BTklJPC9iPiBwb2Ryw6EgcmV0aXJhciBvIGludmlzaWJpbGl6YXIgbGEgb2JyYSBkZSBSRURJLCBwcmV2aWEgbm90aWZpY2FjacOzbiBhbCBBdXRvciwgZW4gc3VwdWVzdG9zIHN1ZmljaWVudGVtZW50ZSBqdXN0aWZpY2Fkb3MsIG8gZW4gY2FzbyBkZSByZWNsYW1hY2lvbmVzIGRlIHRlcmNlcm9zLjwvcD4KGobiernohttps://www.anii.org.uy/https://redi.anii.org.uy/oai/requestjmaldini@anii.org.uyUruguayopendoar:94212023-04-13T14:19:32REDI - Agencia Nacional de Investigación e Innovaciónfalse |
spellingShingle | Avoiding synchronization to accelerate a CFD solver in GPU Dufrechou, Ernesto GPU CFD Solver SIP Ingeniería y Tecnología Ingeniería Mecánica |
status_str | publishedVersion |
title | Avoiding synchronization to accelerate a CFD solver in GPU |
title_full | Avoiding synchronization to accelerate a CFD solver in GPU |
title_fullStr | Avoiding synchronization to accelerate a CFD solver in GPU |
title_full_unstemmed | Avoiding synchronization to accelerate a CFD solver in GPU |
title_short | Avoiding synchronization to accelerate a CFD solver in GPU |
title_sort | Avoiding synchronization to accelerate a CFD solver in GPU |
topic | GPU CFD Solver SIP Ingeniería y Tecnología Ingeniería Mecánica |
url | https://hdl.handle.net/20.500.12381/3190 |