Advanced HIP Workshop | Optimising compute with concurrent IO | Converting output to pinned memory

แชร์
ฝัง
  • เผยแพร่เมื่อ 3 ก.ค. 2024
  • This video is part of the HIP workshop playlist. View full playlist here: • Advanced HIP Workshop
    The 3D output array out_h on the host is allocated as pinned memory using hipHostMalloc to enable asynchronous copies. Silent failures were encountered when trying to use addresses within the pinned memory allocation as inputs to hipMemcpyAsync. Therefore I had to use hipMemcpy3DAsync to perform the copies asynchronously and pass in the host pointer returned by hipHostmalloc.
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น •