OpenCL仅在循环呼叫时停止运行

OpenCL stops running only while call in a loop

本文关键字:运行 呼叫 循环 OpenCL      更新时间:2023-10-16

当我尝试将数据循环回内核函数时,我的代码断开,几次迭代后,它停止工作,仅给出0作为答案,有人知道为什么吗?如果我循环整个调用内核的方法,但速度较慢

cl_mem *ptrInput = &Pressure_BUFF;
cl_mem *ptrOutput = &Pressure_OUT_BUFF;
for(int i = 0; i<Interaction_per_frame; i++){
    clSetKernelArg(kernel_2, 4, sizeof(Pressure_BUFF), ptrInput);
    clEnqueueNDRangeKernel(queue_2, kernel_2, 1, NULL,&work_units_per_kernel, NULL, 0, NULL, NULL);
    clFinish(queue_2);//Terminar de calcular
    cl_mem *ptrTpm = ptrInput;
    ptrInput = ptrOutput;
    ptrOutput = ptrTpm;
}
clEnqueueReadBuffer(queue_2, Pressure_OUT_BUFF, CL_TRUE, 0,sizeof(Pressure), Pressure, 0, NULL, NULL);

您不能仅仅更改输入存储器缓冲区,使输出未触及。否则数据的输入与输出相同。

最干净的方法是使用2个内核,因此您无需每次致电Setargs并完成。

//Create 2 buffers, A and B
bufA = clCreateBuffer(...);
bufB = clCreateBuffer(...);
//Create 2 kernels with same parameters
kernelAB = clCreateKernel(...);
kernelBA = clCreateKernel(...);
//Set one to input A output B, and the other in reverse
clSetKernelArgs(kernelAB, in, bufferA);
clSetKernelArgs(kernelAB, out, bufferB);
clSetKernelArgs(kernelBA, in, bufferB);
clSetKernelArgs(kernelBA, out, bufferA);
for(int i = 0; i<Interaction_per_frame; i++){
    clEnqueueNDRangeKernel(queue_2, i%2 ? kernelBA : kernelAB, 1, NULL,&work_units_per_kernel, NULL, 0, NULL, NULL);    
}
clEnqueueReadBuffer(queue_2, Interaction_per_frame%2 ? bufferB : bufferA, CL_TRUE, 0,sizeof(Pressure), Pressure, 0, NULL, NULL);