美文网首页
2021-04-07

2021-04-07

作者: 麦穗_4feb | 来源:发表于2021-04-07 11:17 被阅读0次
__global void MatAdd(float A[N][N], float B[N][N], float C[N][N])  
{  
    int i = blockIdx.x ∗ blockDim.x + threadIdx.x;  
    int j = blockIdx.y ∗ blockDim.y + threadIdx.y;  
    if (i < N && j < N)  
        C[i ][ j ] = A[i][ j ] + B[i][ j ];  
}  
int main()  
{  
    ...  
    // Kernel invocation  
    dim3 threadsPerBlock(16, 16);  
    dim3 numBlocks(N / threadsPerBlock.x, N / threadsPerBlock.y);  
    MatAdd<<<numBlocks, threadsPerBlock>>>(A, B, C);  
    ...  
}  

相关文章

网友评论

      本文标题:2021-04-07

      本文链接:https://www.haomeiwen.com/subject/eroykltx.html