jeudi 1 novembre 2018

CUDA: Unresolved extern function 'cudaMemcpyAsync'

I keep getting compilation error:

ptxas fatal   : Unresolved extern function 'cudaMemcpyAsync'

in file buffer.cuh:

__device__ void markBuffer(volatile bool* is_ready_for_write_list, void* volatile * data_list, void* data, size_t num_samples_per_read, size_t sample_offset, void** tensor_list) {
    size_t index = getIndex(num_samples_per_read, sample_offset);

    data_list[index] = data;
    cudaMemcpyAsync((tensor_list)[index], ((int*)(data_list[index])) - (num_samples_per_read - 1), num_samples_per_read * 4, cudaMemcpyDeviceToDevice, NULL);
    is_ready_for_write_list[index] = true;
}

buffer.cuh is inculuded in nv_wavenet.cuh, and nv_wavenet.cuh is included in nv_wavenet_test.cu.

I am compiling with :

nvcc -arch=sm_61 -std=c++11  -g --use_fast_math -G -g -maxrregcount 128 nv_wavenet_test.cu matrix.cpp nv_wavenet_reference.cpp -o nv_wavenet_test

Aucun commentaire:

Enregistrer un commentaire