[Documentation] CudaContext::AllocDeferredCpuMem #23485

axbycc-mark · 2025-01-24T20:02:40Z

Describe the documentation issue

Official documentation on implementing custom ops links to the repo code below.

onnxruntime/onnxruntime/test/testdata/custom_op_library/cuda/cuda_ops.cc

Lines 27 to 40 in 5f0b62c

    
           void KernelOne(const Ort::Custom::CudaContext& cuda_ctx, 
        
                          const Ort::Custom::Tensor<float>& X, 
        
                          const Ort::Custom::Tensor<float>& Y, 
        
                          Ort::Custom::Tensor<float>& Z) { 
        
             CUSTOM_ENFORCE(cuda_ctx.cuda_stream, "failed to fetch cuda stream"); 
        
             CUSTOM_ENFORCE(cuda_ctx.cudnn_handle, "failed to fetch cudnn handle"); 
        
             CUSTOM_ENFORCE(cuda_ctx.cublas_handle, "failed to fetch cublas handle"); 
        
             CUSTOM_ENFORCE(cuda_ctx.arena_extend_strategy == 0, "arena_extend_strategy mismatch"); 
        
             void* deferred_cpu_mem = cuda_ctx.AllocDeferredCpuMem(sizeof(int32_t)); 
        
             CUSTOM_ENFORCE(deferred_cpu_mem, "failed to allocate deferred cpu allocator"); 
        
             cuda_ctx.FreeDeferredCpuMem(deferred_cpu_mem); 
        
             auto z_raw = Z.Allocate(X.Shape()); 
        
             cuda_add(Z.NumberOfElement(), z_raw, X.Data(), Y.Data(), cuda_ctx.cuda_stream); 
        
           }

On line 35, we see a call to cuda_ctx.AllocDeferredCpuMem. This memory is then immediately freed. This raises some questions.

Is that line just 35 dead code?
Why would we use the CudaContext::deferred_cpu_allocator over the default standard allocator (malloc, free)?
What is the meaning of deferred? Do we have to wait for some condition before the memory becomes usable?
Are there any cases within a custom op kernel where the deferred_cpu_allocator will be empty?

Page / URL

https://github.com/microsoft/onnxruntime/blob/rel-1.17.0/onnxruntime/test/testdata/custom_op_library/cuda/cuda_ops.cc#L35

The text was updated successfully, but these errors were encountered:

axbycc-mark added the documentation improvements or additions to documentation; typically submitted using template label Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Documentation] CudaContext::AllocDeferredCpuMem #23485

[Documentation] CudaContext::AllocDeferredCpuMem #23485

axbycc-mark commented Jan 24, 2025 •

edited

Loading

[Documentation] CudaContext::AllocDeferredCpuMem #23485

[Documentation] CudaContext::AllocDeferredCpuMem #23485

Comments

axbycc-mark commented Jan 24, 2025 • edited Loading

Describe the documentation issue

Page / URL

axbycc-mark commented Jan 24, 2025 •

edited

Loading