/normxcorr/trunk : revision 31

To get this branch, use:

bzr branch
http://suren.me/webbzr/normxcorr/trunk

Viewing changes to dict_hw/README

Tags: single_gpu

CUDAfication of real-time module

added added

removed removed

transfer is interleaved with computations. Unfortunatelly, in image mode

the memory transfer is handled as computations and there is no interleave

is possible. Therefore, in most cases the fragment mode is faster compared

to image mode.

b'\\ No newline at end of file'

to image mode.

4. We probably can use the same buffer for cuda_base_buffer and cuda_data_buffer,

the problem the extra space should be zeroed, and in the base buffer more

data is filled. Another option is to unblock computations in load base (3D

copy?) and then we would no need it CP_BLOCK times, but just ones.