Pinned memory (page-locked memory) is host Memory that is locked in physical RAM and cannot be paged out by the OS. This allows the GPU to access it directly via DMA, enabling faster and asynchronous CPU-to-GPU transfers compared to pageable memory, which must first be staged through a temporary buffer.
Allocated with cudaMallocHost(). The tradeoff is reduced available memory for the OS and other processes.