GPU Pooling Algorithm

3
Window Size
8
Threads
0
Current Step
Input Array (size=8)
Block 0
Shared Memory (TPB=8) • Parallel Processing
Sum Pooling (Window=3)
Output Array (size=8)