GPU Pooling Algorithm
3
Window Size
8
Threads
0
Current Step
Input Array (size=8)
Block 0
Shared Memory (TPB=8) •
Parallel Processing
Sum Pooling (Window=3)
Output Array (size=8)