Efficient Tensor Decomposition-Based Filter Pruning

Background Introduction Network Pruning is a crucial technique for designing efficient Convolutional Neural Network (CNN) models. By reducing memory footprint and computational demands, while maintaining or improving overall performance, it makes deploying CNNs on resource-constrained devices (such as mobile phones or embedded systems) feasible. Th...