vimarsana.com

Hree Dimensional Convolutional Neural Networks 3d Cnns News Today : Breaking News, Live Updates & Top Stories | Vimarsana

D3D: Dual 3-D Convolutional Network for Real-Time Action Recognition by Shengqin Jiang, Yuankai Qi et al

Abstract Three-dimensional convolutional neural networks (3D CNNs) have been explored to learn spatio-temporal information for video-based human action recognition. Expensive computational cost and memory demand resulted from standard 3D CNNs, however, hinder their application in practical scenarios. In this article, we address the aforementioned limitations by proposing a novel dual 3-D convolutional network (D3DNet) with two complementary lightweight branches. A coarse branch maintains large temporal receptive field by a fast temporal downsampling strategy and simulates the expensive 3-D convolutions using a combination of more efficient spatial convolutions and temporal convolutions. Meanwhile, a fine branch progressively downsamples the video in the temporal domain and adopts 3-D convolutional units with reduced channel capacities to capture multiresolution spatio-temporal information. Instead of learning these two branches independently, a shallow spatiotemporal downsampling mo

© 2025 Vimarsana

vimarsana © 2020. All Rights Reserved.