Dilated, Residual, Gated CNN

This is a PyTorch implementation of the network presented in Chang et al "Temporal Modeling Using Dilated Convolution and Gating for Voice-Activity-Detection" 2018 Link to paper

The network is used for Voice Activity Detection (VAD) in the paper

Network Architecture

The core network arcitecture can be seen in the drawing below

The original paper does not state how they do the dimension matching and flattening to the fully connected layer in the end of the network. For the dimension matching, simple 2D convolutions were used. For the flattening, two consecutive 1x1 convolutions were used before flattening to the fully connected layer.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
images		images
README.md		README.md
netModules.py		netModules.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dilated, Residual, Gated CNN

Network Architecture

About

Releases

Packages

Languages

lhl1001/residual-gated-dilated-CNN

Folders and files

Latest commit

History

Repository files navigation

Dilated, Residual, Gated CNN

Network Architecture

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages