Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection

February 21, 2017 ยท Declared Dead ยท ๐Ÿ› IEEE/ACM Transactions on Audio Speech and Language Processing

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Emre ร‡akฤฑr, Giambattista Parascandolo, Toni Heittola, Heikki Huttunen, Tuomas Virtanen arXiv ID 1702.06286 Category cs.LG: Machine Learning Cross-listed cs.SD Citations 589 Venue IEEE/ACM Transactions on Audio Speech and Language Processing Last Checked 4 months ago
Abstract
Sound events often occur in unstructured environments where they exhibit wide variations in their frequency content and temporal structure. Convolutional neural networks (CNN) are able to extract higher level features that are invariant to local spectral and temporal variations. Recurrent neural networks (RNNs) are powerful in learning the longer term temporal context in the audio signals. CNNs and RNNs as classifiers have recently shown improved performances over established methods in various sound recognition tasks. We combine these two approaches in a Convolutional Recurrent Neural Network (CRNN) and apply it on a polyphonic sound event detection task. We compare the performance of the proposed CRNN method with CNN, RNN, and other established methods, and observe a considerable improvement for four different datasets consisting of everyday sound events.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning

Died the same way โ€” ๐Ÿ‘ป Ghosted