Multi-Temporal Resolution Convolutional Neural Networks for Acoustic Scene Classification

November 11, 2018 ยท Declared Dead ยท ๐Ÿ› Workshop on Detection and Classification of Acoustic Scenes and Events

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Alexander Schindler, Thomas Lidy, Andreas Rauber arXiv ID 1811.04419 Category cs.SD: Sound Cross-listed cs.MM, eess.AS Citations 14 Venue Workshop on Detection and Classification of Acoustic Scenes and Events Last Checked 3 months ago
Abstract
In this paper we present a Deep Neural Network architecture for the task of acoustic scene classification which harnesses information from increasing temporal resolutions of Mel-Spectrogram segments. This architecture is composed of separated parallel Convolutional Neural Networks which learn spectral and temporal representations for each input resolution. The resolutions are chosen to cover fine-grained characteristics of a scene's spectral texture as well as its distribution of acoustic events. The proposed model shows a 3.56% absolute improvement of the best performing single resolution model and 12.49% of the DCASE 2017 Acoustic Scenes Classification task baseline.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Sound

Died the same way โ€” ๐Ÿ‘ป Ghosted