Utilize este identificador para referenciar este registo: http://hdl.handle.net/10400.14/4801
Título: Modeling Grouping Cues for Auditory Scene Analysis Using a Spectral Clustering Formulation
Autor: Martins, Luís Gustavo
Lagrange, Mathieu
Tzanetaks, George
Data: 2011
Editora: InformatIon Science Reference
Citação: MARTINS, Luís Gustavo ; LAGRANGE, Mathieu ; TZANETAKIS, George - Modeling Grouping Cues for Auditory Scene Analysis Using a Spectral Clustering Formulation. In WANG, Wenwu - Machine Audition: Principles, Algorithms and Systems. Hershey: InformatIon Science Reference, cop. 2011. ISBN 978-1-61520-919-4. Cap. 2, p. 22-60.
Resumo: Computational Auditory Scene Analysis (CASA) is challenging problem for which many different approaches have been proposed. These approaches can be based on statistical and signal processing methods such as Independent Component Analysis or can be based on our current knowledge about human auditory perception. Learning happens at the boundary interactions between prior knowledge and incoming data. Separating complex mixtures of sound sources such as music requires a complex interplay between prior knowledge and analysis of incoming data. Many approaches to CASA can also be broadly categorized as either model-based or grouping-based. Although it is known that our perceptual-system utilizes both of these types of processing, building such systems computationally has been challenging. As a result most existing systems either rely on prior source models or are solely based on grouping cues. In this chapter the authors argue that formulating this integration problem as clustering based on similarities between time-frequency atoms provides an expressive yet disciplined approach to building sound source characterization and separation systems and evaluating their performance. After describing the main components of such an architecture, the authors describe a concrete realization that is based on spectral clustering of a sinusoidal representation. They show how this approach can be used to model both traditional grouping cues such as frequency and amplitude continuity as well as other types of information and prior knowledge such as onsets, harmonicity and timbre-models for specific instruments.Experiments supporting their approach to integration are also described. The description also covers issues of software architecture, implementation and efficiency, which are frequently not analyzed in depth for many existing algorithms. The resulting system exhibits practical performance (approximately realtime with consistent results without requiring example-specific parameter optimization and is available as part of the Marsyas open source audio processing framework.
URI: http://hdl.handle.net/10400.14/4801
Aparece nas colecções:EA - Livros e partes de livros / Books and chapters

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
Machine Audition Book (Ch. 2).pdf10,59 MBAdobe PDFVer/Abrir    Acesso Restrito. Solicitar cópia ao autor!

FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.