• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Cemgil, “Score guided audio restoration via generalised coupled tensor factorisation (2012)

by U Simsekli, Y K Yilmaz, A T
Venue:in ICASSP
Add To MetaCart

Tools

Sorted by:
Results 1 - 5 of 5

Cemgil, “Score guided musical source separation using generalized coupled tensor factorization

by A. Taylan Cemgil - in EUSIPCO , 2012
"... Providing prior knowledge about sources to guide source sep-aration is known to be useful in many audio applications. In this paper we present two tensor factorization models for mu-sical source separation where musical information is incorpo-rated by using the Generalized Coupled Tensor Factorizati ..."
Abstract - Cited by 8 (0 self) - Add to MetaCart
Providing prior knowledge about sources to guide source sep-aration is known to be useful in many audio applications. In this paper we present two tensor factorization models for mu-sical source separation where musical information is incorpo-rated by using the Generalized Coupled Tensor Factorization (GCTF) framework. The approach is an extension of Non-negative Matrix Factorization where more than one matrix or tensor object is simultaneously factorized. The first model uses a temporally aligned transcription of the mixture and in-corporates spectral knowledge via coupling. In contrast of using a temporally aligned transcription, the second model incorporates harmonic information by taking an approximate, incomplete, and not necessarily aligned transcription of the musical piece as input. We evaluate our models on piano and cello duets where the experiments show that instead of using a temporally aligned transcription, we can achieve competitive results by using only a partial and incomplete transcription.

HIERARCHICAL AND COUPLED NON-NEGATIVE DYNAMICAL SYSTEMS WITH APPLICATION TO AUDIO MODELING

by U. Le Roux, J. Hershey, Jonathan Le Roux, John R. Hershey , 2013
"... Many kinds of non-negative data, such as power spectra and count data, have been modeled using non-negative matrix factorization. Even though this modeling paradigm has yielded successful applications, it falls short when the data have certain hierarchical and temporal structure. In this study, we p ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
Many kinds of non-negative data, such as power spectra and count data, have been modeled using non-negative matrix factorization. Even though this modeling paradigm has yielded successful applications, it falls short when the data have certain hierarchical and temporal structure. In this study, we propose a novel dynamical system model that can handle these kinds of complex structures that often arise in non-negative data. We show that our model can be extended to handle heterogeneous data for data-driven regularization. We present convergence-guaranteed update rules for each latent factor. In order to assess the performance, we evaluate our model on the transcription of classical piano pieces, and show that it outperforms related models. We also illustrate that the performance can be further improved by making use of symbolic data.
(Show Context)

Citation Context

... an extension to do transfer learning using heterogeneous data. Recent studies suggest that providing additional sources of information to audio models can increase the performance on different tasks =-=[4, 13]-=-. Such a transfer-learning paradigm is compelling for music signals because large amounts of symbolic music data are available. Symbolic music data in the form of U ′ ≡ {u ′ km} encodes whether the no...

SCALABLE AUDIO SEPARATION WITH LIGHT KERNEL ADDITIVE MODELLING

by Antoine Liutkus, Derry Fitzgerald, Zafar Rafii, Antoine Liutkus, Derry Fitzgerald, Zafar Rafii , 2015
"... Scalable audio separation with light kernel additive modelling ..."
Abstract - Add to MetaCart
Scalable audio separation with light kernel additive modelling
(Show Context)

Citation Context

...trumental stems from a musical track. It is a topic that has many applications in the entertainment industry such as automatic karaoke [19], [29], music upmixing [21], [22], [23] or audio restoration =-=[31]-=-. For this reason, it has gathered the attention of a large community of researchers in the past 15 years [35], [34]. The inherent difficulty of audio source separation comes from the fact that it is ...

A Survey of Tensor Factorization Frameworks on Audio Modelling

by Ünsal Gökdağ , 2014
"... Abstract: This survey is about Tensor Factorization methods for audio modeling, which focuses on probabilistic latent tensor factorization and generalized coupled tensor factorization by expectation maximization method while using several linear and nonlinear distance measure methods. ..."
Abstract - Add to MetaCart
Abstract: This survey is about Tensor Factorization methods for audio modeling, which focuses on probabilistic latent tensor factorization and generalized coupled tensor factorization by expectation maximization method while using several linear and nonlinear distance measure methods.
(Show Context)

Citation Context

...the factor graph. The authors also noted onlysthe mathematical operations used in TF are analogous to thesfactor model in terms of inference algorithm of probabilisticsgraphical model.sAnother article=-=[3]-=- uses Generalized Coupled Tensor Factorizationsfor estimation of both TF model and missing values. In detail, thesmodel incorporates different kinds of musical information whilesestimating the missing...

Learning the β-Divergence in Tweedie Compound Poisson Matrix Factorization Models

by Ali Taylan Cemgil, Yusuf Kenan Yılmaz
"... In this study, we derive algorithms for estimating mixed β-divergences. Such cost functions are useful for Nonnegative Matrix and Tensor Factorization models with a compound Poisson observation model. Compound Poisson is a particular Tweedie model, an important special case of exponential dispersion ..."
Abstract - Add to MetaCart
In this study, we derive algorithms for estimating mixed β-divergences. Such cost functions are useful for Nonnegative Matrix and Tensor Factorization models with a compound Poisson observation model. Compound Poisson is a particular Tweedie model, an important special case of exponential dispersion models characterized by the fact that the variance is proportional to a power function of the mean. There are several well known matrix and tensor factorization algorithms that minimize the β-divergence; these estimate the mean parameter. The probabilistic interpretation gives us more flexibility and robustness by providing us additional tunable parameters such as power and dispersion. Estimation of the power parameter is useful for choosing a suitable divergence and estimation of dispersion is useful for data driven regularization and weighting in collective/coupled factorization of heterogeneous datasets. We present three inference algorithms for both estimating the factors and the additional parameters of the compound Poisson distribution. The methods are evaluated on two applications: modeling symbolic representations for polyphonic music and lyric prediction from audio features. Our conclusion is that the compound poisson based factorization models can be useful for sparse positive data. Proceedings of the 30 th
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University