A robust and precise method for solving the permutation problem of frequency-domain blind source separation H Sawada, R Mukai, S Araki, S Makino IEEE transactions on speech and audio processing 12 (5), 530-538, 2004 | 697 | 2004 |
The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech S Araki, R Mukai, S Makino, T Nishikawa, H Saruwatari IEEE Transactions on Speech and Audio Processing 11 (2), 109-116, 2003 | 468 | 2003 |
Underdetermined convolutive blind source separation via frequency bin-wise clustering and permutation alignment H Sawada, S Araki, S Makino IEEE Transactions on Audio, Speech, and Language Processing 19 (3), 516-527, 2010 | 352 | 2010 |
Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors S Araki, H Sawada, R Mukai, S Makino Signal Processing 87 (8), 1833-1847, 2007 | 294 | 2007 |
Multichannel extensions of non-negative matrix factorization with complex-valued data H Sawada, H Kameoka, S Araki, N Ueda IEEE Transactions on Audio, Speech, and Language Processing 21 (5), 971-982, 2013 | 246 | 2013 |
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices T Yoshioka, N Ito, M Delcroix, A Ogawa, K Kinoshita, M Fujimoto, C Yu, ... 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 200 | 2015 |
Polar coordinate based nonlinear function for frequency-domain blind source separation H Sawada, R Mukai, S Araki, S Makino IEICE Transactions on Fundamentals of Electronics, Communications and …, 2003 | 194 | 2003 |
The signal separation evaluation campaign (2007–2010): Achievements and remaining challenges E Vincent, S Araki, F Theis, G Nolte, P Bofill, H Sawada, A Ozerov, ... Signal Processing 92 (8), 1928-1936, 2012 | 190 | 2012 |
The 2008 signal separation evaluation campaign: A community-based approach to large-scale evaluation E Vincent, S Araki, P Bofill International Conference on Independent Component Analysis and Signal …, 2009 | 161 | 2009 |
Grouping separated frequency components by estimating propagation model parameters in frequency-domain blind source separation H Sawada, S Araki, R Mukai, S Makino IEEE Transactions on Audio, Speech, and Language Processing 15 (5), 1592-1604, 2007 | 156 | 2007 |
The 2011 signal separation evaluation campaign (SiSEC2011):-audio source separation S Araki, F Nesta, E Vincent, Z Koldovskı, G Nolte, A Ziehe, A Benichoux International Conference on Latent Variable Analysis and Signal Separation …, 2012 | 154 | 2012 |
Measuring dependence of bin-wise separated signals for permutation alignment in frequency-domain BSS H Sawada, S Araki, S Makino 2007 IEEE International Symposium on Circuits and Systems, 3247-3250, 2007 | 139 | 2007 |
Frequency-domain blind source separation S Makino, H Sawada, S Araki Blind Speech Separation, 47-78, 2007 | 128 | 2007 |
Blind extraction of dominant target sources using ICA and time-frequency masking H Sawada, S Araki, R Mukai, S Makino IEEE Transactions on Audio, Speech, and Language Processing 14 (6), 2165-2173, 2006 | 116 | 2006 |
A two-stage frequency-domain blind source separation method for underdetermined convolutive mixtures H Sawada, S Araki, S Makino 2007 IEEE Workshop on Applications of Signal Processing to Audio and …, 2007 | 100 | 2007 |
Blind source separation of convolutive mixtures of speech in frequency domain S Makino, H Sawada, R Mukai, S Araki IEICE transactions on fundamentals of electronics, communications and …, 2005 | 100 | 2005 |
A realtime multimodal system for analyzing group meetings by combining face pose tracking and speaker diarization K Otsuka, S Araki, K Ishizuka, M Fujimoto, M Heinrich, J Yamato Proceedings of the 10th international conference on Multimodal interfaces …, 2008 | 95 | 2008 |
Equivalence between frequency-domain blind source separation and frequency-domain adaptive beamforming for convolutive mixtures S Araki, S Makino, Y Hinamoto, R Mukai, T Nishikawa, H Saruwatari EURASIP Journal on Advances in Signal Processing 2003 (11), 198923, 2003 | 95 | 2003 |
Low-latency real-time meeting recognition and understanding using distant microphones and omni-directional camera T Hori, S Araki, T Yoshioka, M Fujimoto, S Watanabe, T Oba, A Ogawa, ... IEEE transactions on audio, speech, and language processing 20 (2), 499-513, 2011 | 93 | 2011 |
DOA estimation for multiple sparse sources with normalized observation vector clustering S Araki, H Sawada, R Mukai, S Makino 2006 IEEE International Conference on Acoustics Speech and Signal Processing …, 2006 | 88 | 2006 |