Prof. Dr.-Ing. Dorothea Kolossa

Ruhr-Universität Bochum
Digitale Signalverarbeitung
Fakultät für Elektrotechnik und Informationstechnik
Universitätsstr. 150
D-44780 Bochum
Raum: ID/2/325


Email: Dorothea.Kolossa@rub.de
Tel: +49 (0)234 32-28965
Fax: +49 (0)234 32-14165

Conferece Articles

B. Rafaely, D. Kolossa, Y. Maymon: "Towards acoustically robust localization of speakers in a reverberant environment", Proc. HSCMA, San Francisco, March 2017.

H. Meutzner, N. Ma, R. Nickel, C. Schymura, D. Kolossa: "Improving audio-visual speech recognition using deep neural networks with dynamic stream reliability estimates", Proc. ICASSP, New Orleans, March 2017.

B. Rafaely, Dorothea Kolossa: "Speaker localization in reverberant rooms based on direct path dominance test statistics", Proc. ICASSP, New Orleans, March 2017.

C. Schymura, J. Rios Grajales, D. Kolossa: "Monte Carlo exploration for active binaural localization", Proc. ICASSP, New Orleans, March 2017.

L. Schönherr, D. Orth, M. Heckmann, D. Kolossa: "Environmentally Robust Audio-Visual Speaker Identification", Proc. IEEE Workshop on Spoken Language Technology (SLT 2016), San Diego, USA, 13–16 December 2016

A. Hussen Abdelaziz, S. Watanabe, J. Hershey, E. Vincent, D. Kolossa: “Uncertainty Propagation through Deep Neural Networks,” in Proc. Interspeech, Dresden, Germany, September 2015.

C. Schymura, F. Winter, D. Kolossa, S. Spors: “Binaural Sound Source Localisation and Tracking using a dynamic Spherical Head Model,” in Proc. Interspeech, Dresden, Germany, September 2015.

R. Jacobi, G. vom Bögel, D. Kolossa: “Multilevel Decoding Scheme for RFID and Sensor Signals in Inductively Coupled Systems,” Proc. of the European Conference on Smart Objects, Systems and Technologies (SmartSysTech), Aachen, Germany, June 2015.

R. Jacobi, A. Süss, G. vom Bögel, D. Kolossa: “Determination of the Optimal Carrier Frequency in Harsh Environments by Parameter Estimation,” Proc. of the European Conference on Smart Objects, Systems and Technologies (SmartSysTech), Aachen, Germany, June 2015.

R. Jacobi, A. Süss, G. vom Bögel, D. Kolossa: “Carrier Frequency Adaptation Approach,” Proc. IEEE International Conference on RFID, San Diego, USA, April 2015.

M. Karbasi, D. Kolossa: “A Microscopic Approach to Speech Intelligibility Prediction using Auditory Models,” Proc. DAGA, Nürnberg, March 2015.

D. Kolossa: “Narrowing the gap: Probabilistic interfaces for signal enhancement and pattern recognition”, accepted for publication, Proc. IEEE GlobalSIP - Machine Learning Applications in Speech Processing, December 2014.

M. Darnstädt, H. Meutzner, D. Kolossa: “Reducing the Cost of Breaking Audio CAPTCHAs by Active and Semi-Supervised Learning”, Proc. ICMLA, December 2014.

Meutzner, H., Gupta, S., Kolossa, D. (2015). "Constructing Secure Audio CAPTCHAs by Exploiting Differences between Humans and Machines", Proc. ACM Conference on Human Factors in Computing Systems (CHI), Seoul, Korea, April 2015.

H. Meutzner, V. H. Nguyen, T. Holz, D. Kolossa: “Using Automatic Speech Recognition for Attacking Acoustic CAPTCHAs: The Trade-off between Usability and Security”, Proc. ACSAC, December 2014.

A. Hussen Abdelaziz, D. Kolossa: „Dynamic Stream Weight Estimation in Coupled-HMM- based Audio-visual Speech Recognition Using Multilayer Perceptrons”, Proc. Interspeech, September 2014.

M. Heckmann, P. Mikias, D. Kolossa: “The Impact of Word Alignment Accuracy on Audio- visual Word Prominence Detection”, Proc. ITG Fachtagung Sprachkommunikation, September 2014.

S. Zeiler, J. Cwiklak, D. Kolossa: “Robust Multimodal Human Machine Interaction using the Kinect Sensor”, Proc. ITG Fachtagung Sprachkommunikation, September 2014.

R. Jacobi, S. Grey, G. vom Bögel, D. Kolossa: “Digitally Controlled Analog Front End for Inductively Coupled Transponder Systems”, Proc. IEEE RFID Technology and Applications, September 2014.

C. Schymura , N. Ma,, G. J. Brown, G., T. Walther, D. Kolossa: "Binaural Sound Source Localisation using a Bayesian-network-based Blackboard System and Hypothesis-driven Feedback", in: Proc. FORUM ACUSTICUM 2014, PL-Krakow.

A. Hussen Abdelaziz, S. Zeiler, D. Kolossa: “A new EM estimation of dynamic stream weights for coupled-HMM-based audio-visual ASR”, Proc. ICASSP, Florence, May 2014.

A. Hussen Abdelaziz, S. Zeiler, D. Kolossa: „Using Twin-HMM-Based Audio-Visual Speech Enhancement as a Front-End for Robust Audio-Visual Speech Recognition”, Proc. Interspeech, Lyon, France, August 2013.

H. Meutzner, A. Schlesinger, S. Zeiler, D. Kolossa: „ Binaural Signal Processing for Enhanced Speech Recognition Robustness in Complex Listening Environments“, Proc. 2nd CHiME Workshop on Machine Listening in Multisource Environments, Vancouver, Canada, June 2013.

A. Hussen Abdelaziz, S. Zeiler, D. Kolossa: „Twin-HMM-based Audiovisual Speech
Enhancement“, Proc. ICASSP, Vancouver, Canada, May 2013.


A. Hussen Abdelaziz, S. Zeiler, D. Kolossa, V. Leutnant, R. Haeb-Umbach: „GMM-based
Significance Decoding“, Proc. ICASSP, Vancouver, Canada, May 2013.

A. Hussen Abdelaziz, D. Kolossa: “Decoding of Uncertain Features Using the Posterior Distribution of the Clean Data for Robust Speech Recognition”, Proc. Interspeech, Portland, Oregon, USA, September 9-13, 2012.

D. Kolossa, R. Nickel, S. Zeiler, R. Martin: “Inventory-Based Audio-Visual Speech Enhancement”, Proc. Interspeech, Portland, Oregon, USA, September 9-13, 2012.

R. Nickel, R. Fernandez Astudillo, D. Kolossa, S. Zeiler, R. Martin: „Inventory-Style Speech Enhancement with Uncertainty-of-Observation Techniques“, Proc. ICASSP, pp. 3877-3880, Kyoto, Japan, March 2012.

D. Kolossa, R. Fernandez Astudillo, A. Abad, S. Zeiler, R. Saeidi, P. Mowlaee, J. P. da Silva Neto, R. Martin: “CHiME Challenge: Approaches to Robustness using Beamforming and Uncertainty-of-Observation Techniques”, Proc. CHiME Workshop on Machine Listening in Multisource Environments, Florence, Italy, Sept.1, 2011.

F. Kohl, G. Wübbeler, D. Kolossa, C. Elster, M. Bär, R. Orglmeister: „Noise adjusted PCA for finding the subspace of evoked dependent signals from MEG data”, Latent Variable Analysis and Signal Separation (LVA 2010), Lecture Notes in Computer Science, vol. 6365, pp. 442 - 449, September 2010.

D. Kolossa, J. Chong, S. Zeiler, K. Keutzer: “Efficient Manycore CHMM Speech Recognition for Audiovisual and Multistream Data”, Proc. Interspeech 2010, pp. 2698 – 2701, Makuhari, Japan, September 26-30, 2010.

A. Vorwerk, X. Wang, D. Kolossa, S. Zeiler, R. Orglmeister: “WAPUSK20 - A Database for Robust Audiovisual Speech Recognition“, Proc. 7th Int. Conf. on International Language Resources and Evaluation (ELREC), pp. 3016 – 3019, 2010.

R. Fernandez Astudillo, D. Kolossa and R. Orglmeister: "Accounting for the Uncertainty of Speech Estimates in the Complex Domain for Minimum Mean Square Error Speech Enhancement", Interspeech 2009, Brighton, UK, September 2009.

D. Kolossa, S. Zeiler, A. Vorwerk, R. Orglmeister: "Audiovisual Speech Recognition with Missing or Unreliable Data", Audiovisual Speech Processing Workshop (AVSP 2009), Brighton, UK, September 10-13, 2009.

F. Kohl, G. Wübbeler, D. Kolossa, C. Elster, M. Bär, R. Orglmeister: "Non-Independent BSS: A Model for Evoked MEG Signals with Controllable Dependencies" in: Proceedings of the ICA 2009, pp. 443-450, Paraty, Brazil, March 15-18, 2009.

E. Hoffmann, D. Kolossa, R. Orglmeister: "Time Frequency Masking Strategy for Blind
Source Separation of Acoustic Signals Based on Optimally-Modified LOG-Spectral Amplitude Estimator" in: Proceedings of the ICA 2009, pp. 581-588, Paraty, Brazil, March 15-18, 2009.

F. Kohl, G. Wübbeler, D. Kolossa, R. Orglmeister, C. Elster, M. Bär: "Performance of ICA for MEG data generated from subspaces with dependent sources", Proc. European Biomedical Engineering Congress (EMBEC), Antwerpen, Nov. 2008.

D. Kolossa, S. Araki, M. Delcroix, T. Nakatani, R. Orglmeister and S. Makino: „Missing Feature Speech Recognition in a Meeting Situation with Maximum SNR Beamforming”, invited paper, Proc. ISCAS, pp. 3218-3221, Seattle, WA, May 2008.

E. Hoffmann, D. Kolossa and R. Orglmeister: “A Batch Algorithm for Blind Source Separation of Acoustic Signals Using ICA and Time-Frequency Masking”, Proc. ICA 2007, pp. 480-487, Springer Verlag, Berlin.


E. Hoffmann, D. Kolossa and R. Orglmeister: „A Soft Masking Strategy based on Multichannel Speech Probability Estimation for Source Separation and Robust Speech Recognition”, Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2007), pp. 118-121, New Paltz, NY.

R. Fernandez Astudillo, D. Kolossa and R. : “Propagation of Statistical Information Through Non-Linear Feature Extractions For Robust Speech Recognition”, Proc. MaxEnt2007, pp. 245-252, 27th Int. Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, Saratoga Springs, July 2007.

D. Kolossa, H. Sawada, R. Fernandez Astudillo, R. Orglmeister and S. Makino: „Recognition of convolutive speech mixtures by missing feature techniques for ICA“, invited paper, in: Proc. 40th Asilomar Conference on Signals, Systems and Computers, pp. 1397-1401, October 29 - November 1, Pacific Grove, USA, 2006.

S. Maraboina, D. Kolossa, P. Bora and R. Orglmeister: „Multi-Speaker Voice Activity Detection using ICA and Beampattern Analysis“, in: Proc. Eusipco 2006, September 4-8, Florence, Italy.

D. Kolossa, A. Klimas and R. Orglmeister: „Separation and Robust Recognition of Noisy, Convolutive Speech Mixtures using Time-Frequency Masking and Missing Data Techniques“, in: Proceedings of the WASPAA 2005, pp. 82-85, New Paltz, NY, USA, October 16-19, 2005.

D. Kolossa and R. Orglmeister: „Nonlinear Postprocessing for Blind Speech Separation“, in: Proceedings of the ICA 2004, pp. 832–839, Granada, Spain, September 22-24, 2004.

W. Baumann, D. Kolossa and R. Orglmeister: „Beamforming-Based Convolutive Source
Separation“, in: Proceedings ICASSP 2003, pp. 357-360, Hong Kong, China, April 6-10,
2003.

W. Baumann, D. Kolossa and R. Orglmeister: „Maximum Likelihood Permutation Correction for Convolutive Source Separation“, in: Proceedings of the ICA 2003, pp. 373-378, Nara, Japan, April 1-4, 2003.

D. Kolossa and Q. Huo: „Using Time-Stretched Pulses for Accurate Splitting of Speech Utterances Played Back in Noisy Reverberant Environments“, in: Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp.1541-1544, September 16-20 2002, Denver, CO, USA.

D. Kolossa, B.-U. Köhler, M. Conrath and R. Orglmeister: „Optimal Permutation Correction by Multiobjective Genetic Algorithms”, in: Proceedings of the ICA2001, pp. 373-378, San Diego, California, USA.

W. Baumann, B.-U. Köhler, D. Kolossa and R. Orglmeister: „Real Time Separation of Convolutive Mixtures“, in: Proceedings of the ICA2001, pp.65-69, San Diego, California, USA.

D. Kolossa and G. Grübel: „Evolutionary Computation and Nonlinear Programming in Multi- Model Robust Control Design“, In: Lecture Notes on Computer Science vol. 1803, pp. 147- 157, Stefano Cagnoni et al. (Eds.), Springer Verlag, Berlin, Heidelberg, New York, 2000.

 

Other Conference Papers

C. Schymura, T. Walther, D. Kolossa, N. Ma and G. Brown: “Binaural Sound Source Localisation using a Bayesian-network-based Blackboard System and Hypothesis-driven Feedback,” Proc. Forum Acusticum, Krakow, September 2014.

D. Kolossa: „Methoden zur robusten Spracherkennung unter Beobachtungsunsicherheit,“ Proc. DAGA, March 2014.

A. Hussen Abdelaziz, L. Charaf, S. Zeiler, D. Kolossa: „On Dynamic Stream Weight Learning for Coupled-HMM-based Audio-visual Speech Recognition,“ Proc. DAGA, March 2014.

A. Raake, J. Blauert, J. Braasch, G. Brown, P. Danès, T. Dau, B. Gas, S. Argentieri, A. Kohlrausch, D. Kolossa, N. Le Goff, T. May, K. Obermayer, S. Spors: „TWO!EARS - integral interactive model of auditory perception and experience,“ Proc. DAGA 2014.

J. Blauert, D. Kolossa, P. Danès: “Feedback loops in engineering models of binaural listening,” 167th Meeting of the Acoustical Society of America, Providence, Rhode Island, May 2014.

H. Meutzner, S. Malik, D. Kolossa: “SVM-Based Preprocessing for Automatic Speech Recognition”, Winner of best presentation and paper award, invited paper, Proc. DAGA 2013, Meran, Italy, March 18-21, 2013.

D. Schmid, P. Thüne, D. Kolossa, G. Enzner: “Dereverberation preprocessing and training data adjustments for robust speech recognition in reverberant environments”, ITG Fachtagung Sprachkommunikation, Braunschweig, Germany, September 26-28, 2012.

A. Hussen Abdelaziz, S. Zeiler, D. Kolossa: “Audio-Visual Speech Recognition for Uncertain Acoustical Observations”, ITG Fachtagung Sprachkommunikation, Braunschweig, Germany, September 26-28, 2012.

R. C. Jacobi, A. Hennig, D. Kolossa: „Simulation Methods for Inductively Coupled Sensor Systems in Varying Environments“, Proc. PRIME, Aachen, Germany, June 12-15, 2012

E. Hoffmann, D. Kolossa, R. Orglmeister: „Time-Frequency-Processing for ICA-Supported Speech Recognition in Multitalker Conditions“, invited paper, Proc. DAGA, March 19-22, Darmstadt, Germany, 2012.

D. Kolossa: “High-Level Processing of Binaural Features”, invited paper, Proc. Forum
Acusticum, June 27-July 1, Aalborg, Denmark, 2011.

D. Kolossa, R. Fernandez Astudillo, S. Zeiler, A. Vorwerk, D. Lerch, J. Chong, R. Orglmeister: “Missing Feature Audiovisual Speech Recognition under Real-Time Constraints”, ITG Fachtagung Sprachkommunikation, paper 22, 4 pages, Bochum, Germany, October 6-8, 2010.

M. Jeub, D. Kolossa, R. Fernandez Astudillo, R. Orglmeister: "Performance Analysis of Wavelet-based Voice Activity Detection", invited paper, Proc. DAGA2009, pp. 407-408, Rotterdam, March 2009.

R. Fernandez Astudillo, D. Kolossa, R. Orglmeister: "Uncertainty Propagation for Speech Recognition using RASTA Features in Highly Nonstationary Noisy Environments", ITG Fachtagung Sprachkommunikation, Aachen, October 2008.

D. Kolossa, E. Hoffman, R. Orglmeister: "ICA-Based Bayesian Time-Frequency Masking", invited paper, ITG Fachtagung Sprachkommunikation, Aachen, October 2008.

F. Kohl, G. Wübbeler, T. Sander, L. Trahms, D. Kolossa, R. Orglmeister, C. Elster and M. Bär: " Performance of ICA for Dependent Sources using Synthetic Stimulus Evoked MEG Data", invited paper, Workshop Biosignalverarbeitung, pp. 32-35, Potsdam, July 2008.

D. Kolossa, R. Fernandez Astudillo, and R. Orglmeister: “Spracherkennung im Automobil durch Verwendung von Missing Feature Techniken“, invited paper, Proc. DAGA 2007, pp. 301-302.

D. Kolossa, A. Klimas, W. Baumann and R. Orglmeister: „Robuste Erkennung gestörter Sprache im Automobil durch MMSE-Störgeräuschunterdrückung und Missing-Data Spracherkennung“, invited paper, in: Proc. Daga 2006, March 20 - 23, Braunschweig, Germany.

W. Baumann, D. Kolossa and R. Orglmeister: „Realtime Implementation of a Beamforming Based Convolutive Source Separation Algorithm“, Tagung der Deutschen Gesellschaft für Akustik, DAGA 2003, Aachen, March 18-20, 2003.

back

 

Foto Kolossa