Dr.-Ing. Steffen Zeiler

Ruhr-Universität Bochum
Digitale Signalverarbeitung
Fakultät für Elektrotechnik und Informationstechnik
Universitätsstr. 150
D-44780 Bochum
Raum: ID/2/327


Email: Steffen.Zeiler@rub.de
Tel.: +49 234 32 27585


Courses


Journal Articles

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2015). “Learning Dynamic Stream Weights For Coupled-HMM-based Audio-visual Speech Recognition”, IEEE Trans. Audio Speech and Language Processing, vol. 23, no. 5, pp. 863-876, May 2015.

Peer-Reviewed Conferences


Zeiler, S., Meutzner, H., Abdelaziz A. H., Kolossa, D. (2016), "Introducing the Turbo-Twin-HMM for Audio-Visual Speech Enhancement", Proc. INTERSPEECH, San Francisco, USA, September 2016.

Gergen, S., Zeiler, S., Hussen Abdelaziz, A., Kolossa, D. (2016). "New Insights into Turbo-Decoding-Based AVSR with Dynamic Stream Weights", ITG-Fachtagung Sprachkommunikation, Paderborn, Germany, Oct. 2016.

Gergen, S., Zeiler, S., Hussen Abdelaziz, A., Nickel, R., Kolossa, D. (2016). "Dynamic Stream Weighting for Turbo-Decoding-Based Audiovisual ASR", Interspeech 2016, San Francisco, Sept. 2016.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2014). "A new EM estimation of dynamic stream weights for coupled-HMM-based audio-visual ASR", Proc. ICASSP, Florence, May 2014.

Zeiler, S., Cwiklak, J., Kolossa, D. (2014). "Robust Multimodal Human Machine Interaction using the Kinect Sensor", Proc. ITG Fachtagung Sprachkommunikation, September 2014.

Astudillo, F. R., Kolossa, D., Abad, A., Zeiler, S., Saeidi, R., Mowlaee, R., da Silva Neto, J. P., Martin, R. (2013). "Integration of Beamforming and Uncertainty-of-Observation Techniques for Robust ASR in Multi-Source Environments", Computer Speech and Language, Special Issue on Multisource Environments, vol. 27, no. 3, pp. 837-850, May 2013.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2013). ''Using Twin-HMM-Based Audio-Visual Speech Enhancement as a Front-End for Robust Audio-Visual Speech Recognition'', Proc. Interspeech, Lyon, France, August 2013.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2013). ''Twin-HMM-based audio-visual speech enhancement'', Proc. ICASSP, Vancouver, Canada, May 2013.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D., Leutnant, V., Haeb-Umbach, R. (2013). ''GMM-based Significance Decoding'', Proc. ICASSP, Vancouver, Canada, May 2013.

Kolossa, D., Zeiler, S., Saeidi, R., Astudillo, F. R. (2013). “Noise-Adaptive LDA: A New Approach for Speech Recognition Under Observation Uncertainty", IEEE Signal Processing Letters, vol. 20, no. 11, pp. 1018-1021, 2013.

Meutzner, H., Schlesinger, A., Zeiler, S., Kolossa, D. (2013). "Binaural Signal Processing for Enhanced Speech Recognition Robustness in Complex Listening Environments", Proc. 2nd CHiME Workshop on Machine Listening in Multisource Environments, Vancouver, Canada, June 2013.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2012). "Audio-Visual Speech Recognition for Uncertain Acoustical Observations", ITG Fachtagung Sprachkommunikation, (2012).

Nickel, R., Astudillo, F. R., Kolossa, D., Zeiler, S., Martin, R. (2012). "Inventory-Style Speech Enhancement with Uncertainty-of- Observation Techniques", ICASSP, pp. 4645-4648, Kyoto, Japan, March 2012.

Kolossa, D., Astudillo, R. F., Abad, A., Zeiler, S., Saeidi, R., Mowlaee, P., da Silva Neto, J. P., Martin, R. (2011). “CHIME Challenge: Approaches to Robustness using Beamforming and Uncertainty-of-Observation Techniques”, in Proc. CHiME 2011 - to appear in Workshop on Machine Listening in Multisource Environments, Interspeech 2011 satellite event.

Vorwerk, A., Zeiler, S., Kolossa, D., Astudillo, F. R., Lerch, D. (2011). “Use of Missing and Unreliable Data for Audiovisual Speech Recognition”, in: Kolossa, D., Haeb-Umbach, R. (eds.): „Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications“, Springer Verlag, pp. 345-375, July 2011.

Kolossa, D., Astudillo, F. R., Abad, A., Zeiler, S., Saeidi, R., Mowlaee, P., da Silva Neto, J.P., Martin, R. (2011). “CHiME Challenge: Approaches to Robustness using Beamforming and Uncertainty-of-Observation Techniques”, to appear in Proc. CHiME Workshop on Machine Listening in Multisource Environments, Florence, Italy, Sept.1, 2011.

Kolossa, D., Astudillo, F. R., Zeiler, S. , Vorwerk, A., Lerch, D., Chong, J., Orglmeister, R. (2010). “Missing Feature Audiovisual Speech Recognition under Real-Time Constraints”, ITG Fachtagung Sprachkommunikation, paper 22, 4 pages, Bochum, Germany, October 6-8, 2010.

Kolossa, D., Chong, J., Zeiler, S., Keutzer, K. (2010). “Efficient Manycore CHMM Speech Recognition for Audiovisual and Multistream Data”, Proc. Interspeech 2010, pp. 2698 – 2701, Makuhari, Japan, September 26-30, 2010.

Vorwerk, A., Wang, X., Kolossa, D., Zeiler, S., Orglmeister, R. (2010). "WAPUSK20 - A Database for Robust Audiovisual Speech Recognition", Proc. 7th Int. Conf. on International Language Resources and Evaluation (ELREC), pp. 3016 – 3019, 2010.

Kolossa, D., Zeiler, S., Vorwerk, A., Orglmeister, R.(2009). "Audiovisual Speech Recognition with Missing or Unreliable Data", Audiovisual Speech Processing Workshop (AVSP 2009), Brighton, UK, September 10-13, 2009.

Steffen Zeiler