Papers and Book Chapters

List of publications

2021

Agcaer, S., & Martin, R. (2021). Binaural Speaker Localization Based on Front/Back-Beamforming and Modulation-Domain Features. In G. Schmidt & P. Jax (Eds.), ITG-Fachbericht: Vol. 298, Speech communication: 14th ITG Conference 29.09. - 01.10.2021, Online-Event (pp. 79–83). VDE Verlag.

Chen, S., Taghia, J., Fei, T., Kuhnau, U., Pohl, N., & Martin, R. (2021). A DNN Autoencoder for Automotive Radar Interference Mitigation. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 4065–4069). IEEE. (https://doi.org/10.1109/ICASSP39728.2021.9413619)

Chinaev, A., Enzner, G., Gburrek, T., &Schmalenstroeer, J. (2021). Online Estimation of Sampling Rate Offsets in Wireless Acoustic Sensor Networks with Packet Loss. In 2021 29th European Signal Processing Conference (EUSIPCO) (pp. 1110–1114). IEEE.(https://doi.org/10.23919/EUSIPCO54536.2021.9616037)

Chinaev, A., Thuene, P., & Enzner, G. (2021). Double-Cross-Correlation Processing for Blind Sampling-Rate and Time-Offset Estimation. IEEE/ACM Transactions on Audio, Speech, and Language Processing. (https://doi.org/10.1109/TASLP.2021.3071967)

Chinaev, A., Wienand, S., & Enzner, G. (2021). Control Architecture of the Double-Cross-Correlation Processor for Sampling-Rate-Offset Estimation in Acoustic Sensor Networks. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 801–805). IEEE. (https://doi.org/10.1109/ICASSP39728.2021.9413768)

Gauer, J., Kleingarn, D., & Martin, R. (2021). Analysis and Improvements of the Cepstrum Method for Fundamental Frequency Estimation in Music Signals. In 2021 29th European Signal Processing Conference (EUSIPCO) (pp. 371–375). IEEE. (https://doi.org/10.23919/EUSIPCO54536.2021.9616178)

Gauer, J., Nagathil, A., Lentz, B., Martin, R. (2021). Can spectral complexity reduction improve music perception in cochlear implant users? Music and Cochlear Implants Symposium, 15.-16.09.2021, Cambridge, UK.

Gauer, J., Nagathil, A., Eckel, K., & Martin, R. (2021). A versatile deep-learning-based music preprocessing scheme for cochlear implant users, In 2021 Conference on Implantable Auditory Prostheses, p. 49 (Abstract).

Koppelmann, T., Nelus, A., Schönherr, L., Kolossa, D., & Martin, R. (2021). Privacy-Preserving Feature Extraction for Cloud-Based Wake Word Verification. Proc. Interspeech 2021, 876-880. (https://doi.org/10.21437/Interspeech.2021-262)

Markovic, N., Vahle, D., Staudt, V., & Kolossa, D. (2021). Condition Monitoring for Power Converters via Deep One-Class Classification. In 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA) (pp. 1513–1520). IEEE. (https://doi.org/10.1109/ICMLA52953.2021.00244)

Nagathil, A., Göbel, F., Nelus, A., & Bruce, I.C. (2021). Computationally Efficient DNN-based Approximation of an Auditory Model for Applications in Speech Processing. In Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (pp. 301-305). IEEE. (https://doi.org/10.1109/ICASSP39728.2021.9413993)

Nelus, A., Glitza, R., & Martin, R. (2021). Estimation of Microphone Clusters in Acoustic Sensor Networks Using Unsupervised Federated Learning. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 761–765). IEEE. (https://doi.org/10.1109/ICASSP39728.2021.9414186)

Nelus, A., & Martin, R. (2021). Privacy-preserving audio classification using variational information feature extraction. IEEE/ACM Transactions on Audio, Speech, and Language Processing. (https://doi.org/10.1109/TASLP.2021.3108063)

Nelus, A., Glitza, R., & Martin, R. (2021). Unsupervised Clustered Federated Learning in Complex Multi-source Acoustic Environments. In 2021 29th European Signal Processing Conference (EUSIPCO) (pp. 1115–1119). IEEE. (https://doi.org/10.23919/EUSIPCO54536.2021.9615980)

Oliveira, I., Latoschewski, D., Wiede, C., Oettmeier, M., Graurock, D., & Kolossa, D. (2021). Embedded acoustic fault monitorin for water pumps. In 2021 28th IEEE International Conference on Electronics, Circuits, and Systems (ICECS) (pp. 1–4). IEEE.(https://doi.org/10.1109/ICECS53924.2021.9665616)

Schymura, C., Ochiai, T., Delcroix, M., Kinoshita, K., Nakatani, T., Araki, S., & Kolossa, D. (2021). Exploiting attention-based sequence-to-sequence architectures for sound event localization. In 2020 28th European signal processing conference (Eusipco) (pp. 231–235). IEEE. (https://doi.org/10.23919/Eusipco47968.2020.9287224)

Thaleiser, S., & Enzner, G. (2021). Cue-Preserving MMSE Filter with Bayesian SNR Marginalization for Binaural Speech Enhancement. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6124–6128). IEEE. (https://doi.org/10.1109/ICASSP39728.2021.9414956)

Völter, C., Oberländer, K., Carroll, R., Dazert, S., Lentz, B., Martin, R., & Thomas, J. P. (2021). Nonauditory functions in low-performing adult cochlear implant users. Otology & Neurotology, 42(5), e543-e551. (https://doi.org/10.1097/MAO.0000000000003033)

Wissing, J., Boenninghoff, B., Kolossa, D., Ochiai, T., Delcroix, M., Kinoshita, K., Nakatani, T., Araki, S., & Schymura, C. (2021). Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 4705–4709). IEEE. (https://doi.org/10.1109/ICASSP39728.2021.9413399)

Yang, F., Enzner, G., & Yang, J. (2021). New Insights into Convergence Theory of Constrained Frequency-Domain Adaptive Filters. Circuits, Systems, and Signal Processing, 40(4), 2076–2090. (https://doi.org/10.1007/s00034-020-01569-6)

Yu, W., Zeiler, S., & Kolossa, D. (2021). Fusing Information Streams in End-to-End Audio-Visual Speech Recognition. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASPP) (pp. 3430–3434). IEEE. (https://doi.org/10.1109/ICASSP39728.2021.9414553)

Yu, W., Zeiler, S., & Kolossa, D. (2021). Multimodal integration for large-vocabulary audio-visual speech recognition. In 2020 28th European signal processing conference (Eusipco) (pp. 341–345). IEEE. (https://doi.org/10.23919/Eusipco47968.2020.9287841)

Yu, W., Freiwald, J., Tewes, S., Huennemeyer, F., & Kolossa, D. (2021). Federated Learning in ASR: Not as Easy as You Think. In ITG-Fachbericht, 0932-6022: Vol. 298, Speech communication: 14th ITG Conference 29.09. - 01.10.2021, Online-Event. VDE- Verlag

Yu, W., Zeiler, S., & Kolossa, D. (2021). Large-vocabulary Audio-visual Speech Recognition in Noisy Environments. In 2021 IEEE 23rd International Workshop on Multimedia Signal Processing (MMSP). IEEE. (https://doi.org/10.1109/MMSP53017.2021.9733452)

Institute of Communication acoustics

Publications

List of publications