RUB 

List of publications


Ordered by Year

 

2020

Becker, L., Nelus, A., Gauer, J., Rudolph, L., & Martin, R. (2020). Audio Feature Extraction for Vehicle Engine Noise Classification. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 711-715). IEEE. (https://doi.org/10.1109/ICASSP40776.2020.9053117)

Däubener, S., Schönherr, L., Fischer, A. , & Kolossa, D. (2020). Detecting Adversarial Examples for Speech Recognition via Uncertainty Quantification. In Interspeech 2020 (pp. 4661-4665). (https://doi.org/10.21437/Interspeech.2020-2734 ).

Frank, J., Eisenhofer, T., Schönherr, L., Fischer, A., Kolossa, D., & Holz, T. (2020). Leveraging Frequency Analysis for Deep Fake Image Recognition. In Proceedings of Machine Learning Research: Vol. 119. Proceedings of the 37th International Conference on Machine Learning (ICML) 2020, 13-18 July 2020 (pp. 3247-3258).

Freiwald, J., Schoenherr, L., Schymura, C., Zeiler, S., & Kolossa, D. (2020). Loss Functions for Deep Monaural Speech Enhancement. In 2020 International Joint Conference on Neural Networks (IJCNN).(https://doi.org/10.1109/IJCNN48605.2020.9207184)

Lentz, B., Nagathil, A., Gauer, J., & Martin, R. (2020). Harmonic/Percussive Sound Separation and Spectral Complexity Reduction of Music Signals for Cochlear Implant Listeners. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 8713-8717). IEEE. (https://doi.org/10.1109/ICASSP40776.2020.9052920)

Nabizadeh, N., Kolossa, D., & Heckmann, M. (2020). Myfixit: An Annotated Dataset, Annotation Tool, and Baseline Methods for Information Extraction from Repair Manuals. In Proceedings of The 12th Language Resources and Evaluation Conference (pp. 2120-2128). European Language Resources Association. (https://www.aclweb.org/anthology/2020.lrec-1.260)

Nabizadeh, N., Heckmann, M., & Kolossa, D. (2020). Target-aware Prediction of Tool Usage in Sequential Repair Tasks. In Lecture Notes in Computer Science. Machine Learning, Optimization, and Data Science,Vol. 12566 (pp. 156-168). Springer International Publishing. ( https://doi.org/10.1007/978-3-030-64580-9_13)

Neudek, D., Nagathil, A., Getzmann, S., Martin, R. (2020). Speaker Change Detection Based on Event-Related Potentials with a Consumer Brain Computer Interface. In Fortschritte der Akustik - DAGA 2020: 46. Deutsche Jahrestagung für Akustik, (pp. 949-951). DEGA.

Peifer, C., Kluge, A., Rummel, N., & Kolossa, D. (2020). Fostering Flow Experience in HCI to Enhance and Allocate Human Energy. In Lecture Notes in Computer Science. Engineering Psychology and Cognitive Ergonomics. Mental Workload, Human Physiology, and Human Energy Vol. 12186(pp. 204-220). Springer International Publishing. (https://doi.org/10.1007/978-3-030-49044-7_18)

Schymura, C., & Kolossa, D. (2020). Audiovisual Speaker Tracking Using Nonlinear Dynamical Systems With Dynamic Stream Weights. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, 1065-1078. (https://doi.org/10.1109/TASLP.2020.2980974)

Schymura, C., Ochiai, T., Delcroix, M., Kinoshita, K., Nakatani, T., Araki, S., & Kolossa, D. (2020). A Dynamic Stream Weight Backprop Kalman Filter for Audiovisual Speaker Tracking. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 581-585). IEEE. (https://doi.org/10.1109/ICASSP40776.2020.9054005)

Thaleiser, S., & Enzner, G. (2020). A Computationally Light Algorithm for Bayesian Speech Enhancement with SNR Marginalization. In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6209-6213). IEEE. (https://doi.org/10.1109/ICASSP40776.2020.9054611)

Trowitzsch, I., Schymura, C., Kolossa, D., & Obermayer, K. (2020). Joining Sound Event Detection and Localization Through Spatial Segregation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, 487-502. (https://doi.org/10.1109/TASLP.2019.2958408)

Urbanietz, C., & Enzner, G. (2020). Direct Spatial-Fourier Regression of HRIRs from Multi-Elevation Continuous-Azimuth Recordings. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, 1129-1142. (https://doi.org/10.1109/TASLP.2020.2982291)

Wolf, M., Trentsios, P., Kubatzki, N., Urbanietz, C., & Enzner, G. (2020). Implementing Continuous-Azimuth Binaural Sound in Unity 3D. In 2020 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW) (pp. 384-389). IEEE. (https://doi.org/10.1109/VRW50115.2020.00083)

Wolf, M., Trentsios, P., Urbanietz, C., & Enzner, G. (2020). Experiencing and Navigating Virtual Reality without Sight (The all-seeING ears). In 2020 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW) (pp. 525-526). IEEE. (https://doi.org/10.1109/VRW50115.2020.00114)

Zohourian, M., & Martin, R. (2020). Binaural Direct-to-Reverberant Energy Ratio and Speaker Distance Estimation. IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 92-104 (https://doi.org/10.1109/TASLP.2019.2948730)