Programmübersicht
14:00 - 14:15
Opening Session
14:15 - 15:15 Keynote by on "Understanding the application of neural networks for signal enhancement"
15:15 - 15:45 Coffee Break
15:45 - 17:15 Oral Session: Iterative Algorithms & Machine Learning for Speech Enhancement
EXIT Charts for Turbo Automatic Speech Recognition: A Case Study
Timo Lohrenz, Simon Receveur and Tim Fingscheidt, TU Braunschweig
Introducing Block-Wise Processing into Turbo Viterbi ASR
Simon Receveur, Timo Lohrenz and Tim Fingscheidt, TU Braunschweig
Noise-Presence-Probability-Based Noise PSD Estimation by Using DNNs
Iterative Harmonic Speech Enhancement
Factor Graph Decoding for Speech Presence Probability Estimation
New Insights into Turbo-Decoding-Based AVSR with Dynamic Stream Weights
1Ruhr-Universität Bochum, 2International Computer Science Institute Berkeley
17:15 - 18:45 Poster Session: Iterative Algorithms & Machine Learning for Speech Enhancement
Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering
Ruhr-Universität Bochum, 2Bucknell University
A Combination of Pre-Trained Approaches and Generic Methods for an Improved Speech Enhancement
Balancing Gaussianity and sparseness in feature-space speaker adaptation for word prominence detection
1 TU Darmstadt, 2Honda Research Institute Europe GmbH
17:15 - 18:45 Poster Session: Selected Topics in Speech Processing
Evaluation of Enhanced F0-Trajectories for Speech Detection and Classification in Acoustic Monitoring
General Detection of Speech Signals in the Time-Frequency Plane
Improving Vector Quantization-Based Decoders for Correlated Processes in Error-Free Transmission
Head-Orientation-Based Device Selection: Are You Talking to Me?
1Jade Hochschule, 2Universität Oldenburg
Voice Activity Detection Based on Modulation-Phase Differences
1Nuance Communications Deutschland GmbH, 2Universität Kiel
A Method to Analyze the Spatial Response of Informed Spatial Filters
Estimating Source Dominated Microphone Clusters in Ad-Hoc Microphone Arrays by Fuzzy Clustering in the Feature Space
On the Bias of Direction of Arrival Estimation Using Linear Microphone Arrays
Coding of Parametric Models with Randomized Quantization in a Distributed Speech and Audio Codec
17:15 - 18:45 Poster Session: Emerging Topics and Applications
“Listen, Follow me”: The Transformational Leadership Corpus (TLC)
1Universität Wuppertal, 2Helmut-Schmidt-Universität, 3TU Dresden
Towards Opaque Audio Features for Privacy in Acoustic Sensor Networks
The Fraunhofer IAIS Audio Mining System: Current State and Future Directions
Personalized News Event Retrieval for Small Talk in Social Dialog Systems
1Karlsruher Institut für Technologie, 2Human Language Technology Fondazione Bruno Kessler
Using Tweets as "Ice-Breaking" Sentences in a Social Dialog System
18:30 - 19:30 ITG Fachgruppensitzung
19:00 - 21:00 Welcome Reception
8:30-9:30 Keynote by
onOptimizing Speech Intelligibility in Noisy Environments Using a Simple Model of Communication"9:30 - 10:00 Coffee Break
10:00 - 11:00 Oral Session: Speech Processing for ear-mounted devices
Performance Comparison of Bilateral and Binaural MVDR-based Noise Reduction Algorithms in the Presence of DOA Estimation Errors
Active Cancellation of the Occlusion Effect in Hearing Aids by Time Invariant Robust Feedback
A Model-Based Placement Strategy for a Nearby External Microphone for Speech Enhancement in Hearing Aids
1Sivantos GmbH, 2Ruhr-Universität Bochum
On the Use of Beamforming Approaches for Binaural Speaker Localization
11:00 - 12:30 Poster Session: Speech Processing for ear-mounted devices
Probabilistic Spatial Filter Estimation for Multi-Channel Signal Enhancement in Hearing Aids
Development of a Sound Coding Strategy based on a Deep Recurrent Neural Network for Monaural Source Separation in Cochlear Implants
1Medizinische Hochschule Hannover, 2Universitat Pompeu Fabra
On The Impact of Quantization on Binaural MVDR Beamforming
1TU Delft, 2Aalborg University
A Robust Null-Steering Beamformer for Acoustic Feedback Cancellation for a Multi-Microphone Earpiece
1Universität Oldenburg, 2Curtin University
Two-channel Coherence-Based Own Voice Detection for Privacy-aware Long-term Acoustic Measurements
11:00 - 12:30 Poster Session: Quality Evaluation
Method for analyzing personalized telephone speech in quiet and noisy environments in normal-hearing and hearing-impaired listeners
1Fraunhofer IDMT, 2Hörzentrum Oldenburg GmbH
Design of Double Talk Sequences in Different Languages to Harmonize Third Party Listening Test Results
Towards VoIP quality testing with real-life devices and degradations
1TU Ilmenau, 2HEAD acoustics GmbH, 3AVM GmbH
Instrumental speech and noise quality assessment for super-wideband and fullband transmission
Emotion Intelligibility within Codec-Compressed and Reduced Bandwith Speech
1Otto von Guericke Universität, 2Hochschule für Telekommunikation Leipzig
Voice and Speech Assessment From Telephone Recordings Using Prosodic Analysis Based on mu-Law-Companded Features
Evaluation of Communication Systems for Full-Face Firefighter Masks
1Dräger Safety AG, 2Universität Kiel
11:00 - 12:30 Poster Session: Speech & Diagnostics
Large Sleepy Reading Corpus (LSRC): Applying Read Speech for Detecting Sleepiness
1Bergische Universität Wuppertal, 2Rheinische Fachhochschule Köln, 3Universität Tübingen, 4FH Schmalkalden
An Analysis of Perplexity to Reveal the Effects of Alzheimer's Disease on Language
Gender–dependent GMM–UBM for tracking Parkinson’s disease progression from speech
1Universidad de Antioquia, 2Universität Nürnberg-Erlangen
Towards Cross-lingual Automatic Diagnosis of Autism Spectrum Condition in Children's Voices
1Universität Passau, 2Universität München, 3Université Grenoble Alpes
Acoustic and grammatical characterization of crisis-related babblings in Italian persons undergoing Courts-of-Law examinations
Non-invasive photoglottography for use in the lab and the field
1TU Dresden, 2Universität Jena
On the Role of the Limbic Brain System in Recognizing Emotions From Paralinguistic Speech Features
12:30 - 13:30 Lunch Break
13:30 - 14:30 Oral Session: Quality Evaluation
Non-Intrusive Estimation Model for the Speech-Quality Dimension Loudness
Predicting the quality of processed speech by combining modulation based features and model-trees
1Fraunhofer IDMT, 2Institut National de la Recherche Scientifique, 3Universität Oldenburg, 4Imperial College London
A Paired-Comparison Listening Test for Collecting Voice Likability Scores
Objective Assessment of Artificial Speech Bandwidth Extension Approaches
1TU Braunschweig, 2NXP Software
14:30 - 15:30 Oral Session: Speech & Diagnostics
A Bag-of-Audio-Words Approach for Snore Sounds’ Excitation Localisation
1 Universität Passau, 2 TU München
Wavelet-Based Time-Frequency Representations for Automatic Recognition of Emotions from Speech
1Universidad de Antioquia, 2Universität Erlangen-Nürnberg
Detection of Intra-Personal Development of Cognitive Impairment From Conversational Speech
Parkinson-Speech Analysis: Methods and Aims
Universität Kiel
15:30 - 16:00 Coffee Break
16:00 - 18:45 Excursion Town & HNF Computer Museum
19:00 - 23:00 Dinner Gut Ringelsbruch
8:30-9:30 Keynote by
on "Multistream Recognition of Speech"9:30 - 10:00 Coffee Break
10:00 - 11:30 Oral Session: Speech Enhancement in Dynamic Acoustic Scenarios
Time Domain Approach for Listening Enhancement in Noisy Environments
Multiframe Echo Suppression Based on Orthogonal Signal Decompositions
1Northwestern Polytechnical University, 2Universität Nürnberg-Erlangen, 3University of Quebec
Combined Single-Microphone Wiener and MVDR Filtering based on Speech Interframe Correlations and Speech Presence Probability
1Universität Oldenburg, 2International Audio Laboratories Erlangen
A Priori SNR Estimation Using Weibull Mixture Model
Maximum-Likelihood Approach to Multichannel-Wiener-Postfiltering for Wind-Noise
Reduction
Kurtosis-Controlled Babble Noise Suppression
1 Nuance Communications Deutschland GmbH, 2 Universität Kiel
11:30 - 13:00 Poster Session: Speech Enhancement in Dynamic Acoustic Scenarios
Combined Linear and Nonlinear Residual Echo Suppression Using a Deficient Distortion Model - A Proof of Concept
1 Nuance Communications Deutschland GmbH, 2 Otto-von-Guericke Universität
On the Performance of LPTV Coherence Reduction Methods in the Sub-band Domain for Stereophonic Acoustic Echo Cancellation
Spectral Envelope Statistics for Source Modelling in Speech Enhancement
A Practical Beamformer-Postfilter System for Microphone Arrays on Seat Belts
1Hochschule Aschaffenburg, 2Paragon AG
HMM Embedded Conditional Vector Estimation Applied to Noisy Line Spectral Frequencies
Acoustic Feedback Compensation with Reverb-based Stepsize Control for In-car Communication Systems
1Daimler AG, 2Universität Kiel
Noise Reduction in the Time Domain Using ARMA Filtering
11:30 - 13:00 Poster Session: Efficient Modeling ASR
Phoneme Boundary Detection using Deep Bidirectional LSTMs
1Karlsruher Institut für Technologie, 2Zentrum für Allgemeine Sprachwissenschaft
Training Deep Neural Networks for Reverberation Robust Speech Recognition
Karlsruher Institut für Technologie
11:30 - 13:00 Poster Session: Show & Tell
Binaural Noise Reduction using Raspberry Pi
3PASS & HHP IV - up-to-date speech quality tests of terminals
Real-time Noise Reduction and Speech Dereverberation Using a Small Microphone Array
13:00 - 14:00 Lunch Break
14:00 - 15:30 Oral Session: Efficient Modeling ASR
Robust Online Multi-Channel Speech Recognition
1RWTH Aachen, 2Universität Paderborn
Modeling of Phone Features for Phoneme Perception
Language Feature Vectors for Resource Constraint Speech Recognition
Uncertainty Decoding Using a Sampling Strategy Based on the Eigenvalue Decomposition
Growing a Deep Neural Network Acoustic Model with Singular Value Decomposition
Karlsruher Institut für Technologie
Rank based Decoding for Improved DNN/HMM Hybrid Acoustic Models in the EML Transcription Platform
15:30 - 15:45 Closing Session
15:45 - 16:00 Coffee Break