Typical use cases are access systems in buildings. A person speaks a few words and the door opens if the person has the right to enter. Otherwise, the door remains closed. Persons with access rights register themselves beforehand by speaking a few sentences. Another use case is the area of forensic phonetics where a suspected person is to be identified given a speech recording.
So far, speaker recognition is based on frequency-domain features almost exclu-sively. At Empa, we will compute temporal information of the voice and use it to improve automatic speaker recognition.
Areas of activity
- Software development
- Speech processing
- Temporal information
Research and development projects
„VoiceTime: Speaker recognition by temporal information“, Gebert Rüf Stiftung, 2014 – 2016
Prof. Volker Dellwo, Phonetics Laboratory, University of Zurich