Collection and analysis of multi-condition audio recordings for forensic automatic speaker recognition

Katharina Klug; Michael Jessen; Yosef A. Solewicz; Isolde Wagner

doi:10.17469/O2108AISV000003

Authors

Katharina Klug Forensic Science Institute, Bundeskriminalamt, Germany; Department of Language and Linguistic Science, University of York, United Kingdom https://orcid.org/0000-0002-3629-750X
Michael Jessen Forensic Science Institute, Bundeskriminalamt, Germany https://orcid.org/0000-0001-5031-5279
Yosef A. Solewicz Division of Identification and Forensic Science, Israel Police, Israel https://orcid.org/0000-0003-3987-1201
Isolde Wagner Forensic Science Institute, Bundeskriminalamt, Germany

DOI:

https://doi.org/10.17469/O2108AISV000003

Keywords:

Forensic automatic speaker recognition, Real case recordings, Validation, Match and mismatch condition, Calibration

Abstract

The major aim of the project presented here is to compile a corpus from real case recordings to validate more recording conditions and languages under match and mismatch conditions for forensic automatic speaker recognition (FASR). The challenges and limitations of compiling a real case corpus are explained. First results of validation tests are presented for male speakers of German in the match condition [voice message – voice message] as well as in the mismatch condition [voice message – telephone]. Results for the match condition [voice message] are compared to previous findings for the match condition [telephone]. Variations of performance metrics such as Equal Error Rate (EER) and log-likelihoodratio cost (Cllr) are discussed with respect to effects of normalisation and calibration, and patterns of score distributions are analysed using Tippett plots.