Publications
COPYRIGHT NOTICE
The documents available on this web page have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work. Copyright and all rights of listed publications are maintained by the authors or by other copyright holders. It is understood that all individuals copying this information will adhere to the terms and constraints invoked by each copyright. These works may not be reposted without the explicit permission of the copyright holder.
2023
- Birkholz P, Blandin R, Kürbis, S (2023). Bandwidths of vocal tract resonances in physical models compared to transmission-line simulations. The Journal of the Acoustical Society of America, XX, pp. XX.
- Krug PK, Birkholz P, Gerazov B, van Niekerk DR, Xu A, Xu Y (2023). Artificial vocal learning guided by phoneme recognition and visual information. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 31, pp. 1734-1744. doi: 10.1109/TASLP.2023.3264454
- Häsner P, Birkholz P (2023). Reproducibility and aging of different silicone vocal folds models. Journal of Voice, 2023 Mar 23:S0892-1997(23)00085-1. doi: 10.1016/j.jvoice.2023.02.028
- Kleiner C, Häsner P, Birkholz P (2023). Intrinsic velocity differences between larynx raising and larynx lowering. PLOS ONE, 18(2): e0281877. doi: 10.1371/journal.pone.0281877
- van Niekerk DR, Xu A, Gerazov B, Krug PK, Birkholz P, Halliday L, Prom-on S, Xu Y (2023). Simulating vocal learning of spoken language: Beyond imitation. Speech Communication, 147, pp. 51-62. doi: 10.1016/j.specom.2023.01.003
- Kleiner K, Birkholz P (2023). Comparison of object tracking algorithms for larynx phantom movements in ultrasound videos. In: Draxler C (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2023 (TUDPress, Dresden), pp. 1-8 [pdf]
- Krug PK, Birkholz P, Gerazov B, van Niekerk DR, Xu A, Xu Y (accepted). Self-supervised solution to the control problem of articulatory synthesis. In Proc. of the Interspeech 2023, pp. XX, Dublin, Ireland
- Steiner P, Jalalvand A, Birkholz P (accepted). Non-standard Echo State Networks for video door state monitoring. In Proc. of the International Joint Conference on Neural Networks (IJCNN 2023), Queensland, Australia
- Xu A, Gerazov B, van Niekerk D, Krug PK, Prom-on S, Birkholz P, Xu Y (accepted). Articulatory learning of English diphthongs: One dynamic target vs. two static targets. In Proc. of the 20th International Congress of Phonetic Sciences (ICPhS 2023), Prague, Czech Republic
- Howson P, Birkholz P (accepted). An MRI examination of lingual fricatives in Upper Sorbian. In Proc. of the 20th International Congress of Phonetic Sciences (ICPhS 2023), Prague, Czech Republic
- Birkholz P, Stone S, Wagner C, Kürbis S, Wilbrandt A, Bosshammer M (accepted). A review of palatographic measurement devices developed at the TU Dresden from 2011 to 2022. In Proc. of the 20th International Congress of Phonetic Sciences (ICPhS 2023), Prague, Czech Republic
2022
- Birkholz P, Ossmann S, Blandin R, Wilbrandt A, Krug P, Fleischer M (2022). Modeling speech sound radiation with different degrees of realism for articulatory synthesis. IEEE Access, 10, pp. 95008-95019. doi: 10.1109/ACCESS.2022.3204816
- Blandin R, Arnela M, Félix S, Doc JB, Birkholz P (2022). Efficient 3D acoustic simulation of the vocal tract by combining the multimodal method and finite elements. IEEE Access, 10, pp. 69922-69938. doi: 10.1109/ACCESS.2022.3187424
- Steiner P, Jalalvand A, Stone S, Birkholz P (2022). PyRCN: A toolbox for exploration and application of reservoir computing networks. Engineering Applications of Artificial Intelligence, 113, 104964. doi: 10.1016/j.engappai.2022.104964
- Lapthawan T, Prom-On S, Birkholz P, Xu Y (2022). Estimating underlying articulatory targets of Thai vowels by using deep learning based on generating synthetic samples from a 3D vocal tract model and data augmentation. IEEE Access, 10, pp. 41489-41502. doi: 10.1109/ACCESS.2022.3166922
- Wagner C, Schaffer P, Amini Digehsara P, Bärhold M, Plettemeier D, Birkholz P (2022). Silent speech command word recognition using stepped frequency continuous wave radar. Scientific Reports, 12, 4192. doi: https://doi.org/10.1038/s41598-022-07842-9
- Steiner P, Jalalvand A, Birkholz P (2022). Cluster-based input weight initialization for Echo State Networks. IEEE Transactions on Neural Networks and Learning Systems, XX, pp. XX. doi: 10.1109/TNNLS.2022.3145565, [preprint]
- Kleiner C, Kainz MA, Echternach M, Birkholz P (2022). Velocity differences in laryngeal adduction and abduction gestures. The Journal of the Acoustical Society of America, 151(1), pp. 45-55. [pdf Copyright (2022) Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. doi: 10.1121/10.0009141]
- Stone S, Gao Y, Birkholz P (in press). Articulatory synthesis of vocalized /r/ allophones in German. IEEE/ACM Transactions on Audio, Speech, and Language Processing
- Winkelmann E, Shevchenko I, Steiner C, Kleiner C, Kaltenborn U, Birkholz P, Schwarz H, Steiner T (in press). Monitoring of partial discharges in HVDC power cables. IEEE Electrical Insulation Magazine
- Wagner C, Stappenbeck L, Wenzel H, Steiner P, Lehnert B, Birkholz P (2022). Evaluation of a non-personalized optopalatographic device for prospective use in functional post-stroke dysphagia therapy. IEEE Transactions on Biomedical Engineering, 69(1), pp. 356-365 , doi: 10.1109/TBME.2021.3094415
- Fietkau AL, Stone S, Birkholz P (2022). Relationship between the acoustic time intervals and tongue movements of German diphthongs. In Proc. of the Interspeech 2022, pp. 734-738, Incheon, Korea [pdf]
- Amini Digehsara P, Possamai de Menezes JV, Wagner C, Bärhold M, Schaffer P, Plettemeier D, Birkholz P (2022). A user-friendly headset for radar-based silent speech recognition. In Proc. of the Interspeech 2022, pp. 4835-4839, Incheon, Korea [pdf]
- Possamai de Menezes JV, Amini Digehsara P, Wagner C, Mütze M, Bärhold M, Schaffer P, Plettemeier D, Birkholz P (2022). Evaluation of different antenna types and positions in a stepped frequency continuous-wave radar-based silent speech interface. In Proc. of the Interspeech 2022, pp. 3633-3637, Incheon, Korea [pdf]
- Liebig L, Wagner C, Mainka A, Birkholz P (2022). An investigation of regression-based prediction of the femininity or masculinity in speech of transgender people. In Proc. of the Interspeech 2022, pp. 4676-4680, Incheon, Korea [pdf]
- Langheinrich I, Stone S, Zhang X, Birkholz P (2022). Glottal inverse filtering based on articulatory synthesis and deep learning. In Proc. of the Interspeech 2022, pp. 1327-1331, Incheon, Korea [pdf]
- Mohapatra DR, Fleischer M, Zappi V, Birkholz P, Fels S (2022). Three-dimensional finite-difference time-domain acoustic analysis of simplified vocal tract shapes. In Proc. of the Interspeech 2022, pp. 764-768, Incheon, Korea [pdf]
- van Niekerk DR, Xu A, Gerazov B, Krug PK, Birkholz P, Xu, Y (2022). Exploration strategies for articulatory synthesis of complex syllable onsets. In Proc. of the Interspeech 2022, pp. 635-639, Incheon, Korea [pdf]
- Krug PK, Birkholz P, Gerazov B, van Niekerk DR, Xu A, Xu Y (2022). Articulatory synthesis for data augmentation in phoneme recognition. In Proc. of the Interspeech 2022, pp. 1228-1232, Incheon, Korea [pdf]
- Xu Y, Xu A, van Niekerk DR, Gerazov B, Birkholz P, Krug PK, Prom-on S, Halliday LF (2022). Evoc-Learn — High quality simulation of early vocal learning. In Proc. of the Interspeech 2022, pp. 3665-3666, Incheon, Korea [pdf]
- Birkholz P, Häsner P, Kürbis S (2022). Acoustic comparison of physical vocal tract models with hard and soft walls. In Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 8242-8246, Singapore [pdf]
- Kath H, Stone S, Rapp S, Birkholz P (2022). CARINA – A corpus of aligned German read speech including annotations. In Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 6157-6161, Singapore [pdf]
- Krug PK, Birkholz P, Gerazov B, van Niekerk DR, Xu A, Xu Y (2022). Efficient exploration of articulatory dimensions. In: Niebuhr O, Lundmark MS, Weston H (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2022 (TUDPress, Dresden), pp. 51-58 [pdf]
- Birkholz P, Mayer CK, Häsner P (2022). Towards a soft fluidic elastomer tongue for a mechanical vocal tract. In: Niebuhr O, Lundmark MS, Weston H (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2022 (TUDPress, Dresden), pp. 24-31 [pdf]
- Stone S, Abdul-Hak P, Birkholz P (2022). Perceptual cues for smiled voice - an articulatory synthesis study. In: Niebuhr O, Lundmark MS, Weston H (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2022 (TUDPress, Dresden), pp. 131-138 [pdf]
2021
- Köberlein M, Birkholz P, Burdumy M, Richter B, Burk F, Traser L, Echternach M (2021). Investigation of resonance strategies of high pitch singing sopranos using dynamic 3D Magnetic Resonance Imaging. The Journal of the Acoustical Society of America
- Cucchi M, Gruener C, Petrauskas L, Steiner P, Tseng H, Fischer A, Penkovsky B, Matthus C, Birkholz P, Kleemann H, Leo K (2021). Reservoir computing with biocompatible organic electrochemical networks for brain-inspired biosignal classification. Science Advances, 7(34), eabh0693, doi: 10.1126/sciadv.abh0693
- Krug PK, Gerazov B, van Niekerk DR, Xu A, Xu Y, Birkholz P (2021). Modelling microprosodic effects can lead to an audible improvement in articulatory synthesis. The Journal of the Acoustical Society of America, 150(2), pp. 1209-1217. [pdf Copyright (2021) Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. doi: 10.1121/10.0005876]
- Gao Y, Ding H, Birkholz P, Lin Y (2021). Comparing fundamental frequency of German vowels produced by German native speakers and Mandarin Chinese learners. JASA Express Letters, 1(7), 075203. doi: 10.1121/10.0005593, [pdf]
- Birkholz P, Drechsel S (2021). Effects of the piriform fossae, transvelar acoustic coupling, and laryngeal wall vibration on the naturalness of articulatory speech synthesis. Speech Communication, 132, pp. 96-105. doi: 10.1016/j.specom.2021.06.002, [preprint]
- Häsner P, Prescher A, Birkholz P (2021). Effect of wavy trachea walls on the oscillation onset pressure of silicone vocal folds. The Journal of the Acoustical Society of America, 149(1), pp. 466-475. [pdf Copyright (2021) Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. doi: 10.1121/10.0003362]
- Xue Y, Marxen M, Akagi M, Birkholz P (accepted). Acoustic and articulatory analysis and synthesis of shouted vowels. Computer Speech & Language, XX
- Birkholz P, Kleiner C (2021). Velocity differences between velum raising and lowering movements. In: Karpov A, Potapova R (eds.) Speech and Computer. SPECOM 2021. Lecture Notes in Computer Science, vol 12997, pp. 70-80. Springer, Cham. doi: 10.1007/978-3-030-87802-3_7, [pdf]
- Blandin R, Félix S, Doc JB, Birkholz P (2021). Combining multimodal method and 2D finite elements for the efficient simulation of vocal tract acoustics. In Proc. of the 27th International Congress on Sound and Vibration (ICSV27), Czech Republic [pdf]
- Steiner P, Jalalvand A, Birkholz P (accepted). Unsupervised pretraining of Echo State Networks for onset detection. In: Farkas I, Wermter S, Kurkova V (eds.) Artificial Neural Networks and Machine Learning - ICANN 2021 (Springer International Publishing), pp. XX-XX,
- Krug PK, Stone S, Birkholz P (2021). Intelligibility and naturalness of articulatory synthesis with VocalTractLab compared to established speech synthesis technologies. In Proc. of the 11th ISCA Speech Synthesis Workshop (SSW 11), pp. 102-107, Budapest, Hungary. doi: 10.21437/SSW.2021-18, [pdf]
- Wilbrandt A, Stone S, Birkholz P (2021). Articulatory data recorder: a framework for real-time articulatory data recording. In Proc. of the Interspeech 2021, pp. 3313-3314, Brno, Czech Republic [pdf]
- Xu A, Van Niekerk D, Gerazov B, Krug PK, Prom-on S, Birkholz P, Xu Y (2021). Model-based exploration of linking between vowel articulatory space and acoustic space. In Proc. of the Interspeech 2021, pp. 3191-3195, Brno, Czech Republic [pdf]
- Blandin R, Arnela M, Félix S, Doc JB, Birkholz P (2021). Comparison of the finite element method, the multimodal method and the transmission-line model for the computation of vocal tract transfer functions. In Proc. of the Interspeech 2021, pp. 3330-3334, Brno, Czech Republic [pdf]
- Steiner P, Jalalvand A, Birkholz P (2021). Improved acoustic modeling for automatic piano music transcription using Echo State Networks. In: Rojas I, Joya G, Catala A (eds) Advances in Computational Intelligence. IWANN 2021. Lecture Notes in Computer Science, vol 12862. Springer, Cham. [pdf]
- Winkelmann E, Kleiner C, Shevchenko I, Steiner C, Birkholz P, Kaltenborn U, Steiner T (2021). Partial discharge localization and characterization on power cables using an adaptive model. In Proc. of the IEEE Conference on Electrical Insulation and Dielectric Phenomena (CEIDP 2021), pp. XX, Vancouver, BC, Canada
- Winkelmann E, Steiner C, Shevchenko I, Steiner P, Birkholz P, Kaltenborn U (2021). Machine learning based evaluation of dynamic events in medium voltage grid components. In Proc. of the CIRED 2021, pp. XX, Geneva, Switzerland
- Barth S, Stone S, Birkholz P (2021). Artificial bandwidth extension using a glottal excitation model. In: Hillmann S, Weiss B, Michael T, Möller S (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2021 (TUDPress, Dresden), pp. 95-103 [pdf]
- Digehsara PA, Wagner C, Schaffer P, Bärhold M, Stone S, Plettemeier D, Birkholz P (2021). On the optimal set of features and the robustness of classifiers in radar-based silent phoneme recognition. In: Hillmann S, Weiss B, Michael T, Möller S (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2021 (TUDPress, Dresden), pp. 112-119 [pdf]
- Krug PK, Stone S, Wilbrandt A, Birkholz P (2021). TargetOptimizer 2.0: Enhanced estimation of articulatory targets. In: Hillmann S, Weiss B, Michael T, Möller S (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2021 (TUDPress, Dresden), pp. 145-152 [pdf]
- Steiner P, Howard IS, Birkholz P (2021). Glottal closure instant detection using Echo State Networks. In: Hillmann S, Weiss B, Michael T, Möller S (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2021 (TUDPress, Dresden), pp. 161-168 [pdf]
2020
- Toya T, Birkholz P, Unoki M (accepted). Measurements of transmission characteristics related to bone-conducted speech using excitation signals in the oral cavity. Journal of Speech, Language, and Hearing Research, XX
- Birkholz P, Kürbis S, Stone S, Häsner P, Blandin R, Fleischer M (2020). Printable 3D vocal tract shapes from MRI data and their acoustic and aerodynamic properties. Scientific Data, 7, 255, doi: 10.1038/s41597-020-00597-w
- Gao Y, Ding H, Birkholz P (2020). An acoustic comparison of German tense and lax vowels produced by German native speakers and Mandarin Chinese learners. The Journal of the Acoustical Society of America, 148(1), pp. EL112-EL118, doi: 10.1121/10.0001628
- Thiele C, Mooshammer C, Belz M, Rasskazova O, Birkholz P (in print). An experimental study of tongue body loops in V1-V2-V1 sequences. Journal of Phonetics, 80, pp. 100965, doi: 10.1016/j.wocn.2020.100965
- Thuan Van Ngo, Akagi M, Birkholz P (2020). Effect of articulatory and acoustic features on the intelligibility of speech in noise: an articulatory synthesis study. peech Communication, 117, pp. 13-20. doi: 10.1016/j.specom.2020.01.004
- Steiner P, Jalalvand A, Stone S, Birkholz P (2020). Feature engineering and stacked echo state networks for musical onset detection. In Proc. of the 25th International Conference on Pattern Recognition (ICPR 2020), pp. XX, Milan, Italy
- Gao Y, Zhang X, Xu Y, Zhang J, Birkholz P (2020). An investigation of the target approximation model for tone modeling and recognition in continuous Mandarin speech. In Proc. of the Interspeech 2020, pp. XX, Shanghai, China
- van Niekerk DR, Xu A, Gerazov B, Krug PK, Birkholz P, Xu Y (2020). Finding intelligible consonant-vowel sounds using high-quality articulatory synthesis. In Proc. of the Interspeech 2020, pp. XX, Shanghai, China
- Steiner P, Stone S, Birkholz P, Jalalvand A (2020). Multipitch tracking in music signals using Echo State Networks. In: Proc. of the 28th European Signal Processing Conference (EUSIPCO 2020), pp. 126-130, Amsterdam, Netherlands [pdf]
- Engeln L, Groh R, Gabriel F, Birkholz P, Jäckel R, Hoffmann R, Felten J, Bergmann R, Scharloth J, Lüneburg L, Krzywinski J, Neumann J, Plaßmeyer P (2020). Faszination sprechende Maschinen: Technologischer Wandel der Sprachsynthese über zwei Jahrhunderte. In: Fortschritte der Akustik - DAGA 2020. Tagungsband der 46. deutschen Jahrestagung für Akustik, pp. 44-47, ISBN 978-3-939296-17-1
- Gao Y, Steiner P, Birkholz P (2020). Articulatory copy synthesis using long short-term memory networks. In: Böck R, Siegert I, Wendemuth A (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020 (TUDPress, Dresden), pp. 52-59
- Große K, Birkholz P (2020). Tongue mouse - comparison of existing approaches. In: Böck R, Siegert I, Wendemuth A (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020 (TUDPress, Dresden), pp. 34-43
- Steiner P, Stone S, Birkholz P (2020). Note onset detection using echo state networks. In: Böck R, Siegert I, Wendemuth A (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2020 (TUDPress, Dresden), pp. 157-164
- Stone S, Birkholz P (2020). Cross-speaker silent-speech command word recognition using electro-optical stomatography. In Proc. of the 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 7849-7853, Barcelona, Spain [pdf]
- Stone S, Schmidt P, Birkholz P (2020). Prediction of voicing and the f0 contour from electromagnetic articulography data for articulation-to-speech synthesis. In Proc. of the 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 7329-7333, Barcelona, Spain [pdf]
- Birkholz P, Zhang X (2020). Accounting for microprosody in modeling intonation. In Proc. of the 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), pp. 8099-8103, Barcelona, Spain [pdf]
2019
- Birkholz P, Gabriel F, Kürbis S, Echternach M (2019). How the peak glottal area affects linear predictive coding-based formant estimates of vowels. The Journal of the Acoustical Society of America, 146(1), pp. 223-232. [pdf Copyright (2019) Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. doi: 10.1121/1.5116137]
- Birkholz P, Pape D (2019). How modeling entrance loss and flow separation in a two-mass model affects the oscillation and synthesis quality. Speech Communication, 110, pp. 108-116.
-
Bellinghausen C, Fangmeier T, Schröder B, Keller J, Drechsel S, Birkholz P, van Elst LT, Riedel A (accepted). On the role of disfluent speech for uncertainty in articulatory speech synthesis. In Proc. of the Workshop on Disfluency in Spontaneous Speech (DISS2019), pp. XX-XX, Budapest
- Birkholz P, Drechsel S, Stone S (2019). Perceptual optimization of an enhanced geometric vocal fold model for articulatory speech synthesis. In Proc. of the Interspeech 2019, pp. 3765-3769, Graz, Austria [pdf]
- Gao Y, Stone S, Birkholz P (2019). Articulatory copy synthesis based on a genetic algorithm. In Proc. of the Interspeech 2019, pp. 3770-3774, Graz, Austria [pdf]
- Toya T, Birkholz P, Unoki M (2019). Estimates of transmission characteristics related to perception of bone-conducted speech using real utterances and transcutaneous vibration on larynx. In: International Conference on Speech and Computer (SPECOM 2019), pp. 491-500
- Birkholz P, Stone S, eds. (2019). Studientexte zur Sprachkommunikation (vol. 93): Elektronische Sprachsignalverarbeitung 2019. Tagungsband der 30. Konferenz. TUDpress, Dresden
- Gao Y, Ding H, Birkholz P, Jäckel R, Lin Y (2019). Perception of German tense and lax vowel contrast by Chinese learners. In: Birkholz P, Stone S (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2019 (TUDPress, Dresden), pp. 25-32 [pdf]
- Birkholz P, Stone S, Kürbis S (2019). Comparison of different methods for the voiced excitation of physical vocal tract models. In: Birkholz P, Stone S (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2019 (TUDPress, Dresden), pp. 84-94 [pdf]
- Gabriel F, Häsner P, Dohmen E, Borin D, Birkholz P (2019). Surface stickiness and waviness of two-layer silicone structures for synthetic vocal folds. In: Birkholz P, Stone S (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2019 (TUDPress, Dresden), pp. 221-230 [pdf]
- Drechsel S, Gao Y, Frahm J, Birkholz P (2019). Modell einer Frauenstimme für die artikulatorische Sprachsynthese mit VocalTractLab. In: Birkholz P, Stone S (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2019 (TUDPress, Dresden), pp. 239-246 [pdf]
- Howard I, Birkholz P (2019). Modelling vowel acquisition using the Birkholz synthesizer. In: Birkholz P, Stone S (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2019 (TUDPress, Dresden), pp. 304-311 [pdf]
- Fuchs S, Birkholz P (2019). Phonetics of Consonants. In Oxford Research Encyclopedia of Linguistics. Oxford University Press. doi:10.1093/acrefore/9780199384655.013.410
- Xu A, Birkholz P, Yi Xu (2019). Coarticulation as synchronized dimension-specific sequential target approximation: An articulatory synthesis simulation. In Proc. of the International Congress of Phonetic Sciences (ICPhS 2019), Melbourne, Australia
2018
- Birkholz P, Stone S, Wolf K, Plettemeier D (2018). Non-invasive silent phoneme recognition using microwave signals. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(12), pp. 2404-2411. doi: 10.1109/TASLP.2018.2865609 [preprint]
- Wagner C, Stone S, Birkholz P (2018). Optical force and distance sensing in intraoral devices for stroke rehabilitation: a distance calibration and force classification approach. In Proc. of the 13th ITG Conference on Speech Communication, pp. 345-349, Oldenburg, Germany [pdf]
- Echternach M, Kob M, Sundberg J, Traser L, Birkholz P, Köberlein M, Richter B (2018). Bestimmung von Vokaltraktresonanzen und Formanten mittels verschiedener Analyseansätze. In: Wissenschaftliche Jahrestagung der DGPP, 4. Dreiländertagung D-A-CH, Innsbruck, Austria.
- Birkholz P, Venus E (2018). Considering lip geometry in one-dimensional tube models of the vocal tract. In: Fang Q et al. (eds.) Studies on Speech Production: 11th International Seminar on Speech Production (ISSP 2017), LNAI 10733, pp. 78–86 [pdf]
- Hoffmann R, Birkholz P, Gabriel F, Jäckel R (2018). From Kratzenstein to the Soviet vocoder: some results of a historic research project in speech technology. In: Karpov A, Jokisch O, Potapova R (eds.) Speech and Computer (SPECOM 2018), LNAI 11096 (Springer Nature Swizerland), pp. 215-225 [pdf]
- Stone S, Marxen M, Birkholz P (2018). Construction and evaluation of a parametric one-dimensional vocal tract model. IEEE/ACM Transactions on Audio, Speech and Language Processing, 26(8), pp. 1381-1392. doi: 10.1109/TASLP.2018.2825601
- Fleischer M, Mainka A, Kürbis S, Birkholz P (2018). How to precisely measure the volume velocity transfer function of physical vocal tract models by external excitation. PLoS ONE 13(3): e0193708. doi: 10.1371/journal.pone.0193708 [Open Access]
- Dohmen E, Borin D, Gabriel F, Odenbach S, Birkholz P (2018). Artificial vocalis muscles for speech synthesis made from contact-less adaptable magnetorheological elastomer. In: Proc. of the International Electorheological Fluids and Magnetorheological Suspensions Conference (ERMR2018), Maryland, USA, [abstract]
- Birkholz P, Schmager P, Xu Y (2018). Estimation of Pitch Targets from Speech Signals by Joint Regularized Optimization. In: Proc. of the 26th European Signal Processing Conference (EUSIPCO 2018), pp. 2089-2093, Rome, Italy
- Gao Y, Birkholz P (2018). Speaking Rate Changes Affect Phone Durations Differently for Neutral and Emotional Speech. In: Proc. of the 26th European Signal Processing Conference (EUSIPCO 2018), pp. 2084-2088, Rome, Italy
- Marwitz JA, Stone S, Birkholz P (2018). Optimierung der Numerik eines linearen Gleichungssystems für die Simulation des Schallfeldes im Vokaltrakt. In: Berton A, Haiber U, Minker W (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2018 (TUDPress, Dresden), pp. 359-366 [pdf]
- Howard IS, Birkholz P (2018). Using state feedback to control an articulatory synthesizer. In: Berton A, Haiber U, Minker W (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2018 (TUDPress, Dresden), pp. 351-358
- Steiner P, Stone S, Birkholz P (2018). PianoTranscriber - A note-based approach for multipitch tracking. In: Berton A, Haiber U, Minker W (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2018 (TUDPress, Dresden), pp. 326-333 [pdf]
- Wagner C, Stone S, Birkholz P (2018). Towards combined force and distance sensing using only optical sensors to aid in stroke habilitation. In: Berton A, Haiber U, Minker W (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2018 (TUDPress, Dresden), pp. 318-325 [pdf]
2017
- Fleischer M, Kürbis S, Mainka A, Birkholz P, Mürbe D (2017). Measuring the transfer function of physical vocal tract models - a comparative study. In: Proc. of the 12th Pan-European Voice Conference (PEVOC 12), Ghent, Belgium. [abstract]
- Birkholz P, Venus E (2017). Considering lip geometry in one-dimensional tube models of the vocal tract. In: Proc. of the 11th International Seminar on Speech Production (ISSP 2017), Tianjin, China [extended abstract]
- Jiao L, Wang C, Hsu C, Birkholz P, Xu Y (2017). Posh accent and vocal attractiveness in British English. In Proc. of the 8th Tutorial and Research Workshop on Experimental Linguistics, pp. 45-48, Heraklion, Crete, Greece
- Schröder A, Stone S, Birkholz P (2017). The sound of deception - What makes a speaker credible? In Proc. of the Interspeech 2017, pp. 1467-1471, Stockholm, Sweden [pdf]
- Stone S, Steiner P, Birkholz P (2017). A time-warping pitch tracking algorithm considering fast f0 changes. In Proc. of the Interspeech 2017, pp. 419-423, Stockholm, Sweden [pdf]
- Jiao L, Wang C, Hsu C, Birkholz P , Xu Y (2017). Does posh english sound attractive? In Proc. of the Interspeech 2017, pp. 2257-2261, Stockholm, Sweden [pdf]
- Traser L, Birkholz P, Flügge TV, Kamberger R, Burdumy M, Richter B, Korvink JG, Echternach M (accepted). Relevance of teeth implementation in three dimensional vocal tract models. Journal of Speech, Language, and Hearing Research, XX, pp. XX-XX.
- Weitz B, Steiner I, Birkholz P (2017). Gesture-based articulatory text to speech synthesis. In: Trouvain J, Steiner I, Möbius B (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2017 (TUDPress, Dresden), pp. 324-331 [pdf]
- Tayal S, Stone S, Birkholz P (2017). Towards the measurement of the actor's formant in female voices. In: Trouvain J, Steiner I, Möbius B (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2017 (TUDPress, Dresden), pp. 286-293 [pdf]
- Stone S, Schulze K, Steiner P, Birkholz P (2017). Real-time manipulation of the F0-contour in synthetic speech using the Fujisaki model. In: Trouvain J, Steiner I, Möbius B (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2017 (TUDPress, Dresden), pp. 278-285 [pdf]
- Klause F, Stone S, Birkholz P (2017). A head-mounted camera system for the measurement of lip protrusion and opening during speech production. In: Trouvain J, Steiner I, Möbius B (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2017 (TUDPress, Dresden), pp. 145-151 [pdf]
- Birkholz P, Wang L (2017). Herstellung und Charakterisierung künstlicher Stimmlippen aus Silikonkautschuk. In: Trouvain J, Steiner I, Möbius B (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2017 (TUDPress, Dresden), pp. 58-66 [pdf]
- Birkholz P, Neuschaefer-Rube C (in press). Articulatory models. In: am Zehnhoff-Dinnesen A, Wiskirska-Woznica B, Neumann K, Nawka T (eds.) Phoniatrics I: Fundamentals – Voice Disorders – Disorders of Language and Hearing Development (Springer-Verlag Berlin Heidelberg), pp. XX-XX
- Stone S, Birkholz P (2017). Angle Correction in Optopalatographic Tongue Distance Measurements. IEEE Sensors Journal, 17(2), pp. 459-468. doi: 10.1109/JSEN.2016.2630742
- Zaretsky E, Pluschinski P, Sader R, Birkholz P, Neuschaefer-Rube C, Hey C (2017). Identification of the most significant electrode positions in electromyographic evaluation of swallowing-related movements in humans. European Archives of Oto-Rhino-Laryngology, 274(2), pp. 989–995. doi: 10.1007/s00405-016-4288-7
- Heinen E, Birkholz P, Willmes K, Neuschaefer-Rube C (2017). Do long-term tongue piercings affect speech quality? Logopedics Phoniatrics Vocology, 42(3), pp. 126-132
2016
- Suthau E, Birkholz P, Mainka A, Simpson AP (2016). Non-invasive photoglottography for use in the lab and the field. In Proc. of the 12th ITG Conference on Speech Communication, pp. 273-277, Paderborn, Germany. [pdf]
- Zaretsky E, Pluschinski P, Birkholz P, Neuschaefer-Rube C, Sader R, Hey C (2016). Kurvenmorphologische und physiologische Korrelate des On- und Offsets der schluckassoziierten Signale in der Oberflächenelektromyographie. In: 33. Wissenschaftliche Jahrestagung der DGPP, Regensburg, Germany. doi: 10.3205/16dgpp31
- Mainka A, Fleischer M, Kürbis S, Mürbe D, Birkholz P (2016). Studie zur Transferfunktion des Vokaltraktes – akustische Analyse am gedruckten 3D-Modell mittels retrograder Schallanregung. In: 33. Wissenschaftliche Jahrestagung der DGPP, Regensburg, Germany. doi: 10.3205/16dgpp24
- Labrunie M, Badin P, Lamalle L, Vilain C, Boë LJ, Frahm J, Birkholz P (2016). Suivi de contours d'articulateurs orofaciaux à partir d'IRM dynamique. In Proc. of the 31èmes Journées d'Etude de la Parole, vol. 1 : JEP, pp. 687-695, Paris, France [pdf]
- Birkholz P, Bakardjiev P, Kürbis S, Petrick R (2016). Towards minimally invasive velar state detection in normal and silent speech. In Proc. of the Interspeech 2016, pp. 1780-1784, San Francisco, USA [pdf]
- Preuß S, Birkholz P (2016). Silent-speech command word recognition using electro-optical stomatography. In Proc. of the Interspeech 2016, pp. 2350-2351, San Francisco, USA [pdf]
- Birkholz P, Martin L, Xu Y, Scherbaum S, Neuschaefer-Rube C (2016). Manipulation of the prosodic features of vocal tract length, nasality and articulatory precision using articulatory synthesis. Computer Speech & Language, 41, pp. 116-127. doi: 10.1016/j.csl.2016.06.004
- Echternach M, Birkholz P, Sundberg J, Traser L, Korvink JG, Richter B (2016) . Resonatory properties in professional tenors singing above the passaggio. Acta Acustica united with Acustica , 102(2), pp. 298-306.
- Birkholz P (2016). GlottalImageExplorer – an open source tool for glottis segmentation in endoscopic high-speed videos of the vocal folds. In: Jokisch O (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2016 (TUDPress, Dresden), pp. 39-44 [pdf]
- Wang L, Preuß S, Birkholz P (2016). Untersuchung elastischer Materialien für künstliche Stimmlippen. In: Jokisch O (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2016 (TUDPress, Dresden), pp. 45-52 [pdf]
2015
- Birkholz P, Martin L, Willmes K, Kröger BJ, Neuschaefer-Rube C (2015). The contribution of phonation type to the perception of vocal emotions in German: an articulatory synthesis study. Journal of the Acoustical Society of America, 137(3), pp. 1503–1512. [link]
- Löscher V, Birkholz P, Neuschaefer-Rube C (2015). Vergleichende Untersuchung von Elektropalatographie und Optopalatographie anhand der Artikulation von Normalsprechern des Deutschen. Sprache-Stimme-Gehör, 2015;39, Supplement 1:e1-e2.
- Preuß S, Birkholz P (2015). Fortschritte in der Elektro-Optischen Stomatographie. In: Wirsching G (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2015 (TUDPress, Dresden), pp. 248-255 [pdf]
- Echternach M, Birkholz P, Traser L, Flügge T, Kamberger R, Burk F, Burdumy M, Richter B (2015). Articulation and vocal tract acoustics at soprano subject`s high fundamental frequencies. Journal of the Acoustical Society of America, 137(5), pp. 2586-2595. [link]
- Murakami M, Kröger BJ, Birkholz P, Triesch J (2015). Seeing [u] aids vocal learning: babbling and imitation of vowels using a 3D vocal tract model, reinforcement learning, and reservoir computing. In: Proc. of the 5th International Conference on Development and Learning and on Epigenetic Robotics, pp. 208-213, Providence, Rhode Island, USA [pdf]
- Preuß S, Birkholz P (2015). Optical sensor calibration for electro-optical stomatography. In Proc. of the Interspeech 2015, pp. 618-622, Dresden, Germany [pdf]
- Pape D, Jesus LMT, Birkholz P (2015). Intervocalic fricative perception in European Portuguese: An articulatory synthesis study. Speech Communication , 74, pp. 93-103. [link]
- Birkholz P (2015). Modellierung des Sprechapparats als akustisches Netzwerk. In: Gerlach G, Marschner U, Starke E (eds.) Nichtelektrische Netzwerke: Wie die Systemtheorie hilft, die Welt zu verstehen (TUDPress, Dresden), pp. 65-71 [pdf]
- Martin L, Birkholz P, Neuschaefer-Rube C (2015). Manipulation sekundärer prosodischer Merkmale mittels artikulatorischer Sprachsynthese. In: 32. Wissenschaftliche Jahrestagung der DGPP, Oldenburg, Germany. doi: 10.3205/15dgpp28
- Zaretsky E, Pluschinski P, Birkholz P, Strecha G, Neuschaefer-Rube C, Sader R, Hey C (2015). Visualisierung des Schluckvorgangs mittels Oberflächenelektromyographie und Elektropalatographie. In: 32. Wissenschaftliche Jahrestagung der DGPP, Oldenburg, Germany. doi: 10.3205/15dgpp50
- Pluschinski P, Zaretsky E, Grabmann N, Birkholz P, Neuschaefer-Rube C, Sader R, Hey C (2015). Intrapersonelle Unterschiede in schluckrelevanten sEMG-Signalen gleicher Volumina und Konsistenzen. In: 32. Wissenschaftliche Jahrestagung der DGPP, Oldenburg, Germany. doi: 10.3205/15dgpp53
2014
- Junger J, Habel U, Bröhr S, Neulen J, Neuschaefer-Rube C, Birkholz B, Kohler C, Schneider F, Derntl B, Pauly K (2014). More than just two sexes: The neural correlates of voice gender perception in gender dysphoria. PLoS ONE, 9(11): e111672. doi: 10.1371/journal.pone.0111672 [Open Access]
- Neuschaefer-Rube C, Preuß S, Eckers C, Birkholz P (2014). Entwicklung eines OPG-gesteuerten Serious Games als innovatives therapeutisches Hilfsmittel zur Durchführung mundmotorischer Übungen. In: 31. Wissenschaftliche Jahrestagung der DGPP, Lübeck, Germany. doi: 10.3205/14dgpp19
- Prom-on S, Birkholz P, Xu Y (2014). Estimating vocal tract shapes of Thai vowels from contextual tonal variation. In Proc. of the 17th Oriental COCOSDA, Phuket, Thailand [pdf]
- Prom-on S, Birkholz P, Xu Y (2014). Identifying underlying articulatory targets of Thai vowels from acoustic data based on an analysis-by-synthesis approach. EURASIP Journal on Audio, Speech, and Music Processing, 2014:23, doi:10.1186/1687-4722-2014-23 [Open Access]
- Mumtaz R, Preuß S, Neuschaefer-Rube C, Hey C, Sader R, Birkholz P (2014). Tongue contour reconstruction from optical and electrical palatography. IEEE Signal Processing Letters, 21(6), pp. 658-662, doi: 10.1109/LSP.2014.2312456 [link]
- Birkholz P (2014). Enhanced area functions for noise source modeling in the vocal tract. In Proc. of the 10th International Seminar on Speech Production (ISSP 2014), pp. 37-40, Cologne, Germany [pdf]
- Preuß S, Neuschaefer-Rube C, Birkholz P (2014). Evaluation of an OPG-controlled animated vocal tract model as a biofeedback system. In Proc. of the 10th International Seminar on Speech Production (ISSP 2014), pp. 340-343, Cologne, Germany [pdf]
- Birkholz P, Schutte M, Preuß S, Neuschaefer-Rube C (2014). Towards non-invasive velum state detection during speaking using high-frequency acoustic chirps. In: Hoffmann R (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2014 (TUDPress, Dresden), pp. 126-133 [pdf]
- Preuß S, Eckers C, Birkholz P, Neuschaefer-Rube C (2014). Ein OPG-gesteuertes Serious Game zur Unterstützung mundmotorischer Übungen. In: Hoffmann R (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2014 (TUDPress, Dresden), pp. 134-141 [pdf]
- Lasarcyk E, Birkholz P, Barry WJ (accepted). Imitating a bi-dialectal speaker using acoustic-to-articulatory inversion: Articulatory basis of vowels in Saxon and Standard High German. In Proc. of the International Workshop on Multilinguality in Speech Research: Data, Methods and Models, Schloss Dagstuhl
2013
- Birkholz P (2013). Modeling consonant-vowel coarticulation for articulatory speech synthesis. PLoS ONE, 8(4): e60603. doi:10.1371/journal.pone.0060603 [link]
- Junger J, Pauly K, Bröhr S, Birkholz P, Neuschaefer-Rube C, Kohler C, Schneider F, Derntl B, Habel U (2013). Sex matters: Neural correlates of voice gender perception. NeuroImage, 79, pp. 275-287
- Xu Y, Lee A, Wu W-L, Liu X, Birkholz P (2013). Human vocal attractiveness as signaled by body size projection. PLoS ONE, 8(4): e62397. doi:10.1371/journal.pone.0062397 [link]
- Heinen E, Birkholz P, Willmes K, Neuschaefer-Rube C (2013). Beeinflussen Zungenpiercings die Sprechqualität? In: 30. Wissenschaftliche Jahrestagung der DGPP, Bochum, Germany. doi: 10.3205/13dgpp24
- Neuschaefer-Rube C, Junger J, Derntl B, Habel U, Frölich D, Birkholz P (2013). Gibt es eine genderspezifische Stimmverarbeitung im Gehirn? Eine fMRI-Studie bei gesunden Erwachsenen. In: 30. Wissenschaftliche Jahrestagung der DGPP, Bochum, Germany. doi: 10.3205/13dgpp56
- Pluschinski P, Zaretsky Y, Sader R, Birkholz P, Mumtaz R, Neuschaefer-Rube C, Hey C (2013). Oberflächenmyographie als Biofeedback-Verfahren für Dysphagiepatienten: Bestimmung der optimalen Elektrodenpositionen und -anzahl. In: 30. Wissenschaftliche Jahrestagung der DGPP, Bochum, Germany. doi: 10.3205/13dgpp66
- Preuß S, Neuschaefer-Rube C, Birkholz P (2013). Real-time control of a 2D animation model of the vocal tract using optopalatography. In Proc. of the Interspeech 2013, pp. 997-1001, Lyon, France [pdf]
- Prom-on S, Birkholz P, Xu Y (2013). Training an articulatory synthesizer with continuous acoustic data. In Proc. of the Interspeech 2013, pp. 349-353, Lyon, France [pdf]
- Neuschaefer-Rube C, Junger J, Pauly K, Birkholz P, Schneider F, Kohler C, Bröhr S, Derntl B, Habel U (2013). Gender-specific voice perception in the brain. fMRI-data in adult volunteers. In: Proc. of the 29th World Congress of the IALP, Turin, Italy
- Preuß S, Neuschaefer-Rube C, Birkholz P (2013). Prospects of EPG and OPG sensor fusion in pursuit of a 3D real-time representation of the oral cavity. In: Wagner P (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2013 (TUDPress, Dresden), pp. 144-151 [pdf]
- Preuß S, Neuschaefer-Rube C, Birkholz P (2013). Real-time feedback of speech movements based on optopalatography. LingUnite – Tag der Sprachforschung, RWTH Aachen University, Germany [abstract]
- Birkholz P (2013). Artikulatorisch-akustische Simulation der Spracherzeugung. LingUnite – Tag der Sprachforschung, RWTH Aachen University, Germany [abstract]
- Birkholz P (2013). Elektromyographische Analyse von Sprech- und Schluckbewegungen. In: Wagner P (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2013 (TUDPress, Dresden), pp. 119 [abstract]
2012
- Birkholz P, Neuschaefer-Rube C (2012). A new artificial palate design for the optical measurement of tongue and lip movements. In: Wolff M (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2012 (TUDPress, Dresden), pp. 89-95 [pdf]
- Birkholz P, Hoole P (2012). Intrinsic velocity differences of lip and jaw movements: preliminary results. In Proc. of the Interspeech 2012, Portland, Oregon, USA [pdf]
- Birkholz P, Dächert P, Neuschaefer-Rube C (2012). Advances in combined electro-optical palatography. In Proc. of the Interspeech 2012, Portland, Oregon, USA [pdf]
- Birkholz P, Neuschaefer-Rube C (2012). A system for the comparison of glottal source models for articulatory speech synthesis. Abstract for the 8th International
- Conference on Voice Physiology and Biomechanics, Erlangen, Germany [pdf]
2011
- Kröger BJ, Birkholz P, Neuschaefer-Rube C (2011). Towards an articulation-based developmental robotics approach for word processing in face-to-face communication. PALADYN Journal of Behavioral Robotics, 2(2), pp. 82-93
- Kröger BJ, Birkholz P, Kannampuzha J, Kaufmann E, Mittelberg I (2011). Movements and holds in fluent sentence production of American Sign Language: The action-based approach. Cognitive Computation, 3(3), pp. 449-465
- Birkholz P, Kröger BJ, Neuschaefer-Rube C (2011). Model-based reproduction of articulatory trajectories for consonant-vowel sequences. IEEE Transactions on Audio, Speech and Language Processing, 19(5), pp. 1422-1433
- Kröger BJ, Birkholz P, eds. (2011). Studientexte zur Sprachkommunikation (Bd. 61): Elektronische Sprachsignalverarbeitung 2011. Tagungsband der 22. Konferenz. TUDpress, Dresden
- Birkholz P, Neuschaefer-Rube C (2012). Messung von Sprechbewegungen durch eine Gaumenplatte mit integrierten Abstands- und Kontaktsensoren. In: 28th Jahrestagung der DGPP, Zurich, Switzerland
- Birkholz P (2011). A survey of self-oscillating lumped-element models of the vocal folds. In: Kröger BJ, Birkholz P (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2011 (TUDPress, Dresden), pp. 47-58 [pdf]
- Kröger BJ, Birkholz P, Kannampuzha J, Eckers C, Kaufmann E, Neuschaefer-Rube C (2011). Neurobiological interpretation of a quantitative target approximation model for speech actions. In: Kröger BJ, Birkholz P (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2011 (TUDPress, Dresden), pp. 184-194 [pdf]
- Kröger BJ, Birkholz P, Kaufmann E, Neuschaefer-Rube C (2011). Beyond vocal tract actions: Speech prosody and co-verbal gesturing in face-to-face communication. In: Kröger BJ, Birkholz P (eds.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2011 (TUDPress, Dresden), pp. 195-204 [pdf]
- Birkholz P, Neuschaefer-Rube C (2011). Combined optical distance sensing and electropalatography to measure articulation. In Proc. of the Interspeech 2011, pp. 285-288, Florence, Italy [pdf]
- Birkholz P, Kröger BJ, Neuschaefer-Rube C (2011). Synthesis of breathy, normal, and pressed phonation using a two-mass model with a triangular glottis. In Proc. of the Interspeech 2011, pp. 2681-2684, Florence, Italy [pdf]
- Birkholz P, Hoole P, Kröger BJ, Neuschaefer-Rube C (2011). Tongue body loops in vowel sequences. In Proc. of the 9th International Seminar on Speech Production (ISSP 2011), pp. 203-210, Montreal, Canada [pdf]
- Birkholz P, Kröger BJ, Neuschaefer-Rube C (2011). Articulatory synthesis of words in six voice qualities using a modified two-mass model of the vocal folds. In Proc. of the First International Workshop on Performative Speech and Singing Synthesis (p3s 2011), Vancouver, BC, Canada [pdf]
- Kröger BJ, Birkholz P, Kannampuzha J, Neuschaefer-Rube C (2011). Categorical perception of consonants and vowels: Evidence from a neurophonetic model of speech production and perception. In: Esposito A, Esposito AM, Martone R, Müller VC, Scarpetta G (eds.) Towards Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues. LNCS 6456 (Springer, Berlin), pp. 354-361 [pdf]
2010
- Birkholz P, Kröger BJ, Neuschaefer-Rube C (2010). Articulatory synthesis and perception of plosive-vowel syllables with virtual consonant targets. In Proc. of the Interspeech 2010, pp. 1017-1020, Makuhari, Japan [pdf]
- Birkholz P, Kröger BJ, Neuschaefer-Rube C (2010). Stimmsynthese mit einem Zwei-Massen-Modell der Stimmlippen mit dreieckigem Öffnungsquerschnitt. In: 27th Jahrestagung der DGPP, Aachen, Germany [pdf]
- Bauer D, Birkholz P, Kannampuzha J, Kröger BJ (2010). Evaluation of articulatory speech synthesis: a perception study. In: 36th Deutsche Jahrestagung für Akustik (DAGA 2010), pp. 1003-1004, Berlin, Germany [pdf]
- Kröger BJ, Birkholz P, Kannampuzha J, Neuschaefer-Rube C (2010). Modeling different voice qualities for female and male talkers using a geometric-kinematic articulatory voice source model: preliminary results. In: Fuchs S, Hoole P, Mooshammer C, Zygis M (eds.) Between the Regular and the Particular in Speech and Language. Frankfurt/M: Peter Lang Verlag, pp. 97-124.
- Kröger BJ, Birkholz P, Hoffmann R, Meng H (2010). Audiovisual tools for phonetic and articulatory visualisation in computer-aided pronunciation training. In: Esposito A, Campbell N, Vogel N, Hussain A, Nijholt A (eds.) Development of Multimodal Interfaces: Active Listening and Synchrony. LNCS 5967 (Springer Verlag, Berlin), pp. 337-345 [pdf]
- Kröger BJ, Birkholz P, Kannampuzha J, Neuschaefer-Rube C (2010). Categorical perception results from clustering of supramodal neural representations of sounds and syllables: Evidence from computer simulation experiments. Abstracts of the Second Annual Neurobiology of Language Conference NCL 2010, pp. 129, San Diego, CA, United States
2009
- Birkholz P, Lehnert B, Neuschaefer-Rube C (2009). VocalTractLab – Ein neues Softwaretool für die artikulatorische Sprachsynthese in der Lehre. In: 26th Jahrestagung der DGPP, pp. 209–211, Leipzig, Germany
- Kröger BJ, Birkholz P (2009). Artikulatorische Sprachsynthese. In: Hoffmann R (ed.) Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2009 (TUDPress, Dresden), vol. 1, pp. 182-189 [pdf]
- Kröger BJ, Birkholz P (2009). Articulatory synthesis of speech and singing: State of the art and suggestions for future research. In: Esposito A, Hussain A, Marinaro M (eds.) Multimodal Signals: Cognitive and Algorithmic Issues. LNAI 5398 (Springer Verlag, Berlin), pp. 306–319 [pdf]
2008
- Boë LJ, Ménard L, Serkhane J, Birkholz P, Kröger BJ, Badin P, Captier G, Canault M, Kielwasser N (2008). La croissance de l’instrument vocal : contrôle, modélisation, potentialités acoustiques et conséquences perceptives. Revue Française de Linguistique Appliquée, 13, pp. 59–80
- Boë LJ, Captier G, Granat J, Deshayes MJ, Heim LJ, Birkholz P, Badin P, Kielwasser N, Sawallis T (2008). Skull and vocal tract growth from fetus to 2 years. In 8th International Seminar on Speech Production (ISSP 2008), pp. 157–160, Strasbourg, France
2007
- Birkholz P, Jackèl D, Kröger BJ (2007). Simulation of losses due to turbulence in the time-varying vocal system. IEEE Transactions on Audio, Speech and Language Processing, 15(4), pp. 1218–1226
- Kröger BJ, Birkholz P, Neuschaefer-Rube C (2007). Ein neuronales Modell zur sensomotorischen Entwicklung des Sprechens. Laryngo-Rhino-Otologie, 86, pp. 365–370
- Kröger BJ, Birkholz P, Kannampuzha J, Neuschaefer-Rube C (2007). Modeling the perceptual magnet effect and categorical perception using self-organizing neural networks. In Proc. of the 16th International Congress of Phonetic Sciences (ICPhS 2007), pp. 789–792, Saarbrücken, Germany [pdf]
- Birkholz P, Kröger BJ (2007). Simulation of vocal tract growth for articulatory speech synthesis. In Proc. of the 16th International Congress of Phonetic Sciences (ICPhS 2007), pp. 377–380, Saarbrücken, Germany [pdf]
- Birkholz P, Steiner I, Breuer S (2007). Control concepts for articulatory speech synthesis. In Proc. of the 6th ISCA Workshop on Speech Synthesis, pp. 5–10, Bonn, Germany [pdf]
- Birkholz P (2007). Control of an articulatory speech synthesizer based on dynamic approximation of spatial articulatory targets. In Proc. of the Interspeech 2007 - Eurospeech, pp. 2865–2868, Antwerp, Belgium [pdf]
- Kröger BJ, Birkholz P, Kannampuzha J, Neuschaefer-Rube C (2007). Multidirectional mappings and the concept of a mental syllabary in a neural model of speech production. In: 33rd Deutsche Jahrestagung für Akustik (DAGA 2007), pp. 91–92, Stuttgart, Germany [pdf]
- Kröger BJ, Birkholz P (2007). A gesture–based concept for speech movement control in articulatory speech synthesis. In: Esposito A, Faundez-Zanuy M, Keller E, Marinaro M (eds.) Verbal and Nonverbal Communication Behaviours, LNAI 4775 (Springer Verlag, Berlin), pp. 174–189 [pdf]
- Birkholz P (2007). Articulatory synthesis of singing. In: Bloothooft G (ed.) Synthesis of Singing Challenge. Antwerp, Belgium, URL: http://www.let.uu.nl/~Gerrit.Bloothooft/personal/SSC/ index.htm
2006
- Kröger BJ, Birkholz P, Kannampuzha J, Neuschaefer-Rube C (2006). Learning to associate speech-like sensory and motor states during babbling. In Proc. of the 7th International Seminar on Speech Production (ISSP 2006), pp. 67–74, Ubatuba, Brazil [pdf]
- Birkholz P, Kröger BJ (2006). Vocal tract model adaptation using magnetic resonance imaging. In Proc. of the 7th International Seminar on Speech Production (ISSP 2006), pp. 493–500, Ubatuba, Brazil [pdf]
- Kröger BJ, Birkholz P, Kannampuzha J, Neuschaefer-Rube C (2006). Modeling sensory-to-motor mappings using neural nets and a 3d articulatory speech synthesizer. In Proc. of the Interspeech 2006–ICSLP, pp. 565–568, Pittsburgh, Pennsylvania, USA [pdf]
- Kröger BJ, Birkholz P, Kannampuzha J, Neuschaefer-Rube C (2006). Spatial-to-joint coordinate mapping in a neural model of speech production. In: 32nd Deutsche Jahrestagung für Akustik (DAGA 2006), pp. 561–562, Braunschweig, Germany [pdf]
- Birkholz P, Jackèl D (2006). Modellierung des subglottalen Systems für die Artikulatorische Sprachsynthese. In: 32nd Deutsche Jahrestagung für Akustik (DAGA 2006), pp. 557–558, Braunschweig, Germany
- Birkholz P, Jackèl D, Kröger BJ (2006). Construction and control of a three-dimensional vocal tract model. In Proc. of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), pp. 873–876, Toulouse, France [pdf]
- Kröger BJ, Birkholz P, Kannampuzha J, Neuschaefer-Rube C (2006). Somatosensory, auditory, and motor representations in a neural model of speech production. In Stem-, Spraak- en Taalpathologie 14, Suppl. – Abstracts of the 5th International Conference for Speech Motor Control, pp. 33, Nijmegen, Netherlands
- Birkholz P, Jackèl D (2006). Noise sources and area functions for the synthesis of fricative consonants. Rostocker Informatik Berichte, 30, pp. 17–23 [pdf]
2005
- Birkholz P (2005). 3D-Artikulatorische Sprachsynthese. Logos Verlag, Berlin [pdf]
- Birkholz P, Jackèl D (2005). Artikulatorische Sprachsynthese mit dem Programm TractSyn - Ein Überblick. In: 31st Deutsche Jahrestagung für Akustik (DAGA 2005), pp. 79–80, Munich, Germany
2004
- Birkholz P, Jackèl D (2004). Influence of temporal discretization schemes on formant frequencies and bandwidths in time domain simulations of the vocal tract system. In Proc. of the Interspeech 2004-ICSLP, pp. 1125–1128, Jeju, Korea [pdf]
- Birkholz P, Jackèl D (2004). Boundary-layer resistance in time-domain simulations of the vocal tract system. In Proc. of the 12th European Signal Processing Conference (EUSIPCO 2004), pp. 999–1002, Vienna, Austria [pdf]
- Birkholz P, Jackèl D (2004). Simulation of flow and acoustics in the vocal tract. In: 30th Deutsche Jahrestagung für Akustik (CFA/DAGA 2004), pp. 895–896, Strasbourg, France [pdf]
- Birkholz P, Jackèl D (2004). Automatic fricative noise generation in the vocal tract based on aeroacoustic principles (abstract only). In: German-French Summerschool on “Cognitive and Physical Models of Speech Production, Perception and Perception-Production Interaction”, Lubmin, Germany
2003
- Birkholz P, Jackèl D (2003). A three-dimensional model of the vocal tract for speech synthesis. In Proc. of the 15th International Congress of Phonetic Sciences (ICPhS 2003), pp. 2597–2600, Barcelona, Spain [pdf]
- Birkholz P (2003). Grundfrequenzbestimmung unter Berücksichtigung linearer Frequenzänderungen. In: 29th Deutsche Jahrestagung für Akustik (DAGA 2003), pp. 768–769, Aachen, Germany [pdf]
- Birkholz P, Jackèl D (2003). Sprachgesteuerte Gesichtsanimation. Rostocker Informatik Berichte, 28
2002
- Birkholz P (2002). Entwicklung eines dreidimensionalen Artikulatormodells für die Sprachsynthese. Diploma thesis, University of Rostock, Germany [pdf]