TY - JOUR A1 - Hecker, Pascal A1 - Steckhan, Nico A1 - Eyben, Florian A1 - Schuller, Björn Wolfgang A1 - Arnrich, Bert T1 - Voice Analysis for Neurological Disorder Recognition – A Systematic Review and Perspective on Emerging Trends JF - Frontiers in Digital Health N2 - Quantifying neurological disorders from voice is a rapidly growing field of research and holds promise for unobtrusive and large-scale disorder monitoring. The data recording setup and data analysis pipelines are both crucial aspects to effectively obtain relevant information from participants. Therefore, we performed a systematic review to provide a high-level overview of practices across various neurological disorders and highlight emerging trends. PRISMA-based literature searches were conducted through PubMed, Web of Science, and IEEE Xplore to identify publications in which original (i.e., newly recorded) datasets were collected. Disorders of interest were psychiatric as well as neurodegenerative disorders, such as bipolar disorder, depression, and stress, as well as amyotrophic lateral sclerosis amyotrophic lateral sclerosis, Alzheimer's, and Parkinson's disease, and speech impairments (aphasia, dysarthria, and dysphonia). Of the 43 retrieved studies, Parkinson's disease is represented most prominently with 19 discovered datasets. Free speech and read speech tasks are most commonly used across disorders. Besides popular feature extraction toolkits, many studies utilise custom-built feature sets. Correlations of acoustic features with psychiatric and neurodegenerative disorders are presented. In terms of analysis, statistical analysis for significance of individual features is commonly used, as well as predictive modeling approaches, especially with support vector machines and a small number of artificial neural networks. An emerging trend and recommendation for future studies is to collect data in everyday life to facilitate longitudinal data collection and to capture the behavior of participants more naturally. Another emerging trend is to record additional modalities to voice, which can potentially increase analytical performance. KW - neurological disorders KW - voice KW - speech KW - everyday life KW - multiple modalities KW - machine learning KW - disorder recognition Y1 - 2022 U6 - https://doi.org/10.3389/fdgth.2022.842301 SN - 2673-253X PB - Frontiers Media SA CY - Lausanne, Schweiz ER - TY - GEN A1 - Hecker, Pascal A1 - Steckhan, Nico A1 - Eyben, Florian A1 - Schuller, Björn Wolfgang A1 - Arnrich, Bert T1 - Voice Analysis for Neurological Disorder Recognition – A Systematic Review and Perspective on Emerging Trends T2 - Zweitveröffentlichungen der Universität Potsdam : Reihe der Digital Engineering Fakultät N2 - Quantifying neurological disorders from voice is a rapidly growing field of research and holds promise for unobtrusive and large-scale disorder monitoring. The data recording setup and data analysis pipelines are both crucial aspects to effectively obtain relevant information from participants. Therefore, we performed a systematic review to provide a high-level overview of practices across various neurological disorders and highlight emerging trends. PRISMA-based literature searches were conducted through PubMed, Web of Science, and IEEE Xplore to identify publications in which original (i.e., newly recorded) datasets were collected. Disorders of interest were psychiatric as well as neurodegenerative disorders, such as bipolar disorder, depression, and stress, as well as amyotrophic lateral sclerosis amyotrophic lateral sclerosis, Alzheimer's, and Parkinson's disease, and speech impairments (aphasia, dysarthria, and dysphonia). Of the 43 retrieved studies, Parkinson's disease is represented most prominently with 19 discovered datasets. Free speech and read speech tasks are most commonly used across disorders. Besides popular feature extraction toolkits, many studies utilise custom-built feature sets. Correlations of acoustic features with psychiatric and neurodegenerative disorders are presented. In terms of analysis, statistical analysis for significance of individual features is commonly used, as well as predictive modeling approaches, especially with support vector machines and a small number of artificial neural networks. An emerging trend and recommendation for future studies is to collect data in everyday life to facilitate longitudinal data collection and to capture the behavior of participants more naturally. Another emerging trend is to record additional modalities to voice, which can potentially increase analytical performance. T3 - Zweitveröffentlichungen der Universität Potsdam : Reihe der Digital Engineering Fakultät - 13 KW - neurological disorders KW - voice KW - speech KW - everyday life KW - multiple modalities KW - machine learning KW - disorder recognition Y1 - 2023 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-581019 IS - 13 ER - TY - JOUR A1 - Garcia, Rowena A1 - Dery, Jeruen E. A1 - Roeser, Jens A1 - Höhle, Barbara T1 - Word order preferences of Tagalog-speaking adults and children JF - First language N2 - This article investigates the word order preferences of Tagalog-speaking adults and five- and seven-year-old children. The participants were asked to complete sentences to describe pictures depicting actions between two animate entities. Adults preferred agent-initial constructions in the patient voice but not in the agent voice, while the children produced mainly agent-initial constructions regardless of voice. This agent-initial preference, despite the lack of a close link between the agent and the subject in Tagalog, shows that this word order preference is not merely syntactically-driven (subject-initial preference). Additionally, the children’s agent-initial preference in the agent voice, contrary to the adults’ lack of preference, shows that children do not respect the subject-last principle of ordering Tagalog full noun phrases. These results suggest that language-specific optional features like a subject-last principle take longer to be acquired. KW - Child language acquisition KW - sentence production KW - Tagalog acquisition KW - voice KW - word order Y1 - 2018 U6 - https://doi.org/10.1177/0142723718790317 SN - 0142-7237 SN - 1740-2344 VL - 38 IS - 6 SP - 617 EP - 640 PB - Sage Publ. CY - London ER - TY - THES A1 - Garcia, Rowena T1 - Thematic role assignment and word order preferences in the child language acquisition of Tagalog T1 - Zuweisung thematischer Rollen und Wortstellungspräferenz im kindlichen Spracherwerb von Tagalog N2 - A critical task in daily communications is identifying who did what to whom in an utterance, or assigning the thematic roles agent and patient in a sentence. This dissertation is concerned with Tagalog-speaking children’s use of word order and morphosyntactic markers for thematic role assignment. It aims to explain children’s difficulties in interpreting sentences with a non-canonical order of arguments (i.e., patient-before-agent) by testing the predictions of the following accounts: the frequency account (Demuth, 1989), the Competition model (MacWhinney & Bates, 1989), and the incremental processing account (Trueswell & Gleitman, 2004). Moreover, the experiments in this dissertation test the influence of a word order strategy in a language like Tagalog, where the thematic roles are always unambiguous in a sentence, due to its verb-initial order and its voice-marking system. In Tagalog’s voice-marking system, the inflection on the verb indicates the thematic role of the noun marked by 'ang.' First, the possible basis for a word order strategy in Tagalog was established using a sentence completion experiment given to adults and 5- and 7-year-old children (Chapter 2) and a child-directed speech corpus analysis (Chapter 3). In general, adults and children showed an agent-before-patient preference, although adults’ preference was also affected by sentence voice. Children’s comprehension was then examined through a self-paced listening and picture verification task (Chapter 3) and an eye-tracking and picture selection task (Chapter 4), where word order (agent-initial or patient-initial) and voice (agent voice or patient voice) were manipulated. Offline (i.e., accuracy) and online (i.e., listening times, looks to the target) measures revealed that 5- and 7-year-old Tagalog-speaking children had a bias to interpret the first noun as the agent. Additionally, the use of word order and morphosyntactic markers was found to be modulated by voice. In the agent voice, children relied more on a word order strategy; while in the patient voice, they relied on the morphosyntactic markers. These results are only partially explained by the accounts being tested in this dissertation. Instead, the findings support computational accounts of incremental word prediction and learning such as Chang, Dell, & Bock’s (2006) model. N2 - Eine zentrale Anforderung in der täglichen Kommunikation ist es, in Äußerungen zu identifizieren, wer was wem getan hat, sprich, die thematischen Rollen des Agens und Patiens in einem Satz zuzuweisen. Diese Dissertation behandelt die Nutzung der Wortstellung und morphosyntaktischer Markierungen für diese Zuweisung thematischer Rollen durch Tagalog sprechende Kinder. Das Ziel ist es zu klären, wieso Kinder Schwierigkeiten im Verständnis von Sätzen mit nicht-kanonischer Argumentstellung (z.B., Patiens-vor-Agens) haben. Hierfür wurden die Vorhersagen folgender Modelle getestet: der frequenzbasierte Ansatz (Demuth, 1989), das Konkurrenzmodell (Competition Model; MacWhinney & Bates, 1989), und die Annahme inkrementeller Verarbeitung (Trueswell & Gleitman, 2004). In erster Linie untersuchen die durchgeführten Experimente, die Auswirkungen einer möglichen wortstellungsbasierten Strategie in einer Sprache wie Tagalog, in der die thematische Rollen eines Satzes aufgrund der verbinitialen Wortstellung und der transparenten Diathesemarkierungen immer eindeutig bestimmbar sind. In der Diathesemarkierung in Tagalog deutet die Flexion am Verb die thematische Rolle des Nomens an, das mit 'ang' markiert ist. Zunächst wurde die mögliche Grundlage einer wortstellungsbasierten Strategie in Tagalog ermittelt. Zu diesem Zweck wurde neben einem Satzvervollständigungsexperiment mit Erwachsenen und 5- und 7-jährigen Kindern (Kapitel 2) auch die Analyse eines kindgerichteten Sprachkorpus durchgeführt. Grundsätzlich wiesen die Erwachsenen und Kinder eine Agens-vor-Patiens-Präferenz auf, wobei die Präferenz der Erwachsenen durch die Satzdiathese beeinflusst wurde. Die Verständnisfähigkeiten der Kinder wurde mithilfe einer selbstgesteuerten Hör- und Bildverifizierungsaufgabe (Kapitel 3), und einer Bildauswahlaufgabe mit Blickbewegungsmessung (Kapitel 4) bestimmt. In der letzteren Aufgabe, wurden Wortstellung (Agens-initial oder Patiens-initial) und Diathese (Agens-Diathese oder Patiens-Diathese) manipuliert. Offline- (d.h., Akkuratheit) und Online-Messungen (d.h., Hördauer, Blickdauer) zeigten, dass 5- und 7-jährige, Tagalog sprechende Kinder dazu neigen, das erste Nomen als Agens zu interpretieren. Darüber hinaus wurde festgestellt, dass die Verwendung der Wortstellung und morphosyntaktischer Markierungen durch die Diathese moduliert wird. In der Agens-Diathese verließen sich die Kinder dabei mehr auf die wortstellungsbasierte Strategie, während sie sich in der Patiensdiathese vermehrt auf morphosyntaktische Markierungen stützten. Diese Ergebnisse sind nur teilweise durch die in dieser Dissertation getesteten Ansätze erklärbar. Stattdessen scheinen die Ergebnisse computerbasierte Theorien inkrementeller Wortvorhersage und ihres Erwerbs, wie etwa das Modell von Chang, Dell, & Bock (2006), zu stützen. KW - thematic role assignment KW - Zuweisung thematischer Rollen KW - Tagalog acquisition KW - sentence processing KW - production KW - word order KW - voice KW - Erwerb des Tagalog KW - Satzverarbeitung KW - Produktion KW - Wortstellung KW - Diathese KW - eye-tracking KW - Blickbewegungsmessung Y1 - 2018 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-421742 CY - Potsdam ER -