|About Research Members Publications Resources Contact|
How do human children learn to understand and produce speech without explicit teaching? What aspects of language development are built-in to our brains and bodies, and how much is actually learnable from the
environment using generic cognitive skills? How can we make machines to use and understand language in the way humans do, not necessarily
through textual representations, but by truly understanding and communicating meanings in the signal?
These are some of the key questions that we work on in the Speech and Cognition research group. Our primary research method is computational modeling that combines signal processing and machine learning to (potentially large-scale) language and multimodal data in order to address these questions. In addition, we work on various other topics related to speech technology and signal processing, such as development of automatic detection of neurophysiological problems in infants and development of technological tools for large-scale audio- and language data analysis.
Some selected publications
Khorrami, K. & Räsänen, O. (2021). Can phones, syllables, and words emerge as side-products of cross-situational audiovisual learning? – A computational investigation. Language Development Research, https://doi.org/10.34842/w3vw-s845
Räsänen, O., Seshadri, S., Lavechin, M., Cristia, A., & Casillas, M. (in press). ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings.
Behavior Research Methods
(PsyArXiv) (ALICE open-source).
Airaksinen, M., Räsänen, O., Ilén, E., Häyrinen, T., Kivi, A., Marchi, V., Gallen, A., Blom, S., Varhe, A., Kaartinen, N., Haataja, L., & Vanhatalo, S. (2020). Automatic posture and movement
tracking of infants with wearable movement sensors.
Scientific Reports, 10:169
Räsänen, O., Doyle, G., & Frank, M. C. (2018). Pre-linguistic segmentation of speech into syllable-like units. Cognition, 171, 130–150
Kakouros, S., Salminen, N. & Räsänen, O. (2018). Making predictable unpredictable with style
— Behavioral and electrophysiological evidence for the critical role of prosodic expectations in the perception of prominence in speech.
Neuropsychologia, 109, 181–199
Räsänen, O., Kakouros, S. & Soderstrom, M. (2018). Is infant-directed speech interesting because it is surprising? — Linking properties of IDS to statistical learning and attention at the prosodic level. Cognition, 178, 193–206 (.pdf).
Räsänen, O. & Rasilo, H. (2015). A joint model of word segmentation and meaning acquisition through cross-situational learning. Psychological Review, 122(4), 792–829 (.pdf).
Räsänen, O. & Laine, U. K. (2013). Time-frequency integration characteristics of hearing are optimized for perception of speech-like acoustic patterns. The Journal of the Acoustical Society of America, 134, 407–419 (web).