Week 13

This week I focused on trying to build an outline and presentation for the paper I’m hoping to be submitting to CogSci in February. Sidenote: with my past research experiences, the presentation and paper go hand in hand. That is, it’s easier to write a paper or create a presentation first and then do the other. Ideally, you work on them in tandem…

The overall goal of the project is try and use speech recognition to better understand language acquisition. To do this, we want to train a speech recognition system on child-directed speech to try and see if the acoustics of the parent input accounts for any of the variation in how children acquire language: slide1

Here I outline how typical human speech recognition works. When it’s between two adult humans who don’t have any aphasia’s or communication disorders it’s usually very accurate. Two individuals can carry on in a conversation where multiple acoustic strings are exchanged: slide2

slide3

In order to better understand how humans got so good, it’s important to look at the field of early (first) language acquisition: slide4

First language acquisition outlines a number of theories on how humans acquire language: slide5

One of the main theories of how children acquire is by reasoning with analogy or imitating their linguistic input (parents): slide6

Perhaps it’s reinforcement learning and correction that explains the ability for children make progress. More clearly, children have some internal policy that is a function of their parents such that when they utter incorrect phrases or construct they are “punished” (there is some correction cost to an incorrect utterance). However, when children say correct utterances the lack of correct reinforces good behavior: slide7

Parental input has become a crucial component to assessing how children acquire language. Of course, parent input (how parents choose to teach their children language) differs across cultures. These cultures differences are often based on geographical location, class, and ethnicity. For example, there are many Native American tribes that engage with their children as true conversational partners as opposed to talking to their children with impoverished speech like some of their midwestern middle-class American counterparts. slide8

In fact, because we notice the variance in parental input and because it’s possible to find counterexamples to reinforcement and correction, and pattern recognition. Perhaps we have to appeal to something much deeper, intrinsic to children. The theory of Universal Grammar allows us to vision language acquisition as much more than reinforcement and pattern recognition, but outlines how all humans utilize their intuitions about the systematicity of language:

slide9

Here I outline how typical human to computer speech recognition works: slide10

slide11

Outline of ASR System: slide12

slide13

slide14

After sufficient training is done one using Word error rate as a form of testing and assessment: slide15

slide16

Using ASR to better understand language acquisition: slide17

slide18

slide19

slide20

slide21

Best,
EO