• Week 18

    Cleaning up data This is the most tedious part of the project. I think that’s why I’ve been avoiding it for so long. It’s nothing but cleaning up the usable data so that it can be properly formatted for the actual ASR construction and analysis. For my data clean up...

  • Week 17

    Happy New Year! Still working on my CogSci submission. Best, EO

  • Week 16

    This week I will be working on my CogSci paper. I’ve attached the overleaf link here. As a group we decided that we could write the entire paper independent of the results. So we outlined a specific hypothesis and potential results that we would generate. Then, we complete are results...

  • Week 15

    Group meeting We had another group meeting to catch up on some last minute details and go over our CogSci outlines. We were still unclear about how much usable data from the CHILDES corpus we have, so we talked about different ways to extract all the transcripts/audio files that had...

  • Week 14

    Revisiting data clean up This entire semester we’ve been trying to get an appropriate amount of data to train out automatic speech recognition system and build co-occurence networks. It has been a little difficult because of the massive amount of data available on CHILDES. We still are in the process...

  • Week 13

    This week I focused on trying to build an outline and presentation for the paper I’m hoping to be submitting to CogSci in February. Sidenote: with my past research experiences, the presentation and paper go hand in hand. That is, it’s easier to write a paper or create a presentation...

  • Week 12

    Working with MATLAB This week I started playing around with MATLAB because I thought it would be useful for visualizing some of our summary statistics. MATLAB also has a suite of really nice audio functions that I thought might be helpful. Actually, I ended up writing a script that would...

  • Week 11

    Decoding, decoding, decoding Just to recap, I collected VoxForge speech data, formatted the data for Kaldi, split the Voxforge into a training and testing test, finished installing the language modeling system, and the dictionary system. This week I started doing decoding on the VoxForge data. Decoding is essentially the last...