Coding a Overheard Speech (OvS)Files
This page contains instructions for annotating the overheard speech files. Any questions, email Jasenia Hartman at jaseniahartman@fas.harvard.edu
NOTE: When you are working on a file, mark the file as in progress on Asana by moving it to the "in progress" column
Naming system
OvS files are named the following format OvS_subID_subMo
The first code stands for Overheard Speech.
The second code refers to the subject’s seedlings subject ID
The third code refers to the infant’s age at which recording occurred
So for example, the file OvS_45_07 refers to SEEDLinG subject 45 at 07 months.
Paths and Folders
1. The path to overheard speech directory is /Volumes/Fas-Phyc-PEB-Lab/OvSpeech/SubjectFiles/Seedlings/overheard_speech
. Here, you will find several subfolders. The relevant folders for transcription are the following: eafs, annotations-in-progress, annotations-to-be-superchecked, and annotations-complete.
Getting Started
The eaf folder contains all the files for the overheard speech. Find the folder that you are assigned to and copy it the annotations-in-progress folder.
Go to the
eaf folder.
Find the folder that corresponds to the file you are assigned to annotate. Copy this folder into theannotations-in-progress
folder.In the
annotations-in-progress
folder, rename the copied folder by adding your initials to the end of the folder name. So for example, if I were assigned to annotate OvS_45_07, then I would renamed the folder OvS_45_07_JH. Do the same for the eaf file.Open the eaf file. There are two things you need to do before you can begin annotation. The first is linking the audio file. The second is adding the cds tier type.
Link the audio file following the steps below:
a. In ELAN, go to Edit > Linked Files…
b. Click Add..
c. Go to Seedlings/Subject_Files/SubID/subID_subMo/Home_Visit/Processing/Audio_Files/subID_subMo.wav
(Note: SubID will be the second code of the overheard speech file, subMo will the third code of the overheard speech file. So, in the case of OvS_45_07, the path is the following: Seedlings/Subject_Files/45/45_07/Home_Visit/Processing/Audio_Files/45_07.wav)
Add the cds tier type in the following the steps below:
a. In Elan, go to Edit>Edit Control Vocabularies...
b. Click external CV and enter this url link: https://raw.githubusercontent.com/BergelsonLab/public-files/main/ACLEW-blab-vocabularies.ecv
c. Click Ok and then Close.
d. Go to Tier > Add New Tier Type...
e. Enter the following for CDS
f. Click add and then close
5. You can now begin annotating. Annotate the .eaf file, one clip at a time, following ACLEW standards, in the verision of your folder in annotations-in-progress.
A few notes
Take notes about any coding issues you face as you go (e.g., if there is a annotation that you would like reviewed or if there is anything weird or difficult to code with this file)
if there are any utterances directed to a child, you will have to create a dependent cds subtier under the xds for the speaker (For instructions, See Step 4 under Gold Standard Test under the coding for different types of child directed speech page).
Once you're done, slide the task to the “ready for superchecker” on asana.
Last updated