File Organization
All VIHI data lives on Fas-Phyc-PEB-Lab/VIHI.
IRB protocols and associated paperwork and flyers live with the other lab IRB protocols. It is organized by the substudy under the VIHI umbrella.
consent_sharing
This folder contains information about how we're allowed to share recordings we collected for VIHI. DataLibrariesPermissions.xlsx contains the permissions we obtained at the time of data collection. VIHI_recordings_sharing_permissions.xlsx is updated from when we moved from to Duke to Harvard and were required to get moving and sharing consent again as part of our data use agreement. Use VIHI_recordings_sharing_permissions.xlsx unless there is no response from a family included. Then default to the older DataLibrariesPermissions.xlsx.
Lab_visit_studies
Lab_visit_studies is for all the studies that involve a family coming in to the lab for a session, like pay sessions, eyetracking, or EEG studies. Inside, each study has its own folder by name and contains study checklists, experimenter scripts, experiments, stimuli, and the videos and data we collect during visits.
ELSSP
Stands for Early Learning Sensory Support Program. This is a study conducted by Erin Campbell in 2019 assessing the services being accessed by young children with hearing loss in North Carolina. All data and metadata for this study are contained in this folder.
Recruiting
This folder contains eligibility information and phone/email scripts for each of the studies in VIHI.
Scripts
Scripts contains (computational, not for speaking to families) scripts used for data or stimuli processing for VIHI projects, organized by script.
Surveys
This contains forms, participant information, and response data from each type of survey.
Photos
Contains photos of children during VIHI studies, with information about how we're allowed to share them.
Metadata
contains miscellaneous in-lab relevant information that is relevant to multiple substudies, including the naming scheme, keeping track of paying participants, participation dates, and VIHI_contact_information.xlsx, which is the spreadsheet that links identifiable information to participant subject numbers.
Corpus
This contains studcture and demographic information about the corpus data, instructions for collecting LENA recordings and project files that use the corpus. The most useful document in this folder is match_key.xlsx, which shows the VI and HI subjects side by side with which TD files they are matched to.
SubjectFiles
SubjectFiles contains corpus data from participants only, and is specific to the annotation process. There is a directory layer called LENA directly under this, which then breaks into folders by the stage of annotations:
annotations
is the permanent, clean, most up to date version of .eaf files and their associated notes. This folder is broken down by sensory group:

Then by participant number (without age in days):

Then by participant number (with age in days):

The participant folder with age in days should include all files from that data collection session.

Derivatives
contains metadata we extracted during the hivol sampling process.
annotations-in-progress
contains temporary versions of files from annotations (above), each tagged with a coder who is in the process of annotating it for something. These are regularly checked and merged back to annotations.
rawish-data
contains .its files, which are the files generated by the LENA software with metadata about each recording. Also contains the audio recordings themselves: .wav and .mp3 versions.
exported-annotations
are different, dated versions of the text annotations we export en masse from the .eaf files in VIHI.
Other folders are reliability files, which are sub-sampled to be recoded and assess intercoder agreement.
Last updated