File Organization

All VIHI data lives on Fas-Phyc-PEB-Lab/VIHI. IRB protocols and associated paperwork and flyers live with the other lab IRB protocols. It is organized by the substudy under the VIHI umbrella.

This folder contains information about how we're allowed to share recordings we collected for VIHI. DataLibrariesPermissions.xlsx contains the permissions we obtained at the time of data collection. VIHI_recordings_sharing_permissions.xlsx is updated from when we moved from to Duke to Harvard and were required to get moving and sharing consent again as part of our data use agreement. Use VIHI_recordings_sharing_permissions.xlsx unless there is no response from a family included. Then default to the older DataLibrariesPermissions.xlsx.

Lab_visit_studies

Lab_visit_studies is for all the studies that involve a family coming in to the lab for a session, like pay sessions, eyetracking, or EEG studies. Inside, each study has its own folder by name and contains study checklists, experimenter scripts, experiments, stimuli, and the videos and data we collect during visits.

ELSSP

Stands for Early Learning Sensory Support Program. This is a study conducted by Erin Campbell in 2019 assessing the services being accessed by young children with hearing loss in North Carolina. All data and metadata for this study are contained in this folder.

Recruiting

This folder contains eligibility information and phone/email scripts for each of the studies in VIHI.

Scripts

Scripts contains (computational, not for speaking to families) scripts used for data or stimuli processing for VIHI projects, organized by script.

Surveys

This contains forms, participant information, and response data from each type of survey.

Photos

Contains photos of children during VIHI studies, with information about how we're allowed to share them.

Metadata

contains miscellaneous in-lab relevant information that is relevant to multiple substudies, including the naming scheme, keeping track of paying participants, participation dates, and VIHI_contact_information.xlsx, which is the spreadsheet that links identifiable information to participant subject numbers.

Corpus

This contains studcture and demographic information about the corpus data, instructions for collecting LENA recordings and project files that use the corpus. The most useful document in this folder is match_key.xlsx, which shows the VI and HI subjects side by side with which TD files they are matched to.

SubjectFiles

SubjectFiles contains corpus data from participants only, and is specific to the annotation process. There is a directory layer called LENA directly under this, which then breaks into folders by the stage of annotations:

annotations

is the permanent, clean, most up to date version of .eaf files and their associated notes. This folder is broken down by sensory group:

folders in SubjectFiles > LENA> annotations

Then by participant number (without age in days):

Folders in SubjectFiles > LENA > annotations> HI

Then by participant number (with age in days):

Folders within LENA > HI > HI_423

The participant folder with age in days should include all files from that data collection session.

Files within HI_423_959

Derivatives

contains metadata we extracted during the hivol sampling process.

annotations-in-progress

contains temporary versions of files from annotations (above), each tagged with a coder who is in the process of annotating it for something. These are regularly checked and merged back to annotations.

rawish-data

contains .its files, which are the files generated by the LENA software with metadata about each recording. Also contains the audio recordings themselves: .wav and .mp3 versions.

exported-annotations

are different, dated versions of the text annotations we export en masse from the .eaf files in VIHI.

Other folders are reliability files, which are sub-sampled to be recoded and assess intercoder agreement.

Last updated