BiRCh corpus is a project in progress. These files are updated whenever changes are made, and new files posted as they are ready. Sample files below all pertain to child B, USA, b. Dec. 30, 2012
Morphologically tagged pseudonymized transcriptions (XML)
Quirks of our pilot annotation software mean that these tagged files group utterances by speaker - first all child utterances, then mother's, then other speakers'.
In future, tagged files will be chronological and will include speaker tags (child, mother, father, etc.).