Refine
Document Type
- Article (3) (remove)
Language
- English (3) (remove)
Keywords
- Child language (1)
- Lexical database (1)
- Reading development (1)
Institute
This article introduces childLex, an online database of German read by children. childLex is based on a corpus of children's books and comprises 10 million words that were syntactically annotated and lemmatized. childLex reports linguistic norms for lexical, superlexical, and sublexical variables in three different age groups: 6-8 (grades 1-2), 9-10 (grades 3-4), and 11-12 years (grades 5-6). Here, we describe how childLex was collected and analyzed. In addition, we provide information about the distributions of word frequency, word length, and orthographic neighborhood size, as well as their intercorrelations. Finally, we explain how childLex can be accessed using a Web interface.
Inhalt: Introduction Developments in creating corpora dlexDB, subtitles, and tabloid newspapers Rating corpus emotionality Current study Method Materials Corpora Results Type-token ratio Validity: Effects of task difficulty Emotionality of a corpus Validity: Effects of emotionality Discussion Outlook References