Download
Two types of files (.xlsx format) are available for download
• Lexical databases: The Eqol-infra-All
contains all the lexical entries (about 13,500)
whereas the Eqol-infra-Lemme
file contains only the orthographic forms
corresponding to the lemmas (lexemes; about 6,300). The
grapho-phonological statistics of the words are independent. The
Eqol-infra-lemme file allows to characterize the
grapho-phonological properties of words independently of
gender/number nominal inflections and verbal inflections. Note
that words that appear in schoolbooks in an inflected form only
are not included in this second analysis, since the
non-inflected form is not encountered by children when reading
print.
• Full lists of G-Ph, Ph-G, and rimes associations: Frequency
and consistency statistics for each association are generated
from all lexical entries in the EqolAll-Associations
file and from lexical entries corresponding to
lemmas in the EqolLemme-associations
file.
Filters are added at the top of the word lists to help
selection
The Eqol-All and Eqol-Lemme files contain the following information:
• Orthographic and phonological codes of the words
• Grammatical category according to the EQOL database
• Number of letters, phonemes, graphemes, syllables
• Graphemic complexity (n of letters / n of phonemes)
• Word frequency (per million words) from Grade 1 to Grade 6
according to EQOL
• Orthographic neighborhood (Levenshtein OLD20 index)
• G-Ph segmentation and Ph-G segmentation
• Phonological rime and orthographic counterpart
• Consistency and frequency of orthography-phonology (reading
direction) or phonology-orthography (writing direction)
associations on the word's phonological rime. Values by type and
token
• Average frequency and consistency of G-Ph and Ph-G
associations (values by type and by token)
• Least frequent or least consistent G-Ph and Ph-G associations
of the word. The least consistent association is not necessarily
the least frequent, and vice versa.
• Frequency and consistency of G-Ph associations (values by type
and by token) as a function of the position within the word
(initial, internal, final)
• Frequency and consistency of Ph-G associations (values by type
and by token) as a function of the position within the word
(initial, internal, final)
Note: Google Sheets allows you to browse the files from your
Google Drive. To import files directly into your Google Drive,
use Chrome and the "Save to Google Drive" extension available
on the Chrome Web Store. Then right-click on the file link to
save it to your Google Drive