Experiments with the Lexique French frequency database
Lexique 3 is a French word database from the Université de Savoie. SQLite 3 is a fast, local database. IPython is a cool way to work with and visualize scientific data.
I'm also merging in the French verb conjugation rules data set, which will provide detailed information about regular verbs.
A notebook with lots of interesting data is available via nbviewer.
Lexique380
is distributed under a Creativeverbs-0-2-0.xml
are in the publicverb-prototypes.tsv
file is generated using both the LexiqueYou will need:
iconv
sqlite3
pip
Run the following commands from the command line:
# Install ipython and supporting libraries.
pip install -U pandas
pip install -U ipython[notebook]
pip install -U brewer2mpl
# Generate our database from the raw Lexique data.
make
# Open up our interactive notebook in a web browser.
ipython notebook 'French Vocabulary Frequency with Lexique.ipynb'