Alpage software and resources


Installation assistant for Alpage tools and resources

alpi (ALPage Installer)

alpi is a Perl script that helps users to install locally Alpage software. It can also be used for installing the whole Alpage linguistic processing chain for French. This script is reasonably user-friendly, since it detects and installs software prerequisites (such as non-standard Perl packages).


Alpage software

alpc (Alpage Linguistic Processing Chain)

The Alpage team develops and maintains a full-features linguistic processing chain for French (see our online Demos). This chain relies on DyALog, FRMG, Lefff and SxPipe (see below for a description of these tools and resources).

DyALog

DyALog is en environment for compiling and using logic programs and chart parsers for natural languages, that handles various grammatical formalisms (DCG, TAG, TIG, RCG). (Page on INRIA GForge).

SYNTAX

Set of tools for the automatic construction of efficient parsers from syntactic descriptions. SYNTAX handles several formalisms such as (deterministic and non-deterministic) CFGs, TAGs, LFGs, RCGs,...

ALPAGE Linguistic Workbench

ALPAGE Linguistic Workbench provides several modules for setting up and using a linguistic processing chain, in particular for French, including the shallow processing chain SxPipe and the POS-tagger MElt.

BONSAI

BONSAI is a complete processing chain for statistical dependency parsing of French, based on models trained on the French Treebank.

MElt

MElt is a freely available (LGPL) state-of-the-art sequence labeller aimed at generating morphosyntactic (POS) taggers trained on both annotated corpora and an external lexicons. MElt is provided with a state-of-the-art tagging model for French, as well as tagging models for other languages (English, Spanish, Italian, German). MElt also includes a normalization wrapper aimed at helping processing noisy text, such as user-generated data retrieved on the web (French and English only).

MetaGrammar Toolkit

MetaGrammar Toolkit includes several tools for developing and compiling TAG metagrammars. It also contains a large-coverage metagrammar for French, FRMG. (Page on INRIA GForge)

SxLFG

SxLFG is a parser generator for Lexical Functional Grammars (LFG) that relies on SYNTAX.


Lexical resources

Alexina

Atelier pour les LEXiques INformatiques et leur Acquisition (Workbench for electronic lexica and their acquisition) - Development of morphological and syntactic lexica for NLP. Includes tools as well as several lexica: the Lefff (French), the Leffe (Spanish), PolLex (Polish), SkLex (Slovak), DeLex (German), PerLex (Persian), KurLex (Kurmanji Kurdish) and SoraLex (Sorani Kurdish). Two other freely-available lexica have been imported within the Alexina architecture, namely the morphological lexica for Dutch and Italian distributed respectively within the Alpino project and the Morph-it! lexicon.

WOLF

The WOLF (Wordnet Libre du Français) is a freely-available semantic lexicon (wordnet) for French.


Corpora

French Social Media Bank

Treebank built on data extracted from French social media (Facebook, Twitter) and French forums (Doctissimo, JeuxVideos.com). The main interest of this corpus is to provide annotated data for texts whose quality range from medium to very noisy.

Sequoia Treebank

The corpus contains 3200 French sentences, from Europarl, Est Republicain newspaper, French Wikipedia and European Medicine Agency. Each sentence is annotated for part-of-speech and phrase-structure, following the French Treebank guidelines. The constituency trees were then automatically converted to dependency trees.


LANGUE/LANGUAGE

Calendar

Mo Tu We Th Fr Sa Su
29 30 31 01 02 03 04
05 06 07 08 09 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 01 02 03 04