word processor

Word processor
Higher Edu - Research dev card
Development from the higher education and research community
  • Creation or important update: 12/03/10
  • Minor correction: 12/03/10

String process : implementation of algorithms for automatic processing of natural language

This software was developed (or is under development) within the higher education and research community. Its stability can vary (see fields below) and its working state is not guaranteed.
  • Web site
  • System:
  • Current version: 1.0 - 2001
  • License(s): not yet chosen
  • Status: stable release
  • Support: maintained, no ongoing development
  • Designer(s): Maxime Crochemore, Christophe Hancart, Thierry Lecroq
  • Contact designer(s): Thierry.Lecroq @ univ-rouen.fr
  • Laboratory, service:

 

General software features

Set of C and Java code to implement algorithms on strings :

  • Pattern matching automata
  • String searching with a sliding window
  • Suffix arrays
  • Structures for indexes
  • Indexes
  • Alignments
  • Approximate patterns
  • Local periods

These algorithms are presented in [Crochemore, Hancart, Lecroq, 2001].

Context in which the software is used

This set of programs was provided to explain the algorithms of the book [Crochemore, Hancart, Lecroq, 2001].
This is the first French book dedicated to string process algorithms.

Publications related to the software
  • Algorithmique du texte, M. Crochemore, C. Hancart et T. Lecroq, Vuibert, 2001
  • Algorithms on Strings, M. Crochemore, C. Hancart et T. Lecroq, Cambridge University Press, 2007
Higher Edu - Research dev card
Development from the higher education and research community
  • Creation or important update: 24/03/09
  • Minor correction: 10/07/13

Unitex : corpus processing using finite state technology

This software was developed (or is under development) within the higher education and research community. Its stability can vary (see fields below) and its working state is not guaranteed.
  • Web site
  • System:
  • Current version: 3.0 stable - september 2012
  • License(s): LGPL - - The language resources distributed with the software are licensed LGPLLR, a license developed by the Université Paris-Est Marne-la-Vallée and validated by the FSF as the equivalent of LGPL for linguistic data. http://igm.univ-mlv.fr/~unitex/lgpllr.html
  • Status: validated (according to PLUME), stable release, under development
  • Support: maintained, ongoing development
  • Designer(s): Sébastien Paumier
  • Contact designer(s): unitex@univ-mlv.fr
  • Laboratory, service:

 

General software features

The Unitex system provides tools to build language resources such as electronic dictionaries and grammars to use them in advanced searches in texts and in generating concordances.

The French validated software index card Fiche Plume describes the software in detail.

Context in which the software is used

Exploration tool used for research by the language processing team of the computer laboratory.
It is also used in several universities at international level as a tool for research and teaching in computer language studies.

Publications related to the software
  • Sébastien Paumier. 2000. Nouvelles méthodes pour la recherche d'expressions dans de grands corpus. In A. Dister (ed.), Actes des 3èmes Journées INTEX. Revue Informatique et Statistique dans les Sciences Humaines, 36ème année, n° 1 à 4.
  • Sébastien Paumier. 2003. A Time-Efficient Token Representation for Parsers, Proceedings of the EACL Workshop on Finite-State Methods in Natural Language Processing, Budapest, pp. 83-90.
  • Other publications associated with the project can be found at its website.
Syndicate content