The Ubuntu NLP Repository v8.04LTS
Powered by Falcon
Component nlp
Language-independent NLP tools.
You can use apt to download and install the packages. Use the following lines in /etc/apt/sources.list and use the command sudo apt-get update to enable downloading from this component.
Don't forget to read the notice on the frontpage!
deb-src http://cl.naist.jp/~eric-n/ubuntu-nlp hardy nlp
Packages
- freeling
Version: 2.1-beta1-9nlp2~0hardy1 Source (dsc): freeling_2.1-beta1-9nlp2~0hardy1.dsc Source (tar.gz): freeling_2.1-beta1-9nlp2~0hardy1.tar.gz - freeling
Description: an open-source suite of language analyzers More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>Package: freeling_2.1-beta1-9nlp2~0hardy1_amd64.deb - freeling-data
Description: Linguistic data used by FreeLing More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>This package contains the linguistic data needed by Freeling
Package: freeling-data_2.1-beta1-9nlp2~0hardy1_amd64.deb - freeling-doc
Description: Documentation for FreeLing More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>This package contains the FreeLing documentation.
Package: freeling-doc_2.1-beta1-9nlp2~0hardy1_all.deb - libmorfo-dev
Description: FreeLing's morfo library (dev) More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>This package contains the development files of the morph library used by FreeLing.
Package: libmorfo-dev_2.1-beta1-9nlp2~0hardy1_amd64.deb - libmorfo2
Description: FreeLing's morfo library More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>This package contains the morph library used by FreeLing.
Package: libmorfo2_2.1-beta1-9nlp2~0hardy1_amd64.deb - giza-pp
Version: 1:1.0.5-1nlp1~0hardy1 Source (dsc): giza-pp_1.0.5-1nlp1~0hardy1.dsc Source (tar.gz): giza-pp_1.0.5-1nlp1~0hardy1.tar.gz - giza++
Description: A tool for training statistical alignment models More... GIZA++: Training of statistical translation models.
GIZA++ is an extension of the program GIZA (part of the SMT toolkit EGYPT)
which was developed by the Statistical Machine Translation team during the
summer workshop in 1999 at the Center for Language and Speech Processing
at Johns-Hopkins University (CLSP/JHU). GIZA++ includes a lot of additional
features. The extensions of GIZA++ were designed and written by Franz Josef
Och.About GIZA++
The program includes the following extensions to GIZA:
* IBM Model 4;
* IBM Model 5;
* Alignment models depending on word classes
* Implements the HMM alignment model: Baum-Welch training, Forward-Backward
algorithm, empty word, dependency on word classes, transfer to fertility
models
* Includes a variant of Model 3 and Model 4 which allow the training of the
parameter p_0;
* Various smoothing techniques for fertility, distortion/alignment parameters;
* Significant more efficient training of the fertility models;
* Correct implementation of pegging as described in (Brown et al. 1993), a
series of heuristics in order to make pegging sufficiently efficient;For more information, consult the following publication:
@ARTICLE{och03:asc,
AUTHOR = {Franz Josef Och and Hermann Ney},
TITLE = {A Systematic Comparison of Various Statistical Alignment Models},
JOURNAL= {Computational Linguistics},
NUMBER = 1,
VOLUME = 29,
YEAR = 2.0.2003,
PAGES = {19--51}}or the GIZA++ project homepage <http://www.fjoch.com/GIZA++.html>
Package: giza++_1.0.5-1nlp1~0hardy1_i386.deb Package: giza++_1.0.5-1nlp1~0hardy1_amd64.deb - mkcls
Description: A tool for training statistical alignment models More... mkcls: word class training with maximum likelihood-criterion.
mkcls is a tool to train word classes by using a maximum-likelihood-criterion.
The resulting word classes are especially suited for language models or
statistical translation models. The program mkcls was written by Franz Josef
Och.For more information, consult the following publication:
* Franz Josef Och: "An Efficient Method for Determining Bilingual Word
Classes"; pp. 71-76, Ninth Conf. of the Europ. Chapter of the Association
for Computational Linguistics; EACL'99, Bergen, Norway, June 1999.or the mkcls project homepage <http://www.fjoch.com/mkcls.html>
Package: mkcls_1.0.5-1nlp1~0hardy1_i386.deb Package: mkcls_1.0.5-1nlp1~0hardy1_amd64.deb - libcfg+
Version: 0.6.2-1nlp2~0hardy1 Source (dsc): libcfg+_0.6.2-1nlp2~0hardy1.dsc Source (tar.gz): libcfg+_0.6.2-1nlp2~0hardy1.tar.gz - libcfg+
Description: command line and configuration file parsing library More... libcfg+ is a C library that features multi- command line and configuration
file parsing. It is possible to set up various special properties such as
quoting characters, deliminator strings, file comment prefixes, multi-line
postfixes, and more. It supports many data types such as booleans, integers,
decimal numbers, strings with many additional data type flags (such as
multiple values for a single option).For more information see the project homepage at:
<http://platon.sk/projects/main_page.php?project_id=3>Package: libcfg+_0.6.2-1nlp2~0hardy1_i386.deb Package: libcfg+_0.6.2-1nlp2~0hardy1_amd64.deb - libcfg+-dev
Description: command line and configuration file parsing library (dev) More... libcfg+ is a C library that features multi- command line and configuration
file parsing. It is possible to set up various special properties such as
quoting characters, deliminator strings, file comment prefixes, multi-line
postfixes, and more. It supports many data types such as booleans, integers,
decimal numbers, strings with many additional data type flags (such as
multiple values for a single option).For more information see the project homepage at:
<http://platon.sk/projects/main_page.php?project_id=3>This package contains the libcfg+ headers and other development files.
Package: libcfg+-dev_0.6.2-1nlp2~0hardy1_i386.deb Package: libcfg+-dev_0.6.2-1nlp2~0hardy1_amd64.deb - libfries
Version: 1.0-1nlp4~0hardy1 Source (dsc): libfries_1.0-1nlp4~0hardy1.dsc Source (tar.gz): libfries_1.0-1nlp4~0hardy1.tar.gz - libfries-dev
Description: Feature Retriever for Intensional Encoding of Sentences (dev) More... FRIES provides an expressive feature definition language that enables
the extraction of advanced patterns from input data. FRIES is specially
oriented to encode Natural Language sentences and corpora into feature
vectors.For more information see the project homepage at:
<http://www.lsi.upc.edu/~nlp/omlet+fries/>Package: libfries-dev_1.0-1nlp4~0hardy1_i386.deb Package: libfries-dev_1.0-1nlp4~0hardy1_amd64.deb - libfries1
Description: Feature Retriever for Intensional Encoding of Sentences More... FRIES provides an expressive feature definition language that enables
the extraction of advanced patterns from input data. FRIES is specially
oriented to encode Natural Language sentences and corpora into feature
vectors.For more information see the project homepage at:
<http://www.lsi.upc.edu/~nlp/omlet+fries/>This package contains the libfries headers and other development files.
Package: libfries1_1.0-1nlp4~0hardy1_i386.deb Package: libfries1_1.0-1nlp4~0hardy1_amd64.deb - libomlet
Version: 1.0-1nlp4~0hardy1 Source (dsc): libomlet_1.0-1nlp4~0hardy1.dsc Source (tar.gz): libomlet_1.0-1nlp4~0hardy1.tar.gz - libomlet-dev
Description: Open-source Machine Learning Extensible Toolkit (dev) More... OMLET provides an extensible framework where new ML algorithms and
techniques can be integrated, tested, and combined.For more information see the project homepage at:
<http://www.lsi.upc.edu/~nlp/omlet+fries/>This package contains the libomlet headers and other development files.
Package: libomlet-dev_1.0-1nlp4~0hardy1_i386.deb Package: libomlet-dev_1.0-1nlp4~0hardy1_amd64.deb - libomlet1
Description: Open-source Machine Learning Extensible Toolkit More... OMLET provides an extensible framework where new ML algorithms and
techniques can be integrated, tested, and combined.For more information see the project homepage at:
<http://www.lsi.upc.edu/~nlp/omlet+omlet/>Package: libomlet1_1.0-1nlp4~0hardy1_i386.deb Package: libomlet1_1.0-1nlp4~0hardy1_amd64.deb - libtext-meteor-perl
Version: 0.6-2nlp1~0hardy1 Source (dsc): libtext-meteor-perl_0.6-2nlp1~0hardy1.dsc Source (tar.gz): libtext-meteor-perl_0.6-2nlp1~0hardy1.tar.gz - libtext-meteor-perl
Description: The METEOR Automatic Machine Translation Evaluation System More... The METEOR Automatic Machine Translation Evaluation System
METEOR is a system that automatically evaluates the output of machine
translation engines by comparing to them to (one or more) reference
translations. For a given pair of hypothesis and reference strings,
the evaluation proceeds in a sequence of stages, with different criteria
being used at each stage to find and score unigram matches. By default,
at the first stage all exact matches are detected between the two
strings, while in the second stage the words not matched in the first
stage are stemmed using the Porter stemmer and then matches are found
between these stemmed words. For further details, please refer Banerjee &
Lavie,2005The matching system is written in Perl, and each matching stage is
implemented as a separate Perl module. In addition to the two default
matching modules (exact matching and stemmed matching), a WordNet based
stemmed matching module and a WordNet based synonym matching module are
also provided with this distribution. METEOR can be run with the default
modules, or the user can override the defaults, and use one or more of
the given modules in any order of preference. Further, the user can write
his own matching module and plug it into the generic matching system.METEOR's input file format is exactly the same as those of Bleu and
NIST's Machine Translation Evaluation system. Thus all translation data
that can be evaluated using Bleu (such as the TIDES data) can also be
directly evaluated using METEOR. Starting from version 0.5 METEOR can
take as input n-best lists and score them.Package: libtext-meteor-perl_0.6-2nlp1~0hardy1_i386.deb Package: libtext-meteor-perl_0.6-2nlp1~0hardy1_amd64.deb - mgiza++
Version: 0.6.3-1nlp3~0hardy1 Source (dsc): mgiza++_0.6.3-1nlp3~0hardy1.dsc Source (tar.gz): mgiza++_0.6.3-1nlp3~0hardy1.tar.gz - mgiza++
Description: A multi-threaded tool for training statistical alignment models More... Multi-Threaded GIZA++ is an extension to the GIZA++ word aligning tool by
Qin Gao <qing@cs.cmu.edu> of CMU. It can perform much faster training
than origin GIZA++ if you have more than one CPUs. In addition it fixed
some bugs in GIZA, and the final aligning perplexity is generally lower
than the original GIZA++.GIZA++ is an extension of the program GIZA (part of the SMT toolkit EGYPT)
which was developed by the Statistical Machine Translation team during the
summer workshop in 1999 at the Center for Language and Speech Processing
at Johns-Hopkins University (CLSP/JHU). GIZA++ includes a lot of additional
features. The extensions of GIZA++ were designed and written by Franz Josef
Och.About GIZA++
The program includes the following extensions to GIZA:
* IBM Model 4;
* IBM Model 5;
* Alignment models depending on word classes
* Implements the HMM alignment model: Baum-Welch training, Forward-Backward
algorithm, empty word, dependency on word classes, transfer to fertility
models
* Includes a variant of Model 3 and Model 4 which allow the training of the
parameter p_0;
* Various smoothing techniques for fertility, distortion/alignment parameters;
* Significant more efficient training of the fertility models;
* Correct implementation of pegging as described in (Brown et al. 1993), a
series of heuristics in order to make pegging sufficiently efficient;For more information, consult the following publication:
@ARTICLE{och03:asc,
AUTHOR = {Franz Josef Och and Hermann Ney},
TITLE = {A Systematic Comparison of Various Statistical Alignment Models},
JOURNAL= {Computational Linguistics},
NUMBER = 1,
VOLUME = 29,
YEAR = 2.0.2003,
PAGES = {19--51}}or the GIZA++ project homepage <http://www.fjoch.com/GIZA++.html>
or Qin Gao's homepage <http://www.cs.cmu.edu/~qing/>Package: mgiza++_0.6.3-1nlp3~0hardy1_i386.deb Package: mgiza++_0.6.3-1nlp3~0hardy1_amd64.deb - moses
Version: 20101125svn-1nlp4~0hardy1 Source (dsc): moses_20101125svn-1nlp4~0hardy1.dsc Source (tar.gz): moses_20101125svn-1nlp4~0hardy1.tar.gz - moses
Description: Moses: a factored phrase-based beam-search decoder for machine translation More... Moses is a statistical machine translation system that allows you to automatically train translation
models for any language pair. All you need is a collection of translated texts (parallel corpus).
* beam-search: an efficient search algorithm finds quickly the highest probability translation
among the exponential number of choices
* phrase-based: the state-of-the-art in statistical machine translation allows the translation of
short text chunks
* factored: words may have factored representation (surface forms, lemma, part-of-speech,
morphology, word classes...)Features
* Moses is a drop-in replacement for Pharaoh, the popular phrase-based decoder, with many extensions.
* Moses allows the decoding of confusion networks, enabling easy integration with ambiguous
upstream tools, such as automatic speech recognizers
* Moses features novel factored translation models, which enable the integration linguistic and
other information at many stages of the translation processFor more information, visit <http://www.statmt.org/moses/>
Package: moses_20101125svn-1nlp4~0hardy1_i386.deb Package: moses_20101125svn-1nlp4~0hardy1_amd64.deb - moses-doc
Description: Documentation for Moses More... Moses is a statistical machine translation system that allows you to automatically train translation
models for any language pair. All you need is a collection of translated texts (parallel corpus).
* beam-search: an efficient search algorithm finds quickly the highest probability translation
among the exponential number of choices
* phrase-based: the state-of-the-art in statistical machine translation allows the translation of
short text chunks
* factored: words may have factored representation (surface forms, lemma, part-of-speech,
morphology, word classes...)Features
* Moses is a drop-in replacement for Pharaoh, the popular phrase-based decoder, with many extensions.
* Moses allows the decoding of confusion networks, enabling easy integration with ambiguous
upstream tools, such as automatic speech recognizers
* Moses features novel factored translation models, which enable the integration linguistic and
other information at many stages of the translation processThis package contains additional documentation for Moses.
Package: moses-doc_20101125svn-1nlp4~0hardy1_all.deb - mosesmake
Version: 0.0.20091215hg-3nlp2~0hardy1 Source (dsc): mosesmake_0.0.20091215hg-3nlp2~0hardy1.dsc Source (tar.gz): mosesmake_0.0.20091215hg-3nlp2~0hardy1.tar.gz - mosesmake
Description: Makefile utilities for rapid deployment of Moses SMT systems More... Moses Make is a set of makefiles and utilities for automatic setup of Moses SMT systems.
Moses Make will tokenize and annotate data with POS, lemma form, and morphology factors.
Currently, Moses Make supports English, Italian, Japanese, and Spanish, but it can easily be extended to support any language with a POS tagger and morphological analyzer.For more information, see Moses Make's homepage at http://cl.naist.jp/~/eric-n/hg/mosesmake/
Package: mosesmake_0.0.20091215hg-3nlp2~0hardy1_all.deb - python-nltk
Version: 0.9.2-1nlp2~0hardy1 Source (dsc): python-nltk_0.9.2-1nlp2~0hardy1.dsc Source (tar.gz): python-nltk_0.9.2-1nlp2~0hardy1.tar.gz - python-nltk
Description: Natural Language Toolkit More... NLTK — the Natural Language Toolkit — is a suite of open source
Python modules, data and documentation for research and development
in natural language processing.NLTK contains Code supporting dozens of NLP tasks, along with
40 popular Corpora and extensive Documentation including a 375-page
online Book.For more information, see the project homepage:
<http://nltk.org>Package: python-nltk_0.9.2-1nlp2~0hardy1_i386.deb Package: python-nltk_0.9.2-1nlp2~0hardy1_amd64.deb - python-nltk-data
Version: 0.9.2-1nlp2~0hardy1 Source (dsc): python-nltk-data_0.9.2-1nlp2~0hardy1.dsc Source (tar.gz): python-nltk-data_0.9.2-1nlp2~0hardy1.tar.gz - python-nltk-data
Description: Natural Language Toolkit Data More... NLTK — the Natural Language Toolkit — is a suite of open source
Python modules, data and documentation for research and development
in natural language processing.NLTK contains Code supporting dozens of NLP tasks, along with
40 popular Corpora and extensive Documentation including a 375-page
online Book.For more information, see the project homepage:
<http://nltk.org>This package contains data including corpora for use with NLTK.
Package: python-nltk-data_0.9.2-1nlp2~0hardy1_all.deb - python-nltk-doc
Version: 0.9.2-2nlp1~0hardy1 Source (dsc): python-nltk-doc_0.9.2-2nlp1~0hardy1.dsc Source (tar.gz): python-nltk-doc_0.9.2-2nlp1~0hardy1.tar.gz - python-nltk-doc
Description: Natural Language Toolkit Documentation More... NLTK — the Natural Language Toolkit — is a suite of open source
Python modules, data and documentation for research and development
in natural language processing.NLTK contains Code supporting dozens of NLP tasks, along with
40 popular Corpora and extensive Documentation including a 375-page
online Book.For more information, see the project homepage:
<http://nltk.org>This package contains documentation and examples for NLTK.
Package: python-nltk-doc_0.9.2-2nlp1~0hardy1_all.deb - srilm
Version: 1.5.11-1~0hardy1 Source (dsc): srilm_1.5.11-1~0hardy1.dsc Source (tar.gz): srilm_1.5.11-1~0hardy1.tar.gz - srilm
Description: The SRI Language Model Toolkit More... SRILM is a toolkit for building and applying statistical language models (LMs),
primarily for use in speech recognition, statistical tagging and segmentation.
It has been under development in the SRI Speech Technology and Research
Laboratory since 1995.SRILM consists of the following components:
* A set of C++ class libraries implementing language models, supporting data
stuctures and miscellaneous utility functions.
* A set of executable programs built on top of these libraries to perform
standard tasks such as training LMs and testing them on data, tagging or
segmenting text, etc.
* A collection of miscellaneous scripts facilitating minor related tasks.For more information, visit <http://www.speech.sri.com/projects/srilm/>
Package: srilm_1.5.11-1~0hardy1_i386.deb Package: srilm_1.5.11-1~0hardy1_amd64.deb - srilm-dev
Description: The SRI Language Model Toolkit More... SRILM is a toolkit for building and applying statistical language models (LMs),
primarily for use in speech recognition, statistical tagging and segmentation.
It has been under development in the SRI Speech Technology and Research
Laboratory since 1995.SRILM consists of the following components:
* A set of C++ class libraries implementing language models, supporting data
stuctures and miscellaneous utility functions.
* A set of executable programs built on top of these libraries to perform
standard tasks such as training LMs and testing them on data, tagging or
segmenting text, etc.
* A collection of miscellaneous scripts facilitating minor related tasks.This package contains headers and other files used for development with SRILM.
Package: srilm-dev_1.5.11-1~0hardy1_i386.deb Package: srilm-dev_1.5.11-1~0hardy1_amd64.deb - srilm-doc
Description: Documentation for the SRI Language Model Toolkit More... SRILM is a toolkit for building and applying statistical language models (LMs),
primarily for use in speech recognition, statistical tagging and segmentation.
It has been under development in the SRI Speech Technology and Research
Laboratory since 1995.SRILM consists of the following components:
* A set of C++ class libraries implementing language models, supporting data
stuctures and miscellaneous utility functions.
* A set of executable programs built on top of these libraries to perform
standard tasks such as training LMs and testing them on data, tagging or
segmenting text, etc.
* A collection of miscellaneous scripts facilitating minor related tasks.This package contains additional documentation for SRILM.
Package: srilm-doc_1.5.11-1~0hardy1_i386.deb Package: srilm-doc_1.5.11-1~0hardy1_amd64.deb - treetagger
Version: 3.2-3nlp2~0hardy1 Source (dsc): treetagger_3.2-3nlp2~0hardy1.dsc Source (tar.gz): treetagger_3.2-3nlp2~0hardy1.tar.gz - treetagger
Description: a language independent part-of-speech tagger More... The TreeTagger is a tool for annotating text with part-of-speech and
lemma information which has been developed within the TC project at
the Institute for Computational Linguistics of the University of
Stuttgart. The TreeTagger has been successfully used to tag German,
English, French, Italian, Dutch, Spanish, Bulgarian, Russian, Greek,
Portuguese, Chinese and old French texts and is easily adaptable to
other languages if a lexicon and a manually tagged training corpus
are available.This package downloads and installs the TreeTagger binaries and
helper scripts. The source code for the TreeTagger has not been
released but its license permits free use "for research purposes."
Installation of this package implies consent with its terms. For the
full text of the license, see
http://www.ims.uni-stuttgart.de/~schmid/Tagger-Licence or the
TreeTagger's homepage at
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/Package: treetagger_3.2-3nlp2~0hardy1_i386.deb Package: treetagger_3.2-3nlp2~0hardy1_amd64.deb - treetagger-english
Version: 3.1-1nlp2~0hardy1 Source (dsc): treetagger-english_3.1-1nlp2~0hardy1.dsc Source (tar.gz): treetagger-english_3.1-1nlp2~0hardy1.tar.gz - treetagger-english
Description: English language parameter files for TreeTagger More... The TreeTagger is a tool for annotating text with part-of-speech and
lemma information which has been developed within the TC project at
the Institute for Computational Linguistics of the University of
Stuttgart. The TreeTagger has been successfully used to tag German,
English, French, Italian, Dutch, Spanish, Bulgarian, Russian, Greek,
Portuguese, Chinese and old French texts and is easily adaptable to
other languages if a lexicon and a manually tagged training corpus
are available.This package downloads and installs the parameter files necessary for
tagging Englis data. The source code for the TreeTagger has not been
released but its license permits free use "for research purposes."
Installation of this package implies consent with its terms. For the
full text of the license, see
http://www.ims.uni-stuttgart.de/~schmid/Tagger-Licence or the
TreeTagger's homepage at
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/Package: treetagger-english_3.1-1nlp2~0hardy1_all.deb - treetagger-italian
Version: 3.1-1nlp1~0hardy1 Source (dsc): treetagger-italian_3.1-1nlp1~0hardy1.dsc Source (tar.gz): treetagger-italian_3.1-1nlp1~0hardy1.tar.gz - treetagger-italian
Description: Italian language parameter files for TreeTagger More... The TreeTagger is a tool for annotating text with part-of-speech and
lemma information which has been developed within the TC project at
the Institute for Computational Linguistics of the University of
Stuttgart. The TreeTagger has been successfully used to tag German,
Italian, French, Italian, Dutch, Italian, Bulgarian, Russian, Greek,
Portuguese, Chinese and old French texts and is easily adaptable to
other languages if a lexicon and a manually tagged training corpus
are available.This package downloads and installs the parameter files necessary for
tagging Englis data. The source code for the TreeTagger has not been
released but its license permits free use "for research purposes."
Installation of this package implies consent with its terms. For the
full text of the license, see
http://www.ims.uni-stuttgart.de/~schmid/Tagger-Licence or the
TreeTagger's homepage at
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/Package: treetagger-italian_3.1-1nlp1~0hardy1_all.deb - treetagger-spanish
Version: 3.1-1nlp1~0hardy1 Source (dsc): treetagger-spanish_3.1-1nlp1~0hardy1.dsc Source (tar.gz): treetagger-spanish_3.1-1nlp1~0hardy1.tar.gz - treetagger-spanish
Description: Spanish language parameter files for TreeTagger More... The TreeTagger is a tool for annotating text with part-of-speech and
lemma information which has been developed within the TC project at
the Institute for Computational Linguistics of the University of
Stuttgart. The TreeTagger has been successfully used to tag German,
Spanish, French, Italian, Dutch, Spanish, Bulgarian, Russian, Greek,
Portuguese, Chinese and old French texts and is easily adaptable to
other languages if a lexicon and a manually tagged training corpus
are available.This package downloads and installs the parameter files necessary for
tagging Englis data. The source code for the TreeTagger has not been
released but its license permits free use "for research purposes."
Installation of this package implies consent with its terms. For the
full text of the license, see
http://www.ims.uni-stuttgart.de/~schmid/Tagger-Licence or the
TreeTagger's homepage at
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/Package: treetagger-spanish_3.1-1nlp1~0hardy1_all.deb