The Ubuntu NLP Repository v8.04LTS
Powered by Falcon
Component all
This is the 'all' metacomponent containing all packages in all components.
You can use apt to download and install the packages. Use the following lines in /etc/apt/sources.list and use the command sudo apt-get update to enable downloading from this component.
Don't forget to read the notice on the frontpage!
deb-src http://cl.naist.jp/~eric-n/ubuntu-nlp hardy all
Packages
- amarok-moodbar
Version: 0.1.2-1nlp3~0hardy1 Source (dsc): amarok-moodbar_0.1.2-1nlp3~0hardy1.dsc Source (tar.gz): amarok-moodbar_0.1.2-1nlp3~0hardy1.tar.gz - amarok-moodbar
Description: Moodbar plugin for Amarok More... The Amarok Moodbar analyzes songs and displays their moods in the Amarok GUI.
Package: amarok-moodbar_0.1.2-1nlp3~0hardy1_i386.deb Package: amarok-moodbar_0.1.2-1nlp3~0hardy1_amd64.deb - cabocha
Version: 0.60-4nlp1~0hardy1 Source (dsc): cabocha_0.60-4nlp1~0hardy1.dsc Source (tar.gz): cabocha_0.60-4nlp1~0hardy1.tar.gz - cabocha
Description: A Japanese dependency/case structure analysis system More... CaboCha is a parser which detects dependency/case structures of
Japanese sentences.Package: cabocha_0.60-4nlp1~0hardy1_i386.deb Package: cabocha_0.60-4nlp1~0hardy1_amd64.deb - cabocha-dic
Description: Dictionaries of Cabocha (EUC-JP) More... CaboCha is a parser which detects dependency/case structures of
Japanese sentences.This package contains EUC-JP formatted CaboCha dictionaries.
Package: cabocha-dic_0.60-4nlp1~0hardy1_all.deb - cabocha-dic-utf8
Description: Dictionaries of Cabocha (UTF-8) More... CaboCha is a parser which detects dependency/case structures of
Japanese sentences.This package contains UTF-8 formatted CaboCha dictionaries.
Package: cabocha-dic-utf8_0.60-4nlp1~0hardy1_all.deb - libcabocha-dev
Description: Header files of Cabocha More... CaboCha is a parser which detects dependency/case structures of
Japanese sentences.Package: libcabocha-dev_0.60-4nlp1~0hardy1_i386.deb Package: libcabocha-dev_0.60-4nlp1~0hardy1_amd64.deb - libcabocha2
Description: Libraries of Cabocha More... CaboCha is a parser which detects dependency/case structures of
Japanese sentences.Package: libcabocha2_0.60-4nlp1~0hardy1_i386.deb Package: libcabocha2_0.60-4nlp1~0hardy1_amd64.deb - chasen
Version: 2.4.4-1nlp1~0hardy1 Source (dsc): chasen_2.4.4-1nlp1~0hardy1.dsc Source (tar.gz): chasen_2.4.4-1nlp1~0hardy1.tar.gz - chasen
Description: a Japanese Morphological Analysis System More... ChaSen is a morphological analysys system. It can segment and
tokenize Japanese text string, and can output with many additional
informations (pronunciation, semantic information, and others).
It will print the result of such an operation to the standard output,
so that it can be either written to a file or further processed.Package: chasen_2.4.4-1nlp1~0hardy1_i386.deb Package: chasen_2.4.4-1nlp1~0hardy1_amd64.deb - chasen-dictutils
Description: a Japanese Morphological Analysis System - utilities for dictionary More... Dictionary utilities for ChaSen.
ChaSen is a morphological analysys system. It can segment and
tokenize Japanese text string, and can output with many additional
informations (pronunciation, semantic information, and others).Package: chasen-dictutils_2.4.4-1nlp1~0hardy1_i386.deb Package: chasen-dictutils_2.4.4-1nlp1~0hardy1_amd64.deb - libchasen-dev
Description: a Japanese Morphological Analysis System (libraries and headers) More... You can use ChaSen library to put ChaSen's module into other programs.
ChaSen is a morphological analysys system. It can segment and
tokenize Japanese text string, and can output with many additional
informations (pronunciation, semantic information, and others).Package: libchasen-dev_2.4.4-1nlp1~0hardy1_i386.deb Package: libchasen-dev_2.4.4-1nlp1~0hardy1_amd64.deb - libchasen0c2a
Description: a Japanese Morphological Analysis System (shared libraries) More... Shared libraries for ChaSen.
ChaSen is a morphological analysys system. It can segment and
tokenize Japanese text string, and can output with many additional
informations (pronunciation, semantic information, and others).Package: libchasen0c2a_2.4.4-1nlp1~0hardy1_i386.deb Package: libchasen0c2a_2.4.4-1nlp1~0hardy1_amd64.deb - chicken
Version: 3.4.0-0.1nlp2~0hardy1 Source (dsc): chicken_3.4.0-0.1nlp2~0hardy1.dsc Source (tar.gz): chicken_3.4.0-0.1nlp2~0hardy1.tar.gz - chicken-bin
Description: A practical and portable Scheme system - compiler More... CHICKEN is a Scheme compiler which compiles a subset of R5RS into C.
It uses the ideas presented in Baker's paper "Cheney on the MTA", and
has a small core and is easily extendable.This package contains the compiler.
Package: chicken-bin_3.4.0-0.1nlp2~0hardy1_i386.deb Package: chicken-bin_3.4.0-0.1nlp2~0hardy1_amd64.deb - libchicken-dev
Description: A practical and portable Scheme system - development More... CHICKEN is a Scheme compiler which compiles a subset of R5RS into C.
It uses the ideas presented in Baker's paper "Cheney on the MTA", and
has a small core and is easily extendable.This package contains the header file and static library for developing
with chicken.Package: libchicken-dev_3.4.0-0.1nlp2~0hardy1_i386.deb Package: libchicken-dev_3.4.0-0.1nlp2~0hardy1_amd64.deb - libchicken3
Description: A practical and portable Scheme system - runtime More... CHICKEN is a Scheme compiler which compiles a subset of R5RS into C.
It uses the ideas presented in Baker's paper "Cheney on the MTA", and
has a small core and is easily extendable.This package contains the shared library needed to run programs using
chicken.Package: libchicken3_3.4.0-0.1nlp2~0hardy1_i386.deb Package: libchicken3_3.4.0-0.1nlp2~0hardy1_amd64.deb - corenlp
Version: 1.3.0-2nlp1~0hardy1 Source (dsc): corenlp_1.3.0-2nlp1~0hardy1.dsc Source (tar.gz): corenlp_1.3.0-2nlp1~0hardy1.tar.gz - corenlp
Description: Stanford CoreNLP: A Suite of Core NLP Tools More... Stanford CoreNLP provides a set of natural language analysis tools
which can take raw English language text input and give the base
forms of words, their parts of speech, whether they are names of
companies, people, etc., normalize dates, times, and numeric
quantities, and mark up the structure of sentences in terms of
phrases and word dependencies, and indicate which noun phrases refer
to the same entities. It provides the foundational building blocks
for higher level text understanding applications.Stanford CoreNLP integrates all our NLP tools for the English
language, including the part-of-speech (POS) tagger, the named entity
recognizer (NER), the parser, and the coreference resolution
system. The goal of this project is to enable people to quickly and
painlessly get complete linguistic annotations of natural language
texts. It is designed to be highly flexible and extensible, i.e.,
with a single option you can change which tools should be enabled and
which should be disabled.The Stanford CoreNLP code is licensed under the GNU General Public
License (v2 or later). Source is included. Note that this is the full
GPL, which allows many free uses, but not its use in distributed
proprietary software. The download is 195 MB and requires Java 1.5+.Package: corenlp_1.3.0-2nlp1~0hardy1_all.deb - crf++
Version: 0.54-1nlp1~0hardy1 Source (dsc): crf++_0.54-1nlp1~0hardy1.dsc Source (tar.gz): crf++_0.54-1nlp1~0hardy1.tar.gz - crf++
Description: Yet Another CRF toolkit More... CRF++ is a simple, customizable, and open source implementation of Conditional
Random Fields (CRFs) for segmenting/labeling sequential data. CRF++ is
designed for generic purpose and will be applied to a variety of NLP tasks,
such as Named Entity Recognition, Information Extraction and Text Chunking.Features:
* Can redefine feature sets
* Written in C++ with STL
* Fast training based on LBFGS, a quasi-newton algorithm for large scale
numerical optimization problem
* Less memory usage both in training and testing
* encoding/decoding in practical time
* Can perform n-best outputs
* Can perform single-best MIRA training
* Can output marginal probabilities for all candidates
* Available as an open source softwareFor more information see the project homepage at <http://crfpp.sourceforge.net/>
Package: crf++_0.54-1nlp1~0hardy1_i386.deb Package: crf++_0.54-1nlp1~0hardy1_amd64.deb - crf++-doc
Description: Documentation for CRF++ More... CRF++ is a simple, customizable, and open source implementation of Conditional
Random Fields (CRFs) for segmenting/labeling sequential data. CRF++ is
designed for generic purpose and will be applied to a variety of NLP tasks,
such as Named Entity Recognition, Information Extraction and Text Chunking.Features:
* Can redefine feature sets
* Written in C++ with STL
* Fast training based on LBFGS, a quasi-newton algorithm for large scale
numerical optimization problem
* Less memory usage both in training and testing
* encoding/decoding in practical time
* Can perform n-best outputs
* Can perform single-best MIRA training
* Can output marginal probabilities for all candidates
* Available as an open source softwareFor more information see the project homepage at <http://crfpp.sourceforge.net/>
This package contains the documentation for CRF++.
Package: crf++-doc_0.54-1nlp1~0hardy1_all.deb - libcrf++-dev
Description: Libraries and headers for development with CRF++ More... CRF++ is a simple, customizable, and open source implementation of Conditional
Random Fields (CRFs) for segmenting/labeling sequential data. CRF++ is
designed for generic purpose and will be applied to a variety of NLP tasks,
such as Named Entity Recognition, Information Extraction and Text Chunking.Features:
* Can redefine feature sets
* Written in C++ with STL
* Fast training based on LBFGS, a quasi-newton algorithm for large scale
numerical optimization problem
* Less memory usage both in training and testing
* encoding/decoding in practical time
* Can perform n-best outputs
* Can perform single-best MIRA training
* Can output marginal probabilities for all candidates
* Available as an open source softwareFor more information see the project homepage at <http://crfpp.sourceforge.net/>
This package contains the libraries and headers for development with CRF++.
Package: libcrf++-dev_0.54-1nlp1~0hardy1_i386.deb Package: libcrf++-dev_0.54-1nlp1~0hardy1_amd64.deb - libcrf++0
Description: CRF++ shared libraries More... CRF++ is a simple, customizable, and open source implementation of Conditional
Random Fields (CRFs) for segmenting/labeling sequential data. CRF++ is
designed for generic purpose and will be applied to a variety of NLP tasks,
such as Named Entity Recognition, Information Extraction and Text Chunking.Features:
* Can redefine feature sets
* Written in C++ with STL
* Fast training based on LBFGS, a quasi-newton algorithm for large scale
numerical optimization problem
* Less memory usage both in training and testing
* encoding/decoding in practical time
* Can perform n-best outputs
* Can perform single-best MIRA training
* Can output marginal probabilities for all candidates
* Available as an open source softwareFor more information see the project homepage at <http://crfpp.sourceforge.net/>
This package contains the CRF++ shared libraries.
Package: libcrf++0_0.54-1nlp1~0hardy1_i386.deb Package: libcrf++0_0.54-1nlp1~0hardy1_amd64.deb - ecl0.9h
Version: 0.9h-1nlp1~0hardy1 Source (dsc): ecl0.9h_0.9h-1nlp1~0hardy1.dsc Source (tar.gz): ecl0.9h_0.9h-1nlp1~0hardy1.tar.gz - ecl0.9h
Description: ECL - a Common-Lisp implementation More... ECL stands for Embeddable Common-Lisp. The ECL project is an effort to modernize
Giuseppe Attardi's ECL environment to produce an implementation of the Common-
Lisp language which complies to the ANSI X3J13 definition of the language.The current ECL implementation features:
* A bytecodes compiler and interpreter.
* A translator to C.
* A UFFI-compatible interface to C code.
* A dynamic loader.
* The possibility to build standalone executables and DLLs.
* The Common-Lisp Object System (CLOS).
* Conditions and restarts for handling errors.
* Sockets as ordinary streams.
* The Gnu Multiprecision library for fast bignum operations.
* A simple conservative mark & sweep garbage collector.
* The Boehm-Weiser garbage collector.The original version was written by Giuseppe Attardi. The current maintainer of
ECLS is Juan Jose Garcia Ripoll, who can be reached at the ECLS mailing list.For more information, visit the ECL project homepage at:
<http://ecls.sourceforge.net/>Package: ecl0.9h_0.9h-1nlp1~0hardy1_i386.deb Package: ecl0.9h_0.9h-1nlp1~0hardy1_amd64.deb - ecl0.9h-doc
Description: Documentation for ECL More... ECL stands for Embeddable Common-Lisp. The ECL project is an effort to modernize
Giuseppe Attardi's ECL environment to produce an implementation of the Common-
Lisp language which complies to the ANSI X3J13 definition of the language.The current ECL implementation features:
* A bytecodes compiler and interpreter.
* A translator to C.
* A UFFI-compatible interface to C code.
* A dynamic loader.
* The possibility to build standalone executables and DLLs.
* The Common-Lisp Object System (CLOS).
* Conditions and restarts for handling errors.
* Sockets as ordinary streams.
* The Gnu Multiprecision library for fast bignum operations.
* A simple conservative mark & sweep garbage collector.
* The Boehm-Weiser garbage collector.The original version was written by Giuseppe Attardi. The current maintainer of
ECLS is Juan Jose Garcia Ripoll, who can be reached at the ECLS mailing list.For more information, visit the ECL project homepage at:
<http://ecls.sourceforge.net/>This package contains documentation for ECL.
Package: ecl0.9h-doc_0.9h-1nlp1~0hardy1_all.deb - erg200707
Version: 1nlp1~0hardy1 Source (dsc): erg200707_1nlp1~0hardy1.dsc Source (tar.gz): erg200707_1nlp1~0hardy1.tar.gz - erg200707
Description: English grammar for use with Cheap and LKB parsers More... The LinGO English Resource Grammar (ERG) is a broad-coverage,
linguistically precise HPSG-based grammar of English. It was
developed initially using the DFKI PAGE system, but the LKB
is now the primary grammar engineering environment. The ERG
is semantically grounded in Minimal Recursion Semantics (MRS),
which is a form of flat semantic representation capable of
supporting underspecification.For more information, see the project homepage:
<http://www.delph-in.net/erg/>
.Package: erg200707_1nlp1~0hardy1_all.deb - erg201004
Version: 1nlp1~0hardy1 Source (dsc): erg201004_1nlp1~0hardy1.dsc Source (tar.gz): erg201004_1nlp1~0hardy1.tar.gz - erg201004
Description: English grammar for use with Cheap and LKB parsers More... The LinGO English Resource Grammar (ERG) is a broad-coverage,
linguistically precise HPSG-based grammar of English. It was
developed initially using the DFKI PAGE system, but the LKB
is now the primary grammar engineering environment. The ERG
is semantically grounded in Minimal Recursion Semantics (MRS),
which is a form of flat semantic representation capable of
supporting underspecification.For more information, see the project homepage:
<http://www.delph-in.net/erg/>
.Package: erg201004_1nlp1~0hardy1_all.deb - falcon
Version: 1.5.4-2nlp1~0hardy1 Source (dsc): falcon_1.5.4-2nlp1~0hardy1.dsc Source (tar.gz): falcon_1.5.4-2nlp1~0hardy1.tar.gz - falcon
Description: Falcon repository creator More... Falcon is a tool that generates the repository meta-information (such as
package listings and release files) in order to transform a set of packages
into a proper repository.Features
* Every subdir of the pool is automatically a component
* Easy to use 'all' metacomponent that contains all packages in all components
* Support for automatically signing the Release file
* Easy template based HTML output for repository indices
* Support for moving all old versions of sources and debs to a morgue dir
* Easy support for complete and partial mirrors
* Support for multiple releases and architectures
* Easy creation of .iso files
* Easy import of sources from other debian repositoriesPackage: falcon_1.5.4-2nlp1~0hardy1_all.deb - freeling
Version: 2.1-beta1-9nlp2~0hardy1 Source (dsc): freeling_2.1-beta1-9nlp2~0hardy1.dsc Source (tar.gz): freeling_2.1-beta1-9nlp2~0hardy1.tar.gz - freeling
Description: an open-source suite of language analyzers More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>Package: freeling_2.1-beta1-9nlp2~0hardy1_amd64.deb - freeling-data
Description: Linguistic data used by FreeLing More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>This package contains the linguistic data needed by Freeling
Package: freeling-data_2.1-beta1-9nlp2~0hardy1_amd64.deb - freeling-doc
Description: Documentation for FreeLing More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>This package contains the FreeLing documentation.
Package: freeling-doc_2.1-beta1-9nlp2~0hardy1_all.deb - libmorfo-dev
Description: FreeLing's morfo library (dev) More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>This package contains the development files of the morph library used by FreeLing.
Package: libmorfo-dev_2.1-beta1-9nlp2~0hardy1_amd64.deb - libmorfo2
Description: FreeLing's morfo library More... The FreeLing package consists of a library providing language analysis services.
FreeLing is designed to be used as an external library from any application
requiring this kind of services. Nevertheless, a simple main program is also
provided as a basic interface to the library, which enables the user to analyze
text files from the command line.Main services offered by FreeLing library:
* Text tokenization
* Sentence splitting
* Morphological analysis
* Suffix treatment, retokenization of clitic pronouns
* Flexible multiword recognition
* Contraction splitting
* Probabilistic prediction of unkown word categories
* Named entity detection
* Recognition of dates, numbers, ratios, currency, and physical magnitudes
(speed, weight, temperature, density, etc.)
* PoS tagging
* Chart-based shallow parsing
* Named entity classification
* WordNet based sense annotation
* Rule-based dependency parsingMost of these services are provided for all currently supported languages:
Spanish Catalan, Galician, Italian, and English.For more information see the project homepage at:
<http://garraf.epsevg.upc.es/freeling/index.php>This package contains the morph library used by FreeLing.
Package: libmorfo2_2.1-beta1-9nlp2~0hardy1_amd64.deb - geniatagger
Version: 3.0.1-2nlp1 Source (dsc): geniatagger_3.0.1-2nlp1.dsc Source (tar.gz): geniatagger_3.0.1-2nlp1.tar.gz - geniatagger
Description: GENIA Tagger - English part-of-speech tagger and shallow parser More... The GENIA tagger analyzes English sentences and outputs the base forms,
part-of-speech tags, and chunk tags. The tagger is specifically tuned
for biomedical text such as MEDLINE abstracts. If you need to extract
information from biomedical documents, this tagger could be a useful
preprocessing tool.This package contains the GENIA Tagger program.
Package: geniatagger_3.0.1-2nlp1_i386.deb Package: geniatagger_3.0.1-2nlp1_amd64.deb - geniatagger-doc
Description: Documentation for the GENIA Tagger More... The GENIA tagger analyzes English sentences and outputs the base forms,
part-of-speech tags, and chunk tags. The tagger is specifically tuned
for biomedical text such as MEDLINE abstracts. If you need to extract
information from biomedical documents, this tagger could be a useful
preprocessing tool.This package contains documentation for the GENIA Tagger.
Package: geniatagger-doc_3.0.1-2nlp1_all.deb - giza-pp
Version: 1:1.0.5-1nlp1~0hardy1 Source (dsc): giza-pp_1.0.5-1nlp1~0hardy1.dsc Source (tar.gz): giza-pp_1.0.5-1nlp1~0hardy1.tar.gz - mkcls
Description: Not yet built! More...
This binary package has not yet been built.- giza++
Description: Not yet built! More...
This binary package has not yet been built.- itsdb
Version: 20080208-2nlp1~0hardy1 Source (dsc): itsdb_20080208-2nlp1~0hardy1.dsc Source (tar.gz): itsdb_20080208-2nlp1~0hardy1.tar.gz - itsdb-doc
Description: The ITSDB documentation More... [incr tsdb()] is a hard to pronounce, powerful tool for profiling and treebanking.
This package contains documentation for ITSDB.
Package: itsdb-doc_20080208-2nlp1~0hardy1_all.deb - libitsdb
Description: The ITSDB library More... [incr tsdb()] is a hard to pronounce, powerful tool for profiling and treebanking.
This package contains the ITSDB documentation.
Package: libitsdb_20080208-2nlp1~0hardy1_i386.deb Package: libitsdb_20080208-2nlp1~0hardy1_amd64.deb - libitsdb-dev
Description: Development files for iTSDB More... [incr tsdb()] is a hard to pronounce, powerful tool for profiling and treebanking.
This package contains files useful in software development using ITSDB.
Package: libitsdb-dev_20080208-2nlp1~0hardy1_i386.deb Package: libitsdb-dev_20080208-2nlp1~0hardy1_amd64.deb - jacy20071022
Version: 1nlp1~0hardy1 Source (dsc): jacy20071022_1nlp1~0hardy1.dsc Source (tar.gz): jacy20071022_1nlp1~0hardy1.tar.gz - jacy20071022
Description: Japanese grammar for use with Cheap and LKB parsers More... The Jacy grammar is a broad-coverage linguistically precise grammar
of Japanese. It is based on the HPSG formalism with MRS semantics.
LKB is the primary grammar development environment, but the grammar
processing can be efficiently done with PET.For more information, see the project homepage:
<http://wiki.delph-in.net/moin/JacyTop/>
.Package: jacy20071022_1nlp1~0hardy1_all.deb - jacy20090705
Version: 1nlp1~0hardy1 Source (dsc): jacy20090705_1nlp1~0hardy1.dsc Source (tar.gz): jacy20090705_1nlp1~0hardy1.tar.gz - jacy20090705
Description: Japanese grammar for use with Cheap and LKB parsers More... The Jacy grammar is a broad-coverage linguistically precise grammar
of Japanese. It is based on the HPSG formalism with MRS semantics.
LKB is the primary grammar development environment, but the grammar
processing can be efficiently done with PET.For more information, see the project homepage:
<http://wiki.delph-in.net/moin/JacyTop/>
.Package: jacy20090705_1nlp1~0hardy1_all.deb - libcfg+
Version: 0.6.2-1nlp2~0hardy1 Source (dsc): libcfg+_0.6.2-1nlp2~0hardy1.dsc Source (tar.gz): libcfg+_0.6.2-1nlp2~0hardy1.tar.gz - libcfg+
Description: command line and configuration file parsing library More... libcfg+ is a C library that features multi- command line and configuration
file parsing. It is possible to set up various special properties such as
quoting characters, deliminator strings, file comment prefixes, multi-line
postfixes, and more. It supports many data types such as booleans, integers,
decimal numbers, strings with many additional data type flags (such as
multiple values for a single option).For more information see the project homepage at:
<http://platon.sk/projects/main_page.php?project_id=3>Package: libcfg+_0.6.2-1nlp2~0hardy1_i386.deb Package: libcfg+_0.6.2-1nlp2~0hardy1_amd64.deb - libcfg+-dev
Description: command line and configuration file parsing library (dev) More... libcfg+ is a C library that features multi- command line and configuration
file parsing. It is possible to set up various special properties such as
quoting characters, deliminator strings, file comment prefixes, multi-line
postfixes, and more. It supports many data types such as booleans, integers,
decimal numbers, strings with many additional data type flags (such as
multiple values for a single option).For more information see the project homepage at:
<http://platon.sk/projects/main_page.php?project_id=3>This package contains the libcfg+ headers and other development files.
Package: libcfg+-dev_0.6.2-1nlp2~0hardy1_i386.deb Package: libcfg+-dev_0.6.2-1nlp2~0hardy1_amd64.deb - libfries
Version: 1.0-1nlp4~0hardy1 Source (dsc): libfries_1.0-1nlp4~0hardy1.dsc Source (tar.gz): libfries_1.0-1nlp4~0hardy1.tar.gz - libfries-dev
Description: Feature Retriever for Intensional Encoding of Sentences (dev) More... FRIES provides an expressive feature definition language that enables
the extraction of advanced patterns from input data. FRIES is specially
oriented to encode Natural Language sentences and corpora into feature
vectors.For more information see the project homepage at:
<http://www.lsi.upc.edu/~nlp/omlet+fries/>Package: libfries-dev_1.0-1nlp4~0hardy1_i386.deb Package: libfries-dev_1.0-1nlp4~0hardy1_amd64.deb - libfries1
Description: Feature Retriever for Intensional Encoding of Sentences More... FRIES provides an expressive feature definition language that enables
the extraction of advanced patterns from input data. FRIES is specially
oriented to encode Natural Language sentences and corpora into feature
vectors.For more information see the project homepage at:
<http://www.lsi.upc.edu/~nlp/omlet+fries/>This package contains the libfries headers and other development files.
Package: libfries1_1.0-1nlp4~0hardy1_i386.deb Package: libfries1_1.0-1nlp4~0hardy1_amd64.deb - libmecab-perl
Version: 0.98pre3-1nlp1~0hardy1 Source (dsc): libmecab-perl_0.98pre3-1nlp1~0hardy1.dsc Source (tar.gz): libmecab-perl_0.98pre3-1nlp1~0hardy1.tar.gz - libmecab-perl
Description: Perl interface for MeCab More... libmecab-perl is a perl interface for the Japanese morphological analyzer,
MeCab.Mecab is a morphological analysis system. It reads Japanese sentences from
the standard input, segments them into morpheme sequences, and outputs them
to the standard output with many additional pieces of information
(pronunciation, semantic information, etc).For more information, see http://www.chasen.org/~taku/software/mecab/
(available in Japanese only).Package: libmecab-perl_0.98pre3-1nlp1~0hardy1_i386.deb Package: libmecab-perl_0.98pre3-1nlp1~0hardy1_amd64.deb - libomlet
Version: 1.0-1nlp4~0hardy1 Source (dsc): libomlet_1.0-1nlp4~0hardy1.dsc Source (tar.gz): libomlet_1.0-1nlp4~0hardy1.tar.gz - libomlet-dev
Description: Open-source Machine Learning Extensible Toolkit (dev) More... OMLET provides an extensible framework where new ML algorithms and
techniques can be integrated, tested, and combined.For more information see the project homepage at:
<http://www.lsi.upc.edu/~nlp/omlet+fries/>This package contains the libomlet headers and other development files.
Package: libomlet-dev_1.0-1nlp4~0hardy1_i386.deb Package: libomlet-dev_1.0-1nlp4~0hardy1_amd64.deb - libomlet1
Description: Open-source Machine Learning Extensible Toolkit More... OMLET provides an extensible framework where new ML algorithms and
techniques can be integrated, tested, and combined.For more information see the project homepage at:
<http://www.lsi.upc.edu/~nlp/omlet+omlet/>Package: libomlet1_1.0-1nlp4~0hardy1_i386.deb Package: libomlet1_1.0-1nlp4~0hardy1_amd64.deb - libtext-chasen-perl
Version: 1.04-4nlp2~0hardy1 Source (dsc): libtext-chasen-perl_1.04-4nlp2~0hardy1.dsc Source (tar.gz): libtext-chasen-perl_1.04-4nlp2~0hardy1.tar.gz - libtext-chasen-perl
Description: Japanese Morphological Analysis System module for Perl More... ChaSen (a Japanese morphological analysis system) module for Perl.
Package: libtext-chasen-perl_1.04-4nlp2~0hardy1_i386.deb Package: libtext-chasen-perl_1.04-4nlp2~0hardy1_amd64.deb - libtext-meteor-perl
Version: 0.6-2nlp1~0hardy1 Source (dsc): libtext-meteor-perl_0.6-2nlp1~0hardy1.dsc Source (tar.gz): libtext-meteor-perl_0.6-2nlp1~0hardy1.tar.gz - libtext-meteor-perl
Description: The METEOR Automatic Machine Translation Evaluation System More... The METEOR Automatic Machine Translation Evaluation System
METEOR is a system that automatically evaluates the output of machine
translation engines by comparing to them to (one or more) reference
translations. For a given pair of hypothesis and reference strings,
the evaluation proceeds in a sequence of stages, with different criteria
being used at each stage to find and score unigram matches. By default,
at the first stage all exact matches are detected between the two
strings, while in the second stage the words not matched in the first
stage are stemmed using the Porter stemmer and then matches are found
between these stemmed words. For further details, please refer Banerjee &
Lavie,2005The matching system is written in Perl, and each matching stage is
implemented as a separate Perl module. In addition to the two default
matching modules (exact matching and stemmed matching), a WordNet based
stemmed matching module and a WordNet based synonym matching module are
also provided with this distribution. METEOR can be run with the default
modules, or the user can override the defaults, and use one or more of
the given modules in any order of preference. Further, the user can write
his own matching module and plug it into the generic matching system.METEOR's input file format is exactly the same as those of Bleu and
NIST's Machine Translation Evaluation system. Thus all translation data
that can be evaluated using Bleu (such as the TIDES data) can also be
directly evaluated using METEOR. Starting from version 0.5 METEOR can
take as input n-best lists and score them.Package: libtext-meteor-perl_0.6-2nlp1~0hardy1_i386.deb Package: libtext-meteor-perl_0.6-2nlp1~0hardy1_amd64.deb - libwordnet-querydata-perl
Version: 1.49-1nlp1~0hardy1 Source (dsc): libwordnet-querydata-perl_1.49-1nlp1~0hardy1.dsc Source (tar.gz): libwordnet-querydata-perl_1.49-1nlp1~0hardy1.tar.gz - libwordnet-querydata-perl
Description: direct perl interface to WordNet database More... WordNet::QueryData provides a direct interface to the WordNet database
files. It requires the WordNet package
(http://www.cogsci.princeton.edu/~wn/). It allows the user direct
access to the full WordNet semantic lexicon. All parts of speech are
supported and access is generally very efficient because the index and
morphical exclusion tables are loaded at initialization. This
initialization step is slow (appx. 10-15 seconds), but queries are
very fast thereafter---thousands of queries can be completed every
second.This description was automagically extracted from the module by dh-make-perl.
Package: libwordnet-querydata-perl_1.49-1nlp1~0hardy1_all.deb - lkb
Version: 0.0.20080208-1nlp1~0hardy1 Source (dsc): lkb_0.0.20080208-1nlp1~0hardy1.dsc Source (tar.gz): lkb_0.0.20080208-1nlp1~0hardy1.tar.gz - lkb-src
Description: LKB source code More... The LKB system is a grammar and lexicon development environment for use with unification-based
linguistic formalisms. While not restricted to HPSG, the LKB implements the DELPH-IN reference
formalism of typed feature structures (jointly with other DELPH-IN software using the
same formalism).The primary documentation on the LKB is provided by the book Implementing Typed Feature
Structure Grammars. Excerpts from the book provide an tour of the LKB and the user manual.This package contains the source code for the LKB system.
Package: lkb-src_0.0.20080208-1nlp1~0hardy1_all.deb - logon
Version: 0.0.1nlp1~0hardy1 Source (dsc): logon_0.0.1nlp1~0hardy1.dsc Source (tar.gz): logon_0.0.1nlp1~0hardy1.tar.gz - logon
Description: LOGON dependency package More... This is a virtual package that installs the dependencies for the LOGON MT system.
Package: logon_0.0.1nlp1~0hardy1_all.deb - mecab
Version: 0.98-1nlp1~0hardy1 Source (dsc): mecab_0.98-1nlp1~0hardy1.dsc Source (tar.gz): mecab_0.98-1nlp1~0hardy1.tar.gz - libmecab-dev
Description: Header files of Mecab More... This package provides header files which are necessary to development
programs using runtime libraries of Mecab, that is a Japanese
morphological analysis system.Package: libmecab-dev_0.98-1nlp1~0hardy1_i386.deb Package: libmecab-dev_0.98-1nlp1~0hardy1_amd64.deb - libmecab1
Description: Libraries of Mecab More... This package provides runtime libraries of Mecab, that is a Japanese
morphological analysis system.Package: libmecab1_0.98-1nlp1~0hardy1_i386.deb Package: libmecab1_0.98-1nlp1~0hardy1_amd64.deb - mecab
Description: Japanese morphological analysis system More... Mecab is a morphological analysis system. It reads Japanese
sentences from the standard input, segments them into morpheme
sequences, and outputs them to the standard output with many
additional pieces of information (pronunciation, semantic
information, etc).Package: mecab_0.98-1nlp1~0hardy1_i386.deb Package: mecab_0.98-1nlp1~0hardy1_amd64.deb - mecab-utils
Description: Support programs of Mecab More... This package provides the dictionary compiler to convert a dictionary
written in text format to a binary data for Mecab, that is a Japanese
morphological analysis system. This package is necessary to install
dictionary packages for Mecab like mecab-jumandic.Package: mecab-utils_0.98-1nlp1~0hardy1_i386.deb Package: mecab-utils_0.98-1nlp1~0hardy1_amd64.deb - mecab-ipadic
Version: 2.7.0-20070801-3nlp1~0hardy1 Source (dsc): mecab-ipadic_2.7.0-20070801-3nlp1~0hardy1.dsc Source (tar.gz): mecab-ipadic_2.7.0-20070801-3nlp1~0hardy1.tar.gz - mecab-ipadic
Description: IPA dictionary compiled for Mecab More... This package provides IPA dictionary converted for Mecab, that is a
Japanese morphological analysis system. This dictionary written in
IPA grammar system.Package: mecab-ipadic_2.7.0-20070801-3nlp1~0hardy1_all.deb - mecab-ipadic-utf8
Description: IPA dictionary encoded in UTF-8 compiled for Mecab More... This package provides IPA dictionary converted for Mecab, that is a
Japanese morphological analysis system. This dictionary written in
IPA grammar system.Package: mecab-ipadic-utf8_2.7.0-20070801-3nlp1~0hardy1_all.deb - mecab-naist-jdic
Version: 0.6.3-20100801-1nlp1~0hardy1 Source (dsc): mecab-naist-jdic_0.6.3-20100801-1nlp1~0hardy1.dsc Source (tar.gz): mecab-naist-jdic_0.6.3-20100801-1nlp1~0hardy1.tar.gz - mecab-naist-jdic
Description: free Japanese Dictionaries for mecab (replacement of mecab-ipadic) in UTF-8 More... NAIST Japanese Dictionary is a Dictionary for MeCab, Japanese morphological
analysis implementation.This is based on mecab-ipadic, however it is released under BSD style license
now by NAIST (Nara Institute of Science and Technology, Japan), so it is free
and can replace mecab-ipadic.This package provides EUC-JP data, not UTF-8.
Package: mecab-naist-jdic_0.6.3-20100801-1nlp1~0hardy1_all.deb - mecab-naist-jdic-utf8
Description: free Japanese Dictionaries for mecab (replacement of mecab-ipadic) in UTF-8 More... NAIST Japanese Dictionary is a Dictionary for MeCab, Japanese morphological
analysis implementation.This is based on mecab-ipadic, however it is released under BSD style license
now by NAIST (Nara Institute of Science and Technology, Japan), so it is free
and can replace mecab-ipadic.
.Package: mecab-naist-jdic-utf8_0.6.3-20100801-1nlp1~0hardy1_all.deb - mgiza++
Version: 0.6.3-1nlp3~0hardy1 Source (dsc): mgiza++_0.6.3-1nlp3~0hardy1.dsc Source (tar.gz): mgiza++_0.6.3-1nlp3~0hardy1.tar.gz - mgiza++
Description: A multi-threaded tool for training statistical alignment models More... Multi-Threaded GIZA++ is an extension to the GIZA++ word aligning tool by
Qin Gao <qing@cs.cmu.edu> of CMU. It can perform much faster training
than origin GIZA++ if you have more than one CPUs. In addition it fixed
some bugs in GIZA, and the final aligning perplexity is generally lower
than the original GIZA++.GIZA++ is an extension of the program GIZA (part of the SMT toolkit EGYPT)
which was developed by the Statistical Machine Translation team during the
summer workshop in 1999 at the Center for Language and Speech Processing
at Johns-Hopkins University (CLSP/JHU). GIZA++ includes a lot of additional
features. The extensions of GIZA++ were designed and written by Franz Josef
Och.About GIZA++
The program includes the following extensions to GIZA:
* IBM Model 4;
* IBM Model 5;
* Alignment models depending on word classes
* Implements the HMM alignment model: Baum-Welch training, Forward-Backward
algorithm, empty word, dependency on word classes, transfer to fertility
models
* Includes a variant of Model 3 and Model 4 which allow the training of the
parameter p_0;
* Various smoothing techniques for fertility, distortion/alignment parameters;
* Significant more efficient training of the fertility models;
* Correct implementation of pegging as described in (Brown et al. 1993), a
series of heuristics in order to make pegging sufficiently efficient;For more information, consult the following publication:
@ARTICLE{och03:asc,
AUTHOR = {Franz Josef Och and Hermann Ney},
TITLE = {A Systematic Comparison of Various Statistical Alignment Models},
JOURNAL= {Computational Linguistics},
NUMBER = 1,
VOLUME = 29,
YEAR = 2.0.2003,
PAGES = {19--51}}or the GIZA++ project homepage <http://www.fjoch.com/GIZA++.html>
or Qin Gao's homepage <http://www.cs.cmu.edu/~qing/>Package: mgiza++_0.6.3-1nlp3~0hardy1_i386.deb Package: mgiza++_0.6.3-1nlp3~0hardy1_amd64.deb - morph
Version: 0.0.20030918-2nlp1~0hardy1 Source (dsc): morph_0.0.20030918-2nlp1~0hardy1.dsc Source (tar.gz): morph_0.0.20030918-2nlp1~0hardy1.tar.gz - morph
Description: English morphological analyzer and generator More... Morpha is a fast and robust morphological analyser for English based
on finite-state techniques that returns the lemma and inflection type
of a word, given the word form and its part of speech. (The latter is
optional but accuracy is degraded if it is not present).Morphg generates a word form given a specification of the lemma,
part-of-speech, and the type of inflection required. Morphg is
derived automatically from morpha, ensuring consistency and
reversability of the tools. An option controls British English or
American English behaviour with respect to consonant doubling.For more information see John Carroll's homepage at:
http://www.informatics.sussex.ac.uk/research/groups/nlp/carroll/morph.htmlPackage: morph_0.0.20030918-2nlp1~0hardy1_all.deb - moses
Version: 20101125svn-1nlp4~0hardy1 Source (dsc): moses_20101125svn-1nlp4~0hardy1.dsc Source (tar.gz): moses_20101125svn-1nlp4~0hardy1.tar.gz - moses
Description: Moses: a factored phrase-based beam-search decoder for machine translation More... Moses is a statistical machine translation system that allows you to automatically train translation
models for any language pair. All you need is a collection of translated texts (parallel corpus).
* beam-search: an efficient search algorithm finds quickly the highest probability translation
among the exponential number of choices
* phrase-based: the state-of-the-art in statistical machine translation allows the translation of
short text chunks
* factored: words may have factored representation (surface forms, lemma, part-of-speech,
morphology, word classes...)Features
* Moses is a drop-in replacement for Pharaoh, the popular phrase-based decoder, with many extensions.
* Moses allows the decoding of confusion networks, enabling easy integration with ambiguous
upstream tools, such as automatic speech recognizers
* Moses features novel factored translation models, which enable the integration linguistic and
other information at many stages of the translation processFor more information, visit <http://www.statmt.org/moses/>
Package: moses_20101125svn-1nlp4~0hardy1_i386.deb Package: moses_20101125svn-1nlp4~0hardy1_amd64.deb - moses-doc
Description: Documentation for Moses More... Moses is a statistical machine translation system that allows you to automatically train translation
models for any language pair. All you need is a collection of translated texts (parallel corpus).
* beam-search: an efficient search algorithm finds quickly the highest probability translation
among the exponential number of choices
* phrase-based: the state-of-the-art in statistical machine translation allows the translation of
short text chunks
* factored: words may have factored representation (surface forms, lemma, part-of-speech,
morphology, word classes...)Features
* Moses is a drop-in replacement for Pharaoh, the popular phrase-based decoder, with many extensions.
* Moses allows the decoding of confusion networks, enabling easy integration with ambiguous
upstream tools, such as automatic speech recognizers
* Moses features novel factored translation models, which enable the integration linguistic and
other information at many stages of the translation processThis package contains additional documentation for Moses.
Package: moses-doc_20101125svn-1nlp4~0hardy1_all.deb - mosesmake
Version: 0.0.20091215hg-3nlp2~0hardy1 Source (dsc): mosesmake_0.0.20091215hg-3nlp2~0hardy1.dsc Source (tar.gz): mosesmake_0.0.20091215hg-3nlp2~0hardy1.tar.gz - mosesmake
Description: Makefile utilities for rapid deployment of Moses SMT systems More... Moses Make is a set of makefiles and utilities for automatic setup of Moses SMT systems.
Moses Make will tokenize and annotate data with POS, lemma form, and morphology factors.
Currently, Moses Make supports English, Italian, Japanese, and Spanish, but it can easily be extended to support any language with a POS tagger and morphological analyzer.For more information, see Moses Make's homepage at http://cl.naist.jp/~/eric-n/hg/mosesmake/
Package: mosesmake_0.0.20091215hg-3nlp2~0hardy1_all.deb - pet
Version: 0.99.14-1~20080917svn-1nlp1~0hardy1 Source (dsc): pet_0.99.14-1~20080917svn-1nlp1~0hardy1.dsc Source (tar.gz): pet_0.99.14-1~20080917svn-1nlp1~0hardy1.tar.gz - pet-cheap
Description: The Cheap unification parser More... The PET system for efficient processing of unification-based grammars is an industrial strength
implementation of the typed feature structure formalism used in DELPH-IN grammars. PET reads the
exact same source files (modulo some configuration options) as the LKB grammar development
environment and produces identical results. In a nutshell, PET can be viewed as a high-efficiency
batch processing and application delivery engine, while the LKB mainly targets interactive grammar
development.PET was originally developed by UlrichCallmeier at DFKI GmbH and Saarland University, and some
of its design is documented in his 2001 MSc thesis. The software subsequently served to build a
commercial email auto response product (by YY Technologies, Mountain View, CA), ported to Windows
NT, generally "hardened" (eliminating memory leakage, increasing robustness to exceptional
situations, et al.), and extended in functionality and interfaces (including UniCode support,
unknown word support, server and API library modes, lattice input, and initial MRS support); most
of this work was done by Ulrich with help from Stephan Oepen and Bernd Kiefer (of DFKI). As part
of the EU-funded Deep Thought project, Ulrich and Stephan later added support for subsumption-
based ambiguity factoring (giving a significant improvement in parsing efficiency for long inputs),
facilities to rank alternate parses according to a statistical (Maximum Entropy) parse selection
model (which, typically, one would obtain using the Redwoods tools and a hand-constructed treebank),
and the ability to compile in the (Common-Lisp) MRS code base also used in the LKB, thus enabling
output of (R)MRSs in various standard formats.Towards the end of 2003, Ulrich retired from active PET development, and Bernd has since been the
main developer (with occasional help from others, specifically Frederik Fouvry of Saarland
University and Stephan). PET has seen a range of substantial additions in functionality since,
including the ability to add (leaf) types at run-time, output fragmentary analysis hypotheses in
case of parse failures, and an XML-based input format that generalizes the lattice-oriented YY
input mode.For more information on PET and the DELPH-IN project, visit <http://www.delph-in.net/>
This package contains the Cheap chart parser from PET.
Package: pet-cheap_0.99.14-1~20080917svn-1nlp1~0hardy1_i386.deb Package: pet-cheap_0.99.14-1~20080917svn-1nlp1~0hardy1_amd64.deb - pet-doc
Description: Documentation for PET More... The PET system for efficient processing of unification-based grammars is an industrial strength
implementation of the typed feature structure formalism used in DELPH-IN grammars. PET reads the
exact same source files (modulo some configuration options) as the LKB grammar development
environment and produces identical results. In a nutshell, PET can be viewed as a high-efficiency
batch processing and application delivery engine, while the LKB mainly targets interactive grammar
development.PET was originally developed by UlrichCallmeier at DFKI GmbH and Saarland University, and some
of its design is documented in his 2001 MSc thesis. The software subsequently served to build a
commercial email auto response product (by YY Technologies, Mountain View, CA), ported to Windows
NT, generally "hardened" (eliminating memory leakage, increasing robustness to exceptional
situations, et al.), and extended in functionality and interfaces (including UniCode support,
unknown word support, server and API library modes, lattice input, and initial MRS support); most
of this work was done by Ulrich with help from Stephan Oepen and Bernd Kiefer (of DFKI). As part
of the EU-funded Deep Thought project, Ulrich and Stephan later added support for subsumption-
based ambiguity factoring (giving a significant improvement in parsing efficiency for long inputs),
facilities to rank alternate parses according to a statistical (Maximum Entropy) parse selection
model (which, typically, one would obtain using the Redwoods tools and a hand-constructed treebank),
and the ability to compile in the (Common-Lisp) MRS code base also used in the LKB, thus enabling
output of (R)MRSs in various standard formats.Towards the end of 2003, Ulrich retired from active PET development, and Bernd has since been the
main developer (with occasional help from others, specifically Frederik Fouvry of Saarland
University and Stephan). PET has seen a range of substantial additions in functionality since,
including the ability to add (leaf) types at run-time, output fragmentary analysis hypotheses in
case of parse failures, and an XML-based input format that generalizes the lattice-oriented YY
input mode.For more information on PET and the DELPH-IN project, visit <http://www.delph-in.net/>
This package contains the documentation for PET.
Package: pet-doc_0.99.14-1~20080917svn-1nlp1~0hardy1_all.deb - pet-flop
Description: The Flop unification grammar preprocessor More... The PET system for efficient processing of unification-based grammars is an industrial strength
implementation of the typed feature structure formalism used in DELPH-IN grammars. PET reads the
exact same source files (modulo some configuration options) as the LKB grammar development
environment and produces identical results. In a nutshell, PET can be viewed as a high-efficiency
batch processing and application delivery engine, while the LKB mainly targets interactive grammar
development.PET was originally developed by UlrichCallmeier at DFKI GmbH and Saarland University, and some
of its design is documented in his 2001 MSc thesis. The software subsequently served to build a
commercial email auto response product (by YY Technologies, Mountain View, CA), ported to Windows
NT, generally "hardened" (eliminating memory leakage, increasing robustness to exceptional
situations, et al.), and extended in functionality and interfaces (including UniCode support,
unknown word support, server and API library modes, lattice input, and initial MRS support); most
of this work was done by Ulrich with help from Stephan Oepen and Bernd Kiefer (of DFKI). As part
of the EU-funded Deep Thought project, Ulrich and Stephan later added support for subsumption-
based ambiguity factoring (giving a significant improvement in parsing efficiency for long inputs),
facilities to rank alternate parses according to a statistical (Maximum Entropy) parse selection
model (which, typically, one would obtain using the Redwoods tools and a hand-constructed treebank),
and the ability to compile in the (Common-Lisp) MRS code base also used in the LKB, thus enabling
output of (R)MRSs in various standard formats.Towards the end of 2003, Ulrich retired from active PET development, and Bernd has since been the
main developer (with occasional help from others, specifically Frederik Fouvry of Saarland
University and Stephan). PET has seen a range of substantial additions in functionality since,
including the ability to add (leaf) types at run-time, output fragmentary analysis hypotheses in
case of parse failures, and an XML-based input format that generalizes the lattice-oriented YY
input mode.For more information on PET and the DELPH-IN project, visit <http://www.delph-in.net/>
This package contains the Flop grammar pre-processor from PET.
Package: pet-flop_0.99.14-1~20080917svn-1nlp1~0hardy1_i386.deb Package: pet-flop_0.99.14-1~20080917svn-1nlp1~0hardy1_amd64.deb - plan9port
Version: 20100805-1nlp1 Source (dsc): plan9port_20100805-1nlp1.dsc Source (tar.gz): plan9port_20100805-1nlp1.tar.gz - plan9port
Description: A port of many Plan 9 programs from their native Plan 9 environment to Unix-like operating systems. More... Plan 9 from User Space (aka plan9port) is a port of many Plan 9 programs
from their native Plan 9 environment to Unix-like operating systems.The Plan 9 toolset is installed in /usr/share/plan9 and is not added to the system path by default.
To access the toolset, add the following to your profile environment:PLAN9=/usr/share/plan9 export PLAN9
PATH=$PATH:$PLAN9/bin export PATHTo get started see the overview in intro(1) by running "9 man 1 intro" after defining $PLAN9.
For more information, see the plan9port project homepage at: <http://swtch.com/plan9port/>Package: plan9port_20100805-1nlp1_i386.deb Package: plan9port_20100805-1nlp1_amd64.deb - python-chasen
Version: 0.01-1nlp2~0hardy1 Source (dsc): python-chasen_0.01-1nlp2~0hardy1.dsc Source (tar.gz): python-chasen_0.01-1nlp2~0hardy1.tar.gz - python-chasen
Description: Python interface to ChaSen More... python-chasen is a python interface for the Japanese morphological analyzer,
ChaSen.ChaSen is a morphological analysis system. It reads Japanese sentences from
the standard input, segments them into morpheme sequences, and outputs them
to the standard output with many additional pieces of information
(pronunciation, semantic information, etc).For more information, see http://www.chasen.org/ (available in Japanese only).
Package: python-chasen_0.01-1nlp2~0hardy1_i386.deb Package: python-chasen_0.01-1nlp2~0hardy1_amd64.deb - python-mecab
Version: 0.98pre3-1nlp1~0hardy1 Source (dsc): python-mecab_0.98pre3-1nlp1~0hardy1.dsc Source (tar.gz): python-mecab_0.98pre3-1nlp1~0hardy1.tar.gz - python-mecab
Description: Python interface to MeCab More... python-mecab is a python interface for the Japanese morphological analyzer,
MeCab.Mecab is a morphological analysis system. It reads Japanese sentences from
the standard input, segments them into morpheme sequences, and outputs them
to the standard output with many additional pieces of information
(pronunciation, semantic information, etc).For more information, see http://mecab.sourceforge.jp/
(available in Japanese only).Package: python-mecab_0.98pre3-1nlp1~0hardy1_i386.deb Package: python-mecab_0.98pre3-1nlp1~0hardy1_amd64.deb - python-nltk
Version: 0.9.2-1nlp2~0hardy1 Source (dsc): python-nltk_0.9.2-1nlp2~0hardy1.dsc Source (tar.gz): python-nltk_0.9.2-1nlp2~0hardy1.tar.gz - python-nltk
Description: Natural Language Toolkit More... NLTK — the Natural Language Toolkit — is a suite of open source
Python modules, data and documentation for research and development
in natural language processing.NLTK contains Code supporting dozens of NLP tasks, along with
40 popular Corpora and extensive Documentation including a 375-page
online Book.For more information, see the project homepage:
<http://nltk.org>Package: python-nltk_0.9.2-1nlp2~0hardy1_i386.deb Package: python-nltk_0.9.2-1nlp2~0hardy1_amd64.deb - python-nltk-data
Version: 0.9.2-1nlp2~0hardy1 Source (dsc): python-nltk-data_0.9.2-1nlp2~0hardy1.dsc Source (tar.gz): python-nltk-data_0.9.2-1nlp2~0hardy1.tar.gz - python-nltk-data
Description: Natural Language Toolkit Data More... NLTK — the Natural Language Toolkit — is a suite of open source
Python modules, data and documentation for research and development
in natural language processing.NLTK contains Code supporting dozens of NLP tasks, along with
40 popular Corpora and extensive Documentation including a 375-page
online Book.For more information, see the project homepage:
<http://nltk.org>This package contains data including corpora for use with NLTK.
Package: python-nltk-data_0.9.2-1nlp2~0hardy1_all.deb - python-nltk-doc
Version: 0.9.2-2nlp1~0hardy1 Source (dsc): python-nltk-doc_0.9.2-2nlp1~0hardy1.dsc Source (tar.gz): python-nltk-doc_0.9.2-2nlp1~0hardy1.tar.gz - python-nltk-doc
Description: Natural Language Toolkit Documentation More... NLTK — the Natural Language Toolkit — is a suite of open source
Python modules, data and documentation for research and development
in natural language processing.NLTK contains Code supporting dozens of NLP tasks, along with
40 popular Corpora and extensive Documentation including a 375-page
online Book.For more information, see the project homepage:
<http://nltk.org>This package contains documentation and examples for NLTK.
Package: python-nltk-doc_0.9.2-2nlp1~0hardy1_all.deb - python-pywordnet
Version: 2.0.1-2nlp1~0hardy1 Source (dsc): python-pywordnet_2.0.1-2nlp1~0hardy1.dsc Source (tar.gz): python-pywordnet_2.0.1-2nlp1~0hardy1.tar.gz - python-pywordnet
Description: Python interface to WordNet 2.0 More... PyWordNet is a Python interface to the WordNet database
of word meanings and lexical relationships[1].PyWordNet presents a concise interface to WordNet,
that allows the user to type expressions such as
N['dog'], hyponyms(N['dog'][0]), and
closure(ADJ['red'], SYNONYM) to query the database.>>> N['dog']
dog(n.)
>>> N['dog'].getSenses()
('dog' in {noun: dog, domestic dog, Canis familiaris},
'dog' in {noun: frump, dog}, 'dog' in {noun: dog},
'dog' in {noun: cad, bounder, blackguard, dog, hound, heel},
'dog' in {noun: pawl, detent, click, dog},
'dog' in {noun: andiron, firedog, dog, dogiron})For more information, see http://sourceforge.net/projects/pywordnet
Package: python-pywordnet_2.0.1-2nlp1~0hardy1_i386.deb Package: python-pywordnet_2.0.1-2nlp1~0hardy1_amd64.deb - python-romkan
Version: 0.02-3nlp1~0hardy1 Source (dsc): python-romkan_0.02-3nlp1~0hardy1.dsc Source (tar.gz): python-romkan_0.02-3nlp1~0hardy1.tar.gz - python-romkan
Description: Romaji <-> Kana conversion module for Python More... python-romkan is a Python rewrite of the Text::Romkan Romaji <-> Kana conversion Perl module.
Package: python-romkan_0.02-3nlp1~0hardy1_i386.deb Package: python-romkan_0.02-3nlp1~0hardy1_amd64.deb - srilm
Version: 1.5.11-1~0hardy1 Source (dsc): srilm_1.5.11-1~0hardy1.dsc Source (tar.gz): srilm_1.5.11-1~0hardy1.tar.gz - srilm
Description: The SRI Language Model Toolkit More... SRILM is a toolkit for building and applying statistical language models (LMs),
primarily for use in speech recognition, statistical tagging and segmentation.
It has been under development in the SRI Speech Technology and Research
Laboratory since 1995.SRILM consists of the following components:
* A set of C++ class libraries implementing language models, supporting data
stuctures and miscellaneous utility functions.
* A set of executable programs built on top of these libraries to perform
standard tasks such as training LMs and testing them on data, tagging or
segmenting text, etc.
* A collection of miscellaneous scripts facilitating minor related tasks.For more information, visit <http://www.speech.sri.com/projects/srilm/>
Package: srilm_1.5.11-1~0hardy1_i386.deb Package: srilm_1.5.11-1~0hardy1_amd64.deb - srilm-dev
Description: The SRI Language Model Toolkit More... SRILM is a toolkit for building and applying statistical language models (LMs),
primarily for use in speech recognition, statistical tagging and segmentation.
It has been under development in the SRI Speech Technology and Research
Laboratory since 1995.SRILM consists of the following components:
* A set of C++ class libraries implementing language models, supporting data
stuctures and miscellaneous utility functions.
* A set of executable programs built on top of these libraries to perform
standard tasks such as training LMs and testing them on data, tagging or
segmenting text, etc.
* A collection of miscellaneous scripts facilitating minor related tasks.This package contains headers and other files used for development with SRILM.
Package: srilm-dev_1.5.11-1~0hardy1_i386.deb Package: srilm-dev_1.5.11-1~0hardy1_amd64.deb - srilm-doc
Description: Documentation for the SRI Language Model Toolkit More... SRILM is a toolkit for building and applying statistical language models (LMs),
primarily for use in speech recognition, statistical tagging and segmentation.
It has been under development in the SRI Speech Technology and Research
Laboratory since 1995.SRILM consists of the following components:
* A set of C++ class libraries implementing language models, supporting data
stuctures and miscellaneous utility functions.
* A set of executable programs built on top of these libraries to perform
standard tasks such as training LMs and testing them on data, tagging or
segmenting text, etc.
* A collection of miscellaneous scripts facilitating minor related tasks.This package contains additional documentation for SRILM.
Package: srilm-doc_1.5.11-1~0hardy1_i386.deb Package: srilm-doc_1.5.11-1~0hardy1_amd64.deb - tinysvm
Version: 0.09-3nlp2~0hardy1 Source (dsc): tinysvm_0.09-3nlp2~0hardy1.dsc Source (tar.gz): tinysvm_0.09-3nlp2~0hardy1.tar.gz - libtinysvm-dev
Description: Libraries and headers for development with TinySVM More... TinySVM is an implementation of Support Vector Machines (SVMs) [Vapnik 95], [Vapnik 98] for the problem of pattern recognition.
Support Vector Machines is a new generation learning algorithms based on recent advances in statistical learning theory, and
applied to large number of real-world applications, such as text categorization, hand-written character recognition.Package: libtinysvm-dev_0.09-3nlp2~0hardy1_i386.deb Package: libtinysvm-dev_0.09-3nlp2~0hardy1_amd64.deb - libtinysvm1
Description: TinySVM shared libraries More... TinySVM is an implementation of Support Vector Machines (SVMs) [Vapnik 95], [Vapnik 98] for the problem of pattern recognition.
Support Vector Machines is a new generation learning algorithms based on recent advances in statistical learning theory, and
applied to large number of real-world applications, such as text categorization, hand-written character recognition.Package: libtinysvm1_0.09-3nlp2~0hardy1_i386.deb Package: libtinysvm1_0.09-3nlp2~0hardy1_amd64.deb - tinysvm
Description: An implementation of Support Vector Machines (SVMs) More... TinySVM is an implementation of Support Vector Machines (SVMs) [Vapnik 95], [Vapnik 98] for the problem of pattern recognition.
Support Vector Machines is a new generation learning algorithms based on recent advances in statistical learning theory, and
applied to large number of real-world applications, such as text categorization, hand-written character recognition.Package: tinysvm_0.09-3nlp2~0hardy1_i386.deb Package: tinysvm_0.09-3nlp2~0hardy1_amd64.deb - tinysvm-doc
Description: Documentation for TinySVM More... TinySVM is an implementation of Support Vector Machines (SVMs) [Vapnik 95], [Vapnik 98] for the problem of pattern recognition.
Support Vector Machines is a new generation learning algorithms based on recent advances in statistical learning theory, and
applied to large number of real-world applications, such as text categorization, hand-written character recognition.Package: tinysvm-doc_0.09-3nlp2~0hardy1_all.deb - treetagger
Version: 3.2-3nlp2~0hardy1 Source (dsc): treetagger_3.2-3nlp2~0hardy1.dsc Source (tar.gz): treetagger_3.2-3nlp2~0hardy1.tar.gz - treetagger
Description: a language independent part-of-speech tagger More... The TreeTagger is a tool for annotating text with part-of-speech and
lemma information which has been developed within the TC project at
the Institute for Computational Linguistics of the University of
Stuttgart. The TreeTagger has been successfully used to tag German,
English, French, Italian, Dutch, Spanish, Bulgarian, Russian, Greek,
Portuguese, Chinese and old French texts and is easily adaptable to
other languages if a lexicon and a manually tagged training corpus
are available.This package downloads and installs the TreeTagger binaries and
helper scripts. The source code for the TreeTagger has not been
released but its license permits free use "for research purposes."
Installation of this package implies consent with its terms. For the
full text of the license, see
http://www.ims.uni-stuttgart.de/~schmid/Tagger-Licence or the
TreeTagger's homepage at
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/Package: treetagger_3.2-3nlp2~0hardy1_i386.deb Package: treetagger_3.2-3nlp2~0hardy1_amd64.deb - treetagger-english
Version: 3.1-1nlp2~0hardy1 Source (dsc): treetagger-english_3.1-1nlp2~0hardy1.dsc Source (tar.gz): treetagger-english_3.1-1nlp2~0hardy1.tar.gz - treetagger-english
Description: English language parameter files for TreeTagger More... The TreeTagger is a tool for annotating text with part-of-speech and
lemma information which has been developed within the TC project at
the Institute for Computational Linguistics of the University of
Stuttgart. The TreeTagger has been successfully used to tag German,
English, French, Italian, Dutch, Spanish, Bulgarian, Russian, Greek,
Portuguese, Chinese and old French texts and is easily adaptable to
other languages if a lexicon and a manually tagged training corpus
are available.This package downloads and installs the parameter files necessary for
tagging Englis data. The source code for the TreeTagger has not been
released but its license permits free use "for research purposes."
Installation of this package implies consent with its terms. For the
full text of the license, see
http://www.ims.uni-stuttgart.de/~schmid/Tagger-Licence or the
TreeTagger's homepage at
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/Package: treetagger-english_3.1-1nlp2~0hardy1_all.deb - treetagger-italian
Version: 3.1-1nlp1~0hardy1 Source (dsc): treetagger-italian_3.1-1nlp1~0hardy1.dsc Source (tar.gz): treetagger-italian_3.1-1nlp1~0hardy1.tar.gz - treetagger-italian
Description: Italian language parameter files for TreeTagger More... The TreeTagger is a tool for annotating text with part-of-speech and
lemma information which has been developed within the TC project at
the Institute for Computational Linguistics of the University of
Stuttgart. The TreeTagger has been successfully used to tag German,
Italian, French, Italian, Dutch, Italian, Bulgarian, Russian, Greek,
Portuguese, Chinese and old French texts and is easily adaptable to
other languages if a lexicon and a manually tagged training corpus
are available.This package downloads and installs the parameter files necessary for
tagging Englis data. The source code for the TreeTagger has not been
released but its license permits free use "for research purposes."
Installation of this package implies consent with its terms. For the
full text of the license, see
http://www.ims.uni-stuttgart.de/~schmid/Tagger-Licence or the
TreeTagger's homepage at
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/Package: treetagger-italian_3.1-1nlp1~0hardy1_all.deb - treetagger-spanish
Version: 3.1-1nlp1~0hardy1 Source (dsc): treetagger-spanish_3.1-1nlp1~0hardy1.dsc Source (tar.gz): treetagger-spanish_3.1-1nlp1~0hardy1.tar.gz - treetagger-spanish
Description: Spanish language parameter files for TreeTagger More... The TreeTagger is a tool for annotating text with part-of-speech and
lemma information which has been developed within the TC project at
the Institute for Computational Linguistics of the University of
Stuttgart. The TreeTagger has been successfully used to tag German,
Spanish, French, Italian, Dutch, Spanish, Bulgarian, Russian, Greek,
Portuguese, Chinese and old French texts and is easily adaptable to
other languages if a lexicon and a manually tagged training corpus
are available.This package downloads and installs the parameter files necessary for
tagging Englis data. The source code for the TreeTagger has not been
released but its license permits free use "for research purposes."
Installation of this package implies consent with its terms. For the
full text of the license, see
http://www.ims.uni-stuttgart.de/~schmid/Tagger-Licence or the
TreeTagger's homepage at
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/Package: treetagger-spanish_3.1-1nlp1~0hardy1_all.deb - utool
Version: 3.1.1-1nlp1~0hardy1 Source (dsc): utool_3.1.1-1nlp1~0hardy1.dsc Source (tar.gz): utool_3.1.1-1nlp1~0hardy1.tar.gz - utool
Description: utool: The Swiss Army Knife of Underspecification More... utool is a tool that performs various tasks related to scope
underspecification, a particular technique in computational
semantics of natural language. It implements the most efficient
known solver for underspecified descriptions, will convert
between various underspecification formalisms, and does a
variety of helper jobs.utool is implemented in Java, and is thus efficient and portable.
It can be used as a command-line tool, via a GUI, or as a server.
It is well documented and distributed under the GPL.utool was created within the CHORUS project at Saarland University
(Computational Linguistics and Programming Systems Lab). The
primary developers are Alexander Koller, Stefan Thater, and
Michaela Regneri, with help by Marco Kuhlmann.This package contains the utool jar file.
Package: utool_3.1.1-1nlp1~0hardy1_all.deb - utool-doc
Description: Documentation for utool More... utool is a tool that performs various tasks related to scope
underspecification, a particular technique in computational
semantics of natural language. It implements the most efficient
known solver for underspecified descriptions, will convert
between various underspecification formalisms, and does a
variety of helper jobs.utool is implemented in Java, and is thus efficient and portable.
It can be used as a command-line tool, via a GUI, or as a server.
It is well documented and distributed under the GPL.utool was created within the CHORUS project at Saarland University
(Computational Linguistics and Programming Systems Lab). The
primary developers are Alexander Koller, Stefan Thater, and
Michaela Regneri, with help by Marco Kuhlmann.This package contains documentation for utool.
Package: utool-doc_3.1.1-1nlp1~0hardy1_all.deb - yamcha
Version: 0.33-1nlp2~0hardy1 Source (dsc): yamcha_0.33-1nlp2~0hardy1.dsc Source (tar.gz): yamcha_0.33-1nlp2~0hardy1.tar.gz - libyamcha-dev
Description: Header files of YamCha More... YamCha is a general-purpose tagger based on SVMs.
Package: libyamcha-dev_0.33-1nlp2~0hardy1_i386.deb Package: libyamcha-dev_0.33-1nlp2~0hardy1_amd64.deb - libyamcha1
Description: Libraries of YamCha More... YamCha is a general-purpose tagger based on SVMs.
Package: libyamcha1_0.33-1nlp2~0hardy1_i386.deb Package: libyamcha1_0.33-1nlp2~0hardy1_amd64.deb - yamcha
Description: Yet Another Multipurpose CHunk Annotator More... YamCha stands for Yet Another Multipurpose CHunk Annotator, which is
designed as a general-purpose tagger based on SVMs.Package: yamcha_0.33-1nlp2~0hardy1_i386.deb Package: yamcha_0.33-1nlp2~0hardy1_amd64.deb