The Ubuntu NLP Repository v8.04LTS
Powered by Falcon
Component english
NLP tools for processing English.
You can use apt to download and install the packages. Use the following lines in /etc/apt/sources.list and use the command sudo apt-get update to enable downloading from this component.
Don't forget to read the notice on the frontpage!
deb-src http://cl.naist.jp/~eric-n/ubuntu-nlp hardy english
Packages
- corenlp
Version: 1.3.0-2nlp1~0hardy1 Source (dsc): corenlp_1.3.0-2nlp1~0hardy1.dsc Source (tar.gz): corenlp_1.3.0-2nlp1~0hardy1.tar.gz - corenlp
Description: Stanford CoreNLP: A Suite of Core NLP Tools More... Stanford CoreNLP provides a set of natural language analysis tools
which can take raw English language text input and give the base
forms of words, their parts of speech, whether they are names of
companies, people, etc., normalize dates, times, and numeric
quantities, and mark up the structure of sentences in terms of
phrases and word dependencies, and indicate which noun phrases refer
to the same entities. It provides the foundational building blocks
for higher level text understanding applications.Stanford CoreNLP integrates all our NLP tools for the English
language, including the part-of-speech (POS) tagger, the named entity
recognizer (NER), the parser, and the coreference resolution
system. The goal of this project is to enable people to quickly and
painlessly get complete linguistic annotations of natural language
texts. It is designed to be highly flexible and extensible, i.e.,
with a single option you can change which tools should be enabled and
which should be disabled.The Stanford CoreNLP code is licensed under the GNU General Public
License (v2 or later). Source is included. Note that this is the full
GPL, which allows many free uses, but not its use in distributed
proprietary software. The download is 195 MB and requires Java 1.5+.Package: corenlp_1.3.0-2nlp1~0hardy1_all.deb - geniatagger
Version: 3.0.1-2nlp1 Source (dsc): geniatagger_3.0.1-2nlp1.dsc Source (tar.gz): geniatagger_3.0.1-2nlp1.tar.gz - geniatagger
Description: GENIA Tagger - English part-of-speech tagger and shallow parser More... The GENIA tagger analyzes English sentences and outputs the base forms,
part-of-speech tags, and chunk tags. The tagger is specifically tuned
for biomedical text such as MEDLINE abstracts. If you need to extract
information from biomedical documents, this tagger could be a useful
preprocessing tool.This package contains the GENIA Tagger program.
Package: geniatagger_3.0.1-2nlp1_i386.deb Package: geniatagger_3.0.1-2nlp1_amd64.deb - geniatagger-doc
Description: Documentation for the GENIA Tagger More... The GENIA tagger analyzes English sentences and outputs the base forms,
part-of-speech tags, and chunk tags. The tagger is specifically tuned
for biomedical text such as MEDLINE abstracts. If you need to extract
information from biomedical documents, this tagger could be a useful
preprocessing tool.This package contains documentation for the GENIA Tagger.
Package: geniatagger-doc_3.0.1-2nlp1_all.deb - libwordnet-querydata-perl
Version: 1.49-1nlp1~0hardy1 Source (dsc): libwordnet-querydata-perl_1.49-1nlp1~0hardy1.dsc Source (tar.gz): libwordnet-querydata-perl_1.49-1nlp1~0hardy1.tar.gz - libwordnet-querydata-perl
Description: direct perl interface to WordNet database More... WordNet::QueryData provides a direct interface to the WordNet database
files. It requires the WordNet package
(http://www.cogsci.princeton.edu/~wn/). It allows the user direct
access to the full WordNet semantic lexicon. All parts of speech are
supported and access is generally very efficient because the index and
morphical exclusion tables are loaded at initialization. This
initialization step is slow (appx. 10-15 seconds), but queries are
very fast thereafter---thousands of queries can be completed every
second.This description was automagically extracted from the module by dh-make-perl.
Package: libwordnet-querydata-perl_1.49-1nlp1~0hardy1_all.deb - morph
Version: 0.0.20030918-2nlp1~0hardy1 Source (dsc): morph_0.0.20030918-2nlp1~0hardy1.dsc Source (tar.gz): morph_0.0.20030918-2nlp1~0hardy1.tar.gz - morph
Description: English morphological analyzer and generator More... Morpha is a fast and robust morphological analyser for English based
on finite-state techniques that returns the lemma and inflection type
of a word, given the word form and its part of speech. (The latter is
optional but accuracy is degraded if it is not present).Morphg generates a word form given a specification of the lemma,
part-of-speech, and the type of inflection required. Morphg is
derived automatically from morpha, ensuring consistency and
reversability of the tools. An option controls British English or
American English behaviour with respect to consonant doubling.For more information see John Carroll's homepage at:
http://www.informatics.sussex.ac.uk/research/groups/nlp/carroll/morph.htmlPackage: morph_0.0.20030918-2nlp1~0hardy1_all.deb - python-pywordnet
Version: 2.0.1-2nlp1~0hardy1 Source (dsc): python-pywordnet_2.0.1-2nlp1~0hardy1.dsc Source (tar.gz): python-pywordnet_2.0.1-2nlp1~0hardy1.tar.gz - python-pywordnet
Description: Python interface to WordNet 2.0 More... PyWordNet is a Python interface to the WordNet database
of word meanings and lexical relationships[1].PyWordNet presents a concise interface to WordNet,
that allows the user to type expressions such as
N['dog'], hyponyms(N['dog'][0]), and
closure(ADJ['red'], SYNONYM) to query the database.>>> N['dog']
dog(n.)
>>> N['dog'].getSenses()
('dog' in {noun: dog, domestic dog, Canis familiaris},
'dog' in {noun: frump, dog}, 'dog' in {noun: dog},
'dog' in {noun: cad, bounder, blackguard, dog, hound, heel},
'dog' in {noun: pawl, detent, click, dog},
'dog' in {noun: andiron, firedog, dog, dogiron})For more information, see http://sourceforge.net/projects/pywordnet
Package: python-pywordnet_2.0.1-2nlp1~0hardy1_i386.deb Package: python-pywordnet_2.0.1-2nlp1~0hardy1_amd64.deb