ucto 0.30-3 source package in Ubuntu

Changelog

ucto (0.30-3) unstable; urgency=medium

  * Team upload
  * Move to unstable

 -- Bastian Germann <email address hidden>  Tue, 07 May 2024 18:58:42 +0000

Upload details

Uploaded by:
Debian Science Team
Uploaded to:
Sid
Original maintainer:
Debian Science Team
Architectures:
any
Section:
science
Urgency:
Medium Urgency

See full publishing history Publishing

Series Pocket Published Component Section

Downloads

File Size SHA-256 Checksum
ucto_0.30-3.dsc 2.0 KiB ac2ed740e1f453592ebab3bdb34cfca59fa94eea9ce5e1421c9c5ed1c6e6fdae
ucto_0.30.orig.tar.gz 403.8 KiB c7b32976648fa484cf81770f19ff35f260819b518928087ee35c94d6c11e6a02
ucto_0.30-3.debian.tar.xz 5.8 KiB 7cec00e8f74d47a11f63d9072c8ed7e91d9dd6fe1e33388a869e2c6807dffdf8

No changes file available.

Binary packages built by this source

libucto-dev: Unicode Tokenizer - development

 Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
 punctuation, split sentences, generate n-grams), and offers several other
 basic preprocessing steps that make your text suited for further processing
 such as indexing, part-of-speech tagging, or machine translation.
 .
 This package provides C++ headers for the programming library.
 .
 Ucto was written by Maarten van Gompel and Ko van der Sloot. Work on Ucto
 was funded by NWO, the Netherlands Organisation for Scientific Research,
 under the Implicit Linguistics project, the CLARIN-NL program, and the
 CLARIAH project.
 .
 Ucto is a product of the Centre of Language and Speech Technology (Radboud
 University Nijmegen), the KNAW Humanities Cluster, and previously the ILK
 Research Group (Tilburg University, The Netherlands).
 .
 If you are interested in machine parsing of UTF-8 encoded text files, e.g. to
 do scientific research in natural language processing, ucto will likely be of
 use to you.

libucto6: Unicode Tokenizer - runtime

 Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
 punctuation, split sentences, generate n-grams), and offers several other
 basic preprocessing steps that make your text suited for further processing
 such as indexing, part-of-speech tagging, or machine translation.
 .
 This package provides the programming library only.
 .
 Ucto was written by Maarten van Gompel and Ko van der Sloot. Work on Ucto
 was funded by NWO, the Netherlands Organisation for Scientific Research,
 under the Implicit Linguistics project, the CLARIN-NL program, and the
 CLARIAH project.
 .
 Ucto is a product of the Centre of Language and Speech Technology (Radboud
 University Nijmegen), the KNAW Humanities Cluster, and previously the ILK
 Research Group (Tilburg University, The Netherlands).
 .
 If you are interested in machine parsing of UTF-8 encoded text files, e.g. to
 do scientific research in natural language processing, ucto will likely be of
 use to you.

libucto6-dbgsym: debug symbols for libucto6
ucto: Unicode Tokenizer

 Ucto can tokenize UTF-8 encoded text files (i.e. separate words from
 punctuation, split sentences, generate n-grams), and offers several other
 basic preprocessing steps that make your text suited for further processing
 such as indexing, part-of-speech tagging, or machine translation.
 .
 This package provides the command-line tool itself.
 .
 Ucto was written by Maarten van Gompel and Ko van der Sloot. Work on Ucto
 was funded by NWO, the Netherlands Organisation for Scientific Research,
 under the Implicit Linguistics project, the CLARIN-NL program, and the
 CLARIAH project.
 .
 Ucto is a product of the Centre of Language and Speech Technology (Radboud
 University Nijmegen), and previously the ILK Research Group
 (Tilburg University, The Netherlands).
 .
 If you are interested in machine parsing of UTF-8 encoded text files, e.g. to
 do scientific research in natural language processing, ucto will likely be of
 use to you.

ucto-dbgsym: debug symbols for ucto