Port details on branch 2022Q2 |
- ucto Advanced rule-based (regular-expression) and unicode-aware tokenizer
- 0.24.1_2 textproc =0 0.24.1_2Version of this port present on the latest quarterly branch.
- Maintainer: yuri@FreeBSD.org
- Port Added: 2022-04-24 04:16:20
- Last Update: 2022-04-10 19:47:23
- Commit Hash: 035e778
- License: APACHE20
- WWW:
- https://languagemachines.github.io/ucto/
- Description:
- Ucto tokenizes text files: it separates words from punctuation, and splits
sentences. It offers several other basic preprocessing steps such as changing
case that you can all use to make your text suited for further processing such
as indexing, part-of-speech tagging, or machine translation.
Ucto comes with tokenisation rules for several languages and can be easily
extended to suit other languages. It has been incorporated for tokenizing Dutch
text in Frog, our Dutch morpho-syntactic processor.
WWW: https://languagemachines.github.io/ucto/
- ¦ ¦ ¦ ¦
- Manual pages:
-
- pkg-plist: as obtained via:
make generate-plist - Dependency lines:
-
- To install the port:
- cd /usr/ports/textproc/ucto/ && make install clean
- To add the package, run one of these commands:
- pkg install textproc/ucto
- pkg install ucto
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above.- PKGNAME: ucto
- Flavors: there is no flavor information for this port.
- distinfo:
- TIMESTAMP = 1640970217
SHA256 (LanguageMachines-ucto-v0.24.1_GH0.tar.gz) = f386c3a1f000255153c52044e64257789b301428f525711aeaccfc020ff38827
SIZE (LanguageMachines-ucto-v0.24.1_GH0.tar.gz) = 399511
No package information for this port in our database- Sometimes this happens. Not all ports have packages.
- Dependencies
- NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
- Build dependencies:
-
- autoconf-archive>0 : devel/autoconf-archive
- uctodata>0 : textproc/uctodata
- gmake>=4.3 : devel/gmake
- pkgconf>=1.3.0_1 : devel/pkgconf
- autoconf>=2.69 : devel/autoconf
- automake>=1.16.1 : devel/automake
- libtoolize : devel/libtool
- Runtime dependencies:
-
- uctodata>0 : textproc/uctodata
- Library dependencies:
-
- libexttextcat-2.0.so : textproc/libexttextcat
- libfolia.so : textproc/libfolia
- libicuio.so : devel/icu
- libticcutils.so : devel/ticcutils
- libxml2.so : textproc/libxml2
- libedit.so.0 : devel/libedit
- libreadline.so.8 : devel/readline
- There are no ports dependent upon this port
Configuration Options:
- No options to configure
- Options name:
- textproc_ucto
- USES:
- autoreconf compiler:c++11-lang gmake gnome libedit libtool pkgconfig readline
- FreshPorts was unable to extract/find any pkg message
- Master Sites:
|
Number of commits found: 1
Commit History - (may be incomplete: for full details, see links to repositories near top of page) |
Commit | Credits | Log message |
0.24.1_2 10 Apr 2022 19:47:23 |
Charlie Li (vishwin) |
textproc/libxml2: bump all LIB_DEPENDS consumers
This is a separate and direct commit to quarterly as PORTREVISIONs
may not match from main.
PR: 262853, 262940, 262877
Approved by: fluffy (mentor) |
Number of commits found: 1
|