Port details on branch 2022Q2 |
- py-ocrmypdf Adds an OCR text layer to scanned PDF files
- 13.4.7 textproc =0 13.4.7Version of this port present on the latest quarterly branch.
- Maintainer: kai@FreeBSD.org
- Port Added: 2022-04-24 04:16:01
- Last Update: 2022-06-18 14:35:34
- Commit Hash: 0ac71c6
- Also Listed In: python
- License: MPL20
- WWW:
- https://github.com/ocrmypdf/OCRmyPDF
- Description:
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be
searched or copy-pasted.
Main features:
* Generates a searchable PDF/A file from a regular PDF
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a "lossless" operation without
disrupting any other content
* Optimizes PDF images, often producing files smaller than the input file
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Distributes work across all available CPU cores
* Uses Tesseract OCR engine to recognize more than 100 languages
* Scales properly to handle files with thousands of pages
* Battle-tested on millions of PDFs
WWW: https://github.com/ocrmypdf/OCRmyPDF
- ¦ ¦ ¦ ¦
- Manual pages:
- FreshPorts has no man page information for this port.
- pkg-plist: as obtained via:
make generate-plist - There is no configure plist information for this port.
- Dependency lines:
-
- ${PYTHON_PKGNAMEPREFIX}ocrmypdf>0:textproc/py-ocrmypdf@${PY_FLAVOR}
- To install the port:
- cd /usr/ports/textproc/py-ocrmypdf/ && make install clean
- To add the package, run one of these commands:
- pkg install textproc/py-ocrmypdf
- pkg install py38-ocrmypdf
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above. NOTE: This is a Python port. Instead of py38-ocrmypdf listed in the above command, you can pick from the names under the Packages section.- PKGNAME: py38-ocrmypdf
- Package flavors (<flavor>: <package>)
- distinfo:
- TIMESTAMP = 1655549092
SHA256 (ocrmypdf-13.4.7.tar.gz) = 8a0a2fa07cf0aac4dea11990d27a15b552afa7ff2dfffdb322bfd8bd0b77751d
SIZE (ocrmypdf-13.4.7.tar.gz) = 7899982
No package information for this port in our database- Sometimes this happens. Not all ports have packages.
- Dependencies
- NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
- Build dependencies:
-
- py38-setuptools_scm>=0 : devel/py-setuptools_scm@py38
- py38-setuptools_scm_git_archive>=0 : devel/py-setuptools_scm_git_archive@py38
- py38-setuptools>0 : devel/py-setuptools@py38
- python3.8 : lang/python38
- Runtime dependencies:
-
- py38-coloredlogs>=14.0 : devel/py-coloredlogs@py38
- py38-img2pdf>=0.3.0<0.5 : graphics/py-img2pdf@py38
- py38-packaging>=20 : devel/py-packaging@py38
- py38-pillow>=8.2.0 : graphics/py-pillow@py38
- py38-tqdm>=4 : misc/py-tqdm@py38
- py38-reportlab>=3.5.66 : print/py-reportlab@py38
- py38-pdfminer.six>=20191110 : textproc/py-pdfminer.six@py38
- py38-pikepdf>=4.0.0 : textproc/py-pikepdf@py38
- py38-pluggy>=0.13.0<2.0 : devel/py-pluggy@py38
- tesseract : graphics/tesseract
- pngquant : graphics/pngquant
- unpaper : graphics/unpaper
- ghostscript9-agpl-base>=9.16_2 : print/ghostscript9-agpl-base
- py38-setuptools>0 : devel/py-setuptools@py38
- python3.8 : lang/python38
- py38-importlib-resources>=5 : devel/py-importlib-resources@py38
- There are no ports dependent upon this port
Configuration Options:
- ===> The following configuration options are available for py38-ocrmypdf-13.4.7:
PNGQUANT=on: Optimizes the encoding of PNG-style images in PDFs
UNPAPER=on: Deskew and clean up pages before OCR processing
===> Use 'make config' to modify these settings
- Options name:
- textproc_py-ocrmypdf
- USES:
- ghostscript:run python:3.7+ shebangfix
- FreshPorts was unable to extract/find any pkg message
- Master Sites:
|
Number of commits found: 2
Number of commits found: 2
|