Port details |
- py-ocrmypdf Adds an OCR text layer to scanned PDF files
- 14.2.1 textproc
=0 14.0.4Version of this port present on the latest quarterly branch. - Maintainer: kai@FreeBSD.org
 - Port Added: 2019-07-12 15:08:19
- Last Update: 2023-06-04 06:51:33
- Commit Hash: 3b4f49f
- Also Listed In: python
- License: MPL20
- Description:
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be
searched or copy-pasted.
Main features:
* Generates a searchable PDF/A file from a regular PDF
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a "lossless" operation without
disrupting any other content
* Optimizes PDF images, often producing files smaller than the input file
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Distributes work across all available CPU cores
* Uses Tesseract OCR engine to recognize more than 100 languages
* Scales properly to handle files with thousands of pages
* Battle-tested on millions of PDFs
¦ ¦ ¦ ¦ 
- pkg-plist: as obtained via:
make generate-plist - There is no configure plist information for this port.
- Dependency lines:
-
- ${PYTHON_PKGNAMEPREFIX}ocrmypdf>0:textproc/py-ocrmypdf@${PY_FLAVOR}
- To install the port:
- cd /usr/ports/textproc/py-ocrmypdf/ && make install clean
- To add the package, run one of these commands:
- pkg install textproc/py-ocrmypdf
- pkg install py39-ocrmypdf
NOTE: If this package has multiple flavors (see below), then use one of them instead of the name specified above. NOTE: This is a Python port. Instead of py39-ocrmypdf listed in the above command, you can pick from the names under the Packages section.- PKGNAME: py39-ocrmypdf
- Package flavors (<flavor>: <package>)
- distinfo:
- TIMESTAMP = 1685725110
SHA256 (ocrmypdf-14.2.1.tar.gz) = ebe0fb54e3174b49e53f823a27b191fa7a53862c94b6ee3c005f8a6528a4126c
SIZE (ocrmypdf-14.2.1.tar.gz) = 6516930
No package information for this port in our database- Sometimes this happens. Not all ports have packages.
- Dependencies
- NOTE: FreshPorts displays only information on required and default dependencies. Optional dependencies are not covered.
- Build dependencies:
-
- py39-setuptools>=61 : devel/py-setuptools@py39
- py39-setuptools_scm>=0 : devel/py-setuptools_scm@py39
- py39-wheel>=0 : devel/py-wheel@py39
- python3.9 : lang/python39
- py39-build>=0 : devel/py-build@py39
- py39-installer>=0 : devel/py-installer@py39
- Test dependencies:
-
- py39-hypothesis>=6.0.0 : devel/py-hypothesis@py39
- py39-pytest-xdist>=2.2.0 : devel/py-pytest-xdist@py39
- py39-python-xmp-toolkit>=2.0.1 : textproc/py-python-xmp-toolkit@py39
- py39-pytest>=7,1 : devel/py-pytest@py39
- python3.9 : lang/python39
- Runtime dependencies:
-
- py39-coloredlogs>=14.0 : devel/py-coloredlogs@py39
- py39-deprecation>=2.1.0 : devel/py-deprecation@py39
- py39-img2pdf>=0.3.0 : graphics/py-img2pdf@py39
- py39-packaging>=20 : devel/py-packaging@py39
- py39-pillow>=8.2.0 : graphics/py-pillow@py39
- py39-tqdm>=4 : misc/py-tqdm@py39
- py39-reportlab>=3.5.66 : print/py-reportlab@py39
- py39-pdfminer.six>=20201018 : textproc/py-pdfminer.six@py39
- py39-pikepdf>=5.0.1 : textproc/py-pikepdf@py39
- py39-pluggy>=0.13.0 : devel/py-pluggy@py39
- tesseract : graphics/tesseract
- pngquant : graphics/pngquant
- unpaper : graphics/unpaper
- ghostscript9-agpl-base>=9.16_2 : print/ghostscript9-agpl-base
- python3.9 : lang/python39
- py39-typing-extensions>=4 : devel/py-typing-extensions@py39
- There are no ports dependent upon this port
Configuration Options:
- ===> The following configuration options are available for py39-ocrmypdf-14.2.1:
PNGQUANT=on: Optimizes the encoding of PNG-style images in PDFs
UNPAPER=on: Deskew and clean up pages before OCR processing
===> Use 'make config' to modify these settings
- Options name:
- textproc_py-ocrmypdf
- USES:
- ghostscript:run python:3.8+ shebangfix
- FreshPorts was unable to extract/find any pkg message
- Master Sites:
|
Commit History - (may be incomplete: for full details, see links to repositories near top of page) |
Commit | Credits | Log message |
14.2.1 04 Jun 2023 06:51:33
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 14.2.1
* Simplify Makefile by switching to USE_PYTHON=pytest.
Changelog since 14.1.0:
https://github.com/ocrmypdf/OCRmyPDF/blob/v14.2.1/docs/release_notes.rst |
14.1.0_1 25 Apr 2023 15:17:15
    |
Christian Weisgerber (naddy)  |
audio/opus: bump consumers after update to 1.4 |
14.1.0 16 Apr 2023 16:01:30
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 14.1.0
* Remove no longer required USE_LOCALE as the UTF-8 mode is enabled by
default when the locale is C since Python 3.7.
Changelog:
https://github.com/ocrmypdf/OCRmyPDF/blob/v14.1.0/docs/release_notes.rst |
14.0.4_1 04 Apr 2023 18:29:11
    |
Antoine Brodin (antoine)  |
USE_PYTHON=pep517: bump PORTREVISION |
14.0.4 21 Mar 2023 12:23:18
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 14.0.4
Changelog:
https://github.com/ocrmypdf/OCRmyPDF/blob/v14.0.4/docs/release_notes.rst |
14.0.3 24 Feb 2023 05:53:05
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 14.0.3
* Switch to PEP-517 build framework.
Changelogs since 13.7.0:
https://github.com/ocrmypdf/OCRmyPDF/blob/v14.0.3/docs/release_notes.rst
MFH: No (PEP-517 support not present in 2023Q1 branch) |
13.7.0 11 Jan 2023 15:58:34
    |
Dmitry Marakasov (amdmi3)  |
*/*: rename CHEESESHOP to PYPI in MASTER_SITES
PR: 267994
Differential revision: D37518
Approved by: bapt |
13.7.0 22 Oct 2022 17:26:59
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 13.7.0
There is already a new major release (= 14.x) out, but the build system
has been changed here, which therefore takes a little more time to
implement. So use the last release of the 13.x series for a while.
Changelog since 13.6.0:
https://github.com/ocrmypdf/OCRmyPDF/blob/v13.7.0/docs/release_notes.rst
MFH: 2022Q4 |
07 Sep 2022 21:58:51
    |
Stefan Eßer (se)  |
Remove WWW entries moved into port Makefiles
Commit b7f05445c00f has added WWW entries to port Makefiles based on
WWW: lines in pkg-descr files.
This commit removes the WWW: lines of moved-over URLs from these
pkg-descr files.
Approved by: portmgr (tcberner) |
13.6.0 07 Sep 2022 21:10:59
    |
Stefan Eßer (se)  |
Add WWW entries to port Makefiles
It has been common practice to have one or more URLs at the end of the
ports' pkg-descr files, one per line and prefixed with "WWW:". These
URLs should point at a project website or other relevant resources.
Access to these URLs required processing of the pkg-descr files, and
they have often become stale over time. If more than one such URL was
present in a pkg-descr file, only the first one was tarnsfered into
the port INDEX, but for many ports only the last line did contain the
port specific URL to further information.
There have been several proposals to make a project URL available as
a macro in the ports' Makefiles, over time.
(Only the first 15 lines of the commit message are shown above ) |
13.6.0 09 Jul 2022 11:51:17
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 13.6.0
Changelog since 13.4.7:
https://github.com/ocrmypdf/OCRmyPDF/blob/v13.6.0/docs/release_notes.rst
MFH: 2022Q3 |
13.4.7 18 Jun 2022 14:32:16
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 13.4.7
* Update WWW field to spare redirects.
Changelog since 13.4.2:
https://github.com/ocrmypdf/OCRmyPDF/blob/v13.4.7/docs/release_notes.rst
MFH: 2022Q2 |
13.4.2 10 Apr 2022 10:39:26
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 13.4.2
* Remove a copy/paste leftover from the previous commit.
Changelog:
https://github.com/ocrmypdf/OCRmyPDF/blob/v13.4.2/docs/release_notes.rst
MFH: 2022Q2 |
13.4.1 21 Mar 2022 16:42:47
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 13.4.1
Changelog since 12.7.2:
https://github.com/ocrmypdf/OCRmyPDF/blob/v13.4.1/docs/release_notes.rst
MFH: No (feature release) |
12.7.2 07 Nov 2021 10:03:57
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 12.7.2
Changelog since 12.4.0:
https://github.com/jbarlow83/OCRmyPDF/blob/v12.7.2/docs/release_notes.rst |
12.4.0 13 Sep 2021 07:54:21
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 12.4.0
Changelog since 12.3.2:
https://github.com/jbarlow83/OCRmyPDF/blob/v12.4.0/docs/release_notes.rst |
12.3.2 10 Aug 2021 15:42:26
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 12.3.2
Changelog since 12.2.0:
https://github.com/jbarlow83/OCRmyPDF/blob/v12.3.2/docs/release_notes.rst |
12.2.0 05 Jul 2021 10:30:38
    |
Kai Knoblich (kai)  |
textproc/py-ocrmypdf: Update to 12.2.0
Changelogs since 11.7.3:
https://github.com/jbarlow83/OCRmyPDF/blob/v12.2.0/docs/release_notes.rst
MFH: No (feature release) |
11.7.3 07 Apr 2021 08:09:01
    |
Mathieu Arnold (mat)  |
One more small cleanup, forgotten yesterday.
Reported by: lwhsu |
11.7.3 06 Apr 2021 14:31:07
    |
Mathieu Arnold (mat)  |
Remove # $FreeBSD$ from Makefiles. |
11.7.3 24 Mar 2021 14:10:42
  |
kai  |
textproc/py-ocrmypdf: Update to 11.7.3
Changelog since 11.6.2:
https://github.com/jbarlow83/OCRmyPDF/blob/v11.7.3/docs/release_notes.rst |
11.6.2 27 Feb 2021 17:53:36
  |
kai  |
textproc/py-ocrmypdf: Update to 11.6.2
Changelog since 11.6.0:
https://github.com/jbarlow83/OCRmyPDF/blob/v11.6.2/docs/release_notes.rst |
11.6.0 01 Feb 2021 12:53:18
  |
kai  |
textproc/py-ocrmypdf: Update to 11.6.0
Changelog since 11.3.1:
https://github.com/jbarlow83/OCRmyPDF/blob/v11.6.0/docs/release_notes.rst |
11.3.1 30 Oct 2020 07:28:15
  |
kai  |
textproc/py-ocrmypdf: Update to 11.3.1
* The license was changed by upstream with the 11.0.0 release.
Adapt LICENSE accordingly.
Changelog since 10.3.1:
https://github.com/jbarlow83/OCRmyPDF/blob/v11.3.1/docs/release_notes.rst |
10.3.1 29 Jul 2020 06:21:41
  |
kai  |
textproc/py-ocrmypdf: Update to 10.3.1
Changelog since 10.2.0:
https://github.com/jbarlow83/OCRmyPDF/blob/v10.3.1/docs/release_notes.rst |
10.2.0 06 Jul 2020 06:39:04
  |
kai  |
textproc/py-ocrmypdf: Update to 10.2.0
Changelog since 9.8.2:
https://github.com/jbarlow83/OCRmyPDF/blob/v10.2.0/docs/release_notes.rst |
9.8.2 22 Jun 2020 10:08:59
  |
kai  |
textproc/py-ocrmypdf: Fix typo in COMMENT
Reported by: Frank Ebert (via private e-mail) |
9.8.2 09 Jun 2020 11:16:29
  |
kai  |
textproc/py-ocrmypdf: Update to 9.8.2
* Remove uneccessary pytest-cov dependency from TEST_DEPENDS while I'm here.
Changelog since 9.8.0:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.8.2/docs/release_notes.rst |
9.8.0 30 Apr 2020 08:45:34
  |
kai  |
textproc/py-ocrmypdf: Update to 9.8.0
* Pet portclippy while I'm here by placing related variables to the USES
block.
Changelog:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.8.0/docs/release_notes.rst |
9.7.1 14 Apr 2020 11:39:58
  |
kai  |
textproc/py-ocrmypdf: Update to 9.7.1
Changelog:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.7.1/docs/release_notes.rst |
9.7.0 10 Apr 2020 10:03:01
  |
kai  |
textproc/py-ocrmypdf: Update to 9.7.0
Changelog:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.7.0/docs/release_notes.rst
MFH: 2020Q2 (fix regressions with Ghostscript >= 9.52) |
9.6.0 24 Mar 2020 19:54:57
  |
antoine  |
Convert the tree to use PY_PILLOW
With hat: portmgr
Originally submitted by: kai |
9.6.0 17 Feb 2020 08:42:11
  |
kai  |
textproc/py-ocrmypdf: Update to 9.6.0
Changelog since 9.3.0:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.6.0/docs/release_notes.rst
MFH: 2020Q1 (blanket: regression fix) |
9.3.0 02 Jan 2020 19:14:49
  |
kai  |
textproc/py-ocrmypdf: Update to 9.3.0
Changelog since 9.1.1:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.3.0/docs/release_notes.rst
MFH: 2020Q1 |
9.1.1 22 Nov 2019 10:20:47
  |
kai  |
textproc/py-ocrmypdf: Update to 9.1.1
Changelog since 9.0.5:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.1.1/docs/release_notes.rst |
9.0.5 13 Nov 2019 16:26:14
  |
kai  |
textproc/py-ocrmypdf: Update to 9.0.5
* Remove the workaround that coped with the files that were generated in the
CFFI out-of-line ABI mode (through devel/py-cffi) as they are now captured
correctly during the installation process.
Changelog since 9.0.3:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.0.5/docs/release_notes.rst
MFH: 2019Q4 |
9.0.3 10 Sep 2019 20:10:53
  |
kai  |
textproc/py-ocrmypdf: Update to 9.0.3
* Introduce new default option UNPAPER after graphics/unpaper was updated to
a more recent version in r511568.
Changelog:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.0.3/docs/release_notes.rst |
9.0.2 05 Sep 2019 16:15:10
  |
kai  |
textproc/py-ocrmypdf: Update to 9.0.2
* Set the PNGQUANT option as default and remove the related comments since
now the issue with the incorrect inverted PNG images has been fixed.
Changelog:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.0.2/docs/release_notes.rst |
9.0.1 13 Aug 2019 19:32:17
  |
kai  |
textproc/py-ocrmypdf: Update to 9.0.1
* Make the dependency for graphics/pngquant optional as pngquant is not
strictly required but should be provided where possible.
* Set the new option as non-default for a while because there's a bug that
leads to incorrect inverted PNG images in conjunction with pngquant.
* Install shell auto-completions to improve user experience.
* Remove the "slow" tests from the "do-test" target. There's very little
point to run those at downstream.
Changelog since 8.3.2:
https://github.com/jbarlow83/OCRmyPDF/blob/v9.0.1/docs/release_notes.rst |
8.3.2 03 Aug 2019 22:21:57
  |
kai  |
textproc/py-ocrmypdf: Update to 8.3.2
Changelog:
https://github.com/jbarlow83/OCRmyPDF/blob/v8.3.2/docs/release_notes.rst |
8.3.1 15 Jul 2019 10:14:38
  |
kai  |
textproc/py-ocrmypdf: Use defined locale in whole build process
* Set the locale globally for {CONFIGURE,MAKE}_ENV so that the locale
doesn't need to be set explicitly for the do-test target.
* Use ECHO_CMD instead of ECHO as the latter one is already defined in
/usr/share/mk/sys.mk, which can either be "echo" or "true" if the make
flag -s is given.
Suggested by: sunpoet |
8.3.1 12 Jul 2019 17:20:33
  |
kai  |
textproc/py-ocrmypdf: Apply workaround for missing files a bit earlier
* The post-stage target runs very late in the install process and should be
only used if the post-install target ran too early.
Thus place the workaround that copes with the missing files to the
post-install target where it fits better.
Suggested by: mat |
8.3.1 12 Jul 2019 15:08:04
  |
kai  |
New port: textproc/py-ocrmypdf
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be
searched or copy-pasted.
Main features:
* Generates a searchable PDF/A file from a regular PDF
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a "lossless" operation without
disrupting any other content
* Optimizes PDF images, often producing files smaller than the input file
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Distributes work across all available CPU cores
* Uses Tesseract OCR engine to recognize more than 100 languages
* Scales properly to handle files with thousands of pages
* Battle-tested on millions of PDFs
WWW: https://github.com/jbarlow83/OCRmyPDF
Reviewed by: 0mp, koobs
Differential Revision: https://reviews.freebsd.org/D20927 |