Apache Tika 3.0.0 released - Available in the Domino Container
Daniel Nashed – 24 November 2024 14:01:58
Apache Tika is a Java based project leveraged in Domin to parse text from attachments when full text indexing using the search filters.
It's a single JAR running as a separate process listening on the loopback interface to perform attachment parsing.
Tika could actually also be used for your own applications, if you start another instance.
I blogged about it some time ago --> https://blog.nashcom.de/nashcomblog.nsf/dx/tika-in-notesdomino.htm.
Domino 14.5 EA1 and 14.0 FP2 containes the latest stable Tika Server 2.9.2 release.
Now that Tika 3.0.0 is finally released, you can expect Domino 14.5 also to switch to the new major version.
The container project provides a build option to replace the Tika version
I have just updated Tika to 3.0.0 in the container build and did a quick test.
- Comments [3]