Two different highlighting methods are supported and can be used depending on data available:
Highlight document using a search query
In this case, Highlighter takes the user's query string and finds hits to highlight using internal search engine. Typically, you would use this method when integrating with Elasticsearch or Apache Solr based search solutions.
Highlight document using PDF highlight file
In this case, terms in PDF are highlighted using position of text in the document. This is the recommended approach if your search tool can generate highlighting files compatible with Adobe’s PDF Highlight File Format; that way, the PDF will have marked the exact terms as found by the search engine. Use this method when integrating with dtSearch based search solutions or NLP tools.
Self-hosted PDF Highlighter installation comes with a set of examples that you can try at http://localhost:8998/examples/.
comments powered by Disqus