Apache Tika html parser module


Apache Tika html parser module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

Compile beroenden (2)

Grupp / Artefakt Version Nyare Version
org.ccil.cowan.tagsoup » tagsoup 1.2.1 NA
commons-codec » commons-codec 1.15 NA

Provided beroenden (1)

Grupp / Artefakt Version Nyare Version
org.apache.tika » tika-core 2.7.0 NA