Apache Tika WARC parser module


Apache Tika WARC parser module

Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

Compile beroenden (2)

Grupp / Artefakt Version Nyare Version
org.netpreserve » jwarc 0.19.0 NA
org.apache.commons » commons-compress 1.21 NA

Provided beroenden (1)

Grupp / Artefakt Version Nyare Version
org.apache.tika » tika-core 2.5.0 NA