Apache Tika icon
Apache Tika icon

Apache Tika

The Tika application jar can be used as a command line utility for extracting text content and metadata from all sorts of files.

Cost / License

  • Free
  • Open Source

Platforms

  • Mac
  • Windows
  • Linux
  • Java
-
No reviews
1like
0comments
0alternatives
0news articles

Features

Suggest and vote on features
No features, maybe you want to suggest one?

 Tags

  • extractor

Apache Tika News & Activities

Highlights All activities

Recent activities

No activities found.

Apache Tika information

  • Developed by

    Unknown
  • Licensing

    Open Source and Free product.
  • Alternatives

    0 alternatives listed
  • Supported Languages

    • English
Apache Tika was added to AlternativeTo by DzmitryLahoda on and this page was last updated .
No comments or reviews, maybe you want to be first?
Post comment/review

What is Apache Tika?

The Tika application jar can be used as a command line utility for extracting text content and metadata from all sorts of files.

The Apache Tika™ toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more. You can find the latest release on the download page. Please see the Getting Started page for more information on how to start using Tika.

The Parser and Detector pages describe the main interfaces of Tika and how they work.

If you're interested in contributing to Tika, please see the Contributing page or send an email to the Tika development list.

Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.