Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

Apache Tika

Compare

Claimed by Apache Software Foundation Analyzed 2 days ago

The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.

393K lines of code

19 current contributors

3 days since last commit

23 users on Open Hub

High Activity
5.0
 
I Use This

Apache Solr for TYPO3

Compare

  Analyzed 2 days ago

Open Source Enterprise Search meets Open Source Enterprise Content Management System. A TYPO3 extension that integrates the Apache Solr enterprise search server with TYPO3. Features include * User Access Groups Support * Multi Language Handling * File Indexing * Facetting & Filters * ... [More] Sorting * Field Boosting * Spellchecking * Search Word Highlighting * Auto Suggest * Multisite Support * Advanced Templating Engine * Index Reports [Less]

99.5K lines of code

22 current contributors

16 days since last commit

3 users on Open Hub

Moderate Activity
5.0
 
I Use This

Apache Tika for TYPO3

Compare

  Analyzed about 18 hours ago

Apache Tika for TYPO3 offers several services to extract meta data and content from files. The extension also comes with a service to detect the language of a text (requires Tika 0.8+). EXT:tika can use either a locally available Tika CLI app or a remote Apache Solr server. The provided ... [More] services can then be used by other extensions like EXT:dam or EXT:solr for example. [Less]

4.91K lines of code

2 current contributors

19 days since last commit

1 users on Open Hub

Low Activity
5.0
 
I Use This