Application in php to test load of pdf files, using docker-compose and apache-tika.
-
Updated
Dec 1, 2018 - PHP
Application in php to test load of pdf files, using docker-compose and apache-tika.
This repository holds everything that is required to run the Apache Solr Engine and its functionality to crawl documents
بفهرسة اغلب انواع الوثائق والبحث فيها , استبدال العملات وتوحيد صيغ التواريخ والاوقات , يدعم الوثائق شبه المهيكلة باعطاء وزن اعلى للتاغ ذو الاهميه الاكبر, ويوسع الاستعلام باخذ مرادفات مفرداته باستخدام مكتبة ووردنت
[SLOW][WIP] Broodmother is a high performance, distributed, search engine using Apache Tika, Apache Solr, Akka, Neo4j, and Spring.
Using Apache Lucene, TIKI, Solr
This API use Annif as local server, NER component is included. It also includes Tesseract and uses Apache-tika software for language detection. It also has a limited multilingual support.
AWS Lambda code to index S3 buckets into Elasticsearch
Run Apache Tika as a service in AWS Lambda by scanning documents in S3 and storing the extracted text back to S3
PDF parsing and extraction utility using Apache Tika
Information Retrieval system for indexing and searching files stored on disk, with support for Romanian language
Apache Tika integration built in scala for indexing OneDrive files into ElasticSearch.
A vanilla PHP wrapper for Apache Tika and Google Cloud Translate to help them work in harmony.
Apache Tika adapter in Go
Analysis of PixStory social media data combined with Snapchat, COVID-19, and YouTube data. This project uses the Apache Tika Clustering software to cluster certain social media posts together.
microservice web application for uploading and downloading audio files
Secure file uploader web application
a tool set for indexing and searching through documents
Add a description, image, and links to the apache-tika topic page so that developers can more easily learn about it.
To associate your repository with the apache-tika topic, visit your repo's landing page and select "manage topics."