nutch
Here are 29 public repositories matching this topic...
DataHarvest: Dockerized Web Crawling, Indexing, and Storage Solution
-
Updated
Jun 19, 2023 - Python
✨ 🧬 Apache Nutch Plugin for Viglet Turing Search
-
Updated
Aug 5, 2021 - Java
Apache Nutch system adapter for ORCA
-
Updated
Sep 19, 2019 - Java
Rest Service for Spring/Solr backed search engine.
-
Updated
Aug 22, 2021 - Java
Developed as part of an Information Retrieval coursework, this project showcases a search engine that efficiently indexes and retrieves information from a given dataset.
-
Updated
Aug 25, 2023 - Python
Search engine knowledge systems(搜索引擎知识体系).
-
Updated
Feb 22, 2020
Developed a Spatial Search website that allow users to search documents from FBI Vault website. Extract the most frequently occurring location in each of documents, and load the geo-tagged data into Apache Solr to index the documents, visualize search results using the Google Maps API.
-
Updated
Sep 11, 2014 - Java
Nutch 1.x Indexer Plugin that runs against ES6.7
-
Updated
Aug 12, 2019 - Java
Launch fast and easy an Apache Solr linked with Apache Nutch in separated docker containers.
-
Updated
Dec 3, 2015
Simple crawler using apache nutch and elasticsearch
-
Updated
May 27, 2020 - Shell
A simple web crawler inside a docker container using Apache Nutch 1 and Solr.
-
Updated
Jan 15, 2021 - Dockerfile
Python port of Nutch that allows controlling Apache Nutch via its REST API.
-
Updated
Dec 2, 2015 - Python
A very simple search engine "specialised" in searching financial news.
-
Updated
Dec 5, 2016 - Shell
Improve this page
Add a description, image, and links to the nutch topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the nutch topic, visit your repo's landing page and select "manage topics."