Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

The Giles Ecosystem is a distributed version of Giles. It is designed to handle high numbers of request by distributing work employing Apache Kafka.

Currently, there are four six applications in the Giles Ecosystem:

Giles Head

A Giles Head looks and behaves the same way as the full Giles version. However, instead of extracting images and running OCR on them, a Giles Head inserts extraction, OCR, etc. requests into Apache Kafka for other components to fulfill the request processing. The main responsibility of a Giles Head is to provide a stable API and user interface and to coordinate the file processing workflow.

...

Cassiopeia is an app to run OCR routines on images using Tesseract. It listens to OCR requests in Apache Kafka and sends OCR complete requests after successful processing.

Freddie

Freddie is an app that sends any text file to Solr for indexing. Giles has an API endpoint to search the documents of an uploaded user by querying Freddie. Freddie is an optional component. If no search functionality is required, Freddie does not need to be installed.

September

September is a monitoring app for the Giles Ecosystem. Other apps can send messages about errors or warnings into Kafka, which are then picked up by September.