Overview
Arkindex is Teklia’s web solution to extract and analyze text from images of documents.
All of its features are fully accessible from its web interface, but also from a Rest API.
Two versions of Arkindex are available: open-source and enterprise.
Document management¶
Arkindex is first and foremost a web platform to process thousands of documents. Document management is an essential part of its features. We use a generic term Element
to talk about a Document and its components. We also use a graph structure in database to be able to represent any of your documents.
You can read more about document structure on the Data Concepts page and Projects section.
Machine Learning workflows¶
We rely heavily on Machine Learning processes to analyze the text in your documents. Using segmentation, transcription and named entity recognition software, we are able to make sense of most handwritten text.
These complex tools are easily launched from your web browser, and run on powerful servers. Using Arkindex you’ll be able to watch the completion progress of your current analysis.
Data Import¶
Arkindex can import your data from different sources:
- your computer using an image or PDF upload form
- your own IIIF server
- a Gitlab server
- an Amazon S3 bucket
- Transkribus collections
Customizable user access¶
Info
Limited to Enterprise Edition
Each Arkindex platform instance can be shared amongst multiple people, projects, and teams. As a project administrator you will have fine-grained control over which users can access your document collections (so called Projects
in Arkindex).
You can read more about controlling user access.