Ambar: Simple Document Archive

If you like Ambar please ⭐ it!

What is Ambar

Ambar is a simple document archive with automated crawling, OCR, deduplication and ultra-fast full-text search. Imagine having billion of files in different formats like xls, doc, txt, pdf, ppt, etc..., in any encoding. Ambar securely stores them and gives you an ability to search through their content and metadata in milliseconds. It is very lightweight, simple and intuitive, but yet very fast and powerful in terms of data amount and scaling. All the rocket-science is hidden behind the simple UI.

Features

Ambar features overview (Vimeo)

Full-text Search Tutorial: Mastering Ambar Search Queries
Files Crawling (SMB, FTP, Mail) Tutorial: Crawling Your Own Shared Folders
Scheduled Crawling
Dropbox Integration Tutorial: How to Search Through Your Dropbox Files Content
Advanced OCR
Files Deduplication
Secure Storage
Real-Time Statistics
Web UI
REST API

Cloud

It's full-featured latest Ambar, hosted on our servers. All the accounts and data is secured and carefully stored. You can connect Ambar directly to your Dropbox account and enjoy Ambar powerful search over your Dropbox. Trying Ambar Cloud is a perfect way to get the taste what Ambar is.

Signup
That's it!

Basic Ambar Cloud Account gives you space to store up to 2000 documents. To store more files you can upgrade to Pro version.

Hosting Ambar on Your Own Servers

Self-Hosted Ambar can be installed as a set of Docker images. Community Edition is available for free. It's a tiny version of Enterprise Edition with limited number of pipelines and crawlers and disabled authentication, though preserving full functionality. Also you can request a trial for Enterprise Edition, drop us an email on hello@ambar.cloud

Installation Instructions

Docker images can be found on Docker Hub

How it Works

Under the Hood
REST API Documentation
The Source Code is freely available under Fair Source License 1. (Frontend, Crawler, ElasticSearch, Rabbit, Mongo, Installer)

FAQ

Is it open-source?

Yes, almost every Ambar's module is published on GitHub under Fair Source License 1

Is it free?

Yes, Community Edition is forever free. We will NOT charge a penny from you to use it. Basic cloud account is also forever free.

Does it perform OCR?

Yes, it performs OCR on images (jpg, tiff, bmp, etc) and PDF's. OCR is perfomed by well-known open-source library Tesseract. We tuned it to achieve best perfomance and quality on scanned documents.

Which languages are supported by OCR?

Supported languages: Eng, Rus, Ita, Deu, Fra, Spa, Nld. If you miss your language, please create an issue on GitHub and we'll add it ASAP.

Does it support tagging?

Nope, we working on it. As a workaround you can use folders hierarchy as a set of tags.

What about searching in PDF?

Yes, it can search through any PDF, even badly encoded or with scans inside. We did our best to make search over any kind of pdf document smooth.

I miss XXX language analyzer. Can you add it?

Yes, please create an issue on GitHub.

Are you going to add UI localizations?

We're working on it. Be patient.

What is the maximum file size it can handle?

It's limited by amount of RAM on your machine, typically 500MB. It's an awesome result, as typical document managment systems offer 30MB maximum file size to be processed.

What is the difference between Ambar CE and Ambar EE?

Basically Ambar CE is a downscaled Ambar EE. For more details check this

Can anyone else see my documents?

Nope, check our Privacy Policy.

I have a problem what should I do?

Submit an issue or chat with us on https://ambar.cloud

Change Log

Contributors

hartmch

Privacy Policy

License

Fair Source 1 License v0.9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Ambar: Simple Document Archive

What is Ambar

Features

Cloud

Hosting Ambar on Your Own Servers

How it Works

FAQ

Is it open-source?

Is it free?

Does it perform OCR?

Which languages are supported by OCR?

Does it support tagging?

What about searching in PDF?

I miss XXX language analyzer. Can you add it?

Are you going to add UI localizations?

What is the maximum file size it can handle?

What is the difference between Ambar CE and Ambar EE?

Can anyone else see my documents?

I have a problem what should I do?

Change Log

Contributors

Privacy Policy

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

Ambar: Simple Document Archive

What is Ambar

Features

Cloud

Hosting Ambar on Your Own Servers

How it Works

FAQ

Is it open-source?

Is it free?

Does it perform OCR?

Which languages are supported by OCR?

Does it support tagging?

What about searching in PDF?

I miss XXX language analyzer. Can you add it?

Are you going to add UI localizations?

What is the maximum file size it can handle?

What is the difference between Ambar CE and Ambar EE?

Can anyone else see my documents?

I have a problem what should I do?

Change Log

Contributors

Privacy Policy

License