(Originally from : https://medium.com/@mathiasconradt/using-scanbot-and-nextcloud-for-scanning-ocr-and-file-management-229750420882)
For my personal documents, I use a Nextcloud installation on a RaspberryPi 3 and reachable from the outside. In order to digitalize paper documents that I receive, I’ve been using Scanbot for some years now.
It’s an Android app that lets you photograph documents and save them as images or (my preference) pdf files.
With Scanbot Pro you can even let an OCR engine analyze the text, which stores the text as an extra layer in the pdf, making it searchable and even indexable (great if you use the Nextcloud Nextant app powered by Solr).
In order to connect Scanbot with your Nextcloud, you can simply define a custom WebDav server endpoint and select a folder in the app preferences. In Scanbot, you can then upload the scanned documents manually or have them uploaded automatically.
In my case, I have a folder named “Scanbot” in my root directory under my user. So the WebDav endpoint url would look like this:
When I now scan new documents in the Scanbot app, they are automatically uploaded to my “Scanbot” folder on my Nextcloud.
Thanks @mathiasconradt for posting!
Community! I have a spare Scanbot Pro license! Anyone interested in getting it, give the above OP a and on Friday 4th I’ll randomly select a winner!
Scanbot also has an option for an Owncloud account (which is compatible with Nextcloud) to make the setup a bit easier because you only need your Nextcloud domain instead of the full Webdav address.
@mathiasconradt great piece of work!! <3 <3 <3
only one little concern… wouldn’t nextant go out of service sometime during next future (like from NC 14 on)? have you tried replacing it with Full Text Search as well?
apart from that… wonderful. thanks so much for your contribution
For whatever reason, I was not able to use the ownCloud connector for my instance, so also went with webDAV. Didn’t need a full path either as it brought up an interactive folder picker for me.
“interactive folder picker” Sorry what does this mean?
@JimmyKater When Full Text Search becomes stable (should be out of beta state pretty soon), Nextant isn’t needed anymore. The result should be pretty similar though… it will index the OCR’ed PDFs just fine; just using ElasticSearch instead of Solr, should make no difference for the end user.
Yes, I tested Full Text Search a while ago.
@tv_cologne What do you mean with the “workflow part” ?
and so it makes the whole story round… yay
thanks again for your great howto!
My winner is about to get a PM. Thanks for playing!
I used the owncloud connector and then used the syntax of: > https://<mydomain>/remote.php/dav/files/<username>/Scanbot/ and it worked.
I had tried it before with just > https://<mydomain>/ but never thought to specify the full path.
Really handy, thanks! I use Scanbot every day and was emailing docs manually…horrible!
Since today Scanbot offers nextcloud as a direct cloud setup option.
Does the Nextcloud integration work?
When I want to add an account for Nextcloud, the connection cannot be established. No problems by using WebDAV…
I believe it does not work. Attempted to add my Nextcloud account to Scanbot in all possible ways (URL with and without
https:// prefix and with and without
/remote.php/webdav suffix, user password and app password), but it never managed to establish the connection.
Interestingly, I can successfully connect to my Nextcloud instance with the ‘Owncloud’ offering, works fine. This forum thread was the first hit when I searched the web for ‘scanbot nextcloud’, so if anyone else gets here looking for a solution, just use the ‘Owncloud’ or the WebDAV option.
I got this to work using nextcloud connector. Open your nextcloud in a web browser, hit the hamburger on top left, settings, then you’ll see a url for your server. Copy paste that, along with user name and password, and it worked. I can also browse for existing folders for default upload location. Back up and running.
Keep in mind Scanbot is not open source. I use Open Note Scanner, instead - but it’s not as featureful.