I wanted to share my experience trying to set up OCR in Nextcloud over the past month. This isnāt a call for help - Iām done trying. Just feedback that I hope might be useful for the Nextcloud team or others considering OCR functionality.
Iāve spent a month attempting various approaches:
Full Text Search with Tesseract OCR app
OCRmyPDF integration
Paperless-ngx bridges with Nextcloud
Various custom configurations and workarounds
For context, Iām comfortable managing my own server - Docker environments, PostgreSQL databases, reverse proxies, etc. Iāve successfully set up complex self-hosted systems before. This isnāt about lacking technical skills.
Yet nothing has worked reliably for OCR. Things that should be straightforward turned into endless debugging sessions. Components that claim to work together simply donāt. Manual tests succeed while automated processes fail silently.
After a month of troubleshooting, reading documentation, checking logs, adjusting configurations, and rebuilding indexes, Iām calling it. The time investment just isnāt worth it anymore.
I love Nextcloud for collaboration, file sharing, and team workflows. But the OCR functionality feels fundamentally broken or at least severely under-documented. Itās genuinely disappointing because everything else works so well.
Just wanted to put this out there. Maybe itāll help set expectations for others, or maybe itāll reach someone who can improve the situation.
I have to admit Nextcloud is not good with āforeignā or ānon-coreā tasks. File management, sharing and Office (Collabora integration ) work well, but other things like Photos app, Paperless-ngx integration are really basic and unready. Mail app becomes somewhat useable after years
In my setup I switched to Joplin long time ago for managing notes, Iām planning so start using Immich or Photoprism for pictures and most likely Paperless-ngx is better for structured documents management.
Would be really great if there would be less āhalf-readyā apps in favor of solid 3rd party integrations.
Speaking of Nextcloud Mail, Roundcube has been under Nextcloudās umbrella for quite some time now. Why not just use that and adapt the design a little more to Nextcloud? It might not be all that easy to integrate it seamlessly into Nextcloudās features, but at least the core of Roundcube, the email program itself, has been rock solid for years and years, even with very large mailboxes.
Agree. After all, neither M365 nor Google Workspace have a built-in DMS or photo management solution. I think, integrations with well-established products such as PaperlessNGX or Immich, which specialise in these things, would indeed be more useful.
Regarding OCR: Fulltextsearch tesseract is ābrokenā since years. The good thing is: you donāt need it.
workflow_ocr does a great job and provides good defaults and the possibility for custom adjustments if required. Processed pdfs are automatically picked up by fulltextsearch. Thus, no need for fulltextsearch tesseract.
Thatās what Iām using. However, it still relies on Nextcloudās auto upload and Recognize for object and face recognition. In those categories, Immich is simply better.
Hereās an example. I set up two test instances. Immich and Nextcloud with Recognize and uploading the same few hundred photos to both instances. Immich recognised most of the faces within just a few minutes, whereas Recognize stubbornly insisted that there were too few faces to build a cluster or something like that..
Another reason, and in my opinion a more important one, is that the more apps you install on Nextcloud, the greater the likelihood of problems arising.
Despite this, I do like Memories quite a bit, and I think the Android app is actually quite good as well.
Just my two pennies worth on OCR. I use OCRmyPDF a lot and have it running locally, but it is always a little hit and miss depending on the format of the text in the document. I have played a lot with selecting different options in Tesseract to get better results but with little success on problem documents. I have an old copy of Finereader on an old W10 box and often drop back to that to get a clean result, but only for useful stuff, and those I spend the time spellchecking. Tesseract is not a good base to convert a scanned PDF into a āword processorā document which Finereader is a lot more capable of doing.
All that said, on the whole having thrown a lot of magazines through the scanner and added search to them, they work well once then fed into FTS options. I can find the mags containing a particular target easily enough direct from Dolphin and the same capability should work well enough in Nextcloud?
Yes, but full-text search is another one of those things (like Recognize) that, at best, only generates error messages in the log and, at worst, simply doesnāt work for a few weeks if youāve updated to a new Nextcloud version too early. In the end, it doesnāt matter which OCR backend you use, because if the Fuill Text search app itself doesnāt work, it wonāt work, even if you only upload PDFs where the text is already available.
And honestly. Now you could say, donāt always update right away, and I say that myself when others here have problems. But keeping track of a multidude apps and backends, whether they are all compatible with the next version, and how they might affect each other, is a full-time job. At the end of the day, itās easier to use dedicated services that (more or less) just work on their own, instead of trying to squeeze everything into Nextcloud.
But just to be clear, Iām talking about full-text search, which is something you would expect from a modern file storage/sharing service today, i.e it should be part of the core functionality, not an app, in my humble opinion. OCR, however, is another topic altogether.
For Google this is possibly true. M365 is SharePoint at its core, which was developed as a DMS in the beginning. I deal with it professionally. You can manage tens of thousands of documents, have automated multilingual metadata, document workflows, ocr and machine learning models, identifying metadata and so on.