This is a thread you probably couldn’t find:)
However, I want to turn attention to something Matias and me were doing and it is here:
This is Nextcloud app that is doing somewhat similar to what OP is looking for, but for faces only. It is:
- fully self-hosted,
- doesn’t have any dependencies on python/docker/external APIs
- no need for GPU (but it does help:D)
- results are very good
It is not yet in Nextcloud store, though. It is almost ready, but to be there, we need to polish documentation and add Debian packages for PDLib (to make onboarding smoother). Other than that, it has all the functionality we wanted to have - detection, grouping of faces and search.
I encourage everyone to try it and give us feedback (there are already some people playing with it), but beware - you will have to have at least 2GB of RAM and to get your hands dirty by compiling DLib and PDLib. Ask us anything on Github, and propose new functionalities, as we expect real value will come with other scenarios on top of basic recognition and clustering! Whole functional spec is located here.