Fulltetxtsearch not found results

hermann1514 · October 26, 2018, 6:12am

Hello

i have installed on my NC14.03 elasticsearch and the 3 search apps. When I create the first index on the command line this works without error messages.

Unfortunately I only see “No results” in the Nextcloud when i perfomed a search.

I have already uninstalled the 3 apps, deleted tables and reinstalled. Unfortunately, no change.

The files are all in PDF format. I have also installed the AddOn for Elasticsearch.

What else can I do?

    Test Elasticsearch:
        ~$ curl -XGET 'localhost:9200/?pretty'
        {
          "name" : "DRuPAvx",
          "cluster_name" : "elasticsearch",
          "cluster_uuid" : "8wMByPh2QTO37bgeWbwYbw",
          "version" : {
            "number" : "6.4.2",
            "build_flavor" : "default",
            "build_type" : "deb",
            "build_hash" : "04711c2",
            "build_date" : "2018-09-26T13:34:09.098244Z",
            "build_snapshot" : false,
            "lucene_version" : "7.4.0",
            "minimum_wire_compatibility_version" : "5.6.0",
            "minimum_index_compatibility_version" : "5.0.0"
          },
          "tagline" : "You Know, for Search"
        }

create Index:

  Options: []
Memory: 25 MB
┌─ Indexing  ────
│ Action: indexDocument
│ Provider: Files                Account: Testuser
│ Document: 296
│ Info: httpd/unix-directory
│ Title: flowupload
│ Content size: 0
│ Progress:    all/383
└──
┌─ Results ────
│ Result:    416/416
│ Index: files:296
│ Status: ok
│ Message: {"_index":"myfiles_idx","_type":"standard","_id":"files:296","_version":1,"result":"created","_shards":{"total":2,"successful":1,"failed":0},"_seq_no":81,"_primary_term":1}
│
│
└──
┌─ Errors ────
│ Error:      0/0
│ Index:
│ Exception:
│ Message:
│
│
└──
## x:first result ## c/v:prec/next result ## b:last result
## f:first error ## h/j:prec/next error ## d:delete error ## l:last error
## q:quit ## p:pause

A fulltextsearch:test:

.Testing your current setup:
Creating mocked content provider. ok
Testing mocked provider: get indexable documents. (2 items) ok
Loading search platform. (Elasticsearch) ok
Testing search platform. ok
Locking process ok
Removing test. ok
Pausing 3 seconds 1 2 3 ok
Initializing index mapping. ok
Indexing generated documents. ok
Pausing 3 seconds 1 2 3 ok
Retreiving content from a big index (license). (size: 32386) ok
Comparing document with source. ok
Searching basic keywords:
 - 'test' (result: 1, expected: ["simple"]) ok
 - 'document is a simple test' (result: 2, expected: ["simple","license"]) ok
 - '"document is a test"' (result: 0, expected: []) ok
 - '"document is a simple test"' (result: 1, expected: ["simple"]) ok
 - 'document is a simple -test' (result: 1, expected: ["license"]) ok
 - 'document is a simple +test' (result: 1, expected: ["simple"]) ok
 - '-document is a simple test' (result: 0, expected: []) ok
Updating documents access. ok
Pausing 3 seconds 1 2 3 ok
Searching with group access rights:
 - 'license' - [] -  (result: 0, expected: []) ok
 - 'license' - ["group_1"] -  (result: 1, expected: ["license"]) ok
 - 'license' - ["group_1","group_2"] -  (result: 1, expected: ["license"]) ok
 - 'license' - ["group_3","group_2"] -  (result: 1, expected: ["license"]) ok
 - 'license' - ["group_3"] -  (result: 0, expected: []) ok
Searching with share rights:
 - 'license' - notuser -  (result: 0, expected: []) ok
 - 'license' - user2 -  (result: 1, expected: ["license"]) ok
 - 'license' - user3 -  (result: 1, expected: ["license"]) ok
Removing test. ok
Unlocking process ok

How long does an Index running for the first time? Can i check if the index running?

Thx.
Hermann

Cult · October 29, 2018, 3:38pm

Time depends on the size of your data.

The interface, during the index, displays enough information to do your own estimation.

hermann1514 · November 13, 2018, 2:17pm

OK. I tried something again.

if I take PDFs from other companies that I have received by mail (invoices), then the full-text search works. Only with my own PDF’s not. Do I have to pay attention to something while scanning?
I’m scanning with an ix500 from Fujitsu. Do you need to activate special options for PDF generation there? (Searchable PDF’s) ??

I also have the OCR software installed in Nextcloud … so it should be recognizable.

Cult · November 13, 2018, 6:17pm

No, the default pdf analysis will only get content from a layered pdf, and the OCR app does not scan pdf file.

The implementation of scanning pdf in the ocr app would be a good feature.