Indexing breaks with an null pointer exception NC21 #616

dragonpil · 2021-03-27T10:17:49Z

Hi,
I am running Nextcloud 21 with Full text search, Full text search - Elasticsearch Platform, Full text search - Files all with version 21.0.0 and Full text search - Files - Tesseract OCR with version 20.0.1
All is running in a docker container from the official base image with additional installation of tesseract, tesseract-deu, ghostscript and ImageMagic-7 and I modified the policy so that it can read and write pdfs.
For some of my pdfs I get the following exception. Workaround at the moment is moving them into a separate folder with .noindex in it. That is of cause just a ugly workaround It seams that the OCR could find something and that's why the second parameter here is null. Wouldn't it be better if there is a kind of try expect / try-catch around that and if you can't index a single file, ignore it and go to the next.

An unhandled exception has been thrown:

TypeError: array_key_exists() expects parameter 2 to be array, null given in /var/www/html/custom_apps/fulltextsearch_elasticsearch/lib/Platform/ElasticSearchPlatform.php:310
Stack trace:
#0 /var/www/html/custom_apps/fulltextsearch_elasticsearch/lib/Platform/ElasticSearchPlatform.php(310): array_key_exists('reason', NULL)
#1 /var/www/html/custom_apps/fulltextsearch_elasticsearch/lib/Platform/ElasticSearchPlatform.php(289): OCA\FullTextSearch_Elasticsearch\Platform\ElasticSearchPlatform->parseIndexErrorException(Object(Elasticsearch\Common\Exceptions\BadRequest400Exception))
#2 /var/www/html/custom_apps/fulltextsearch_elasticsearch/lib/Platform/ElasticSearchPlatform.php(237): OCA\FullTextSearch_Elasticsearch\Platform\ElasticSearchPlatform->manageIndexErrorException(Object(OCA\Files_FullTextSearch\Model\FilesDocument), Object(Elasticsearch\Common\Exceptions\BadRequest400Exception))
#3 /var/www/html/custom_apps/fulltextsearch/lib/Service/IndexService.php(375): OCA\FullTextSearch_Elasticsearch\Platform\ElasticSearchPlatform->indexDocument(Object(OCA\Files_FullTextSearch\Model\FilesDocument))
#4 /var/www/html/custom_apps/fulltextsearch/lib/Service/IndexService.php(326): OCA\FullTextSearch\Service\IndexService->indexDocument(Object(OCA\FullTextSearch_Elasticsearch\Platform\ElasticSearchPlatform),Object(OCA\Files_FullTextSearch\Model\FilesDocument))
#5 /var/www/html/custom_apps/fulltextsearch/lib/Service/IndexService.php(204): OCA\FullTextSearch\Service\IndexService->indexDocuments(Object(OCA\FullTextSearch_Elasticsearch\Platform\ElasticSearchPlatform), Object(OCA\Files_FullTextSearch\Provider\FilesProvider), Array, Object(OCA\FullTextSearch\Model\IndexOptions))
#6 /var/www/html/custom_apps/fulltextsearch/lib/Command/Index.php(410): OCA\FullTextSearch\Service\IndexService->indexProviderContentFromUser(Object(OCA\FullTextSearch_Elasticsearch\Platform\ElasticSearchPlatform), Object(OCA\Files_FullTextSearch\Provider\FilesProvider), 'Alex_Gilgamesch', Object(OCA\FullTextSearch\Model\IndexOptions))
#7 /var/www/html/custom_apps/fulltextsearch/lib/Command/Index.php(273): OCA\FullTextSearch\Command\Index->indexProvider(Object(OCA\Files_FullTextSearch\Provider\FilesProvider), Object(OCA\FullTextSearch\Model\IndexOptions))
#8 /var/www/html/3rdparty/symfony/console/Command/Command.php(255): OCA\FullTextSearch\Command\Index->execute(Object(Symfony\Component\Console\Input\ArgvInput), Object(Symfony\Component\Console\Output\ConsoleOutput))
#9 /var/www/html/core/Command/Base.php(169): Symfony\Component\Console\Command\Command->run(Object(Symfony\Component\Console\Input\ArgvInput), Object(Symfony\Component\Console\Output\ConsoleOutput))
#10 /var/www/html/3rdparty/symfony/console/Application.php(1009): OC\Core\Command\Base->run(Object(Symfony\Component\Console\Input\ArgvInput), Object(Symfony\Component\Console\Output\ConsoleOutput))
#11 /var/www/html/3rdparty/symfony/console/Application.php(273): Symfony\Component\Console\Application->doRunCommand(Object(OCA\FullTextSearch\Command\Index), Object(Symfony\Component\Console\Input\ArgvInput), Object(Symfony\Component\Console\Output\ConsoleOutput))
#12 /var/www/html/3rdparty/symfony/console/Application.php(149): Symfony\Component\Console\Application->doRun(Object(Symfony\Component\Console\Input\ArgvInput), Object(Symfony\Component\Console\Output\ConsoleOutput))
#13 /var/www/html/lib/private/Console/Application.php(215): Symfony\Component\Console\Application->run(Object(Symfony\Component\Console\Input\ArgvInput), Object(Symfony\Component\Console\Output\ConsoleOutput))
#14 /var/www/html/console.php(100): OC\Console\Application->run()
#15 /var/www/html/occ(11): require_once('/var/www/html/c...')

Is there somehow a possibility to ignore a single file, so that I don't have to move them?

fwolfst · 2021-04-27T04:57:45Z

Wouldn't it be better if there is a kind of try expect / try-catch around that and if you can't index a single file, ignore it and go to the next.

Or null-guard the array access, something like:

 if (!is_null($arr['error']['root_cause'][0]) && array_key_exists('reason', $arr['error']['root_cause'][0])) {
 #...
}

ArtificialOwl · 2021-05-06T12:47:30Z

Hello, this patch should fix your issue: nextcloud/fulltextsearch_elasticsearch#141

ArtificialOwl closed this as completed Aug 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Indexing breaks with an null pointer exception NC21 #616

Indexing breaks with an null pointer exception NC21 #616

dragonpil commented Mar 27, 2021

fwolfst commented Apr 27, 2021

ArtificialOwl commented May 6, 2021

Indexing breaks with an null pointer exception NC21 #616

Indexing breaks with an null pointer exception NC21 #616

Comments

dragonpil commented Mar 27, 2021

fwolfst commented Apr 27, 2021

ArtificialOwl commented May 6, 2021