This package can be used to index a huge amount of nodes in Elasticsearch indexes. This package use the Flowpack JobQueue packages to handle the indexing asynchronously.
Topics
You need to install the correct Queue package based on your needs.
Available packages:
Please check the package documentation for specific configurations.
The default configuration uses the FakeQueue, which is provided by the JobQueue.Common package. Note that with that package jobs are executed synchronous with the flow nodeindexqueue:build
command.
Check the Settings.yaml
to adapt based on the Queue package, you need to adapt the className
:
Flowpack:
JobQueue:
Common:
presets:
'Flowpack.ElasticSearch.ContentRepositoryQueueIndexer':
className: 'Flowpack\JobQueue\Common\Queue\FakeQueue'
If you use the doctrine package you have to set the tableName
manually:
Flowpack:
JobQueue:
Common:
presets:
'Flowpack.ElasticSearch.ContentRepositoryQueueIndexer':
className: 'Flowpack\JobQueue\Doctrine\Queue\DoctrineQueue'
queues:
'Flowpack.ElasticSearch.ContentRepositoryQueueIndexer':
options:
tableName: 'flowpack_jobqueue_QueueIndexer'
'Flowpack.ElasticSearch.ContentRepositoryQueueIndexer.Live':
options:
tableName: 'flowpack_jobqueue_QueueIndexerLive'
flow nodeindexqueue:build --workspace live
You can use this CLI command to process indexing job:
flow nodeindexqueue:work --queue batch
You can disable async live indexing by editing Settings.yaml
:
Flowpack:
ElasticSearch:
ContentRepositoryQueueIndexer:
enableLiveAsyncIndexing: false
You can use this CLI command to process indexing job:
flow nodeindexqueue:work --queue live
You can use tools like supervisord
to manage long running processes. Bellow you can find a basic configuration:
[supervisord]
[supervisorctl]
[program:elasticsearch_batch_indexing]
command=php flow nodeindexqueue:work --queue batch
stdout_logfile=AUTO
stderr_logfile=AUTO
numprocs=4
process_name=elasticsearch_batch_indexing_%(process_num)02d
environment=FLOW_CONTEXT="Production"
autostart=true
autorestart=true
stopsignal=QUIT
[program:elasticsearch_live_indexing]
command=php flow nodeindexqueue:work --queue live
stdout_logfile=AUTO
stderr_logfile=AUTO
numprocs=4
process_name=elasticsearch_live_indexing_%(process_num)02d
environment=FLOW_CONTEXT="Production"
autostart=true
autorestart=true
stopsignal=QUIT
- Previously the Beanstalk queue package was installed by default, this is no longer the case.
- The beanstalk queue configuration is removed. The FakeQueue is used if not configured to another queuing package.
Licensed under MIT, see LICENSE