tao-test/app/taoTaskQueue/README.md

202 lines
7.9 KiB
Markdown

# Distributed Task Queue
> This article describes the functioning of an ASYNC Task Queue.
## Install
You can add the Task Queue as a standard TAO extension to your current TAO instance.
```bash
$ composer require oat-sa/extension-tao-task-queue
```
### Queue component
Queue can work with different types of queue brokers, here two types are to accomplish ASYNC mechanism:
- **RdsQueueBroker** which stores tasks in RDS.
- **SqsQueueBroker** which is for using AWS SQS.
_Note_:
> When SqsQueueBroker is used, please make sure that "**oat-sa/lib-generis-aws**" is included in the main composer.json and you have
> generis/awsClient.conf.php properly configured.
#### Weight
A Queue can have a weight. If multiple Queues are in use, this weight will be used for randomly select a Queue to be consumed.
For example, if QueueA has weight of 1 and QueueB has weight of 2, then QueueB has about a 66% chance of being selected.
### Worker component
Here we have a so called `LongRunningWorker` which can run unlimited time.
It has built-in signal handling for the following actions:
- Shutting down the worker gracefully: SIGTERM/SIGINT/SIGQUIT
- Pausing task processing: SIGUSR2
- Resuming task processing: SIGCONT
_Note_:
> Multiple workers can be run at the same time.
After processing the given task, the worker saves the generated report for the task through the Task Log.
## Service setup examples
### Multiple Queues settings
In this case we have 3 Queues registered: one of them is using SQS broker, the other two RDS.
Every Queue has its own weight (like 90, 30, 10) which will be used at selecting the next queue to be consumed.
And we have two tasks linked to different queues, furthermore the default queue is specified ('background')
what will be used for every other tasks not defined in OPTION_TASK_TO_QUEUE_ASSOCIATIONS.
```php
use oat\tao\model\taskQueue\QueueDispatcher;
use oat\tao\model\taskQueue\Queue;
use oat\taoTaskQueue\model\QueueBroker\RdsQueueBroker;
use oat\taoTaskQueue\model\QueueBroker\SqsQueueBroker;
use oat\tao\model\taskQueue\TaskLogInterface;
use oat\tao\model\taskQueue\QueueDispatcherInterface;
$queueService = new QueueDispatcher(array(
QueueDispatcherInterface::OPTION_QUEUES => [
new Queue('priority', new SqsQueueBroker('default', \common_cache_Cache::SERVICE_ID, 10), 90),
new Queue('standard', new RdsQueueBroker('default', 5), 30),
new Queue('background', new RdsQueueBroker('default', 5), 10)
],
QueueDispatcherInterface::OPTION_TASK_LOG => TaskLogInterface::SERVICE_ID,
QueueDispatcherInterface::OPTION_TASK_TO_QUEUE_ASSOCIATIONS => [
SomeImportantAction::class => 'priority',
SomeLessImportantTask::class => 'standard'
]
));
$queueService->setOption(QueueDispatcherInterface::OPTION_DEFAULT_QUEUE, 'background');
$this->getServiceManager()->register(QueueDispatcherInterface::SERVICE_ID, $queueService);
```
If the queue has not been initialized, meaning the required queue container has not been created yet:
```php
try {
$queueService->initialize();
} catch (\Exception $e) {
return \oat\oatbox\reporting\Report::createError('Initializing queues failed');
}
```
### Initializing the queues and the task log container
You can run this script if you want to be sure that the required queues and the task log container are created.
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\InitializeQueue'
```
_Note_:
> This script also can be used to change the current queues to use a different queue broker.
- Changing every existing queue to use InMemoryQueueBroker. (Sync Queue)
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\InitializeQueue' --broker=memory
```
- Changing every existing queue to use RdsQueueBroker.
Option "persistence" is required, "receive" (Maximum amount of tasks that can be received when polling the queue) is optional.
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\InitializeQueue' --broker=rds --persistence=default --receive=10
```
- Changing every existing queue to use SqsQueueBroker. Option "aws-profile" is required, "receive" is optional.
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\InitializeQueue' --broker=sqs --aws-profile=default --receive=10
```
- If you want to apply the settings above for a specific queue, you can add `--queue=...` option to the command. In the following case, only `myQueue` will be modified.
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\InitializeQueue' --queue=myQueue --broker=rds --persistence=default --receive=10
```
- Setting a task selector strategy.
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\InitializeQueue' --strategy="\oat\taoTaskQueue\model\TaskSelector\StrictPriorityStrategy"
```
### Running a worker
To run a worker, use the following command. It will start a worker for running infinitely and iterating over every registered Queues based in their weights.
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\RunWorker'
```
If you want the worker running for a dedicated Queue, pass the name of the queue to the command like this:
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\RunWorker' --queue=priority
```
You can limit the iteration of the worker. It can be used only on a dedicated queue.
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\RunWorker' --queue=standard --limit=5
```
If you want to associate specyfic task to new queue you can use this command:
```bash
$ sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\ManageAssociationMap' \
-t '{ you fully qualified task class name }' -q queue-name
```
Next time when defined task will be created, it will be assign to specified queue.
### Summarize stuck tasks
Execute this command if you want to summarize stuck tasks. Example:
```shell
sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\StuckTaskSummary' \
--queue indexation_queue \
--age 300 \
--whitelist "oat\tao\model\search\tasks\UpdateResourceInIndex,oat\tao\model\search\tasks\UpdateClassInIndex"
```
### Restart stuck tasks
Execute this command if you want to restart stuck tasks. Example:
```shell
sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\StuckTaskRestart' \
--queue indexation_queue \
--age 300 \
--whitelist "oat\tao\model\search\tasks\UpdateResourceInIndex,oat\tao\model\search\tasks\UpdateClassInIndex"
```
## Rest API
The task log reports can be viewed/consume using the Application Programming Interface (API).
In order to use it please check the swagger file in (doc/taskApi.yml).
## Command Line Utility
Besides using the API to check reports of tasks, another way it's using the command line.
```bash
sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\TaskLogUtility' --help
```
This command will show you all the possibilities action the the utility can have.
```text
Examples
1. Stats
Description: Return stats about the tasks logs statuses
Example: sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\TaskLogUtility' --stats
2. List Task Logs
Description: List All the tasks that are not archived will be retrived, default limit is 20
Example: sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\TaskLogUtility' --available --limit[optional]=20 --offset[optional]=10
3. Get Task Log
Description: Get an specific task log by id
Example: sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\TaskLogUtility' --get-task=[taskdId]
4. Archive a Task Log
Description: Archive a task log
Example: sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\TaskLogUtility' --archive=[taskdId] --force[optional]
5. Cancel a Task Log
Description: Cancel a task log
Example: sudo -u www-data php index.php 'oat\taoTaskQueue\scripts\tools\TaskLogUtility' --cancel=[taskdId] --force[optional]
```