Automatic compaction API
This topic describes the status and configuration API endpoints for automatic compaction using Coordinator duties in Apache Druid. You can configure automatic compaction in the Druid web console or API.
Instead of the automatic compaction API, you can use the supervisor API to submit auto-compaction jobs using compaction supervisors. For more information, see Auto-compaction using compaction supervisors.
In this topic, http://ROUTER_IP:ROUTER_PORT
is a placeholder for your Router service address and port. Replace it with the information for your deployment. For example, use http://localhost:8888
for quickstart deployments.
Manage automatic compactionโ
Create or update automatic compaction configurationโ
Creates or updates the automatic compaction configuration for a datasource. Pass the automatic compaction as a JSON object in the request body.
The automatic compaction configuration requires only the dataSource
property. Druid fills all other properties with default values if not specified. See Automatic compaction dynamic configuration for configuration details.
Note that this endpoint returns an HTTP 200 OK
message code even if the datasource name does not exist.
URLโ
POST
/druid/coordinator/v1/config/compaction
Responsesโ
- 200 SUCCESS
Successfully submitted auto compaction configuration
Sample requestโ
The following example creates an automatic compaction configuration for the datasource wikipedia_hour
, which was ingested with HOUR
segment granularity. This automatic compaction configuration performs compaction on wikipedia_hour
, resulting in compacted segments that represent a day interval of data.
In this example:
wikipedia_hour
is a datasource withHOUR
segment granularity.skipOffsetFromLatest
is set toPT0S
, meaning that no data is skipped.partitionsSpec
is set to the defaultdynamic
, allowing Druid to dynamically determine the optimal partitioning strategy.type
is set toindex_parallel
, meaning that parallel indexing is used.segmentGranularity
is set toDAY
, meaning that each compacted segment is a day of data.
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/coordinator/v1/config/compaction"\
--header 'Content-Type: application/json' \
--data '{
"dataSource": "wikipedia_hour",
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"partitionsSpec": {
"type": "dynamic"
},
"type": "index_parallel"
},
"granularitySpec": {
"segmentGranularity": "DAY"
}
}'
POST /druid/coordinator/v1/config/compaction HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Content-Type: application/json
Content-Length: 281
{
"dataSource": "wikipedia_hour",
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"partitionsSpec": {
"type": "dynamic"
},
"type": "index_parallel"
},
"granularitySpec": {
"segmentGranularity": "DAY"
}
}
Sample responseโ
A successful request returns an HTTP 200 OK
message code and an empty response body.
Remove automatic compaction configurationโ
Removes the automatic compaction configuration for a datasource. This updates the compaction status of the datasource to "Not enabled."
URLโ
DELETE
/druid/coordinator/v1/config/compaction/{dataSource}
Responsesโ
- 200 SUCCESS
- 404 NOT FOUND
Successfully deleted automatic compaction configuration
Datasource does not have automatic compaction or invalid datasource name
Sample requestโ
- cURL
- HTTP
curl --request DELETE "http://ROUTER_IP:ROUTER_PORT/druid/coordinator/v1/config/compaction/wikipedia_hour"
DELETE /druid/coordinator/v1/config/compaction/wikipedia_hour HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
A successful request returns an HTTP 200 OK
message code and an empty response body.
Update capacity for compaction tasksโ
This API is now deprecated. Use Update cluster-level compaction config instead.
Updates the capacity for compaction tasks. The minimum number of compaction tasks is 1 and the maximum is 2147483647.
Note that while the max compaction tasks can theoretically be set to 2147483647, the practical limit is determined by the available cluster capacity and is capped at 10% of the cluster's total capacity.
URLโ
POST
/druid/coordinator/v1/config/compaction/taskslots
Query parametersโ
To limit the maximum number of compaction tasks, use the optional query parameters ratio
and max
:
ratio
(optional)- Type: Float
- Default: 0.1
- Limits the ratio of the total task slots to compaction task slots.
max
(optional)- Type: Int
- Default: 2147483647
- Limits the maximum number of task slots for compaction tasks.
Responsesโ
- 200 SUCCESS
- 404 NOT FOUND
Successfully updated compaction configuration
Invalid max
value
Sample requestโ
- cURL
- HTTP
curl --request POST "http://ROUTER_IP:ROUTER_PORT/druid/coordinator/v1/config/compaction/taskslots?ratio=0.2&max=250000"
POST /druid/coordinator/v1/config/compaction/taskslots?ratio=0.2&max=250000 HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
A successful request returns an HTTP 200 OK
message code and an empty response body.
View automatic compaction configurationโ
Get all automatic compaction configurationsโ
Retrieves all automatic compaction configurations. Returns a compactionConfigs
object containing the active automatic compaction configurations of all datasources.
You can use this endpoint to retrieve compactionTaskSlotRatio
and maxCompactionTaskSlots
values for managing resource allocation of compaction tasks.
URLโ
GET
/druid/coordinator/v1/config/compaction
Responsesโ
- 200 SUCCESS
Successfully retrieved automatic compaction configurations
Sample requestโ
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/coordinator/v1/config/compaction"
GET /druid/coordinator/v1/config/compaction HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
{
"compactionConfigs": [
{
"dataSource": "wikipedia_hour",
"taskPriority": 25,
"inputSegmentSizeBytes": 100000000000000,
"maxRowsPerSegment": null,
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"maxRowsInMemory": null,
"appendableIndexSpec": null,
"maxBytesInMemory": null,
"maxTotalRows": null,
"splitHintSpec": null,
"partitionsSpec": {
"type": "dynamic",
"maxRowsPerSegment": 5000000,
"maxTotalRows": null
},
"indexSpec": null,
"indexSpecForIntermediatePersists": null,
"maxPendingPersists": null,
"pushTimeout": null,
"segmentWriteOutMediumFactory": null,
"maxNumConcurrentSubTasks": null,
"maxRetry": null,
"taskStatusCheckPeriodMs": null,
"chatHandlerTimeout": null,
"chatHandlerNumRetries": null,
"maxNumSegmentsToMerge": null,
"totalNumMergeTasks": null,
"maxColumnsToMerge": null,
"type": "index_parallel",
"forceGuaranteedRollup": false
},
"granularitySpec": {
"segmentGranularity": "DAY",
"queryGranularity": null,
"rollup": null
},
"dimensionsSpec": null,
"metricsSpec": null,
"transformSpec": null,
"ioConfig": null,
"taskContext": null
},
{
"dataSource": "wikipedia",
"taskPriority": 25,
"inputSegmentSizeBytes": 100000000000000,
"maxRowsPerSegment": null,
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"maxRowsInMemory": null,
"appendableIndexSpec": null,
"maxBytesInMemory": null,
"maxTotalRows": null,
"splitHintSpec": null,
"partitionsSpec": {
"type": "dynamic",
"maxRowsPerSegment": 5000000,
"maxTotalRows": null
},
"indexSpec": null,
"indexSpecForIntermediatePersists": null,
"maxPendingPersists": null,
"pushTimeout": null,
"segmentWriteOutMediumFactory": null,
"maxNumConcurrentSubTasks": null,
"maxRetry": null,
"taskStatusCheckPeriodMs": null,
"chatHandlerTimeout": null,
"chatHandlerNumRetries": null,
"maxNumSegmentsToMerge": null,
"totalNumMergeTasks": null,
"maxColumnsToMerge": null,
"type": "index_parallel",
"forceGuaranteedRollup": false
},
"granularitySpec": {
"segmentGranularity": "DAY",
"queryGranularity": null,
"rollup": null
},
"dimensionsSpec": null,
"metricsSpec": null,
"transformSpec": null,
"ioConfig": null,
"taskContext": null
}
],
"compactionTaskSlotRatio": 0.1,
"maxCompactionTaskSlots": 2147483647,
}
Get automatic compaction configurationโ
Retrieves the automatic compaction configuration for a datasource.
URLโ
GET
/druid/coordinator/v1/config/compaction/{dataSource}
Responsesโ
- 200 SUCCESS
- 404 NOT FOUND
Successfully retrieved configuration for datasource
Invalid datasource or datasource does not have automatic compaction enabled
Sample requestโ
The following example retrieves the automatic compaction configuration for datasource wikipedia_hour
.
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/coordinator/v1/config/compaction/wikipedia_hour"
GET /druid/coordinator/v1/config/compaction/wikipedia_hour HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
{
"dataSource": "wikipedia_hour",
"taskPriority": 25,
"inputSegmentSizeBytes": 100000000000000,
"maxRowsPerSegment": null,
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"maxRowsInMemory": null,
"appendableIndexSpec": null,
"maxBytesInMemory": null,
"maxTotalRows": null,
"splitHintSpec": null,
"partitionsSpec": {
"type": "dynamic",
"maxRowsPerSegment": 5000000,
"maxTotalRows": null
},
"indexSpec": null,
"indexSpecForIntermediatePersists": null,
"maxPendingPersists": null,
"pushTimeout": null,
"segmentWriteOutMediumFactory": null,
"maxNumConcurrentSubTasks": null,
"maxRetry": null,
"taskStatusCheckPeriodMs": null,
"chatHandlerTimeout": null,
"chatHandlerNumRetries": null,
"maxNumSegmentsToMerge": null,
"totalNumMergeTasks": null,
"maxColumnsToMerge": null,
"type": "index_parallel",
"forceGuaranteedRollup": false
},
"granularitySpec": {
"segmentGranularity": "DAY",
"queryGranularity": null,
"rollup": null
},
"dimensionsSpec": null,
"metricsSpec": null,
"transformSpec": null,
"ioConfig": null,
"taskContext": null
}
Get automatic compaction configuration historyโ
Retrieves the history of the automatic compaction configuration for a datasource. Returns an empty list if the datasource does not exist or there is no compaction history for the datasource.
The response contains a list of objects with the following keys:
globalConfig
: A JSON object containing automatic compaction configuration that applies to the entire cluster.compactionConfig
: A JSON object containing the automatic compaction configuration for the datasource.auditInfo
: A JSON object containing information about the change made, such asauthor
,comment
orip
.auditTime
: The date and time when the change was made.
URLโ
GET
/druid/coordinator/v1/config/compaction/{dataSource}/history
Query parametersโ
interval
(optional)- Type: ISO-8601
- Limits the results within a specified interval. Use
/
as the delimiter for the interval string.
count
(optional)- Type: Int
- Limits the number of results.
Responsesโ
- 200 SUCCESS
- 400 BAD REQUEST
Successfully retrieved configuration history
Invalid count
value
Sample requestโ
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/coordinator/v1/config/compaction/wikipedia_hour/history"
GET /druid/coordinator/v1/config/compaction/wikipedia_hour/history HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
[
{
"globalConfig": {
"compactionTaskSlotRatio": 0.1,
"maxCompactionTaskSlots": 2147483647,
"compactionPolicy": {
"type": "newestSegmentFirst",
"priorityDatasource": "wikipedia"
},
"useSupervisors": true,
"engine": "native"
},
"compactionConfig": {
"dataSource": "wikipedia_hour",
"taskPriority": 25,
"inputSegmentSizeBytes": 100000000000000,
"maxRowsPerSegment": null,
"skipOffsetFromLatest": "P1D",
"tuningConfig": null,
"granularitySpec": {
"segmentGranularity": "DAY",
"queryGranularity": null,
"rollup": null
},
"dimensionsSpec": null,
"metricsSpec": null,
"transformSpec": null,
"ioConfig": null,
"taskContext": null
},
"auditInfo": {
"author": "",
"comment": "",
"ip": "127.0.0.1"
},
"auditTime": "2023-07-31T18:15:19.302Z"
},
{
"globalConfig": {
"compactionTaskSlotRatio": 0.1,
"maxCompactionTaskSlots": 2147483647,
"compactionPolicy": {
"type": "newestSegmentFirst"
},
"useSupervisors": false,
"engine": "native"
},
"compactionConfig": {
"dataSource": "wikipedia_hour",
"taskPriority": 25,
"inputSegmentSizeBytes": 100000000000000,
"maxRowsPerSegment": null,
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"maxRowsInMemory": null,
"appendableIndexSpec": null,
"maxBytesInMemory": null,
"maxTotalRows": null,
"splitHintSpec": null,
"partitionsSpec": {
"type": "dynamic",
"maxRowsPerSegment": 5000000,
"maxTotalRows": null
},
"indexSpec": null,
"indexSpecForIntermediatePersists": null,
"maxPendingPersists": null,
"pushTimeout": null,
"segmentWriteOutMediumFactory": null,
"maxNumConcurrentSubTasks": null,
"maxRetry": null,
"taskStatusCheckPeriodMs": null,
"chatHandlerTimeout": null,
"chatHandlerNumRetries": null,
"maxNumSegmentsToMerge": null,
"totalNumMergeTasks": null,
"maxColumnsToMerge": null,
"type": "index_parallel",
"forceGuaranteedRollup": false
},
"granularitySpec": {
"segmentGranularity": "DAY",
"queryGranularity": null,
"rollup": null
},
"dimensionsSpec": null,
"metricsSpec": null,
"transformSpec": null,
"ioConfig": null,
"taskContext": null
},
"auditInfo": {
"author": "",
"comment": "",
"ip": "127.0.0.1"
},
"auditTime": "2023-07-31T18:16:16.362Z"
}
]
View automatic compaction statusโ
Get segments awaiting compactionโ
Returns the total size of segments awaiting compaction for a given datasource. Returns a 404 response if a datasource does not have automatic compaction enabled.
URLโ
GET
/druid/coordinator/v1/compaction/progress?dataSource={dataSource}
Query parameterโ
dataSource
(required)- Type: String
- Name of the datasource for this status information.
Responsesโ
- 200 SUCCESS
- 404 NOT FOUND
Successfully retrieved segment size awaiting compaction
Unknown datasource name or datasource does not have automatic compaction enabled
Sample requestโ
The following example retrieves the remaining segments to be compacted for datasource wikipedia_hour
.
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/coordinator/v1/compaction/progress?dataSource=wikipedia_hour"
GET /druid/coordinator/v1/compaction/progress?dataSource=wikipedia_hour HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
{
"remainingSegmentSize": 7615837
}
Get compaction status and statisticsโ
Retrieves an array of latestStatus
objects representing the status and statistics from the latest automatic compaction run for all datasources with automatic compaction enabled.
Compaction status responseโ
The latestStatus
object has the following properties:
dataSource
: Name of the datasource for this status information.scheduleStatus
: Automatic compaction scheduling status. Possible values areNOT_ENABLED
andRUNNING
. ReturnsRUNNING
if the datasource has an active automatic compaction configuration submitted. Otherwise, returnsNOT_ENABLED
.bytesAwaitingCompaction
: Total bytes of this datasource waiting to be compacted by the automatic compaction (only consider intervals/segments that are eligible for automatic compaction).bytesCompacted
: Total bytes of this datasource that are already compacted with the spec set in the automatic compaction configuration.bytesSkipped
: Total bytes of this datasource that are skipped (not eligible for automatic compaction) by the automatic compaction.segmentCountAwaitingCompaction
: Total number of segments of this datasource waiting to be compacted by the automatic compaction (only consider intervals/segments that are eligible for automatic compaction).segmentCountCompacted
: Total number of segments of this datasource that are already compacted with the spec set in the automatic compaction configuration.segmentCountSkipped
: Total number of segments of this datasource that are skipped (not eligible for automatic compaction) by the automatic compaction.intervalCountAwaitingCompaction
: Total number of intervals of this datasource waiting to be compacted by the automatic compaction (only consider intervals/segments that are eligible for automatic compaction).intervalCountCompacted
: Total number of intervals of this datasource that are already compacted with the spec set in the automatic compaction configuration.intervalCountSkipped
: Total number of intervals of this datasource that are skipped (not eligible for automatic compaction) by the automatic compaction.
URLโ
GET
/druid/coordinator/v1/compaction/status
Query parametersโ
dataSource
(optional)- Type: String
- Filter the result by name of a specific datasource.
Responsesโ
- 200 SUCCESS
Successfully retrieved latestStatus
object
Sample requestโ
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/coordinator/v1/compaction/status"
GET /druid/coordinator/v1/compaction/status HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
{
"latestStatus": [
{
"dataSource": "wikipedia_api",
"scheduleStatus": "RUNNING",
"bytesAwaitingCompaction": 0,
"bytesCompacted": 0,
"bytesSkipped": 64133616,
"segmentCountAwaitingCompaction": 0,
"segmentCountCompacted": 0,
"segmentCountSkipped": 8,
"intervalCountAwaitingCompaction": 0,
"intervalCountCompacted": 0,
"intervalCountSkipped": 1
},
{
"dataSource": "wikipedia_hour",
"scheduleStatus": "RUNNING",
"bytesAwaitingCompaction": 0,
"bytesCompacted": 5998634,
"bytesSkipped": 0,
"segmentCountAwaitingCompaction": 0,
"segmentCountCompacted": 1,
"segmentCountSkipped": 0,
"intervalCountAwaitingCompaction": 0,
"intervalCountCompacted": 1,
"intervalCountSkipped": 0
}
]
}
[Experimental] Unified Compaction APIsโ
This section describes the new unified compaction APIs which can be used regardless of whether compaction supervisors are enabled (i.e. useSupervisors
is true
) or not in the compaction dynamic config.
- If compaction supervisors are disabled, the APIs read or write the compaction dynamic config, same as the Coordinator-based compaction APIs above.
- If compaction supervisors are enabled, the APIs read or write the corresponding compaction supervisors. In conjunction with the APIs described below, the supervisor APIs may also be used to read or write the compaction supervisors as they offer greater flexibility and also serve information related to supervisor and task statuses.
Update cluster-level compaction configโ
Updates cluster-level configuration for compaction tasks which applies to all datasources, unless explicitly overridden in the datasource compaction config. This includes the following fields:
Config | Description | Default value |
---|---|---|
compactionTaskSlotRatio | Ratio of number of slots taken up by compaction tasks to the number of total task slots across all workers. | 0.1 |
maxCompactionTaskSlots | Maximum number of task slots that can be taken up by compaction tasks and sub-tasks. Minimum number of task slots available for compaction is 1. When using MSQ engine or Native engine with range partitioning, a single compaction job occupies more than one task slot. In this case, the minimum is 2 so that at least one compaction job can always run in the cluster. | 2147483647 (i.e. total task slots) |
compactionPolicy | Policy to choose intervals for compaction. Currently, the only supported policy is Newest segment first. | Newest segment first |
useSupervisors | Whether compaction should be run on Overlord using supervisors instead of Coordinator duties. | false |
engine | Engine used for running compaction tasks, unless overridden in the datasource-level compaction config. Possible values are native and msq . msq engine can be used for compaction only if useSupervisors is true . | native |
Compaction policy newestSegmentFirst
โ
Field | Description | Default value |
---|---|---|
type | This must always be newestSegmentFirst | |
priorityDatasource | Datasource to prioritize for compaction. The intervals of this datasource are chosen for compaction before the intervals of any other datasource. Within this datasource, the intervals are prioritized based on the chosen compaction policy. | None |
URLโ
POST
/druid/indexer/v1/compaction/config/cluster
Responsesโ
- 200 SUCCESS
- 404 NOT FOUND
Successfully updated compaction configuration
Invalid max
value
Sample requestโ
- cURL
- HTTP
curl --request POST "http://ROUTER_IP:ROUTER_PORT/druid/coordinator/v1/config/compaction/cluster" \
--header 'Content-Type: application/json' \
--data '{
"compactionTaskSlotRatio": 0.5,
"maxCompactionTaskSlots": 1500,
"compactionPolicy": {
"type": "newestSegmentFirst",
"priorityDatasource": "wikipedia"
},
"useSupervisors": true,
"engine": "msq"
}'
POST /druid/indexer/v1/compaction/config/cluster HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Content-Type: application/json
{
"compactionTaskSlotRatio": 0.5,
"maxCompactionTaskSlots": 1500,
"compactionPolicy": {
"type": "newestSegmentFirst",
"priorityDatasource": "wikipedia"
},
"useSupervisors": true,
"engine": "msq"
}
Sample responseโ
A successful request returns an HTTP 200 OK
message code and an empty response body.
Get cluster-level compaction configโ
Retrieves cluster-level configuration for compaction tasks which applies to all datasources, unless explicitly overridden in the datasource compaction config. This includes all the fields listed in Update cluster-level compaction config.
URLโ
GET
/druid/indexer/v1/compaction/config/cluster
Responsesโ
- 200 SUCCESS
Successfully retrieved cluster compaction configuration
Sample requestโ
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/indexer/v1/compaction/config/cluster"
GET /druid/indexer/v1/compaction/config/cluster HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
{
"compactionTaskSlotRatio": 0.5,
"maxCompactionTaskSlots": 1500,
"compactionPolicy": {
"type": "newestSegmentFirst",
"priorityDatasource": "wikipedia"
},
"useSupervisors": true,
"engine": "msq"
}
Get automatic compaction configurations for all datasourcesโ
Retrieves all datasource compaction configurations.
URLโ
GET
/druid/indexer/v1/compaction/config/datasources
Responsesโ
- 200 SUCCESS
Successfully retrieved automatic compaction configurations
Sample requestโ
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/indexer/v1/compaction/config/datasources"
GET /druid/indexer/v1/compaction/config/datasources HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
{
"compactionConfigs": [
{
"dataSource": "wikipedia_hour",
"taskPriority": 25,
"inputSegmentSizeBytes": 100000000000000,
"maxRowsPerSegment": null,
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"maxRowsInMemory": null,
"appendableIndexSpec": null,
"maxBytesInMemory": null,
"maxTotalRows": null,
"splitHintSpec": null,
"partitionsSpec": {
"type": "dynamic",
"maxRowsPerSegment": 5000000,
"maxTotalRows": null
},
"indexSpec": null,
"indexSpecForIntermediatePersists": null,
"maxPendingPersists": null,
"pushTimeout": null,
"segmentWriteOutMediumFactory": null,
"maxNumConcurrentSubTasks": null,
"maxRetry": null,
"taskStatusCheckPeriodMs": null,
"chatHandlerTimeout": null,
"chatHandlerNumRetries": null,
"maxNumSegmentsToMerge": null,
"totalNumMergeTasks": null,
"maxColumnsToMerge": null,
"type": "index_parallel",
"forceGuaranteedRollup": false
},
"granularitySpec": {
"segmentGranularity": "DAY",
"queryGranularity": null,
"rollup": null
},
"dimensionsSpec": null,
"metricsSpec": null,
"transformSpec": null,
"ioConfig": null,
"taskContext": null
},
{
"dataSource": "wikipedia",
"taskPriority": 25,
"inputSegmentSizeBytes": 100000000000000,
"maxRowsPerSegment": null,
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"maxRowsInMemory": null,
"appendableIndexSpec": null,
"maxBytesInMemory": null,
"maxTotalRows": null,
"splitHintSpec": null,
"partitionsSpec": {
"type": "dynamic",
"maxRowsPerSegment": 5000000,
"maxTotalRows": null
},
"indexSpec": null,
"indexSpecForIntermediatePersists": null,
"maxPendingPersists": null,
"pushTimeout": null,
"segmentWriteOutMediumFactory": null,
"maxNumConcurrentSubTasks": null,
"maxRetry": null,
"taskStatusCheckPeriodMs": null,
"chatHandlerTimeout": null,
"chatHandlerNumRetries": null,
"maxNumSegmentsToMerge": null,
"totalNumMergeTasks": null,
"maxColumnsToMerge": null,
"type": "index_parallel",
"forceGuaranteedRollup": false
},
"granularitySpec": {
"segmentGranularity": "DAY",
"queryGranularity": null,
"rollup": null
},
"dimensionsSpec": null,
"metricsSpec": null,
"transformSpec": null,
"ioConfig": null,
"taskContext": null
}
]
}
Get automatic compaction configuration for a datasourceโ
Retrieves the automatic compaction configuration for a datasource.
URLโ
GET
/druid/indexer/v1/compaction/config/datasources/{dataSource}
Responsesโ
- 200 SUCCESS
- 404 NOT FOUND
Successfully retrieved configuration for datasource
Invalid datasource or datasource does not have automatic compaction enabled
Sample requestโ
The following example retrieves the automatic compaction configuration for datasource wikipedia_hour
.
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/indexer/v1/compaction/config/datasources/wikipedia_hour"
GET /druid/indexer/v1/compaction/config/datasources/wikipedia_hour HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
{
"dataSource": "wikipedia_hour",
"taskPriority": 25,
"inputSegmentSizeBytes": 100000000000000,
"maxRowsPerSegment": null,
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"maxRowsInMemory": null,
"appendableIndexSpec": null,
"maxBytesInMemory": null,
"maxTotalRows": null,
"splitHintSpec": null,
"partitionsSpec": {
"type": "dynamic",
"maxRowsPerSegment": 5000000,
"maxTotalRows": null
},
"indexSpec": null,
"indexSpecForIntermediatePersists": null,
"maxPendingPersists": null,
"pushTimeout": null,
"segmentWriteOutMediumFactory": null,
"maxNumConcurrentSubTasks": null,
"maxRetry": null,
"taskStatusCheckPeriodMs": null,
"chatHandlerTimeout": null,
"chatHandlerNumRetries": null,
"maxNumSegmentsToMerge": null,
"totalNumMergeTasks": null,
"maxColumnsToMerge": null,
"type": "index_parallel",
"forceGuaranteedRollup": false
},
"granularitySpec": {
"segmentGranularity": "DAY",
"queryGranularity": null,
"rollup": null
},
"dimensionsSpec": null,
"metricsSpec": null,
"transformSpec": null,
"ioConfig": null,
"taskContext": null
}
Create or update automatic compaction configuration for a datasourceโ
Creates or updates the automatic compaction configuration for a datasource. Pass the automatic compaction as a JSON object in the request body.
The automatic compaction configuration requires only the dataSource
property. Druid fills all other properties with default values if not specified. See Automatic compaction dynamic configuration for configuration details.
Note that this endpoint returns an HTTP 200 OK
message code even if the datasource name does not exist.
URLโ
POST
/druid/indexer/v1/compaction/config/datasources/wikipedia_hour
Responsesโ
- 200 SUCCESS
Successfully submitted auto compaction configuration
Sample requestโ
The following example creates an automatic compaction configuration for the datasource wikipedia_hour
, which was ingested with HOUR
segment granularity. This automatic compaction configuration performs compaction on wikipedia_hour
, resulting in compacted segments that represent a day interval of data.
In this example:
wikipedia_hour
is a datasource withHOUR
segment granularity.skipOffsetFromLatest
is set toPT0S
, meaning that no data is skipped.partitionsSpec
is set to the defaultdynamic
, allowing Druid to dynamically determine the optimal partitioning strategy.type
is set toindex_parallel
, meaning that parallel indexing is used.segmentGranularity
is set toDAY
, meaning that each compacted segment is a day of data.
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/indexer/v1/compaction/config/datasources/wikipedia_hour"\
--header 'Content-Type: application/json' \
--data '{
"dataSource": "wikipedia_hour",
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"partitionsSpec": {
"type": "dynamic"
},
"type": "index_parallel"
},
"granularitySpec": {
"segmentGranularity": "DAY"
}
}'
POST /druid/indexer/v1/compaction/config/datasources/wikipedia_hour HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Content-Type: application/json
Content-Length: 281
{
"dataSource": "wikipedia_hour",
"skipOffsetFromLatest": "PT0S",
"tuningConfig": {
"partitionsSpec": {
"type": "dynamic"
},
"type": "index_parallel"
},
"granularitySpec": {
"segmentGranularity": "DAY"
}
}
Sample responseโ
A successful request returns an HTTP 200 OK
message code and an empty response body.
Delete automatic compaction configuration for a datasourceโ
Removes the automatic compaction configuration for a datasource. This updates the compaction status of the datasource to "Not enabled."
URLโ
DELETE
/druid/indexer/v1/compaction/config/datasources/{dataSource}
Responsesโ
- 200 SUCCESS
- 404 NOT FOUND
Successfully deleted automatic compaction configuration
Datasource does not have automatic compaction or invalid datasource name
Sample requestโ
- cURL
- HTTP
curl --request DELETE "http://ROUTER_IP:ROUTER_PORT/druid/indexer/v1/compaction/config/datasources/wikipedia_hour"
DELETE /druid/indexer/v1/compaction/config/wikipedia_hour HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
A successful request returns an HTTP 200 OK
message code and an empty response body.
Get compaction status for all datasourcesโ
Retrieves an array of latestStatus
objects representing the status and statistics from the latest automatic compaction run for all the datasources to which the user has read access.
The response payload is in the same format as Compaction status response.
URLโ
GET
/druid/indexer/v1/compaction/status/datasources
Responsesโ
- 200 SUCCESS
Successfully retrieved latestStatus
object
Sample requestโ
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/indexer/v1/compaction/status/datasources"
GET /druid/indexer/v1/compaction/status/datasources HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
{
"latestStatus": [
{
"dataSource": "wikipedia_api",
"scheduleStatus": "RUNNING",
"bytesAwaitingCompaction": 0,
"bytesCompacted": 0,
"bytesSkipped": 64133616,
"segmentCountAwaitingCompaction": 0,
"segmentCountCompacted": 0,
"segmentCountSkipped": 8,
"intervalCountAwaitingCompaction": 0,
"intervalCountCompacted": 0,
"intervalCountSkipped": 1
},
{
"dataSource": "wikipedia_hour",
"scheduleStatus": "RUNNING",
"bytesAwaitingCompaction": 0,
"bytesCompacted": 5998634,
"bytesSkipped": 0,
"segmentCountAwaitingCompaction": 0,
"segmentCountCompacted": 1,
"segmentCountSkipped": 0,
"intervalCountAwaitingCompaction": 0,
"intervalCountCompacted": 1,
"intervalCountSkipped": 0
}
]
}
Get compaction status for a single datasourceโ
Retrieves the latest status from the latest automatic compaction run for a datasource. The response payload is in the same format as Compaction status response with zero or one entry.
URLโ
GET
/druid/indexer/v1/compaction/status/datasources/{dataSource}
Responsesโ
- 200 SUCCESS
Successfully retrieved latestStatus
object
Sample requestโ
- cURL
- HTTP
curl "http://ROUTER_IP:ROUTER_PORT/druid/indexer/v1/compaction/status/datasources/wikipedia_hour"
GET /druid/indexer/v1/compaction/status/datasources/wikipedia_hour HTTP/1.1
Host: http://ROUTER_IP:ROUTER_PORT
Sample responseโ
View the response
{
"latestStatus": [
{
"dataSource": "wikipedia_hour",
"scheduleStatus": "RUNNING",
"bytesAwaitingCompaction": 0,
"bytesCompacted": 5998634,
"bytesSkipped": 0,
"segmentCountAwaitingCompaction": 0,
"segmentCountCompacted": 1,
"segmentCountSkipped": 0,
"intervalCountAwaitingCompaction": 0,
"intervalCountCompacted": 1,
"intervalCountSkipped": 0
}
]
}