Milvus System Configurations Checklist
This topic introduces the general sections of the system configurations in Milvus.
Milvus maintains a considerable number of parameters that configure the system. Each configuration has a default value, which can be used directly. You can modify these parameters flexibly so that Milvus can better serve your application. See Configure Milvus for more information.
Sections
For the convenience of maintenance, Milvus classifies its configurations into 17 sections based on its components, dependencies, and general usage.
etcd
etcd is the metadata engine supporting Milvus’ metadata storage and access.
Under this section, you can configure etcd endpoints, relevant key prefixes, etc.
See etcd-related Configurations for detailed description for each parameter under this section.
minio
Milvus supports MinIO and Amazon S3 as the storage engine for data persistence of insert log files and index files. Whereas MinIO is the de facto standard for S3 compatibility, you can configure S3 parameters directly under MinIO section.
Under this section, you can configure MinIO or S3 address, relevant access keys, etc.
See MinIO-related Configurations for detailed description for each parameter under this section.
pulsar
Pulsar is the underlying engine supporting Milvus cluster’s reliable storage and publication/subscription of message streams.
Under this section, you can configure Pulsar address, the message size, etc.
See Pulsar-related Configurations for detailed description for each parameter under this section.
rocksmq
RocksMQ is the underlying engine supporting Milvus standalone’s reliable storage and publication/subscription of message streams. It is implemented on the basis of RocksDB.
Under this section, you can configure message size, retention time and size, etc.
See RocksMQ-related Configurations for detailed description for each parameter under this section.
kafka
Apache Kafka is an open-source distributed event streaming platform for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. It serves as an alternative to RocksMQ and Pulsar for reliable storage and publication/subscription of message streams.
See Kafka-related Configurations for detailed description for each parameter under this section.
rootCoord
Root coordinator (root coord) handles data definition language (DDL) and data control language (DCL) requests, manages TSO (timestamp Oracle), and publishes time tick messages.
Under this section, you can configure root coord address, index building threshold, etc.
See Root Coordinator-related Configurations for detailed description for each parameter under this section.
proxy
Proxy is the access layer of the system and endpoint to users. It validates client requests and reduces the returned results.
Under this section, you can configure proxy port, system limits, etc.
See Proxy-related Configurations for detailed description for each parameter under this section.
queryCoord
Query coordinator (query coord) manages topology and load-balancing of the query nodes, and handoff operation from growing segments to sealed segments.
Under this section, you can configure query coord address, auto handoff, auto load-balancing, etc.
See Query coordinator-related Configurations for detailed description for each parameter under this section.
queryNode
Query node performs hybrid search of vector and scalar data on both incremental and historical data.
Under this section, you can configure query node port, graceful time, etc.
See Query Node-related Configurations for detailed description for each parameter under this section.
indexCoord
Index coordinator (index coord) manages topology of the index nodes, and maintains index metadata.
Under this section, you can configure index coord address, etc.
See Index Coordinator-related Configurations for detailed description for each parameter under this section.
indexNode
Index node builds indexes for vectors.
Under this section, you can configure index node port, etc.
See Index Node-related Configurations for detailed description for each parameter under this section.
dataCoord
Data coordinator (data coord) manages the topology of data nodes, maintains metadata, and triggers flush, compact, and other background data operations.
Under this section, you can configure data coord address, segment settings, compaction, garbage collection, etc.
See Data Coordinator-related Configurations for detailed description for each parameter under this section.
dataNode
Data node retrieves incremental log data by subscribing to the log broker, processes mutation requests, and packs log data into log snapshots and stores them in the object storage.
Under this section, you can configure data node port, etc.
See Data Node-related Configurations for detailed description for each parameter under this section.
localStorage
Milvus stores the vector data in local storage during search or query to avoid repetitive access to MinIO or S3 service.
Under this section, you can enable local storage, and configure the path, etc.
See Local Storage-related Configurations for detailed description for each parameter under this section.
log
Using Milvus generates a collection of logs. By default, Milvus uses logs to record information at debug or even higher level for standard output (stdout) and standard error (stderr).
Under this section, you can configure the system log output.
See Log-related Configurations for detailed description for each parameter under this section.
msgChannel
Under this section, you can configure the message channel name prefixes and component subscription name prefixes.
See Message Channel-related Configurations for detailed description for each parameter under this section.
common
Under this section, you can configure the default names of partition and index, and the Time Travel (data retention) span of Milvus.
See Common Configurations for detailed description for each parameter under this section.
knowhere
Knowhere is the underlying engine supporting Milvus’ vector similarity search. Knowhere is the search engine of Milvus.
Under this section, you can configure the default SIMD instruction set type of the system.
See Knowhere-related Configurations for detailed description for each parameter under this section.
Frequently used parameters
Below list some frequently used parameters categorized in accordance with the purposes of modification.
Performance tuning
The following parameters control the system behaviors that influence the performance of index creation and vector similarity search.
queryNode.gracefulTime
rootCoord.minSegmentSizeToEnableIndex
dataCoord.segment.maxSize
dataCoord.segment.sealProportion
dataNode.flush.insertBufSize
queryCoord.autoHandoff
queryCoord.autoBalance
localStorage.enabled
Data and metadata retention
The following parameters control the retention of data and metadata.
common.retentionDuration
rocksmq.retentionTimeInMinutes
dataCoord.enableCompaction
dataCoord.enableGarbageCollection
dataCoord.gc.dropTolerance
Administration
The following parameters control the log output and object storage access.
Quotas and limits
quotaAndLimits.ddl.enabled
quotaAndLimits.ddl.collectionRate
quotaAndLimits.ddl.partitionRate
quotaAndLimits.indexRate.enabled
quotaAndLimits.indexRate.max
quotaAndLimits.flushRate.enabled
quotaAndLimits.flush.max
quotaAndLimits.compation.enabled
quotaAndLimits.compaction.max
quotaAndLimits.dml.enabled
quotaAndLimits.dml.insertRate.max
quotaAndLimits.dml.deleteRate.max
quotaAndLimits.dql.enabled
quotaAndLimits.dql.searchRate.max
quotaAndLimits.dql.queryRate.max
quotaAndLimits.limitWriting.ttProtection.enabled
quotaAndLimits.limitWriting.ttProtection.maxTimeTickDelay
quotaAndLimits.limitWriting.memProtection.enabled
quotaAndLimits.limitWriting.memProtection.dataNodeMemoryLowWaterLevel
quotaAndLimits.limitWriting.memProtection.queryNodeMemoryLowWaterLevel
quotaAndLimits.limitWriting.memProtection.dataNodeMemoryHighWaterLevel
quotaAndLimits.limitWriting.memProtection.queryNodeMemoryHighWaterLevel
quotaAndLimits.limitWriting.diskProtection.enabled
quotaAndLimits.limitWriting.diskProtection.diskQuota
quotaAndLimits.limitWriting.forceDeny
quotaAndLimits.limitReading.queueProtection.enabled
quotaAndLimits.limitReading.queueProtection.nqInQueueThreshold
quotaAndLimits.limitReading.queueProtection.queueLatencyThreshold
quotaAndLimits.limitReading.resultProtection.enabled
quotaAndLimits.limitReading.resultProtection.maxReadResultRate
quotaAndLimits.limitReading.forceDeny
What’s next
Learn how to configure Milvus before installation.
Learn more about the installation of Milvus: