🚀 Try Zilliz Cloud, the fully managed Milvus, for free—experience 10x faster performance! Try Now>>

Try Managed Milvus

Home

About Milvus
Get Started
User Guide
Administration Guide
Integrations
Benchmarks
Tools
Reference
Example Applications
FAQs
API Reference

Home
Docs
User Guide
Manage Data
Insert Entities

Insert Entities

This topic describes how to insert data in Milvus via client.

You can also migrate data to Milvus with MilvusDM, an open-source tool designed specifically for importing and exporting data with Milvus.

Milvus 2.1 supports the VARCHAR data type on scalar fields. When building indexes for VARCHAR-type scalar fields, the default index type is dictionary tree.

The following example inserts 2,000 rows of randomly generated data as the example data (Milvus CLI example uses a pre-built, remote CSV file containing similar data). Real applications will likely use much higher dimensional vectors than the example. You can prepare your own data to replace the example.

When interacting with Milvus using Python code, you have the flexibility to choose between PyMilvus and MilvusClient (new). For more information, refer to Python SDK.

Prepare data

First, prepare the data to insert. Data type of the data to insert must match the schema of the collection, otherwise Milvus will raise an exception.

Milvus supports default values for scalar fields, excluding a primary key field. This indicates that some fields can be left empty during data inserts or upserts. For more information, refer to Create a Collection.

Once you enable dynamic schema, you can append dynamic fields in the data. For details, refer to Dynamic Schema.

Python Java GO Node.js Curl

import random
data = [
  [i for i in range(2000)],
  [str(i) for i in range(2000)],
  [i for i in range(10000, 12000)],
  [[random.random() for _ in range(2)] for _ in range(2000)],
  # use `default_value` for a field
  [], 
  # or
  None,
  # or just omit the field
]

## Once your collection is enabled with dynamic schema,
## you can add non-existing field values.
data.append([str("dy"*i) for i in range(2000)])

const data = Array.from({ length: 2000 }, (v,k) => ({
  "book_id": k,
  "word_count": k+10000,
  "book_intro": Array.from({ length: 2 }, () => Math.random()),
}));

bookIDs := make([]int64, 0, 2000)
wordCounts := make([]int64, 0, 2000)
bookIntros := make([][]float32, 0, 2000)
for i := 0; i < 2000; i++ {
    bookIDs = append(bookIDs, int64(i))
    wordCounts = append(wordCounts, int64(i+10000))
    v := make([]float32, 0, 2)
    for j := 0; j < 2; j++ {
        v = append(v, rand.Float32())
    }
    bookIntros = append(bookIntros, v)
}
idColumn := entity.NewColumnInt64("book_id", bookIDs)
wordColumn := entity.NewColumnInt64("word_count", wordCounts)
introColumn := entity.NewColumnFloatVector("book_intro", 2, bookIntros)

Random ran = new Random();
List<Long> book_id_array = new ArrayList<>();
List<Long> word_count_array = new ArrayList<>();
List<List<Float>> book_intro_array = new ArrayList<>();
for (long i = 0L; i < 2000; ++i) {
    book_id_array.add(i);
    word_count_array.add(i + 10000);
    List<Float> vector = new ArrayList<>();
    for (int k = 0; k < 2; ++k) {
        vector.add(ran.nextFloat());
    }
    book_intro_array.add(vector);
}

# Prepare your data in a CSV file. Milvus CLI only supports importing data from local or remote files.

# See the following step.

Insert data to Milvus

Insert the data to the collection.

By specifying partition_name, you can optionally decide to which partition to insert the data.

Python Java GO Node.js CLI Curl

from pymilvus import Collection
collection = Collection("book")      # Get an existing collection.
mr = collection.insert(data)

const mr = await milvusClient.insert({
  collection_name: "book",
  fields_data: data,
});

_, err = milvusClient.Insert(
    context.Background(), // ctx
    "book",               // CollectionName
    "",                   // partitionName
    idColumn,             // columnarData
    wordColumn,           // columnarData
    introColumn,          // columnarData
)
if err != nil {
    log.Fatal("failed to insert data:", err.Error())
}

List<InsertParam.Field> fields = new ArrayList<>();
fields.add(new InsertParam.Field("book_id", book_id_array));
fields.add(new InsertParam.Field("word_count", word_count_array));
fields.add(new InsertParam.Field("book_intro", book_intro_array));

InsertParam insertParam = InsertParam.newBuilder()
  .withCollectionName("book")
  .withPartitionName("novel")
  .withFields(fields)
  .build();
milvusClient.insert(insertParam);

import -c book 'https://raw.githubusercontent.com/milvus-io/milvus_cli/main/examples/user_guide/search.csv'

# insert an entity to a collection
curl -X 'POST' \
  '${MILVUS_HOST}:${MILVUS_PORT}/v1/vector/insert'  \
  -H 'Authorization: Bearer ${TOKEN}' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
         "collectionName": "collection1",
         "data": {
             "vector": [0.1, 0.2, 0.3],
             "name": "tom",
             "email": "tom@zilliz.com",
             "date": "2023-04-13"
          }
     }'

# insert multiple entities
curl --request POST \
     --url '${MILVUS_HOST}:${MILVUS_PORT}/v1/vector/insert' \
     --header 'Authorization: Bearer <TOKEN>' \
     --header 'accept: application/json' \
     --header 'content-type: application/json' \
     -d '{
         "collectionName": "collection1",
         "data": [
             {
                "vector": [0.1, 0.2, 0.3],
                "name": "bob",
                "email": "bob@zilliz.com",
                "date": "2023-04-13"
             },{
                "vector": [0.1, 0.2, 0.3],
                "name": "ally",
                "email": "ally@zilliz.com",
                "date": "2023-04-11"
             }
         ]
     }'

Output:

{
    "code": 200,
    "data": {
        "insertCount": "integer"
    }
}

Parameter	Description
`data`	Data to insert into Milvus.
`partition_name` (optional)	Name of the partition to insert data into.

Parameter	Description
`collection_name`	Name of the collection to insert data into.
`partition_name` (optional)	Name of the partition to insert data into.
`fields_data`	Data to insert into Milvus.

Parameter	Description
`ctx`	Context to control API invocation process.
`CollectionName`	Name of the collection to insert data into.
`partitionName`	Name of the partition to insert data into. Data will be inserted in the default partition if left blank.
`columnarData`	Data to insert into each field.

Parameter	Description
`fieldName`	Name of the field to insert data into.
`DataType`	Data type of the field to insert data into.
`data`	Data to insert into each field.
`CollectionName`	Name of the collection to insert data into.
`PartitionName` (optional)	Name of the partition to insert data into.

Option	Description
-c	Name of the collection to insert data into.
-p (Optional)	Name of the partition to insert data into.

Parameter	Description	Option
`collectionName`	The name of the collection to which entities will be inserted.	N/A
`data`	Data to insert into Milvus.	N/A
`field_name`	An entity object. Note that the keys in the entity should match the collection schema.	N/A

After inserting entities into a collection that has previously been indexed, you do not need to re-index the collection, as Milvus will automatically create an index for the newly inserted data. For more information, refer to Can indexes be created after inserting vectors?

Flush the Data in Milvus

When data is inserted into Milvus, it is stored in segments. Segments have to reach a certain size before they can be sealed and indexed. Unsealed segments are searched using brute force. If you need to search the data immediately after insertion, you can call the flush() method once the data is inserted. This method seals any remaining segments and sends them for indexing. It is important to only call this method at the end of an insert session. Calling it too frequently will result in fragmented data that will require cleaning later on.

Milvus automatically triggers the flush() operation. In most cases, manual calls to this operation are not necessary.

Limits

Feature	Maximum limit
Dimensions of a vector	32,768

What’s next

Learn more basic operations of Milvus:

Insert Entities
Prepare data
Insert data to Milvus
Flush the Data in Milvus
Limits
What's next

Try Managed Milvus for Free

Zilliz Cloud is hassle-free, powered by Milvus and 10x faster.

Get Started

Feedback

Was this page helpful?