Yes, AutoML can handle unstructured data like images and text. AutoML platforms are designed to automate parts of the machine learning pipeline, including preprocessing, model selection, and hyperparameter tuning, even for non-tabular data. For example, Google’s AutoML Vision and Azure’s Custom Vision allow developers to upload image datasets, automatically train models for tasks like object detection or classification, and deploy them without writing complex code. Similarly, tools like AutoML Natural Language process text for sentiment analysis or entity recognition by automating tokenization, embedding, and model architecture decisions. These platforms abstract the complexity of deep learning frameworks like TensorFlow or PyTorch, making them accessible to developers without specialized ML expertise.
AutoML handles unstructured data by leveraging techniques like convolutional neural networks (CNNs) for images and transformer-based models for text. For images, AutoML tools often apply preprocessing steps like resizing, normalization, and data augmentation automatically. They then search through neural network architectures optimized for vision tasks, such as ResNet or EfficientNet variants, while tuning hyperparameters like learning rates. For text, AutoML systems might use embeddings (e.g., Word2Vec or BERT) to convert words into numerical representations and experiment with architectures like recurrent neural networks (RNNs) or transformers. These steps are wrapped in automated pipelines, allowing developers to focus on labeling data and defining objectives rather than manual model design.
However, AutoML has limitations with unstructured data. While it simplifies training, the quality of results still depends heavily on the dataset’s size and cleanliness. For instance, training an image classifier with poorly labeled or imbalanced data will lead to subpar performance, even with AutoML. Additionally, complex tasks like video analysis or multilingual text processing may require custom architectures beyond AutoML’s default options. Developers should also be mindful of computational costs: training large vision or language models can require significant GPU resources. AutoML is a practical starting point, but teams may need to combine it with manual fine-tuning or domain-specific optimizations for advanced use cases.
Zilliz Cloud is a managed vector database built on Milvus perfect for building GenAI applications.
Try FreeLike the article? Spread the word