Join our Discord Server
Ajeet Raina Ajeet Singh Raina is a former Docker Captain, Community Leader and Distinguished Arm Ambassador. He is a founder of Collabnix blogging site and has authored more than 700+ blogs on Docker, Kubernetes and Cloud-Native Technology. He runs a community Slack of 9800+ members and discord server close to 2600+ members. You can follow him on Twitter(@ajeetsraina).

MindsDB Docker Extension: Build ML powered applications at a much faster pace

3 min read

Imagine a world where anyone, regardless of technical expertise, can easily harness the power of artificial intelligence (AI) to gain insights from their data. Where building and deploying machine learning models is as intuitive as querying a database. This is the future promised by MindsDB, a revolutionary open-source platform that’s democratizing AI for everyone.

MindsDB = SQL + AI

MindsDB combines both AI and SQL functions in one; users can create, train, optimize, and deploy ML models without the need for external tools. Data analysts can create and visualize forecasts without having to navigate the complexities of ML pipelines.MindsDB is open-source and works with well-known databases like MySQL, Postgres, Redit, Snowflakes, etc.

How MindsDB works?

MindsDB boasts a unique architecture designed to bridge the gap between developers and AI through its AI SQL Server approach. Here’s a breakdown of its key components:

1. AI Building Blocks:

These are MindsDB’s core functionalities represented as virtual tables accessible via SQL queries. They include:

  • Models: Trained machine learning models, allowing predictions and inferences.
  • Agents: Pre-built AI functions for tasks like sentiment analysis or chatbots.
  • Knowledge Bases: Structured knowledge graphs for entity recognition and relationship extraction.

2. SQL Interface:

MindsDB leverages your existing SQL knowledge, allowing you to:

  • Fine-tune Models: Improve existing models using your data within familiar SQL syntax.
  • Create AI Tables: Define new AI components like agents or knowledge bases using CREATE statements.
  • Query AI Results: Integrate AI insights into your applications and pipelines using JOINs, filtering, and other familiar operations.

3. Datasources:

MindsDB can access data from various sources to fuel your AI models:

  • Databases: Connect directly to popular relational and NoSQL databases.
  • Applications: Integrate with APIs and web services for real-time data streams.
  • Vector Stores: Utilize dedicated data structures for efficient retrieval of high-dimensional data.

4. Jobs and Triggers:

Automate AI-powered tasks and workflows:

  • Jobs: Schedule queries or model training at specific times or intervals.
  • Triggers: React to data changes automatically by invoking AI operations.

5. Underlying Engine:

MindsDB utilizes various open-source AI frameworks under the hood to handle model training and inference, like PyTorch, TensorFlow, and ONNX. These frameworks remain transparent to the user, who interacts only through the SQL interface.

Why do you need MindsDB Docker Extension?

For Data Scientists, setting up MindsDB in a Docker container might be a tricky situation. With MindsDB Docker Extension, data scientists don’t need to understand Docker concepts but to install it with a single click.

The MindsDB Docker Extension empowers developers to:

  • Integrate AI seamlessly into containerized workflows. Ensure consistent environments from development to production.
  • Share and deploy AI models with ease.
  • Optimize resource usage and security.
  • Streamline deployment to production.

Getting Started

Pre-requisite

Install MindsDB Docker Extension

Open Docker Desktop > Dashboard > Extensions and search for “MindsDB” > Click on “Install”

Image3

You will need to wait for 1 minutes to get it fully installed on your laptop.

Image3

Don’t panic if you see this blank screen for next 20-30 seconds. Have patience and wait till the complete app is up and running.

Image4

You will soon the MindsDB Dashboard as shown below:

Image5

Predicting Text Sentiment with OpenAI GPT

You can find this sample tutorial in the MindsDB “Learning Hub” section. In this sample example, you will create a predictive model to infer emotions behind a text, a task also known as sentiment analysis. The model, or an AI table, leverages OpenAI’s large language models to complete this task.

Connect a Database

Start by connecting a demo database using the CREATE DATABASE statement:

CREATE DATABASE mysql_reviews_db
WITH ENGINE = "mysql",
PARAMETERS = {
    "user": "user",
    "password": "MindsDBUser123!",
    "host": "db-demo-data.cwoyhfn6bzs0.us-east-1.rds.amazonaws.com",
    "port": "3306",
    "database": "public"
    };
Image6

Let’s look at the data stored in the amazon_reviewstable that contains the input columns for the model:

SELECT *
FROM mysql_reviews_db.amazon_reviews;
Image7

Configure an OpenAI Engine in MindsDB

Please note that using OpenAI models require OpenAI API key. Therefore, before creating a model, you need to configure an engine by providing your OpenAI API key as below.

CREATE ML_ENGINE openai_engine
FROM openai
USING
    api_key = 'your-openai-api-key';

Create an OpenAI GPT Model

Our integration automatically manages requests to pre-trained large language models from OpenAI. For example, we can create a model for text sentiment analysis like this:

CREATE MODEL sentiment_classifier_gpt
PREDICT sentiment
USING
engine = 'openai_engine',
prompt_template = 'describe the sentiment of the reviews
strictly as "positive", "neutral", or "negative".
"I love the product":positive
"It is a scam":negative
"{{review}}.":';
Image8

In practice, the CREATE MODEL statement triggers MindsDB to generate an AI table called sentiment_classifier_gpt that uses the OpenAI integration to predict a column named sentiment.

The USINGclause specifies the parameters that this handler requires. You can check our documentation for detailed explanations of all parameters that can be used when creating various models with the OpenAI engine.

Once the above query has started execution, we can check the status of the creation process with the following query:

SELECT * FROM models
WHERE name = 'sentiment_classifier_gpt';

It may take a while to register as completedepending on the internet connection.

Image9

Make a Single Prediction

Once the creation is complete, the behavior is the same as with any other AI table – you can query it either by specifying synthetic data in the actual query:

SELECT review, sentiment
FROM sentiment_classifier_gpt
WHERE review = 'It is ok.';
Image10

Make Batch Predictions

Or by joining with another table for batch predictions:

SELECT input.review, output.sentiment
FROM mysql_reviews_db.amazon_reviews AS input
JOIN sentiment_classifier_gpt AS output
LIMIT 5;

The amazon_reviewstable is used to make batch predictions. Upon joining the sentiment_classifier_gptmodel with the amazon_reviewstable, the model uses all values from the reviewcolumn.

Image12

MindsDB enables in-database ML with your favorite frameworks. You need just a handler for MindsDB to integrate the ML framework of your interest.

Conclusion

In essence, MindsDB aims to make AI accessible and approachable for developers by leveraging familiar SQL tools and integrating seamlessly with existing workflows. This enables developers to build AI applications without extensive machine learning expertise, paving the way for wider adoption and democratization of AI.

References

Keep Reading

  • Testcontainers and Playwright

    Testcontainers and Playwright

    Discover how Testcontainers-Playwright simplifies browser automation and testing without local Playwright installations. Learn about its features, limitations, compatibility, and usage with code examples.

    Read More

  • Getting Started with the Low-Cost RPLIDAR Using NVIDIA Jetson Nano

    Getting Started with the Low-Cost RPLIDAR Using NVIDIA Jetson Nano

    Conclusion Getting started with low-code RPlidar with Jetson Nano is an exciting journey that can open up a wide range of possibilities for building robotics projects. In this blog post, we covered the basic steps to get started with low-code RPlidar with Jetson Nano, including setting up ROS, installing the RPlidar driver and viewing RPlidar…

    Read More

  • Docker and Wasm Containers – Better Together

    Docker and Wasm Containers – Better Together

    Learn how Docker Desktop and CLI both manages Linux containers and Wasm containers side by side.

    Read More

  • Introducing AutoGen v0.4: Revolutionizing Agentic AI with Enhanced Scalability, Flexibility, and Reliability

    Introducing AutoGen v0.4: Revolutionizing Agentic AI with Enhanced Scalability, Flexibility, and Reliability

    Over the past year, Microsoft developments with AutoGen have underscored the remarkable capabilities of agentic AI and multi-agent systems. Microsoft is thrilled to unveil AutoGen v0.4 , a major update shaped by invaluable feedback from our vibrant community of users and developers. This release marks a comprehensive overhaul of the AutoGen library, designed to elevate…

    Read More

  • Transforming Patient Care with IoT in Digital Health

    Transforming Patient Care with IoT in Digital Health

    Explore the impact of IoT on digital health, from improving patient monitoring to transforming healthcare services in general.

    Read More

  • How AI Enhances Cloud-Native App Deployment

    How AI Enhances Cloud-Native App Deployment

    Artificial Intelligence transforms how cloud-native applications are deployed. It simplifies tasks like managing resources and predicting workloads, offering real-time solutions to common challenges. Teams no longer struggle with manual adjustments or complex configurations. AI-powered tools streamline processes, making deployment faster and more efficient. This shift doesn’t just save time – it drives innovation. Developers focus…

    Read More

Have Queries? Join https://launchpass.com/collabnix

Ajeet Raina Ajeet Singh Raina is a former Docker Captain, Community Leader and Distinguished Arm Ambassador. He is a founder of Collabnix blogging site and has authored more than 700+ blogs on Docker, Kubernetes and Cloud-Native Technology. He runs a community Slack of 9800+ members and discord server close to 2600+ members. You can follow him on Twitter(@ajeetsraina).
Join our Discord Server
Index