Ingest MQTT Data into Microsoft SQL Server
TIP
The Microsoft SQL Server data integration is an EMQX Enterprise edition feature.
SQL Server is one of the leading relational commercial database solutions, widely used in enterprises and organizations of various sizes and types. EMQX supports integration with SQL Server, enabling you to save MQTT messages and client events to SQL Server. This facilitates the construction of complex data pipelines and analytical processes for data management and analysis, or for managing device connections and integrating with other enterprise systems such as ERP, CRM, and BI.
This page provides a detailed overview of the data integration between EMQX and Microsoft SQL Server with practical instructions on creating and validating the data integration.
TIP
The data integration with Microsoft SQL Server is supported in EMQX Enterprise 5.0.3 and above.
How It Works
Microsoft SQL Server data integration is an out-of-the-box feature in EMQX, combining EMQX's device connectivity and message transmission capabilities with the powerful data storage capabilities of Microsoft SQL Server. Through the built-in rule engine component and Sink, you can store MQTT messages and client events in Microsoft SQL Server. Additionally, events can trigger updates or deletions of data within Microsoft SQL Server, enabling the recording of information such as device online status and connection history. This integration simplifies the process of ingesting data from EMQX to SQL Server for storage and management, eliminating the need for complex coding.
The diagram below illustrates a typical architecture of data integration between EMQX and SQL Server:
Ingesting MQTT data into Microsoft SQL Server works as follows:
- Message publication and reception: Industrial IoT devices establish successful connections to EMQX through the MQTT protocol and publish real-time MQTT data from machines, sensors, and product lines based on their operational states, readings, or triggered events to EMQX. When EMQX receives these messages, it initiates the matching process within its rules engine.
- Message data processing: When a message arrives, it passes through the rule engine and is then processed by the rule defined in EMQX. The rules, based on predefined criteria, determine which messages need to be routed to Microsoft SQL Server. If any rules specify payload transformations, those transformations are applied, such as converting data formats, filtering out specific information, or enriching the payload with additional context.
- Data ingestion into SQL Server: The rule triggers the writing of messages to Microsoft SQL Server. With the help of SQL templates, users can extract data from the rule processing results to construct SQL and send it to SQL Server for execution, so that specific fields of the message can be written or updated into the corresponding tables and columns of the database.
- Data Storage and Utilization: With the data now stored in Microsoft SQL Server, businesses can harness its querying power for various use cases.
Features and Benefits
The data integration with Microsoft SQL Server offers a range of features and benefits tailored to ensure efficient data transmission, storage, and utilization:
- Real-time Data Streaming: EMQX is built for handling real-time data streams, ensuring efficient and reliable data transmission from source systems to Microsoft SQL Server. It enables organizations to capture and analyze data in real-time, making it ideal for use cases requiring immediate insights and actions.
- High Performance and Scalability: Both EMQX and Microsoft SQL Server feature expandability and reliability, suitable for handling large-scale IoT data. They can undergo uninterrupted horizontal and vertical expansion as demands grow, ensuring the continuity and reliability of IoT applications.
- Flexibility in Data Transformation: EMQX provides a powerful SQL-based Rule Engine, allowing organizations to pre-process data before storing it in Microsoft SQL Server. It supports various data transformation mechanisms, such as filtering, routing, aggregation, and enrichment, enabling organizations to shape the data according to their needs.
- Advanced Analytics: Microsoft SQL Server offers powerful analytical capabilities, such as building multi-dimensional data models through Analysis Services to support complex data analysis and data mining. It also enables the creation and publication of reports through Reporting Services, presenting insights and analysis results of IoT data to stakeholders.
Before You Start
This section describes the preparations you need to complete before you start to create the Microsoft SQL Server data integration, including how to install and connect to the Microsoft SQL Server, create database and data tables, and install and configure the ODBC driver.
Prerequisites
- Knowledge about EMQX data integration rules
- Knowledge about data integration
Install and Connect to Microsoft SQL Server
This section describes how to start Microsoft SQL Server 2019 on Linux/MacOS using Docker images and use sqlcmd
to connect to Microsoft SQL Server. For other installation methods of Microsoft SQL Server, please refer to Microsoft SQL Server Installation Guide.
Install Microsoft SQL Server via Docker, and then start the docker image with the command below. Use
mqtt_public1
as the password. For the password policy of Microsoft SQL Server, see Password Complexity.Note: By starting a Docker container with the environment variable
ACCEPT_EULA=Y
you agree to the terms of Microsoft EULA, see also End-User Licensing Agreement.bash# To start the Microsoft SQL Server docker image and set the password as `mqtt_public1` $ docker run --name sqlserver -p 1433:1433 -e ACCEPT_EULA=Y -e MSSQL_SA_PASSWORD=mqtt_public1 -d mcr.microsoft.com/mssql/server:2022-CU15-ubuntu-22.04
Access the container.
bashdocker exec -it sqlserver bash
Enter the preset password to connect to the server in the container. The characters are not echoed when entering the password. Click
Enter
directly after entering the password.bash$ /opt/mssql-tools18/bin/sqlcmd -S localhost -U sa -P mqtt_public1 -N -C 1>
TIP
The
mssql-tools18
package have been installed in the Microsoft SQL Server container provided by Microsoft, but the executable file is not in$PATH
. Therefore, you need to specify the executable file path forsqlcmd
before proceeding. As for the Docker deployment in this example, the file path should be/opt
.For more information on how to use
mssql-tools18
, see sqlcmd-utility.
So far, the Microsoft SQL Server 2022 instance has been deployed and can be connected.
Create Database and Data Tables
Use the connection created from the previous section and the following SQL statements to create data tables.
Create the following data table for storing the MQTT message, including the message ID, topic, QoS, payload, and publish time of each message.
sqlCREATE TABLE dbo.t_mqtt_msg (id int PRIMARY KEY IDENTITY(1000000001,1) NOT NULL, msgid VARCHAR(64) NULL, topic VARCHAR(100) NULL, qos tinyint NOT NULL DEFAULT 0, payload VARCHAR(100) NULL, arrived DATETIME NOT NULL DEFAULT CURRENT_TIMESTAMP); GO
Create the following data table for recording the online/offline status of clients.
sqlCREATE TABLE dbo.t_mqtt_events (id int PRIMARY KEY IDENTITY(1000000001,1) NOT NULL, clientid VARCHAR(255) NULL, event_type VARCHAR(255) NULL, event_time DATETIME NOT NULL DEFAULT CURRENT_TIMESTAMP); GO
Install and Configure ODBC Driver
You need to configure the ODBC driver to be able to access the Microsoft SQL Server database. You can use either FreeTDS or the msodbcsql18 driver provided by Microsoft as the ODBC driver.
EMQX uses the DSN Name specified in the odbcinst.ini
configuration to determine the path to the driver dynamic library. In the examples below, the DSN Name is ms-sql
. For more information, refer to Connection Properties.
Note
You can choose your own DSN name according to your preference, but it is recommended to use only English letters. Additionally, the DSN Name is case-sensitive.
Install and Configure msodbcsql18 Driver as ODBC Driver
If you need to use msodbcsql18 driver as the ODBC driver, refer to the Microsoft instructions:
- Install the Microsoft ODBC driver for SQL Server (Linux)
- Install the Microsoft ODBC driver for SQL Server (macOS)
Restricted by Microsoft EULA terms, the Docker image provided by EMQX does not include the msodbcsql18 driver. To use it in Docker or Kubernetes, you need to create a new image installed with ODBC driver based on the image provided by EMQX Enterprise to access the Microsoft SQL Server database. Using the new image means that you agree to the Microsoft SQL Server EULA.
Follow the instructions below to build a new image:
Use the following Dockerfile to build a new image.
The base image version in this example is
emqx/emqx-enterprise:5.8.1
. You can build the image based on the EMQX Enterprise version you need, or use the latest version imageemqx/emqx-enterprise:latest
.
FROM emqx/emqx-enterprise:5.8.1
USER root
RUN apt-get -qq update && apt-get install -yqq curl gpg && \
. /etc/os-release && \
curl -fsSL https://packages.microsoft.com/keys/microsoft.asc | gpg --dearmor -o /usr/share/keyrings/microsoft-prod.gpg && \
curl -fsSL "https://packages.microsoft.com/config/${ID}/${VERSION_ID}/prod.list" > /etc/apt/sources.list.d/mssql-release.list && \
apt-get -qq update && \
ACCEPT_EULA=Y apt-get install -yqq msodbcsql18 unixodbc-dev && \
sed -i 's/ODBC Driver 18 for SQL Server/ms-sql/g' /etc/odbcinst.ini && \
apt-get clean && \
rm -rf /var/lib/apt/lists/*
USER emqx
Build a new image using the command
docker build -t emqx/emqx-enterprise:5.8.1-msodbc
.After building, you can use
docker image ls
to obtain a list of local images. You can also upload or save the image for later use.
Note
Check that the DSN Name in odbcinst.ini
should be ms-sql
if you install the msodbcsql18 driver using this example. You can change the DSN Name according to your needs.
Install and Configure FreeTDS as ODBC driver
This section introduces how to install and configure FreeTDS as an ODBC driver on some of the mainstream distributions.
Install and configure FreeTDS ODBC driver on MacOS:
$ brew install unixodbc freetds
$ vim /usr/local/etc/odbcinst.ini
# add the following lines
[ms-sql]
Description = ODBC for FreeTDS
Driver = /usr/local/lib/libtdsodbc.so
Setup = /usr/local/lib/libtdsodbc.so
FileUsage = 1
Install and configure FreeTDS ODBC driver on Centos:
$ yum install unixODBC unixODBC-devel freetds freetds-devel perl-DBD-ODBC perl-local-lib
$ vim /etc/odbcinst.ini
# add the following lines
[ms-sql]
Description = ODBC for FreeTDS
Driver = /usr/lib64/libtdsodbc.so
Setup = /usr/lib64/libtdsS.so.2
Driver64 = /usr/lib64/libtdsodbc.so
Setup64 = /usr/lib64/libtdsS.so.2
FileUsage = 1
Install and configure FreeTDS ODBC driver on Ubuntu (Take Ubuntu20.04 as an example, for other versions, please refer to the official ODBC documentation):
$ apt-get install unixodbc unixodbc-dev tdsodbc freetds-bin freetds-common freetds-dev libdbd-odbc-perl liblocal-lib-perl
$ vim /etc/odbcinst.ini
# add the following lines
[ms-sql]
Description = ODBC for FreeTDS
Driver = /usr/lib/x86_64-linux-gnu/odbc/libtdsodbc.so
Setup = /usr/lib/x86_64-linux-gnu/odbc/libtdsS.so
FileUsage = 1
Create a Connector
This section demonstrates how to create a Connector to connect the Sink to the Microsoft SQL server.
The following steps assume that you run both EMQX and Microsoft SQL Server on the local machine. If you have Microsoft SQL Server and EMQX running remotely, adjust the settings accordingly.
- Enter the EMQX Dashboard and click Integration -> Connectors.
- Click Create in the top right corner of the page.
- On the Create Connector page, select Microsoft SQL Server and then click Next.
- In the Configuration step, configure the following information:
- Connector name: Enter a name for the connector, which should be a combination of upper and lower-case letters and numbers, for example:
my_sqlserver
. - Server Host: Enter
127.0.0.1:1433
, or the URL if the Microsoft SQL Server is running remotely. - Database Name: Enter
master
. - Username: Enter
sa
. - Password: Enter the preset password
mqtt_public1
, or use the actual password. - SQL Server Driver Name: Enter
ms-sql
, as the DSN Name configured inodbcinst.ini
- Connector name: Enter a name for the connector, which should be a combination of upper and lower-case letters and numbers, for example:
- Advanced settings (optional): For details, see Features of Sink.
- Before clicking Create, you can click Test Connectivity to test if the connector can connect to the Microsoft SQL Server.
- Click the Create button at the bottom to complete the creation of the connector. In the pop-up dialog, you can click Back to Connector List or click Create Rule to continue creating rules with Sinks to specify the data to be forwarded to the Microsoft SQL Server and record client events. For detailed steps, see Create a Rule with Microsoft SQL Server Sink for Message Storage and Create a Rule with Microsoft SQL Server Sink for Events Recording.
Create a Rule with Microsoft SQL Server Sink for Message Storage
This section demonstrates how to create a rule in the Dashboard for processing messages from the source MQTT topic t/#
, and saving the processed data to the Microsoft SQL Server table dbo.t_mqtt_msg
via the configured Sink.
Go to EMQX Dashboard, click Integration -> Rules.
Click Create on the top right corner of the page.
Enter
my_rule
as the rule ID. To create a rule for message storage, enter the following statement in the SQL Editor, which means the MQTT messages under topict/#
will be saved to Microsoft SQL Server.Note: If you want to specify your own SQL syntax, make sure that you have included all fields required by the Sink in the
SELECT
part.sqlSELECT * FROM "t/#"
TIP
If you are a beginner user, click SQL Examples and Enable Test to learn and test the SQL rule.
Click the + Add Action button to define an action that will be triggered by the rule. With this action, EMQX sends the data processed by the rule to Microsoft SQL Server.
Select
Microsoft SQL Server
from the Type of Action dropdown list. Keep the Action dropdown with the defaultCreate Action
value. You can also select a Microsoft SQL Server Sink if you have created one. This demonstration will create a new Sink.Enter a name for the Sink. The name should be a combination of upper/lower case letters and numbers.
From the Connector dropdown box, select the
my_sqlserver
created before. You can also create a new Connector by clicking the button next to the dropdown box. For the configuration parameters, see Create a Connector.Configure the SQL Template for message storage, using the following SQL statement:
Note: This is a preprocessed SQL, so the fields should not be enclosed in quotation marks, and do not write a semicolon at the end of the statements.
sqlinsert into dbo.t_mqtt_msg(msgid, topic, qos, payload) values ( ${id}, ${topic}, ${qos}, ${payload} )
If a placeholder variable is undefined in the SQL template, you can toggle the Undefined Vars as Null switch above the SQL template to define the rule engine behavior:
Disabled (default): The rule engine can insert the string
undefined
into the database.Enabled: Allow the rule engine to insert
NULL
into the database when a variable is undefined.TIP
If possible, this option should always be enabled; disabling the option is only used to ensure backward compatibility.
Advanced settings (optional): For details, see Features of Sink.
Before clicking Create, you can click Test Connectivity to test that the Sink can be connected to the Microsoft SQL Server.
Click the Create button to complete the Sink configuration. A new Sink will be added to the Action Outputs.
Back on the Create Rule page, verify the configured information. Click the Create button to generate the rule.
You have now successfully created the rule for the Microsoft SQL Server Sink. You can see the newly created rule on the Integration -> Rules page. Click the Actions(Sink) tab and you can see the new Microsoft SQL Server Sink.
You can also click Integration -> Flow Designer to view the topology and you can see that the messages under topic t/#
are sent and saved to Microsoft SQL Server after parsing by rule my_rule
.
Create a Rule with Microsoft SQL Server for Events Recording
This section demonstrates how to create a rule for recording the clients' online/offline status and storing the events data to the Microsoft SQL Server table dbo.t_mqtt_events
via a configured Sink.
The steps are similar to those in Create a Rule with Microsoft SQL Server Sink for Message Storage expect for the SQL template and SQL rules.
The rule SQL statement for online/offline status recording is as follows.
SELECT
*,
floor(timestamp / 1000) as s_shift,
timestamp div 1000 as ms_shift
FROM
"$events/client_connected", "$events/client_disconnected"
The SQL template for events recording is as follows.
insert into dbo.t_mqtt_events(clientid, event_type, event_time) values ( ${clientid}, ${event}, DATEADD(MS, ${ms_shift}, DATEADD(S, ${s_shift}, '19700101 00:00:00:000') ) )
Test the Rules
Use MQTT X to send a message to topic t/1
to trigger an online/offline event.
mqttx pub -i emqx_c -t t/1 -m '{ "msg": "hello SQL Server" }'
Check the running statistics of the Microsoft SQL Server Sink.
- For the Sink used to store messages, there should be 1 new matching and 1 new outgoing message. Check whether the data is written into the
dbo.t_mqtt_msg
data table.
1> SELECT * from dbo.t_mqtt_msg
2> GO
id msgid topic qos payload arrived
----------- ---------------------------------------------------------------- ---------------------------------------------------------------------------------------------------- --- ---------------------------------------------------------------------------------------------------- -----------------------
1000000001 0005F995096D9466F442000010520002 t/1 0 { "msg": "Hello SQL Server" } 2023-04-18 04:49:47.170
(1 rows affected)
1>
- For the Sink used to record online/offline status, there should be 2 new events recorded: client connected and client disconnected. Check whether the status recording is written into the
dbo.t_mqtt_events
data table.
1> SELECT * from dbo.t_mqtt_events
2> GO
id clientid event_type event_time
----------- ---------------------------------------------------------------- ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- -----------------------
1000000001 emqx_c client.connected 2023-04-18 04:49:47.140
1000000002 emqx_c client.disconnected 2023-04-18 04:49:47.180
(2 rows affected)
1>