Create Export Application

An Export Application is designed to compile and transfer a block of data from the Kelvin Platform to a third party provider. Some common uses are to export to Amazon S3, Azure Data Lake Storage, or Databricks for ML analysis and training.

You can build for both x86_64 and arm64 devices.

create default application
$ kelvin app create <EXPORT_NAME>

This will give a response similar to this;

command output
[kelvin.sdk][2025-03-19 18:39:54][I] Refreshing metadata..
Please provide a name for the application: azure-data-lake-uploader

After providing the Export Application name (i.e.: camera-connector):

command output
[kelvin.sdk][2025-03-18 09:42:30][I] Refreshing metadata..
[kelvin.sdk][2025-03-18 09:42:31][I] Creating new application "azure-data-lake-uploader"
[kelvin.sdk][2025-03-18 09:42:31][R] Successfully created new application: "azure-data-lake-uploader".
[kelvin.sdk][2025-03-18 09:42:31][I] Kelvin code samples are available at: https://github.com/kelvininc/app-samples

This will automatically create an Application bootstrap within a directory named as azure-data-lake-uploader populated with some default files and configurations.

Warning

The default files and configurations are setup for a Kelvin SmartApp™ Application. We will need to adapt it to be an Export Application.

Folder Structure

You can now open the folder in your favorite IDE or editor and start to modify the files to create your Export Application.

default folder structure
$ cd azure-data-lake-uploader
$ tree ./
├── Dockerfile
├── app.yaml
├── main.py
├── requirements.txt
└── schemas
    ├── configuration.json
    └── parameters.json

Below is a brief description of each file.

app.yaml

The app.yaml is the main configuration file that holds both Application definitions as well as the deployment/runtime configuration.

This file is used for the Export Application, Docker Apps, Imports (Connectors) and Exports.

On this page we are only focused on the Export options.

It is composed of the following sections:

spec_version key

The spec_version key is automatically injected and specifies the Export Application JSON Schema (latest) version which both defines and validates the app.yaml structure.

spec_version
spec_version: 5.0.0

type

This defines the type for the application.

app: A Smart App that allows mapping inputs and outputs to data streams, sending control changes, recommendations, and data tags.
docker: An docker application that does not connect to the platform's data streams.
importer: Connects to an external system to import data into the platform as well as receive control changes to act on the external system.
exporter: Connects to the platform to export data to an external system.

application type
type: exporter

info

The root section holds the Export Application basic information required to make itself uploadable to Kelvin's App Registry.

application info
name: azure-data-lake-uploader
title: Azure Data Lake Uploader
description: Uploads timeseries data to Azure Data Lake Gen2
version: 1.0.0

The name is the Export Application's unique identifier.

The title and description will appear on the Kelvin UI when creating a Connector once the Export Application is uploaded.

The version defines the version of this Export Application and is used in the Kelvin UI.

Info

The version should be bumped every time the Export Application gets an update, and before it gets uploaded to the App Registry.

flags

This is where you are able to set some of the Application's capabilities.

application flags
flags:
  enable_runtime_update:

    # enables configuration updates at runtime
    configuration: false 

exporter_io

This is the main section that defines the types and function of the Data Streams that are allowed for this Export Application.

example Exporter_io
# definition of which kind of datastreams are allowed to be deployed
exporter_io:
    - name: default
        data_types:    # default: [number, string, boolean, object name]
            - number
            - string

        control: true # the app allows control

ui_schemas

This is where the Export configuration are defined for the Kelvin UI.

The actual information is kept in a json file in the schemas folder of the project. The file location is defined in the app.yaml file like this;

application ui schemas
# optional to generate UI schemas
ui_schemas:

    # app configuration schema
    configuration: "schemas/configuration.json" # default: "schemas/configuration.json"

    # Exporter_io configuration schema
    io_configuration:
        default: "schemas/io_configuration/default.json" # default: "schemas/io.json"
        dynacard: "schemas/io_configuration/dyncard.json" # default: "schemas/io.json"

configuration.json

The configuration.json file will come with default blank schemas when first created.

Note

configurations.json information is optional, and if not provided, the Kelvin UI will display the configuration settings in a raw JSON or YAML file format without verifying the structure or content before applying them to the Export Application.

default schemas/configurations.json
{
    "type": "object",
    "properties": {},
    "required": []
}

An example of a Configurations file filled in would look something like this;

sample schemas/configurations.json
{
    "type": "object",
    "properties": {
      "upload_interval": {
        "type": "number",
        "default": 60,
        "title": "Upload Interval",
        "minimum": 0
      }
    },
    "required": ["upload_interval"]
}

Which will be displayed on the Kelvin UI like this:

default.json and dyncard.json

These are for the definition of any objects data_types.

If not defined they use the default schemas/io.json.

sample schemas/io.json
{
    "type": "object",
    "properties": {
      "address": {
        "type": "number",
        "title": "Address",
        "minimum": 1,
        "maximum": 49999
      }
    },
    "required": ["address"]
}

defaults

This section hold four main sections;

Note

All items in the defaults section are optional.

system : Is used to set different system requirements/constraints within the Export Application running environment. i.e. Resources, Environment Variables, Volumes, Ports, etc.

application defaults
defaults:
  system: {}
  configuration: {}

defaults/system section

The system section is [optional].

This is where developers can set the system settings that the Export Application needs to be able to function as intended.

This includes opening ports, setting environment variables, limited resource usage, attaching volumes and setting the privileged tag which gives extended privileges on the host system.

application system defaults
defaults:
  system:
    environment_vars: []
    volumes: []

System Section Options

resourcesenvironment_varsvolumesportsprivileged

resources section

The resources defines the reserved (requests) and limits the resources allocated to the Export Application:

Limits: This is a maximum resource limit enforced by the cluster. the Export Application will not be allowed to use more than the limit set.
Requests: This is the minimum resources that is allocated to the Export Application. This is reserved for the Export Application and can not be used by other Applications. If there are extra resources available, the Export Application can use more than the requested resources as long as it does not exceed the Limits.

You can read the full documentation about CPU and Memory resources in the Advanced section.

application resource defaults
defaults:
  system:
    resources:
      requests:   # Reserved
        cpu: 100m
        memory: 256Mi
    limits:     # Limits
        cpu: 200m
        memory: 512Mi

environment_vars section

The environment_vars is used to define Environment Variables available within the Export Application container. i.e.:

application environmental variable defaults
defaults:
environment_vars:
  - name: AZURE_ACCOUNT_NAME
    value: <% secrets.azure-account-name %>
  - name: AZURE_ACCOUNT_KEY
    value: <% secrets.azure-account-key %>
  - name: AZURE_STORAGE_CONTAINER
    value: <% secrets.azure-storage-container %>

volumes section

Mounted volumes are [optional] and their main purpose is to share and persist data generated by the Export Application or used by it in a specific place. They act like a shared folder between the Export Application and the host. Kelvin supports directory volumes, such as folders or serial ports, persistent, and file/test volumes:

application attached volume defaults
defaults:
  system:
    volumes:
      - name: data
        target: data.db
        type: persistent

ports section

The ports is [optional] and used to define network port mappings. i.e.:

application open ports defaults
defaults:
  system:
    ports:
      - name: http
        type: host # Exposed on the host
        host:
        port: 80

      - name: opcua
        type: service # Exposed as a service for other containers
        service:
        port: 48010
        exposed_port: 30120
        exposed: true

privileged key

The privileged key is [optional] and used to grant extended privileges to the Export Application, allowing it to access any devices on the host, such as a Serial device:

application privileged defaults
defaults:
  system:
      privileged: true

defaults/configuration

These are the default global Export Application configuration values.

Configurations can also be optionally defined in the ui_schemas that provides a link to a JSON file containing all the information about how to display Configurations in the Kelvin UI.

Note

Operations will have the option to change these at runtime from the Kelvin UI.

application app configuration defaults
defaults:
  configuration:
      upload_interval: 60

Python

The main.py is used as the entry point of the Export Applications. When it runs, main.py is typically the first script that gets executed, and it usually contains the main logic or orchestrates the flow of the Export Applications. However, naming a file "main.py" is just a convention, and it's not mandatory. The name helps developers quickly identify where the primary logic of the Export Applications begins.

The code example generated upon kelvin app create should be deleted and replaced as this is designed for a Kelvin SmartApp™.

Here is an example script that will compile the data from Kelvin Platform and export it to Azure Data Lake Gen2.

example main.py
import asyncio
from datetime import datetime

import aiofiles
import aiofiles.os
from kelvin.application import KelvinApp, filters

from timeseries import TimeseriesDataStore
from uploader import AzureDataLakeStorageUploader


async def upload(app: KelvinApp, data_store: TimeseriesDataStore, uploader: AzureDataLakeStorageUploader):

    # Create export dir
    await aiofiles.os.makedirs("export/", exist_ok=True)

    while True:
        batch_size = app.app_configuration.get("batch_size", 1000)
        upload_interval = app.app_configuration.get("upload_interval", 30)

        # Create filename
        export_file = f"export/{datetime.now().isoformat()}.parquet"

        try:
            # Export to parquet file
            _, chunk_size = await data_store.export_parquet(file_path=export_file, limit=batch_size)

            # Upload file if exists
            if await aiofiles.os.path.exists(export_file):
                await uploader.upload(file_path=export_file)

                # We should only trim data store if upload was successfully
                await data_store.trim(limit=batch_size)

                # Skip sleep if batch_size was full
                if chunk_size < batch_size:
                    print("No more data to upload, waiting for next interval.")
                    await asyncio.sleep(upload_interval)
                else:
                    print("Chunk was full, continuing to process without sleeping.")
            else:
                print("No data to upload at this time.")
                await asyncio.sleep(upload_interval)

        except Exception as e:
            print(f"Error occurred during upload: {e}")
            await asyncio.sleep(upload_interval)
        finally:
            # Remove file if exists
            if await aiofiles.os.path.exists(export_file):
                await aiofiles.os.remove(export_file)

async def main() -> None:

    # Configure the Azure Data Lake Uploader
    uploader = AzureDataLakeStorageUploader()

    # Configure the Timeseries Database
    data_store = TimeseriesDataStore("data.db")
    await data_store.setup()

    # Creating instance of Kelvin App Client
    app = KelvinApp()

    # Connect the App Client
    await app.connect()

    # Create task to continuously upload data
    asyncio.create_task(upload(app=app, data_store=data_store, uploader=uploader))

    # Subscribe to the asset data streams
    async for msg in app.stream_filter(filters.is_asset_data_message):
        # Insert msg to local data store
        await data_store.insert(timestamp=msg.timestamp, asset=msg.resource.asset, datastream=msg.resource.data_stream, payload=msg.payload)


if __name__ == "__main__":
    asyncio.run(main())

Supporting Files

requirements.txtDockerfile.dockerignore

The requirements.txt file is used to list all the dependencies a Python Application needs. It can be used to easily install all the required packages, ensuring the Export Applications runs correctly.

The Dockerfile is a script used to define the instructions and configuration for building a Docker image. It specifies the base image, installation of software, file copying, and other setup tasks needed to create a reproducible and isolated environment for running the Export Applications in Docker containers.

default Dockerfile
FROM python:3.10-slim

ENV PYTHONUNBUFFERED=1
WORKDIR /opt/kelvin/app
COPY . /opt/kelvin/app
RUN pip install -r requirements.txt

ENTRYPOINT python main.py

Info

If main.py is not the intended entry point, it also needs to be replaced on the Dockerfile.

Specifies which files and directories should be excluded when building the Export Applications Docker image. It helps reducing the build context, resulting in smaller, more efficient Docker image.