skaska

3 Layers of skills in extendable and adaptive LLM-chatbot

2024-01-05T19:00:56+00:00

Introduction

In the dynamic world of artificial intelligence, Language Model (LLM)-powered chatbots stand at the forefront of interactive technology. While foundational skills such as web browsing are commonplace in many chatbots, it is the realms of enhancement and core transformation skills that are truly revolutionizing the capabilities of these digital assistants.

Enhancing Skills

Enhancing the base skill set via a chat interface is now a tangible reality. Innovations like GptEngineer demonstrate the feasibility of programming through chat-like interactions between the user and the LLM. This layer encompasses more than just crafting new base skills from the ground up; it involves refining and amalgamating existing skills into sophisticated, multifaceted tools. Here, the chat interface serves as a unique alternative to conventional Integrated Development Environments (IDEs), facilitating the creation and modification of code.

This layer introduces the concept of self-improving skills, initially with user assistance. In a business context, these advanced capabilities are ideally suited for power users and developers.

Changing the ‘Core’

Delving deeper, we can modify the foundational interaction layer between the application and the LLM, known as LLM programs. For instance, consider the consider_memo_storage method from the AutoGen project:

def consider_memo_storage(self, comment):
        """Determines if a user comment should be stored in the database."""
        # Analyzing for a problem-solution context.
        response = self.analyze(
            comment,
            "Does any part of the TEXT ask the agent to perform a task or solve a problem? Answer with just one word, yes or no.",
        )
        if "yes" in response.lower():
            # Extracting actionable advice.
            advice = self.analyze(
                comment,
                "Briefly copy any advice from the TEXT that may be useful for a similar but different task in the future. If no advice is present, respond with 'none'.",
            )
        ...            

In this method, the LLM is initially consulted, followed by an interpretation of its response in binary terms (‘yes’/’no’). This is a basic form of an LLM program, involving a call-and-response mechanism. The consider_memo_storage method, spanning roughly 50 lines, represents well-structured code, yet its operational efficiency and alignment with our objectives remain hypotheses subject to verification.

At this layer, we extend our exploration into self-improving LLM applications. By analyzing logs comprising method implementations, inputs, and outputs, we can iteratively refine the LLM program, potentially leveraging advanced LLMs for optimization.

Conclusion

The exploration of the advanced skill spectrum in LLM-powered chatbots uncovers a world where these tools transcend basic functions. Enhancement skills empower chatbots to grow and adapt through interactive interfaces, stepping beyond the bounds of traditional programming. Core transformation skills take this further, honing the very essence of chatbot functionality. A key consideration emerges: determining which aspects of the application should remain constant throughout its lifecycle, amidst the continuous evolution of its capabilities. This question underscores the intricate balance between innovation and stability in the development of intelligent chatbot applications.

From LLM-app to AI/LLM-drive-app

2023-11-14T18:00:56+00:00

Introduction

In this article, we delve into the evolving landscape of AI-driven applications, particularly focusing on Large Language Models (LLMs). Let’s begin by defining key terms:

LLM App: An application that primarily utilizes Large Language Models for its core functionality and simple scripting for interation with LLM
AI-Driven App: An alternative approach to AI app development that also actively uses LLMs both for core functionality and control flow

Our journey starts with an analysis of David Shapiro’s demonstration of a medical application, specifically the Medical Intake app. This type of application is what we refer to as an LLM-app. Yes, it’s not a production grade app, but a good starting point for our jorney.

To better grasp this example, we’ll distinguish between LLM interaction logic (or prompt engineering) and control logic (program flow). The Medical Intake app, for instance, uses a straightforward control logic:

Engage in a conversation with a patient and condense the chat into notes.
Use these notes to generate multiple documents.

Our aim is to demonstrate how integrating domain expertise can enhance the control logic, making the app more flexible and reliable.

Steps to Transition from an Initial LLM-App to an AI-Driven App

LLM App: The basic stage where the app’s functionality is primarily driven by LLM capabilities.
Modularization: Decompose the intake process into discrete LLM-powered modules. This allows for more structured prompting, response verification, and streamlined testing.
Workflow Implementation: Incorporating a workflow engine can significantly improve the app by adding monitoring, reliability, and other benefits associated with workflow/business process management systems (WF/BPMS). It also facilitates the integration of new processes.
Evolution to Case Management: Drawing from extensive experience in BPMS integration, it’s evident that workflows often become more complex, evolving into Case Management Systems. In this scenario, the “case” is the patient intake process, encompassing various stages and actions.
LLM-Driven Decision Making: The final step involves enabling the LLM to make case management decisions. While individual steps involve LLM interactions, a master LLM is introduced to orchestrate the entire process.

LLM app

The entire functionality is encapsulated within a single Python script.

Modularization

Evolving into a more manageable and testable structure is a logical next step.

Workflow

Introducing workflow engines or business process management solutions is the natural progression for step orchestration.

Adaptive case management

Even though the initial process seems simple, the need for greater flexibility becomes apparent over time. Adaptive Case Management (ACM) addresses this by breaking down processes into smaller units, introducing the concept of a case with its lifecycle, and centralizing human expertise in managing these cases.

AI

Replacing the human component in ACM with an LLM, we focus on control flow rather than individual tasks. This LLM determines the next steps based on the current state and available actions. The control flow can be enhanced with expert system-style rules, additional information, or alternative data representations. Crucially, integrating human expertise remains vital for moderating AI behavior, adjusting prompts, and introducing new rules. This requires collaboration between AI specialists and domain experts.

Check simplified prompt template as below. Given global task, current state (in form of collected documents) and available actions LLM have to come up with next step to perform.

{mission}
{goal}

Currently you are in the middle of the {process_name} process.
Below is list of activities you can use to complete the goal and already collected documents from the patient.
===
Activities
{activities}
===
Documents available
{documents}

===
Common happy path is to execute activities in the following order:
{happy_path}
You are free to choose any activities that you think will help you to complete the goal.
Provide name of the activity you want to execute and list of parameters to execute it.

This approach to control the flow can further be extented with additional rules (expert system style approach) and other information or other ways to represent same information (i.e. log => actions taken can be presented as results of these actions or rather action description). Additinal flexibility can and should be added by including human expert to moderate AI behavoir by chanaging prompts, introducing new rules etc. Actually there should be 2 experts - one is more AI related with prompt engineering experience, and the second is domain expert ()

AI-Driven App Architecture

Key components of an AI-Driven App:

Core LLM: Retains the same LLM used in the initial app, now augmented with verification mechanisms and structured prompts to improve step reliability.
Step Types: Introduces a variety of steps, including expert consultations and data retrieval from external systems.
LLM for Step Management: A new LLM layer is implemented to determine the sequence of steps, starting as a simple expert system and evolving with the application.
Hardcoded Framework: A foundational structure that encapsulates the concept of steps/actions and their interaction with the executor-LLM.
Rules: Knowledge segments that enhance AI performance in complex scenarios.
Structured Document Storage: An alternative to raw vector storage, facilitating better data management.

Conclusion

As Demis Hassabis once remarked, delegating more of the initial task to neural networks leads to superior outcomes. However, not everyone has access to high-end computational resources like the GH200 clusters. Therefore, understanding and exploring hybrid methodologies is crucial in developing efficient and effective AI.

Workforce agent

2023-10-25T18:00:56+00:00

Consider this as part 3 of the “Sequential Planner” series: part 1, part 2.

Idea

Imagine having unlimited access to remote mid and junior-level knowledge workers. The only limitation? They don’t use Zoom. However:

They are proficient with Jira and Confluence.
They are available 24/7.
They meticulously track their work and validate outcomes.
They seek assistance when needed.
… and much more.

Interaction Scenario

Here’s an illustrative interaction scenario:

A user converses with a manager agent (MA) to define a task.
The MA logs the task in Jira and assigns it to the appropriate worker agent (WA).
The WA drafts a strategy to accomplish the task, seeking clarifications or supplementary documents if necessary. This plan might manifest as sub-tasks within Jira.
Once the plan is set, the WA commences its execution, continually updating the status of each sub-task.
After completing all sub-tasks, the MA conducts a verification/validation. If all criteria are met, the status of the primary task is marked as done.

Current State of Agent Development

Simple

Agents are often perceived as mere remote procedure calls rather than entities capable of planning and execution. Sometimes, they’re even externally orchestrated (e.g., check agentprotocol).

Planners

To grasp the limitations of current planners and potential improvements, refer to part 1 and part 2.

Reasoning

CoT, ToT, GoT – these methods focus predominantly on reasoning, not on collaborative task resolution. Nonetheless, they can certainly serve as low-level approaches.

Main Hint: Workflow Engines Analogy

Indeed, workflows are repetitive processes. However, they offer a prime analogy for understanding task execution. Explore platforms like Camunda, Nintex, Azure Logic Apps, etc. Envision task setup akin to configuring a workflow, accounting for:

Internal variables.
Sub-processes.
Updates to external systems.
And more. The task execution can be likened to running such a workflow – but just once.

Missing Ingredients

There are four pivotal components currently absent in state-of-the-art agent developments:

An innovative task planning methodology, incorporating hierarchical task decomposition and the ask-plan-execute strategy.
Flexibility concerning multiple kernels/LLMS and intricate internal data types (entities, collections).
Resilience regarding external task and sub-task states and statuses.
Seamless integration with task management systems, wikis, and document portals.

Agent Core

Three Modules

The core functionality of an agent can be divided into three modules:

Manager:
- Engages with users or high-level systems.
- Primarily aims to draft a detailed task description.
Planner:
- Utilizes the detailed task description to devise an execution plan.
- Can seek further information from the manager or report issues if the task plan is unfeasible due to insufficient skills or data.
- Delivers a search tree alongside the execution plan, offering insights to the executor in case of execution issues.
- The plan is uploaded to the task management system (TMS).
Executor:
- Carries out the plan either autonomously or with the assistance of simple agents.
- Updates task statuses in the TMS during execution. Some artifacts may also be generated in wikis, portals, or code repositories.

Planning Execution

Approach the agent as you would a junior developer: don’t anticipate exhaustive domain knowledge or flawless execution. Allow the agent to outline its approach to achieving the desired outcome (see Hierarchical task split). A hierarchical planner can either produce an executable plan or one of two errors: lack of functions or lack of information. These inadequacies can be interpreted as calls for assistance.

Tasks, Wikis, etc.

While the conventional approach (except for MemGPT) retains the entire execution log in context, the proposition here is to use a wiki and/or task description to house intermediary data. The motivations are:

It’s pivotal for parallel execution and multiple agents working on sub-tasks.
Agent activities are easily traceable, thanks to standard task management solutions and wikis.

For this to be feasible, agents must understand tasks, sub-tasks, their interrelations, statuses, etc., as well as the structure of the wiki. This understanding is essential as different teams manage projects in varied ways.

Future Work

The way forward is clear: develop the Core for Workforce Agents.

Sequential planner v2

2023-10-25T05:00:56+00:00

This is part 2 (part 1).

Initial Idea for Plan Generation

Create a planner that can:

Confirm that the final result structure and acceptance criteria are met.
Respond to the calling system if some information or functionality is missing.
Use a top-to-bottom approach and split tasks into smaller ones if necessary.
Work with collections/lists and other data structures.

Implementation

Main points:

The user or invoking system provides information necessary to formulate an extended version.
The planner consists of two modules: a task manager that interacts with the user/system and a planner that creates a plan and interacts with the task manager.

Extending Task Description

The task description that is passed to the planner needs to have not only the task text but also:

High-level values for the planner (i.e., mission, project context).
Input information description.
Resulting documents description.
Acceptance criteria.

This extended description can be either the result of interaction with the user/system or generated from project description and documentation, common sense, etc.

Hierarchical Task Splitting

The start of the algorithm is almost the same as for a conventional sequential planner. The difference is the extended task description, but we can ignore it for now.

Given:

Task as a string.
Extended task description.
List of available functions in the form the planner can understand.

Algorithm:

Try to solve the task using a conventional planner and existing functions. If solved => hurrah! Done.
Ask the planner to split the task into several (not many 5-10) stages with a detailed description (same as the extended description for the initial task) on each step.
[Recursion]
- For each stage, try to resolve it by a conventional planner or by splitting it into smaller stages.
- If recursion works => fine!
- If not, dig down till some level.

Key points:

Keep records of all explored steps (tree/graph of hypotheses).
Be creative and try splitting into stages multiple times (i.e., high temperature).
Distinguish between two problems for a solution not found (stage cannot be resolved with conventional planners):
- Cannot apply conventional planner due to a lack of functions.
- Additional information/document is required.

The main difference of the described approach from [X]_of_Thoughts approaches (CoT, ToT, GoT, etc.) is that we are okay to introduce non-defined steps and define them later, instead of doing a search only in the space of available functions.

Example A: Simple Case => Success Plan

Below is a diagram describing a simple case that is identical to a common sequential planner:

The task is successfully split into several activities/subtasks.
Each activity is mapped to an existing skill from the given skillset.
The input document is mapped to the input of activity A (skill A).
The output document is the output from the last activity C (skill C).

Example B: Simple Case => Problematic Plan

This is a more complex case:

The planner was unable to map the initial task into a sequence of skills.
The planner split the initial task into high-level activities.
- Activity #1 and activity #2 were successfully mapped into a sequence of skills.
- But there was a problem with activity #3 - it required additional information (document X).
Though the planner tried to create a plan several times, document X was always required.
This request for document X was passed to the task manager module.

Conclusion

A high-level description of the algorithm for advanced planning was given, and initial ideas were implemented.

Sequential planner flaws

2023-10-24T05:00:56+00:00

Disclaimer #1: The insights shared in this article are based on experiments conducted with Semantic Kernel (version < 1.0). While Langchain exhibits similar challenges, I have not delved deeply into its workings.

Disclaimer #2: The crux of this discussion revolves around devising a plan, rather than its execution.

Disclaimer #3: Concepts such as CoT, ToT, GoT, etc., are not pertinent to this discussion.

The Sequential Planner Flow

An overview of the current implementation of the sequential planner:

Given:

A goal expressed as a string.
A list of available functions that the planner can interpret (SK => plugins).

Flow:

Extract a concise list of functions that may be relevant for the task at hand.
Formulate a prompt encompassing:
- Descriptions of the selected functions.
- Directions to craft a plan.
- The specified goal.
Engage the LLM with the crafted prompt to obtain the plan.

Plan Example:

Given the objective: “Summarize an input, translate it to French, and e-mail it to John Doe”, the following plan was devised:

Steps:
  - SummarizePlugin.Summarize input='$INPUT' => SUMMARY
  - WriterPlugin.Translate input='$SUMMARY' => TRANSLATED_SUMMARY
  - email.GetEmailAddress input='John Doe' => EMAIL_ADDRESS
  - email.SendEmail input='$TRANSLATED_SUMMARY' email_address='$EMAIL_ADDRESS'

Limitations of the Current Approach

While this methodology suffices for rudimentary tasks with concise plans, it falters when addressing more intricate challenges. Some of the pitfalls include:

Irrespective of the prompt instructions, the generated plan may inadvertently employ collections as variables.
The structure of the final solution remains ambiguous. Merely augmenting instructions is ineffective, given the absence of a verification mechanism.
Should a function crucial for achieving the goal be absent, the invoking system remains oblivious.
Similarly, if supplementary information is essential for goal accomplishment, the system remains uninformed.

A Glimmer of Hope

Firstly, LLMs at the caliber of GPT-4 are equipped to devise algorithms to tackle almost any challenge (e.g., the 12 tasks delineated in the ARC report). This includes the ability to decompose a task into more manageable sub-tasks. While the resultant algorithm may not always be optimal, leveraging multiple generations (with a non-zero temperature) could pave the way for satisfactory outcomes.

Proposed Solution

The aspiration is to develop a planner capable of:

Ensuring that the final result adheres to the defined structure and meets acceptance criteria.
Informing the invoking system in case certain information or functionality is lacking.
Employing a top-down strategy, further dissecting tasks as needed.
Seamlessly integrating with collections/lists and other data structures.

AI is not ‘slow kind of country’

2023-10-17T05:00:56+00:00

Back in business - after dramatic pause and 3 projects behind 😀 Below is compilation of 3 small posts from LinkedIn

Context

Red queen race
AI adoption
LLM is junior

The Red Queen’s AI Race

In the realm of tech, AI adoption feels reminiscent of the Red Queen’s race from “Alice in Wonderland” - indeed, we’re no longer in ‘a slow sort of country!’

Participate

Every individual in an organization must now harness the capabilities of AI tools and integrate them into their daily tasks.
In the upcoming month, the focus should shift to crafting in-house AI solutions to refine and elevate processes.
And by the next quarter, it’s time to weave AI seamlessly into product lines, take the helm of industry-wide AI initiatives, and seek inspiration from platforms like ChatGPT for a fresh wave of ideas!😀

Embracing AI

Many of us have dabbled with tools like ChatGPT, StableDiffusion, and other AI utilities. However, integrating AI at an enterprise or departmental level or weaving it into an application remains a challenging feat. The absence of a one-size-fits-all AI framework does complicate matters. Currently, AI projects often feel more akin to exploratory research than traditional software development. Presented here is a methodical approach to ease AI adoption. It’s not a rigid framework, but a flexible strategy to consider.

The AI Adoption Approach:

Identify Potential: Draft a list of use cases that can benefit from AI acceleration or enablement.
Three Pillars of Focus: For each use case, center your strategy around three primary pillars: data, algorithm, and user experience (UX).
- Data: This encompasses the information that will be employed to train or underpin the AI, establishing it as a strategic asset.
- Algorithm: Instead of chasing the latest AI innovations (given the inundated state of AI news feeds), select a reliable, time-tested algorithm.
- UX: Develop AI-backed widgets for integration into your portal or application. Always have an administrative interface or dashboard in place to monitor performance and gauge user engagement.
Quality over Quickness: Prior to rolling out any solution (even to early adopters), it’s crucial to vet its reliability. Due to the unpredictable nature of AI, it’s advisable to employ a “red team” or an independent group to test and challenge the system’s security and reliability.

The Key: Begin with simplicity. A common mistake is to invest all your resources into perfecting one pillar at the outset. Such an approach often stalls projects. Instead, view each pillar as a container that you gradually fill over successive development cycles. For the first cycle, pour a modest amount into each container. Once you have a functioning system in place - which is often the hardest part organizationally - continue to incrementally enhance each pillar over time.

LLM is junior

While general large language models (LLM) like Claude2, Bard, ChatGPT, and others are incredibly versatile, it’s essential to recognize that their performance on niche domain-specific tasks may not always match an expert’s acumen. Instead of expecting senior-level expertise, it might be more realistic to anticipate competence equivalent to a junior or mid-level professional. Strategies to maximize LLM effectiveness include:

Simplifying tasks or dividing them into more manageable sub-tasks.
Giving illustrative examples for better understanding.
Actively ensuring the results’ accuracy.

Verification Techniques:

Integrate a verification step within your prompt. If the model’s response doesn’t satisfy the verification criteria, request reconsideration.
Design the LLM interaction to include multiple steps or loops for self-checks and validation.
Engage a secondary model or system with a distinct prompt to corroborate the initial LLM’s responses.

Book is book, not a knowledge

DIKW pyramid

The DIKW pyramid, also known variously as the DIKW hierarchy, wisdom hierarchy, knowledge hierarchy, information hierarchy, information pyramid, and the data pyramid, refers loosely to a class of models for representing purported structural and/or functional relationships between data, information, knowledge, and wisdom. “Typically information is defined in terms of data, knowledge in terms of information, and wisdom in terms of knowledge”.The DIKW acronym has worked into the rotation from knowledge management. It demonstrates how the deep understanding of the subject emerges, passing through four qualitative stages: “D” – data, “I” – information, “K” – knowledge and “W” – wisdom

In the world of Knowledge Management, let’s break down the differences between data, information, and knowledge in a more casual way. A common misunderstanding these days is thinking that just tossing documents into a digital storage bin and using fancy Retrieval Augmented Generation (RAG) with Large Language Models (LLM) will magically transform your system into a Knowledge Management powerhouse. Nope, it doesn’t work that way – you’re still squarely in the realm of Information Management.

To put it simply, when it comes to knowledge, think of it as “active” compared to information, which is more “passive”. Now, let’s expand on this with some relatable examples:

Knowledge Gets Things Done: Knowledge isn’t just knowing stuff; it’s about having the know-how to get things done. Imagine you have a text with an algorithm that you understand and can execute. That text becomes knowledge because you can put it into action. You’re like a problem-solving wizard with that knowledge!

Information is Like Clues: But, if the same text lands in front of someone who doesn’t have the magic skills to run that algorithm, it’s just information. It’s like giving them a bunch of clues, but they don’t quite know how to use them.

Data is the Raw Material: Now, picture this: someone who not only can’t execute the algorithm but also doesn’t understand the language of that text. For them, it’s not even information; it’s just raw data. It’s like staring at a jumble of characters without a clue. The point here is that in Knowledge Management, we’re not just about collecting heaps of data or storing loads of information. It’s about making sure people can take that info and do something useful with it. True Knowledge Management is about turning data into actionable information and eventually into practical know-how that drives results. It’s like having the right tools in your toolbox – it’s not about how many you have, but how well you can use them to build something amazing.

Some thoughts on Architecture Decisions

2021-12-20T05:45:56+00:00

What is it

An architecture decision (AD) is a software design choice that addresses a significant requirement. From here

During project team makes multiple ADs. Each decision is made at some point in time and within specific context - project history, current functional requirements (i.e. set of user stories), set of non-functional requirements, product vision, clients in pipe etc. This context is subject to change in time.

Track it

ADR is common approach to track AD

Very good example of ADR usage - A Case Study - Konstantin Kudryashov - DDD Europe 2020

Evaluate it

What does it mean to evaluate AD? When should it be done?

Quadrants

Magic

The art of gathering low hanging fruits. Bringing in some new app, lib, pattern that proved to work in similar domains, but it’s unfamiliar to project team. As an example check article Sql is not everything you need

Safe play

No pain no gain or agile trap quadrant. How is it an agile trap? All ADs that are made with only looking at current needs of the project. Is it bad – no. But this kind of decision making фззкщфср will probably make you start v2.0 of your app much sooner than expected. Reason - no platform vision in architecture.

Vision

Hardest one, because it’s easy to fail into overengineering.

Overengineering

Nothing is wrong with this quadrant, unless all decisions end up here.

Multiple ADs, not ONE

Value over time

Good example is AD from vision quadrant. Rarely you can put all ADs right in that quadrant from the very beginning. Initially these decisions tend to move to overengineering quadrant first, and if team is lucky and persistent, it will eventually move to vision.

Strategy

Let’s recap

Safe play is obvious choice for most companies. But that’s mid-term trap.
To get some magic – networking is an answer, or bringing in some external people with different domain expertise.
Vision – requires both technical and product components.

Team should work with ADs as with any portfolio using risk-reward estimation approach, i.e. in general, accept some risk of overengineering but manage it.

Managing risk of overengineering means

mistakes will happen, but they are allowed
reevaluate AD often
be transparent and honest with team

Guesstimate

Do not underestimate element of luck in decision making - team operates with incomplete information in fast changing world.

SQL DB is not ‘the only thing you need’

2021-02-14T04:45:56+00:00

Introduction

After facing some legacy systems lately and complains ‘sql db is bottleneck’ and ‘we have huge sql cluster’, usually accompanying with something like ‘we plan to switch to microservices’. Idea came up to my mind that somehow ‘microservices pattern’ is usually interpreted as separating ‘calculation’ or ’processing’ but not about data storage separation.

SQL database in legacy solution is usually used for storing and sometime processing of

Business data
‘Stateful activity’
Event logs (business, IoT, system)
Business logic execution

Let’s dive deeper into each scenario

Business domain data

Business data – even though you can move it to some document storage that make sense sometime, just leave it there for now. There are usually better options to start optimization with.

Stateful activities

Any activity that will end up one day but needs some persistence till that happy moment. Usually it’s huge piece of work to do it in proper way but it looks pretty small until you dive deeper. Good new there are already solutions to handle all the heavy load.

User interaction requests. When system needs some input from user. Common cases for enterprise solutions are all sorts of business process automation tasks (‘approve document’, ‘fulfill form’ etc.).
Workflows (‘saga pattern’)
Message queuing
ETL artifacts. Some data is processed on regular basis and process is usually multistage, so intermediary results are stored in tables.

Common symptoms

table that is constantly updating (number of update operations is significant) and ‘loosely coupled’ with other tables in DB
table with ‘status’/‘processed’ field or ‘task’ in table name

Scenario	Symptoms	Solutions	App examples
User interaction	tables refer to ‘user data’ table and in permanent update process	BPMS	Camunda
WF, sagas	check for common symptoms	Specialized WF solutions	Temporal, Cadence, Netflix Conductor
Queues	Look at solution code – one or several ‘workers’ is the key.	AMQP, Message bus	RabbitMq, Azure MessageBus
ETL	Same as queues (look for ‘workers’ in the code) but tables usually contains bigger records	Build datapipe or use actor approach for ‘stateful serverless’	Airflow, Luigi vs AKKA, Orleans

Event logs

Journals of ‘actual events’ that are not always well structured.

IoT/IIoT data. User interacts with some hardware and you log this event or some device’s sensor sends data system – think about it as IoT-data-streams and handle these events in ‘modern way’
Business events. External system is generates data that needs to be processed
System, application, solution events, user UI interaction events etc.

Common symptoms

huge (in terms of number of rows) tables that constantly growing with no links to other tables
‘append only’ tables that periodically cleaned up (mostly manually)

Scenario	Solution	App examples
IoT	*simple processing	MongoDb, Kafka, Clickhouse
Business events	*simple processing	MongoDb, MessageBus, Data Warehouse
System events	Logging solutions	ELK, Prometheus

*Simple processing

Store raw incoming events in NoSql Db (MongoDB) with TTL
Create data-pipe to process them (Airflow, Luigi, Spark) or go with RabbitMQ and console-apps
Save consumable results in Data Warehouse or OLAP-friendly DB (Clickhouse)

Check ‘stream processing’ approaches, but it might be too much at this stage.

Stored procedures and triggers

Last but not the least. You have tons of stored procedures and triggers, in other words lots of solutions business logic leaves there

Component	Example	Solution
Triggers	Start ‘next step’ of processing after ‘status’ is changed	check ‘stateful activities’ part of this article
Heavy aggregating procedures	Generating reports or spreadsheets for export	check datapipes
Scheduled heavy updating procedures	Monthly salary payments in big enterprise	BPMS/WF and/or immutable data pattern

(Very opinionated) usually lot’s stored procedures and/or triggers is pure evil.

Conclusion

There are definitely more cases of using SQL DBs and other persistence approaches than mentioned in this short article. Nothing wrong to use DB for all the things mentioned above, but there are special solutions to handle these tasks with

More functionality. Even if don’t need it now, it’s like a good old hammer – you’ll see many nails around.
Better tested, production ready and better optimized
Community to support and developers to find

Digital Twins: Implementation

2020-11-11T02:00:00+00:00

Digital twins: implementation

Let us dive deeper into Digital Twins implementation. The way I see 3 big classes of Digital Twins is described here. First, we will look at types of basic building blocks we need to implement Digital Twins in IIoT solution

digital twin instance implementation extension
management extension for digital twin instances
data flows

Then we take a look at 2 projects and how these blocks can help solve real life scenarios.

Base Digital Twins

Goal is to extend (adopt?) model for simpler yet more powerful tool to model digital twins.

Types of blocks

Devices/sensor level
- Camera, ble/uwb anchor etc
- Simple (on/off) or more complicated state-machine
- Sending live state data every 1 second/minute etc
Algorithmic Estimator
- Classical deterministic or neural nets algorithm to estimate internal state of the object or process
Probabilistic estimator
- For example estimating BLE asset tag
- Telemetry as input
- Particle filter
- Keeps prob-distribution as a internal state
- Outputs – best/mean/distribution
Aggregator
- Combine 2 or more streams of potentially different frequency and latency into one
System estimator
- Inputs are from estimators
- Conflict re-solving

Management extension

Management block (or extension) is used when we have more than one DT of the same type and one data flow that needs to be separated into several (one for each DT) according to some rule.

Data Flows

We have 4 data flow types

Telemetry from IoT devices
State and State Estimations from Digital Twins
Events (like errors)
Configuration and sync business data

Example #1. RTLS

Remote site with ‘thin connection’ to cloud
RTLS BLE-based infrastructure that produce ~10Mb sec IIoT data streams
On-site RTLS server that estimates location of BLE tags
Smartphones are part of solution and produce 2 data flows – real-time BLE-tag info and periodical GPS based location
In cloud all data is aggregated using information from MES that provides employee-smartphone relation

Simplified schema of main blocks of RTLS solution

DT Managers

Digital Twins part that contains

DT manager for BLE-Tag Twin
DT-manager for Employee Twin

DT BLE Tag

This DT ‘reflects’ real-life object ‘ble tag’.

Input stream – RSSI signals from BLE Anchors
Type – probabilistic estimator (particle filter).
Output – weighted mean estimation
Internal state – 3D location and on/off-site info

DT Employee

This DT ‘reflects’ real-life object ‘employee’ that has smartphone (emulates BLE Tag for indoor positioning and periodically sends GPS location for outdoor positioning)

2 input streams: 3D location estimation from ‘DT:BLE Tag’ and GPS signals
Type – aggregator – combines location information from sources of different periodicity into ‘global location’ info
Output - ‘global location’ info

Example #2. Video processing

Project in more details is described here

Overall simplified solution architecture

Digital Twins only

Export costs from Azure

2020-10-24T02:00:00+00:00

Suppose your organization uses more than one cloud provider - Azure, AWS and Atlassian for different functions. Financial department desperately needs more advanced cost analysis by teams/projects/etc. Requirement is simple - make costs data accessible in some corporate BI (PowerBI, Tableau). In this article will take a look at different aproaches for exporting Azure costs. First resource you should get acquainted is Azure Cost Management

Manual export

Export costs manually for last period from all subscriptions using Cost management + Biling. Go to Cost analysis – set grouping options, periods and granularity. Then go to Download, select Excel, press Download Data and you instantly get Excel file with data required. Having real piece of data lets you start deeper conversation with financial department – what and how to import into BI sources. Some tagging and resources reorganization might be expected on Azure side.

Automatic export

Once requirements for data are specified in more details and everybody speaks the same language, export of Excel data might be automated. Go to Configuration:Exports and create new data export with periodicity that you need. Create Azure function import data into data source for BI and run it either periodically or on event based approach (which make more sense). To add some resilience data release approach can be implemented (read more here).

Cost management API

If financial department is hungry for rich daily data - Azure Cost Management REST API goes to the rescue. Architecture become a little more complicated, yet rather straight forward. 3 Azure functions connected via ServiceBus are used

Scheduled function. Creates tasks for data extraction for every subscription in the account
Queries data from API and sends it to Bus in single chunk
Ingest data recieved from bus into storage (same function as in Automatic Export)

API POST call example

Url

https://management.azure.com/subscriptions/{subscription-id}/
providers/Microsoft.CostManagement/query?api-version=2019-11-01

Authentication is required

Request body

{
    "type": "Usage",
    "timeframe": "WeekToDate",
    "dataset": {
        "granularity": "Daily",
        "aggregation": {
            "totalCost": {
                "name": "PreTaxCost",
                "function": "Sum"
            }
        },
        "grouping": [
            {
                "type": "Dimension",
                "name": "ResourceGroup"
            },
            {
                "type": "Dimension",
                "name": "ServiceName"
            }
        ]
    }
}

Sample response

{
  "id": "subscriptions/*****/providers/Microsoft.CostManagement/query/******",
  "name": "*****",
  "type": "Microsoft.CostManagement/query",
  "location": null,
  "sku": null,
  "eTag": null,
  "properties": {
    "nextLink": null,
    "columns": [
      {
        "name": "PreTaxCost",
        "type": "Number"
      },
      {
        "name": "UsageDate",
        "type": "Number"
      },
      {
        "name": "ResourceGroup",
        "type": "String"
      },
      {
        "name": "ServiceName",
        "type": "String"
      },
      {
        "name": "Currency",
        "type": "String"
      }
    ],
    "rows": [
      [
        0,
        20201025,
        "*****",
        "azure app service",
        "RUB"
      ],
      [
        0.000075,
        20201025,
        "*****",
        "bandwidth",
        "RUB"
      ],

        .......

    ]
  }
}

As you can see response is easy to undestand in ingest into storage. You can play with API Query on Microsoft documentation site