DagFlux Documentation

DagFlux is a desktop application for designing, automating, and running data workflows. You assemble a workflow as a graph of typed nodes and let the built-in AI assistant translate plain-language descriptions into the exact operations needed to move and reshape your data.

How DagFlux works in one minute

You start with a blank canvas and drop nodes onto it. Each node has a job:

Start node decides when the workflow runs (manually, on a schedule, when a file changes, or when an HTTP webhook is called).
Data Source node points at one input — a database table, a cloud bucket file, a local spreadsheet, a REST API, etc.
Transformer node modifies data from a single upstream source.
Join node merges data from two or more upstream sources.
Output node writes results to a file, a database, a cloud bucket, a vector store, or an AI analysis report.

When you describe what you want a node to do in plain language, DagFlux asks clarifying questions if needed, then generates the precise operation that will run at execution time. You see and approve that operation before the workflow is saved.

When the workflow runs, DagFlux executes nodes in order, logs every step, and records the result so you can inspect what happened.

Security & Privacy

Your credentials and AI provider keys stay encrypted on your machine and are never visible inside generated workflows or sent anywhere except to the service the connection actually targets.

Documentation map

Section	What it covers
Architecture	How DagFlux is structured, what runs locally, how runs are recorded
Plugins	How DagFlux is extended with connectors and AI skills
Connections	Saved data sources, credential storage, OAuth
AI providers	Which AI providers DagFlux supports and how it uses them
Nodes	What each node type does and how it behaves at run time
Chat prompts	How the AI assistant turns descriptions into operations
Triggers	Scheduled, file-based, and webhook automation

What DagFlux can do

In one workflow you can:

Pull data from any supported database (relational, document, or vector), cloud storage bucket, local file, spreadsheet service, or REST API.
Reshape it with AI-generated SQL, NoSQL operations, or custom code, either creating a new dataset or modifying the source in place.
Combine data from multiple sources with joins — including hybrid joins that mix relational, document, and vector sources, and joins that perform similarity search per row.
Write the result to a local file (in many formats), back into a database, into a cloud bucket, or into a vector index (with embeddings generated automatically).
Run the whole workflow on demand, on a schedule, when a file changes, or in response to an HTTP webhook.
Automatically generate an AI analysis report on the final output.

DagFlux ships with a curated set of integrations and can install more on demand from the plugin catalog.