Build Your Ontology

Ontology is TextQL’s unified semantic layer — a shared source of truth for metric definitions, entity relationships, and business logic that eliminates conflicting definitions across teams. This guide covers the full setup: getting content in, defining objects and metrics, configuring how Ana loads it, and reviewing and auditing changes.

Ontology editor showing Files, Graph, Reviews, and History tabs

Before You Start

The quality of your ontology at launch is a function of the institutional knowledge you bring to it. Before opening the editor, gather:

Metric definitions — how your team actually calculates revenue, churn, conversion, activation. If different teams use different definitions, collect all of them. Ontology is where you canonicalize.
Existing SQL and notebooks — recurring analyses your team runs are direct evidence of what Ana will need to produce. These are your ground truth.
Schema documentation — any notes on non-obvious joins, unreliable columns, or connector-specific quirks that wouldn’t be apparent from table names alone.
Tribal knowledge — meeting notes, Slack threads, recorded data reviews, onboarding docs, ad-hoc write-ups where your team has captured how numbers behave, which fields have edge cases, or why a given metric is calculated the way it is.

You don’t need to format any of this. Raw SQL, Notion links, Slack exports — drop it in. The goal is to start with signal, not structure.

Build incrementally, not exhaustively

Ontology is a living semantic layer. Launch with what you have, then let usage drive what gets added next. Every Thread surfaces gaps — a definition someone corrects mid-analysis, a join Ana had to infer, a rule that keeps coming up across Threads. Non-admin users propose enrichments as they work, and those come through an approval flow for you to review and commit. Don’t try to complete Ontology before your first Thread. Start with your highest-stakes metric definitions, get Ana running against real questions, and let the gaps tell you where to go next.

Step-by-Step Guide

There are three ways to populate Ontology — all give you access to the same full feature set:

Method 1: GitHub Integration

For teams with existing documentation or metric definitions already in Git.

Method 2: Create Files Directly

Build in the UI editor — no Git setup required.

Method 3: Build Through Threads

Let Ana capture knowledge as you run analyses. No admin access required.

Method 1: GitHub Integration

If your team already maintains business logic, metric definitions, or schema documentation in a Git repository, connect it directly. TextQL syncs the repository into Ontology — files appear as context Ana can read, and changes flow in both directions. This is the right choice when you want version control on ontology changes, multiple teams contributing via PR workflow, or you already have a documentation repository. Why connect Git

Edit outside TextQL — your team can edit ontology files directly in GitHub, VS Code, or any editor, and changes sync back
Use existing repos — if your organization already has a documentation repository, connect it and bring everything in without migrating manually
Pull requests — optionally route Ana’s proposed changes through GitHub PRs instead of the in-app Reviews flow
Works with any Git host — GitHub, GitLab, AWS CodeCommit, Bitbucket, or any standard Git remote

Connecting a repository Click Connect Git in the top right of the Ontology editor. Choose your authentication method:

GitHub App
SSH Key
HTTPS Token (Recommended)

The easiest option for GitHub users. Click Install GitHub App and you’ll be redirected to GitHub to install the TextQL Library Sync app on your organization. Once installed, select the repository and branch to connect.

Connect to Git modal showing GitHub App and push mode options

Use a personal access token for HTTPS-based authentication. Enter the repository URL and your token.Finding your repository URL: On GitHub, click the green Code button, select the HTTPS tab, and copy the URL (e.g. https://github.com/your-org/your-repo.git).

HTTPS token connection form showing repository URL and token fields

Getting a GitHub Personal Access Token:

Go to GitHub → profile picture → Settings
In the left sidebar, scroll to Developer settings
Go to Personal access tokens → Fine-grained tokens
Click Generate new token, set a name and expiration, then under Repository permissions grant:
- Contents — Read and write
- Pull requests — Read and write
- Metadata — Read-only
Copy the token immediately — GitHub won’t show it again

Push mode When Ana or a team member makes a change, you choose how it gets written back to your repo:

Mode	How it works
Direct push	Changes are committed directly to the connected branch. No review required in GitHub.
Create PRs	Each change opens a pull request in your repo. Your team reviews and merges via your existing GitHub workflow.

Bidirectional sync Once connected, the integration syncs in both directions:

TextQL → Git: Approved changes in Ontology are pushed to the repo (as commits or PRs, depending on your push mode)
Git → TextQL: Changes committed to the repo’s connected branch are pulled into Ontology automatically, typically within a few minutes

When you first connect a repo, Ontology imports the repository’s existing file structure. Folders, subfolders, and files are brought in as-is — you can then apply role-based access and auto-attach settings on top.

Method 2: Create Files Directly

You can skip Git entirely. The Ontology editor has a built-in file editor — create folders, write files, and manage everything directly inside TextQL without connecting a repository. Navigating the editor The Ontology editor has four tabs:

Files — the main file browser. Create folders, add files, and edit content.
Graph — a visual map of all files in your Files tab. Shows how your defined objects, metrics, and relationships connect to each other.
Reviews — proposed changes from Ana waiting for approval.
History — a full audit log of every change made to Ontology.

The Files tab is the primary surface. Click any folder to open it and see its files. The main panel shows the selected file with three sub-tabs: Preview (rendered content), Source (raw text), and Properties (auto-attach and access settings). Creating a file Click + New at the top of the file browser. A dropdown gives you three options:

Create new file — opens the editor to write a file directly
Upload files — upload an existing file from your machine (PDFs, images, CSVs, etc.)
Create new folder — add a new folder to organize your files

When creating a file, give it a name and write your content directly in the editor. Add an optional commit message to describe what the file is — this shows up in the History tab. Click Create to save. The file is immediately available to Ana. File types

Type	Extension	Best for
Markdown	`.md`	Business rules, instructions, metric definitions, formatting preferences
Ontology graph	`.tql`	Formally defined objects, metrics, and links — compiled directly into queries. Created through the Graph tab, not by hand.
Python	`.py`	Reusable scripts Ana can call — API clients, custom calculations, transformations
CSV	`.csv`	Reference tables, lookup data, territory mappings, static datasets
Text	`.txt`	Free-form notes or plain text reference
PDF / Image	`.pdf` `.png`	Brand guides, ERDs, onboarding decks, visual reference material

Markdown is the most common. Use it for anything you’d explain to a new analyst in plain language: how revenue is calculated, which accounts to exclude, what the pipeline stages mean. .tql files are the other major type. They store formal object, metric, and link definitions — the structured model Ana compiles directly into queries. You don’t write these by hand; they’re authored by Ana through Threads and stored automatically in the Files tab. See .tql Files below for details. Python files unlock something more powerful: Ana can call these scripts directly during analysis — for hitting internal APIs, applying company-specific business logic, or any calculation that would take too long to re-derive each time.

PDFs and images cannot be set to auto-attach for performance reasons. To have Ana always reference one, add a pointer in your ANA.md: “At the start of every Thread, read data-dictionary.pdf.”

Organizing with folders Organize by business domain, not file type. Each folder should represent a context Ana might need:

ANA.md                          ← org-wide entry point
Finance/
  README.md                     ← what's in this folder and when Ana should use it
  revenue-definitions.md
  fiscal-calendar.md
  exclusion-rules.md
Go-To-Market/
  README.md
  pipeline-definitions.md
  crm-field-glossary.md
Product/
  README.md
  engagement-metrics.md
Connectors/
  snowflake-notes.md
  salesforce-notes.md

A few principles:

One topic per file. A focused revenue-definition.md is more useful than a finance-misc.md that grows unbounded. Atomic files are easier to version, permission, and retrieve.
Index every folder. A short README.md at the top of each folder — what’s here, when to use it — lets Ana navigate a large ontology without loading every file inside it.
Treat it like code: stale definitions cause real errors. Unlike human-facing docs that get skipped, Ana follows instructions precisely. An outdated revenue definition produces wrong numbers silently. Update Ontology when business logic changes.

ANA.md — the org-wide entry point ANA.md is a special filename. Any file named ANA.md anywhere in your ontology is automatically loaded into every single chat, for every user, regardless of role or connector. No configuration needed.

Use ANA.md for rules that apply universally across your org:

# Org Instructions

- Always cite the date range and source tables in your analysis
- Exclude test accounts (account_type = 'test') from all revenue calculations
- Fiscal year runs July 1 – June 30: Q1=Jul-Sep, Q2=Oct-Dec, Q3=Jan-Mar, Q4=Apr-Jun

## Navigation

| When asked about | Go to |
|---|---|
| Revenue, ARR, MRR, bookings | `Finance/revenue-definitions.md` |
| Pipeline, deals, CRM data | `GTM/pipeline-definitions.md` |
| Active users, engagement | `Product/engagement-metrics.md` |
| Data source quirks (Snowflake) | `Connectors/snowflake-notes.md` |

Keep ANA.md short and universal. Role-specific or connector-specific instructions belong in their own files with targeted auto-attach settings.

Method 3: Build Through Threads (Non-Admin Friendly)

Every analysis has the potential to make Ontology better. As you work in Threads with Ana, you’ll surface definitions worth saving, rules that should apply everywhere, or data quirks Ana figured out mid-analysis. You can ask Ana to propose any of those as additions to Ontology — nothing you submit affects anyone else until an admin approves it.

What you can propose Any plain-language knowledge that would help Ana answer questions more accurately:

A metric definition or calculation rule (“exclude refunded orders from revenue”)
A fiscal calendar or date convention (“Q1 starts February 1”)
A data quirk or known issue (“the status column in orders uses 1/0, not true/false”)
A business rule or exclusion (“don’t include test accounts in any analysis”)
A clarification about what a table or column represents
A correction to an existing definition you know is wrong

How to ask At any point in a Thread, just say it in plain language:

“Save what you learned about this connector to Ontology.”
“Add a rule that we always exclude refunds from net revenue.”
“The churn calculation we just worked out — can you save that?”
“Update the Finance folder with the new fiscal year start date.”
“What did you learn today that would be useful to save?”

Ana knows the structure of your ontology — which folders exist, what files are already in them — and places new content in the right location. She’ll draft the change and show you exactly what she’s proposing before anything is submitted.

The approval flow Ana never writes directly to Ontology. Every change goes through a review step first. After Ana drafts a proposed change, you see it in the Thread with two options:

Approve — applies the change immediately
Discard — throws it away, nothing is saved

If you have Editor access to the target folder, you can approve directly in the Thread. If not, Ana routes it to the Reviews tab for an admin to approve. See Role & Access for how folder access levels are configured. Once approved, the change is live for every user on your team.

Ontology Reviews tab showing a pending patch with diff and Approve/Reject buttons

Read the diff before approving. An incorrect rule will be applied to every future Thread for every user who loads that file.

What Ana captures well

Schema quirks — non-obvious join keys, unexpected column encodings, table-specific behavior discovered during analysis
Business definitions — if you correct Ana mid-Thread (“actually, ‘active users’ means last 30 days, not 7”), she can save that immediately
Query patterns — complex logic that comes up repeatedly for a specific connector or metric
Org rules — anything you find yourself repeating across Threads

After a few rounds of this, Ontology builds itself around the work your team is actually doing.

Metric Query — Templated SQL Files (.tql)

.tql files are authored by Ana through Threads and stored automatically in the Files tab. To update a .tql file, ask Ana to revise it during a conversation — the result goes through the standard approval flow before going live. You can also edit manually in the Files tab, which commits directly to Main without going through Review.

TextQL has two SQL generation modes:

Mode	How it works	Output
Text-to-SQL	LLM writes SQL from the natural language question	Flexible, probabilistic
Ontology metric query	Ana pulls the relevant `.tql` file and executes it directly — no LLM in the SQL path	Deterministic, consistent

High-stakes metrics — ARR, churn, conversion, activation — should not be re-derived on every question. The LLM resolves intent and supplies parameters; your pre-authored SQL handles execution.

Key Concepts

Every .tql query is structured around five elements:

Element	What it does
Core fact object	The primary entity the query is built around (e.g. `Sale`, `Customer`)
Metrics	The quantifiable measurements to compute — sum, count, average, or custom expressions
Dimensions	Attributes used to slice and group metrics — time, region, category, status
Filters	Criteria that include or exclude rows — date ranges, status values, thresholds
Orders	Sorting applied to results — e.g. descending by revenue to surface top performers

TQL — the Query Language

TQL is a typed SQL templating language. A .tql file declares typed parameters, encodes business logic, and produces consistent SQL regardless of how it’s invoked. Plain SQL template — use this when the logic is essentially SQL with a few direct parameters:

params {
  -- Inclusive lower bound in ISO 8601 format.
  created_after: Timestamp?
  -- Country code. Examples: "US", "CA", "BR"
  country: String = "US"
}

SELECT *
FROM transactions
WHERE country = ${country}
  AND created_at >= ${created_after}

Expression body — use this when you need branching, reusable fragments, or dimension-driven SQL structure:

params {
  metrics: Set<"revenue" | "gmv" | "order_count">
  dimensions: Set<"buyer" | "seller" | "category"> = []
  filters: List<FilterInput> = []
}

let
  metric_frags = matchSet metrics {
    "revenue"      -> sql"SUM(t.revenue) AS revenue"
    "gmv"          -> sql"SUM(t.gmv) AS gmv"
    "order_count"  -> sql"COUNT(*) AS order_count"
  }
  dim_entries = matchSet dimensions {
    "buyer"    -> { expr = sql"buyer_u.name",  join = sql"JOIN users buyer_u ON t.buyer_user_id = buyer_u.id" }
    "seller"   -> { expr = sql"seller_u.name", join = sql"JOIN users seller_u ON t.seller_user_id = seller_u.id" }
    "category" -> { expr = sql"p.category",    join = sql"JOIN products p ON t.product_id = p.id" }
  }
  joins         = concatSep " " (map (\d -> d.join) dim_entries)
  select_dims   = concatSep ", " (map (\d -> d.expr) dim_entries)
  select_expr   = if isEmpty select_dims then select_metrics else sql"${select_dims}, ${select_metrics}"
  group_clause  = wrap "GROUP BY " "" select_dims
in sql''
  SELECT ${select_expr}
  FROM transactions t
  ${joins}
  ${group_clause}
''

This pattern — metrics, dimensions, and filters params combined with matchSet and filterWhere — is the standard semantic-view pattern. It gives callers a compact, stable API while keeping all SQL logic in authored code.

Supported Parameter Types

Type	JSON shape	Notes
`Int`	integer	Non-integer values are rejected
`Float`	number	Integers also accepted
`String`	string
`Bool`	boolean
`Date`	string	Document expected format in `--` comments
`Timestamp`	string	Same as `Date`
`Set<"...">`	array of strings	Values validated against allowed labels
`List<T>`	array	Elements validated recursively
`FilterInput`	object	Used inside `List<FilterInput>` for dynamic filtering

For the complete language reference — nullability, defaults, matchSet, filterWhere, concatSep, wrap, and common gotchas — see the TQL Reference.

Walkthrough

TQL File Built in Chat
Ontology Graph
Clicking into a TQL File

Dynamic Loading — How Ana Uses Your Ontology

Ana doesn’t load your entire ontology into every Thread. She navigates it — reading files selectively based on what’s relevant to the conversation, the user’s role, and the connector they’re using.

Auto-attached files

Auto-attached files load automatically at the start of every applicable chat. Set this on a file’s Properties tab.

File properties showing Auto Attach settings

When auto-attach is on, you can scope it so the file only loads when the right conditions are met:

Condition	When the file is auto-loaded
Always attach	Every Thread, for every user
By role	Only when the user has a specific role (e.g., Go-To-Market)
By DB connector	Only when a specific database connector is active
By API connector	Only when a specific API connector is active

Conditions can be combined — for example, auto-attach the Finance fiscal calendar only for Finance-role users on the Snowflake connector. Keep the always-on list short. Every auto-attached file adds to the context of every Thread. Files that are only sometimes relevant load faster and more accurately when left as on-demand.

On-demand files

On-demand files have no auto-attach conditions set. Ana finds and loads them during the Thread based on what the user is asking. Most files in your ontology should be on-demand. The primary way to guide on-demand loading is a navigation table — add one to ANA.md or a folder-level README.md and Ana will know exactly which file to load when a topic comes up:

When asked about	Go to
”DAU” / “Daily active users”	`metrics/daily_active_users.md`
”New user signups” / “Cumulative users”	`metrics/new_user_signups.md`
”How many tool calls?”	`metrics/tool_calls.md`

When a user asks “how many daily active users did we have this week?”, Ana matches the trigger and loads metrics/daily_active_users.md directly — without pulling in anything else.

Practical example

File	Auto-attach setting
`ANA.md`	Always (org-wide instructions + navigation pointers)
`Organization Context/brand-kit.md`	On-demand
`Finance/fiscal-calendar.md`	Finance role
`Go-To-Market/pipeline-definitions.md`	Go-To-Market role
`Connectors/snowflake-schema-notes.md`	Snowflake connector
`Marketing/utm-conventions.md`	On-demand

A Finance analyst chatting with Snowflake gets ANA.md + fiscal-calendar.md + snowflake-schema-notes.md auto-loaded. Ana finds brand-kit.md and utm-conventions.md on demand. The Go-To-Market files are invisible to this user entirely.

History & Version Control

Every change to Ontology is tracked — who made it, when, and exactly what changed.

The Reviews tab

The Reviews tab is where Ana’s proposed changes wait for approval. Each review shows:

Who proposed it — the user (via Ana) who initiated the change
The source thread — a link to the Thread where the change was proposed
The diff — a line-by-line view of what’s being added or removed
The target file — which file will be affected

Click Approve to commit the change immediately, or Reject to discard it. Admins can always approve. Users with Editor access on the target folder can also approve patches for files within that folder. See Role & Access for how access levels are configured.

The History tab

The History tab shows a chronological log of every change applied to Ontology — approved patches, manual edits, file uploads, deletions, and imports.

Ontology History tab showing a list of changes with diffs

Each entry shows the file changed, who changed it, when, and the full diff. If a change introduced something incorrect, click the entry to see the old content and create a new patch to revert it. If your ontology is connected to a Git repository, all history is also reflected in your repo’s commit log.

What to Build First

If you’re starting from scratch, prioritize in this order:

ANA.md with org-wide rules and a navigation table
One folder per major business domain with a README.md in each
Metric definitions as .md files — revenue, ARR, churn, conversion first
Objects and links (via Ana chat) for the 3–5 tables your team queries most
.tql metric queries for calculations that need deterministic, structurally enforced SQL
Role-scoped files and auto-attach configuration once the baseline is stable

Get Started

How It Works

Set Up Your Data

Ontology

Get Better Results

Admin & Security

FAQ

Legacy Features

Before You Start

Build incrementally, not exhaustively

Step-by-Step Guide

Method 1: GitHub Integration

Method 2: Create Files Directly

Method 3: Build Through Threads

Method 1: GitHub Integration

Method 2: Create Files Directly

Method 3: Build Through Threads (Non-Admin Friendly)

Metric Query — Templated SQL Files (.tql)

Key Concepts

TQL — the Query Language

Supported Parameter Types

Walkthrough

Dynamic Loading — How Ana Uses Your Ontology

Auto-attached files

On-demand files

Practical example

History & Version Control

The Reviews tab

The History tab

What to Build First

Get Started

How It Works

Set Up Your Data

Ontology

Get Better Results

Admin & Security

FAQ

Legacy Features

Documentation Index

​Before You Start

​Build incrementally, not exhaustively

​Step-by-Step Guide

Method 1: GitHub Integration

Method 2: Create Files Directly

Method 3: Build Through Threads

​Method 1: GitHub Integration

​Method 2: Create Files Directly

​Method 3: Build Through Threads (Non-Admin Friendly)

​Metric Query — Templated SQL Files (.tql)

​Key Concepts

​TQL — the Query Language

​Supported Parameter Types

​Walkthrough

​Dynamic Loading — How Ana Uses Your Ontology

​Auto-attached files

​On-demand files

​Practical example

​History & Version Control

​The Reviews tab

​The History tab

​What to Build First

Before You Start

Build incrementally, not exhaustively

Step-by-Step Guide

Method 1: GitHub Integration

Method 2: Create Files Directly

Method 3: Build Through Threads (Non-Admin Friendly)

Metric Query — Templated SQL Files (.tql)

Key Concepts

TQL — the Query Language

Supported Parameter Types

Walkthrough

Dynamic Loading — How Ana Uses Your Ontology

Auto-attached files

On-demand files

Practical example

History & Version Control

The Reviews tab

The History tab

What to Build First