agent-integration

Using Flowbaby Tools with Custom Agents

Flowbaby Chat Memory provides Language Model Tools that allow GitHub Copilot and custom agents to autonomously access workspace memory. These tools appear in VS Code's "Configure Tools" dialog and can be referenced in custom agent configurations.

Quick Start

Enable Tools via Configure Tools UI:
- Open Copilot chat → Click "Tools" icon → "Configure Tools"
- Find the "Flowbaby" section
- Toggle "Store Memory in Flowbaby" and "Retrieve Flowbaby Memory"
- Toggle tools on/off individually (disabled by default for privacy)
Use in Chat:
- Type #flowbaby to see autocomplete suggestions
- Select #flowbabyStoreSummary or #flowbabyRetrieveMemory
- Tools appear only when enabled via Configure Tools
Transparency:
- All tool invocations logged in Output channel ("Flowbaby Agent Activity")
- Configure Tools UI provides visual feedback for tool state

Available Tools

Store Memory Tool (`#flowbabyStoreSummary`)

Stores conversation summaries in Flowbaby knowledge graph.

Parameters:

topic (required): Summary title
context (required): Summary description
decisions (optional): Key decisions made
rationale (optional): Reasoning behind decisions
metadata (optional): Plan ID, status, etc.

Retrieve Memory Tool (`#flowbabyRetrieveMemory`)

Searches Flowbaby knowledge graph for relevant memories.

Parameters:

query (required): Natural language search query
maxResults (optional): Max results to return (default: 3, max: 10)

Returns: Both narrative markdown and structured JSON for agent parsing.

Transparency

When agents use Flowbaby, you see:

Output Channel: All tool invocations logged in "Flowbaby Agent Activity"
Configure Tools UI: Visual feedback for which tools are enabled/disabled
Chat Autocomplete: #flowbaby* commands only appear when tools are enabled
Workspace Logs: Detailed bridge and ingestion logs are stored under .flowbaby/logs in each workspace

Agent Integration Guide: Flowbaby Memory

Version: 1.1
Last Updated: 2025-11-22
Plan: 019 - Rebranding to Flowbaby

Overview

The Flowbaby Memory extension provides commands for GitHub Copilot agents and third-party VS Code extensions to store and retrieve structured conversation summaries. This enables:

Agent continuity: Agents can maintain context across sessions without manual capture
Multi-agent collaboration: Different agents can share memory via a common knowledge base
Custom workflows: Extensions can build custom agent memory patterns

Security and Privacy

Configure Tools Authorization Model

⚠️ IMPORTANT: Flowbaby tools are controlled exclusively through VS Code's Configure Tools UI. When you enable tools there, they become available to GitHub Copilot and all extensions in the workspace.

Why Configure Tools?

VS Code native mechanism for tool authorization
Users explicitly opt-in via clear UI
Tools can be enabled/disabled per workspace
Follows VS Code best practices for Language Model Tools

Recommendations:

✅ Enable tools only in workspaces with trusted extensions
✅ Review installed extensions before enabling
✅ Inspect audit logs regularly (Output > Flowbaby Agent Activity)
❌ Do NOT enable in untrusted or public workspaces
❌ Do NOT enable if workspace contains sensitive data

Audit Logging

All agent ingestion and retrieval attempts are logged:

Output Channel: Output > Flowbaby Agent Activity
- Real-time log of all agent commands
- Shows timestamp, agent name (if provided), topic, and result
- Example: [Agent Ingest] 2025-11-19T08:12:44Z - Agent: GitHub Copilot - Topic: Plan 015 Implementation - Status: success
Audit Log File: .flowbaby/agent_audit.log
- Structured JSON log for programmatic analysis
- Format: {"timestamp": "2025-11-19T08:12:44Z", "command": "ingestForAgent", "agentName": "GitHub Copilot", "topicDigest": "a1b2c3d4", "result": "success", "errorCode": null}
- Topic digest: First 8 characters of SHA-256 hash (for privacy)

Configuration

Tool Authorization

Flowbaby tools are controlled through VS Code's Configure Tools UI:

Open Copilot chat → Click "Tools" (⚙️) → "Configure Tools"
Find "Store Memory in Flowbaby" and "Retrieve Flowbaby Memory"
Toggle tools on/off (disabled by default)

No workspace settings required for authorization.

LLM API Key (Required)

Configure LLM API key in workspace .env:

LLM_API_KEY=sk-...

Without this, ingestion will fail with error code MISSING_API_KEY.

Ingesting Memories from Agents

Command: `Flowbaby.ingestForAgent`

Signature: (requestJson: string) => Promise<string>

Input: JSON string containing FlowbabyIngestRequest payload
Output: JSON string containing FlowbabyIngestResponse result

TypeScript Example (Minimal)

import * as vscode from 'vscode';

// Minimal payload (required fields only)
const payload = {
  topic: "User Question About Async",
  context: "User asked how to use async/await in TypeScript. Agent explained event loop and provided code example.",
  metadata: {
    topicId: "async-question-001",
    createdAt: new Date().toISOString(),
    updatedAt: new Date().toISOString()
  }
};

try {
  const responseJson = await vscode.commands.executeCommand<string>(
    'Flowbaby.ingestForAgent',
    JSON.stringify(payload)
  );
  
  const response = JSON.parse(responseJson);
  
  if (response.success) {
    console.log(`✅ Ingested ${response.ingested_chars} characters`);
    console.log(`   Topic ID: ${response.metadata.topic_id}`);
    console.log(`   Duration: ${response.ingestion_duration_sec.toFixed(2)}s`);
  } else {
    console.error(`❌ Ingestion failed: ${response.error}`);
    console.error(`   Error code: ${response.errorCode}`);
  }
} catch (error) {
  console.error('Exception during ingestion:', error);
}

TypeScript Example (Full Payload)

import * as vscode from 'vscode';

// Full payload with all optional fields
const payload = {
  topic: "Plan 015 Implementation Strategy",
  context: "User discussed agent ingestion command design with architect. Covered TypeScript schema, validation, and access control.",
  decisions: [
    "Use VS Code commands as primary surface",
    "Implement workspace-global access model",
    "Embed metadata in enriched text"
  ],
  rationale: [
    "Commands are accessible to Copilot agents",
    "VS Code doesn't expose caller identity",
    "Flowbaby 0.3.8 doesn't expose DataPoint class"
  ],
  openQuestions: [
    "Should topic_id be hash-based or UUID?",
    "How to handle cross-workspace memory sync?"
  ],
  nextSteps: [
    "Implement TypeScript validation",
    "Create test agent extension",
    "Add audit logging"
  ],
  references: [
    "Plan 015 documentation",
    "VS Code Extension API docs"
  ],
  timeScope: "2025-11-19T08:00:00Z to 2025-11-19T09:30:00Z (15 turns)",
  metadata: {
    topicId: "plan-015-implementation",
    sessionId: "session-2025-11-19-001",
    planId: "015",
    status: "Active",
    createdAt: "2025-11-19T08:00:00Z",
    updatedAt: "2025-11-19T09:30:00Z"
  },
  agentName: "GitHub Copilot" // Optional, for audit logs
};

const responseJson = await vscode.commands.executeCommand<string>(
  'Flowbaby.ingestForAgent',
  JSON.stringify(payload)
);

const response = JSON.parse(responseJson);

if (!response.success) {
  throw new Error(`Ingestion failed: ${response.error} (${response.errorCode})`);
}

console.log('Summary ingested successfully:', response.metadata.topic_id);

Error Handling

The command returns structured errors for programmatic handling:

const response = JSON.parse(responseJson);

if (!response.success) {
  switch (response.errorCode) {
    case 'INVALID_PAYLOAD':
      console.error('Payload validation failed:', response.error);
      // Fix payload and retry
      break;

    case 'MISSING_API_KEY':
      vscode.window.showErrorMessage(
        'LLM_API_KEY not found. Add it to your workspace .env file.',
        'Open Docs'
      );
      break;

    case 'BRIDGE_TIMEOUT':
      vscode.window.showErrorMessage('Flowbaby ingestion timed out. Try again later.');
      break;

    default:
      console.error('Unknown error:', response.error);
  }
}

Common Error Codes

Error Code	Description	Remediation
`INVALID_PAYLOAD`	Payload failed schema validation	Check `response.error` for field details
`MISSING_API_KEY`	`LLM_API_KEY` not in workspace `.env`	Add API key to `.env` file
`INVALID_WORKSPACE_PATH`	Workspace path invalid or inaccessible	Verify workspace exists
`BRIDGE_TIMEOUT`	Python bridge exceeded timeout	Retry; check bridge logs
`FLOWBABY_ERROR`	Flowbaby library threw exception	Check Output channel for details
`429_FLOWBABY_BACKLOG`	Background queue full (5 operations max)	Wait 30-60s for queue to clear, then retry

Async Ingestion Behavior (v0.3.3+)

Overview

Starting in v0.3.3, the Flowbaby.ingestForAgent command operates asynchronously to prevent blocking agent workflows. Previously, ingestion took 60-90 seconds and blocked the agent until completion. With async mode:

Agent response: <10 seconds (returns after data staging)
Background processing: 60-90 seconds (knowledge graph construction)
Notification: User receives completion/failure toast when done

Timing Expectations

Ingestion Flow Timeline:

0-5s: Extension receives ingestForAgent command
5-10s: Python bridge stages data (flowbaby.add()), command returns success
10-100s: Background subprocess builds knowledge graph (flowbaby.cognify())
100s: User receives notification (success or failure toast)

What This Means for Agents:

✅ Agents can acknowledge ingestion immediately (<10s)
✅ Agents don't block waiting for graph construction
✅ Multiple ingestion requests can be queued
⚠️ Memory is not immediately searchable after command returns (60-90s delay)

Response Fields for Async Mode

The FlowbabyIngestResponse includes fields to indicate async processing:

interface FlowbabyIngestResponse {
  success: boolean;
  staged?: boolean;           // true if background processing queued
  operationId?: string;        // UUID for tracking background operation
  ingested_chars: number;
  staging_duration_sec: number; // Time for add() only (excludes cognify)
  // ... other fields
}

Example Response:

{
  "success": true,
  "staged": true,
  "operationId": "a1b2c3d4-5678-90ab-cdef-1234567890ab",
  "ingested_chars": 1234,
  "staging_duration_sec": 6.2,
  "metadata": { "topic_id": "plan-017-async" }
}

Checking Background Operation Status

To query the status of a background operation:

const statusJson = await vscode.commands.executeCommand<string>(
  'Flowbaby.backgroundStatus',
  operationId // optional: specific operation, or omit for all
);

const status = JSON.parse(statusJson);
console.log(`Operation ${operationId}: ${status.status}`);
// status: 'pending' | 'running' | 'completed' | 'failed' | 'terminated' | 'unknown'

Handling 429_FLOWBABY_BACKLOG

Async mode enforces concurrency limits to prevent resource exhaustion:

Max concurrent: 2 cognify() processes running simultaneously
Max queued: 3 pending operations in FIFO queue
Total capacity: 5 operations (2 running + 3 queued)

When capacity is exceeded, ingestForAgent returns an error:

{
  "success": false,
  "errorCode": "429_FLOWBABY_BACKLOG",
  "error": "Background queue full (5 operations max). Wait 30-60s and retry."
}

Recommended Agent Behavior:

const response = JSON.parse(responseJson);

if (response.errorCode === '429_FLOWBABY_BACKLOG') {
  // Option 1: Inform user and defer ingestion
  vscode.window.showWarningMessage(
    'Memory storage queue full. Capturing conversation later when capacity available.'
  );

  // Option 2: Retry after delay (aggressive, may annoy user)
  setTimeout(() => retryIngestion(payload), 60000); // 60s

  // Option 3: Discard this ingestion attempt (acceptable for low-priority captures)
  console.log('Skipping ingestion due to backlog');
}

Race Condition: Retrieve Before Cognify Completes

Scenario: Agent stores memory at T+0s, then immediately retrieves at T+5s. Background cognify() hasn't completed yet (still running at ~T+30s).

Behavior: Retrieval will not find the newly staged memory because knowledge graph hasn't been built yet.

Mitigation Strategies:

Inform user about delay (recommended):

if (response.staged) {
  vscode.window.showInformationMessage(
    'Memory staged – processing will finish in ~1–2 minutes. You\'ll get a notification when it\'s done.'
  );
}

Delay retrieval (if agent controls both store and retrieve):

// Store memory
const ingestResponse = await ingestForAgent(payload);

// Wait for background processing (polling approach)
if (ingestResponse.staged) {
  await waitForCompletion(ingestResponse.operationId, { timeout: 120000, pollInterval: 5000 });
}

// Now retrieve
const retrieveResponse = await retrieveForAgent(query);

Accept eventual consistency (best UX):
- Don't block agent on cognify() completion
- User will see memory in future retrieval attempts (after T+90s)
- This matches async philosophy: don't block workflow for background processing

Using Flowbaby Tools with GitHub Copilot and Custom Agents (Plan 016)

Overview

Flowbaby Memory provides two Language Model Tools that appear in VS Code's "Configure Tools" dialog:

flowbaby_storeMemory (#flowbabyStoreSummary) - Store conversation summaries
flowbaby_retrieveMemory (#flowbabyRetrieveMemory) - Retrieve relevant memories

These tools allow Copilot and custom agents to autonomously access workspace memory through the standard VS Code tools UI.

Recommended Tool Cadence

Retrieve before reasoning: Kick off each turn (or whenever the user references prior work) by calling #flowbabyRetrieveMemory with a natural-language description of the goal. Reviewing context before planning reduces contradictions and surfaces existing decisions.
Store after meaningful progress: Use #flowbabyStoreSummary once a task finishes, a decision is made, or a debugging session concludes. Think of it as a state checkpoint recorded every time the agent would normally summarize work back to the user.
Batch noise, not signal: Minor clarifications or single-turn answers generally do not warrant their own summary—capture them inside the next substantial summary instead so retrieval stays precise.
Close every session: End-of-day or pre-handoff recaps greatly improve continuity; default to capturing one summary even if no major event occurred.

These expectations mirror the guidance embedded in the tool metadata (e.g., 300–1500 character context, 0–5 decision bullets) and are advisory rather than enforced.

Tool Discovery

In Configure Tools UI:

Open any Copilot chat
Click the "Tools" button (⚙️ icon near input box)
Click "Configure Tools"
Find "Flowbaby Memory" tools in the list
Toggle tools on/off individually

In Chat (# Autocomplete):

Type #flowbaby in chat to see autocomplete suggestions
Select #flowbabyStoreSummary or #flowbabyRetrieveMemory
Tool description appears in autocomplete preview

In Custom Agent .agent.md Files:

---
name: Memory-Aware Code Assistant
description: Copilot assistant with access to workspace memory
tools: ['search', 'flowbabyStoreSummary', 'flowbabyRetrieveMemory']
---

You are a code assistant with access to workspace-specific memory.

When the user asks about past decisions or implementations:
1. Use #flowbabyRetrieveMemory to search for relevant context
2. Ground your answer in the retrieved memories
3. If no memories exist, use your training data but clarify it's not workspace-specific

When the user completes an important implementation or makes a decision:
1. Offer to store a summary using #flowbabyStoreSummary
2. Include topic, context, and key decisions in the summary

Store Memory Tool

Tool Name: flowbaby_storeMemory
Reference Name: flowbabyStoreSummary (for # autocomplete and .agent.md files)
Icon: $(database)

Input Schema:

{
  topic: string;           // Required: Summary title
  context: string;         // Required: Summary description
  decisions?: string[];    // Optional: Key decisions
  rationale?: string[];    // Optional: Reasoning
  metadata?: {             // Optional: Metadata (auto-generated if omitted)
    plan_id?: string;
    status?: 'Active' | 'Superseded' | 'DecisionRecord';
  }
}

Length guidance (advisory): Summaries land best when the context field is roughly 300–1500 characters that describe the goal, actions taken, and outcome. Keep decisions/rationale to 0–5 concise bullets each. The extension will not reject longer payloads, but downstream agents assume those ranges when budgeting tokens.

Example Usage in Chat:

User: "Remember that we decided to use Redis for caching"
Agent: I'll store that decision. #flowbabyStoreSummary {
  "topic": "Redis Caching Decision",
  "context": "Team decided to use Redis for session caching to improve performance",
  "decisions": ["Use Redis for session cache", "Deploy as Docker container"],
  "metadata": {"status": "Active"}
}

Tool Response:

{
  "success": true,
  "summary_id": "redis-caching-decision",
  "ingested_chars": 245,
  "duration_ms": 1234
}

Retrieve Memory Tool

Tool Name: flowbaby_retrieveMemory
Reference Name: flowbabyRetrieveMemory (for # autocomplete and .agent.md files)
Icon: $(search)

Input Schema:

{
  query: string;           // Required: Natural language search query
  maxResults?: number;     // Optional: Max results (default: 3, max: 10)
}

Example Usage in Chat:

User: "How did we implement caching?"
Agent: Let me check our memory. #flowbabyRetrieveMemory {
  "query": "caching implementation",
  "maxResults": 3
}

Tool Response (Structured + Narrative):

The tool returns BOTH a narrative markdown summary AND verbatim JSON:

# Retrieved Memories (1 results)

## 1. Redis Caching Decision

**Context:** Team decided to use Redis for session caching to improve performance

**Decisions:**
- Use Redis for session cache
- Deploy as Docker container

**Metadata:**
- Topic ID: redis-caching-decision
- Plan: 015
- Created: 2025-11-19T08:00:00Z
- Relevance Score: 0.923

---

## Structured Response (JSON)

```json
{
  "entries": [
    {
      "summaryText": "Team decided to use Redis for session caching...",
      "topic": "Redis Caching Decision",
      "topicId": "redis-caching-decision",
      "planId": "015",
      "createdAt": "2025-11-19T08:00:00Z",
      "score": 0.923,
      "decisions": ["Use Redis for session cache", "Deploy as Docker container"]
    }
  ],
  "totalResults": 1,
  "tokensUsed": 142
}
```

Why Both Formats?

Narrative: Human-readable, agents can quote directly in responses
JSON: Structured data for agent parsing, auditing, and further processing

Tool Lifecycle

Both tools register at extension activation. VS Code manages tool enablement through Configure Tools UI:

Tools Enabled (via Configure Tools) → Tools appear in:

Configure Tools UI (checkboxes checked)
# autocomplete in chat
Available for agent invocation

Tools Disabled (via Configure Tools) → Tools hidden from:

# autocomplete
Agent invocation (VS Code won't route calls)
Configure Tools shows checkboxes unchecked

Implementation Note: Tools register unconditionally at activation. VS Code's Configure Tools UI is the sole authorization mechanism.

Rate Limits and Throttling

Retrieval concurrency: FlowbabyContextProvider executes up to 2 retrievals at once (user setting can lower this; any value above 5 is clamped). Additional requests queue up to 5 deep before being rejected.
Requests per minute: By default only 10 retrievals may start per 60-second window. Raising the workspace setting above 30 still clamps at 30/min to stay within architectural guarantees.
Ingestion backlog: Async ingestion shares the same 2-active / 3-queued limits via BackgroundOperationManager. When the queue fills, ingestForAgent returns 429_COGNIFY_BACKLOG without staging the summary.

Both retrieval and ingestion communicate throttling with HTTP-style 429 messages:

Surface	Error Code	When it fires	Recommended agent behavior
Retrieval	`RATE_LIMIT_EXCEEDED` (surfaced as `429_AGENT_THROTTLED`)	More than `flowbabyMemory.agentAccess.rateLimitPerMinute` requests in last 60s	Wait 2s, retry; increase delay exponentially up to ~15s on repeated 429s
Retrieval	`QUEUE_FULL` (also surfaced as `429_AGENT_THROTTLED`)	>2 concurrent requests and 5 queued	Inform user queue is saturated, then retry with exponential backoff
Ingestion	`429_FLOWBABY_BACKLOG`	2 background `cognify()` jobs + 3 queued (max 5)	Offer to retry later or skip low-priority summary

async function safeRetrieve(request: FlowbabyContextRequest) {
  for (let attempt = 0; attempt < 4; attempt++) {
    const response = await provider.retrieveContext(request);
    if (!('error' in response)) {
      return response;
    }
    if (response.error === AgentErrorCode.RATE_LIMIT_EXCEEDED ||
        response.error === AgentErrorCode.QUEUE_FULL) {
      const delayMs = Math.min(15000, 2000 * 2 ** attempt);
      await new Promise(resolve => setTimeout(resolve, delayMs));
      continue;
    }
    throw new Error(response.message);
  }
  throw new Error('Exceeded retry budget after repeated 429_AGENT_THROTTLED responses');
}

Always surface a user-facing notice when throttling occurs so developers know why the agent paused.

Transparency Indicators

When agents use Flowbaby tools, you see:

Output Channel: Output > Flowbaby Agent Activity
- Real-time log of all tool invocations
- Shows timestamp, tool name, query/topic, and result
- Example: [Tool Invocation] 2025-11-19T08:12:44Z - flowbaby_retrieveMemory called
Configure Tools UI: Visual feedback for tool state
- Checkboxes show which tools are enabled/disabled
- Clear UI for enabling/disabling per workspace
Confirmation Messages (optional):
- Tools may show confirmation prompts before execution
- Depends on user's trust settings for agents
- Example: "Store this conversation summary in Flowbaby knowledge graph?"

Retrieving Memories for Agents

Command: `Flowbaby.retrieveForAgent`

Signature: (requestJson: string) => Promise<string>

Input: JSON string containing FlowbabyContextRequest payload
Output: JSON string containing FlowbabyContextResponse or AgentErrorResponse

TypeScript Example

import * as vscode from 'vscode';

// Minimal retrieval request
const request = {
  query: "How did we implement caching?",
  maxResults: 3  // Optional, defaults to 3
};

try {
  const responseJson = await vscode.commands.executeCommand<string>(
    'Flowbaby.retrieveForAgent',
    JSON.stringify(request)
  );
  
  const response = JSON.parse(responseJson);
  
  // Check for error
  if ('error' in response) {
    console.error(`❌ Retrieval failed: ${response.message}`);
    console.error(`   Error code: ${response.error}`);
    return;
  }
  
  // Success - process results
  console.log(`✅ Retrieved ${response.entries.length} memories`);
  console.log(`   Total tokens: ${response.tokensUsed}`);
  
  response.entries.forEach((entry, idx) => {
    console.log(`\n--- Memory ${idx + 1} ---`);
    console.log(`Topic: ${entry.topic || 'Untitled'}`);
    console.log(`Score: ${entry.score.toFixed(3)}`);
    console.log(`Summary:\n${entry.summaryText.substring(0, 200)}...`);
    
    if (entry.decisions && entry.decisions.length > 0) {
      console.log(`Decisions: ${entry.decisions.length}`);
    }
    
    if (entry.topicId) {
      console.log(`Topic ID: ${entry.topicId}`);
    }
  });
} catch (error) {
  console.error('Exception during retrieval:', error);
}

Retrieval Error Codes

Error Code	Description	Remediation
`INVALID_REQUEST`	Invalid query or malformed JSON	Check request structure
`RATE_LIMIT_EXCEEDED` (`429_AGENT_THROTTLED`)	Too many requests per minute (default 10/min, max 30/min)	Wait 2–5s and retry with exponential backoff
`QUEUE_FULL` (`429_AGENT_THROTTLED`)	Too many concurrent/queued requests (2 active, 5 queued)	Retry with exponential backoff; inform user queue is saturated
`BRIDGE_TIMEOUT`	Python bridge exceeded timeout	Retry; check bridge logs

Schema Reference

FlowbabyIngestRequest

interface FlowbabyIngestRequest {
  // Required fields
  topic: string;                    // Summary title
  context: string;                  // Summary description
  metadata: SummaryMetadata;        // Metadata (see below)

  // Optional fields
  decisions?: string[];             // Key decisions
  rationale?: string[];             // Rationale items
  openQuestions?: string[];         // Open questions
  nextSteps?: string[];             // Next steps
  references?: string[];            // References/links
  timeScope?: string;               // Time scope description
  agentName?: string;               // Caller hint for audit logs
}

SummaryMetadata

interface SummaryMetadata {
  // Required fields
  topicId: string;                  // Unique identifier
  createdAt: string;                // ISO 8601 timestamp
  updatedAt: string;                // ISO 8601 timestamp

  // Optional fields
  sessionId?: string;               // Session identifier
  planId?: string;                  // Plan/project identifier
  status?: 'Active' | 'Superseded' | 'DecisionRecord';
}

FlowbabyIngestResponse

interface FlowbabyIngestResponse {
  success: boolean;

  // On success
  ingested_chars?: number;          // Character count
  timestamp?: string;               // Ingestion timestamp
  metadata?: {                      // Metadata confirmation
    topic_id: string;
    session_id?: string;
    plan_id?: string;
    status: string;
    created_at: string;
    updated_at: string;
  };
  ingestion_duration_sec?: number;  // Duration in seconds
  ingestion_metrics?: Record<string, number>;

  // On failure
  error?: string;                   // Error message
  errorCode?: string;               // Error code
}

FlowbabyContextRequest

interface FlowbabyContextRequest {
  // Required fields
  query: string;                    // Natural language search query

  // Optional fields
  maxResults?: number;              // Max results (default: 3)
  maxTokens?: number;               // Max tokens (default: 2000)
  includeSuperseded?: boolean;      // Include superseded summaries (default: false)
  halfLifeDays?: number;            // Recency decay half-life in days
  contextHints?: string[];          // Contextual hints
}

FlowbabyContextResponse

interface FlowbabyContextResponse {
  success: boolean;

  // On success
  entries: FlowbabyMemoryEntry[];
  totalResults: number;
  tokensUsed: number;

  // On failure
  error?: string;                   // Error message
  errorCode?: string;               // Error code
}

interface FlowbabyMemoryEntry {
  summaryText: string;              // Summary text
  topic?: string;                   // Topic (optional)
  topicId?: string;                 // Topic ID (optional)
  planId?: string;                  // Plan ID (optional)
  createdAt?: string;               // Creation timestamp (optional)
  score?: number;                   // Relevance score (optional)
  decisions?: string[];             // Decisions made (optional)
}

Audit Logging

All tool invocations are logged for transparency. Check the Output channel:

Open View → Output
Select "Flowbaby Agent Activity" from dropdown
View real-time logs of all tool calls with timestamps and results

Best Practices

Topic ID Generation

Recommendation: Use descriptive, human-readable topic IDs

// ✅ Good: Descriptive and unique
topicId: "plan-015-implementation-2025-11-19"
topicId: "user-question-async-programming-001"

// ❌ Bad: Generic or non-unique
topicId: "summary-1"
topicId: "conversation"

Alternative: Hash-based IDs for guaranteed uniqueness

import * as crypto from 'crypto';

function generateTopicId(topic: string, timestamp: string): string {
  const hash = crypto.createHash('sha256')
    .update(`${topic}-${timestamp}`)
    .digest('hex');
  return hash.substring(0, 16); // First 16 chars
}

const topicId = generateTopicId("Plan 015 Implementation", new Date().toISOString());
// Result: "a1b2c3d4e5f67890"

When to Ingest

DO ingest:

After multi-turn conversations (≥3 turns)
When key decisions are made
When user explicitly requests memory storage
At session end for continuity

DON'T ingest:

After every single turn (too noisy)
For trivial queries ("What's 2+2?")
For sensitive/private data without user consent
When agent access is disabled (check first)

Batching vs Real-Time

Real-time ingestion (after each conversation):

✅ Immediate availability for next session
❌ Higher latency (30-40s per ingestion)
❌ More API calls

Batch ingestion (end of session):

✅ Lower latency impact on user
✅ Can aggregate multiple topics
❌ Delayed availability

Recommendation: Real-time for important decisions, batch for routine conversations.

Testing Your Integration

1. Enable Tools via Configure Tools UI

Open Copilot chat
Click "Tools" (⚙️) → "Configure Tools"
Enable "Store Memory in Flowbaby" and "Retrieve Flowbaby Memory"

2. Create Test Script

// test-agent-ingestion.ts
import * as vscode from 'vscode';

export async function testIngestion() {
  const payload = {
    topic: "Test Agent Ingestion",
    context: "Testing the agent ingestion API",
    metadata: {
      topicId: "test-001",
      createdAt: new Date().toISOString(),
      updatedAt: new Date().toISOString()
    }
  };

  const responseJson = await vscode.commands.executeCommand<string>(
    'Flowbaby.ingestForAgent',
    JSON.stringify(payload)
  );

  const response = JSON.parse(responseJson);

  console.log('Ingestion result:', response);
  
  if (response.success) {
    vscode.window.showInformationMessage(`✅ Test passed: Ingested ${response.ingested_chars} chars`);
  } else {
    vscode.window.showErrorMessage(`❌ Test failed: ${response.error}`);
  }
}

3. Verify in Output Channel

Open Output panel (View > Output)
Select Flowbaby Agent Activity from dropdown
Look for log entry: [Agent Ingest] <timestamp> - Agent: <your-agent> - Topic: Test Agent Ingestion - Status: success

4. Check Audit Log

cat .flowbaby/agent_audit.log | grep test-001

Expected output:

{"timestamp": "2025-11-19T08:12:44Z", "command": "ingestForAgent", "agentName": "Test Agent", "topicDigest": "a1b2c3d4", "result": "success", "errorCode": null}

Troubleshooting

Command not found

Issue: vscode.commands.executeCommand throws "command not found"

Solution: Verify Flowbaby Memory extension is installed and activated

const extension = vscode.extensions.getExtension('cognee.flowbaby');
if (!extension) {
  throw new Error('Flowbaby Memory extension not installed');
}
await extension.activate();

Tools not appearing in chat

Issue: #flowbaby* commands don't appear in autocomplete

Solution: Enable tools via Configure Tools UI

Open Copilot chat
Click "Tools" (⚙️) → "Configure Tools"
Find "Store Memory in Flowbaby" and "Retrieve Flowbaby Memory"
Toggle checkboxes to enable
Return to chat and type #flowbaby to verify autocomplete

Payload validation fails

Issue: Error code INVALID_PAYLOAD

Solution: Check response.error for specific field errors

if (response.errorCode === 'INVALID_PAYLOAD') {
  console.error('Validation errors:', response.error);
  // Example error: 'Field "topic" is required and must be a non-empty string'
}

Ingestion times out

Issue: Error code BRIDGE_TIMEOUT or command never returns

Solution:

Check if LLM API key is valid
Check network connectivity
Verify Flowbaby installation: pip list | grep cognee
Check bridge logs in Output channel

Advanced: Using Validation Helper

The extension exports a validation helper for pre-flight checks:

import { validateIngestRequest } from './validation/summaryValidator';

const payload = {
  topic: "Test",
  context: "Testing",
  metadata: {
    topicId: "test-001",
    createdAt: new Date().toISOString(),
    updatedAt: new Date().toISOString()
  }
};

const validation = validateIngestRequest(payload);

if (!validation.valid) {
  console.error('Validation failed:', validation.errors);
  // Don't call command; fix payload first
  return;
}

// Validation passed; proceed with command
const responseJson = await vscode.commands.executeCommand<string>(
  'Flowbaby.ingestForAgent',
  JSON.stringify(payload)
);

References

Bridge Contract: extension/bridge/INGEST_CONTRACT.md
TypeScript Types: extension/src/types/agentIntegration.ts
Validation Helper: extension/src/validation/summaryValidator.ts
Plan 015: agent-output/planning/015-agent-ingestion-command.md
VS Code Commands API: https://code.visualstudio.com/api/references/vscode-api#commands

Using Flowbaby Tools with Custom Agents​

Quick Start​

Available Tools​

Store Memory Tool (#flowbabyStoreSummary)​

Retrieve Memory Tool (#flowbabyRetrieveMemory)​

Transparency​

Agent Integration Guide: Flowbaby Memory

Overview​

Security and Privacy​

Configure Tools Authorization Model​

Audit Logging​

Configuration​

Tool Authorization​

LLM API Key (Required)​

Ingesting Memories from Agents​

Command: Flowbaby.ingestForAgent​

TypeScript Example (Minimal)​

TypeScript Example (Full Payload)​

Error Handling​

Common Error Codes​

Async Ingestion Behavior (v0.3.3+)​

Overview​

Timing Expectations​

Response Fields for Async Mode​

Checking Background Operation Status​

Handling 429_FLOWBABY_BACKLOG​

Race Condition: Retrieve Before Cognify Completes​

Using Flowbaby Tools with GitHub Copilot and Custom Agents (Plan 016)​

Overview​

Recommended Tool Cadence​

Tool Discovery​

Store Memory Tool​

Retrieve Memory Tool​

Tool Lifecycle​

Rate Limits and Throttling​

Transparency Indicators​

Retrieving Memories for Agents​

Command: Flowbaby.retrieveForAgent​

TypeScript Example​

Retrieval Error Codes​

Schema Reference​

FlowbabyIngestRequest​

SummaryMetadata​

FlowbabyIngestResponse​

FlowbabyContextRequest​

FlowbabyContextResponse​

Audit Logging​

Best Practices​

Topic ID Generation​

When to Ingest​

Batching vs Real-Time​

Testing Your Integration​

1. Enable Tools via Configure Tools UI​

2. Create Test Script​

3. Verify in Output Channel​

4. Check Audit Log​

Troubleshooting​

Command not found​

Tools not appearing in chat​

Payload validation fails​

Ingestion times out​

Advanced: Using Validation Helper​

References​

Using Flowbaby Tools with Custom Agents

Quick Start

Available Tools

Store Memory Tool (`#flowbabyStoreSummary`)

Retrieve Memory Tool (`#flowbabyRetrieveMemory`)

Transparency

Overview

Security and Privacy

Configure Tools Authorization Model

Audit Logging

Configuration

Tool Authorization

LLM API Key (Required)

Ingesting Memories from Agents

Command: `Flowbaby.ingestForAgent`

TypeScript Example (Minimal)

TypeScript Example (Full Payload)

Error Handling

Common Error Codes

Async Ingestion Behavior (v0.3.3+)

Overview

Timing Expectations

Response Fields for Async Mode

Checking Background Operation Status

Handling 429_FLOWBABY_BACKLOG

Race Condition: Retrieve Before Cognify Completes

Using Flowbaby Tools with GitHub Copilot and Custom Agents (Plan 016)

Overview

Recommended Tool Cadence

Tool Discovery

Store Memory Tool

Retrieve Memory Tool

Tool Lifecycle

Rate Limits and Throttling

Transparency Indicators

Retrieving Memories for Agents

Command: `Flowbaby.retrieveForAgent`

TypeScript Example

Retrieval Error Codes

Schema Reference

FlowbabyIngestRequest

SummaryMetadata

FlowbabyIngestResponse

FlowbabyContextRequest

FlowbabyContextResponse

Audit Logging

Best Practices

Topic ID Generation

When to Ingest

Batching vs Real-Time

Testing Your Integration

1. Enable Tools via Configure Tools UI

2. Create Test Script

3. Verify in Output Channel

4. Check Audit Log

Troubleshooting

Command not found

Tools not appearing in chat

Payload validation fails

Ingestion times out

Advanced: Using Validation Helper

References