deep_search_r.json

Overview

deep_search_r.json defines a comprehensive Multi-Agent Deep Research Agent configuration designed for executing structured, multi-step research investigations. This agent is specialized for professional domains such as sales, marketing, policy, or consulting. Its core functionality is to orchestrate multiple specialized sub-agents to collaboratively:

Discover high-quality, authoritative URLs (sources).
Extract and structure detailed content from these URLs.
Synthesize the extracted content into a polished, strategic research report resembling consulting firm deliverables (e.g., McKinsey-style reports).

The file encodes a detailed agent workflow, component parameters, failure recovery strategies, quality gates, and example processes, supporting multilingual capabilities and adaptive resource allocation.

Detailed Explanation of Components

The file uses a JSON format describing a DSL (Domain Specific Language) specification of the multi-agent pipeline, including the following key components:

Top-Level Fields

id: Numeric identifier (6).
title: Localized titles (en and zh) describing the agent as "Deep Research".
description: Localized descriptions explaining the purpose of the agent.
canvas_type: "Agent" indicating this is an agent orchestration.
dsl: Core pipeline definition describing components, globals, and graph topology.
avatar: Base64 image representing the agent visually.

Key Components in `dsl.components`

Agent:NewPumasLick — The Lead Agent ("Deep Research Agent")
- Role: Strategy Research Director coordinating the entire multi-agent research.
- Parameters:
  - llm_id: Uses "qwen-max@Tongyi-Qianwen" model for reasoning.
  - max_tokens: 4096 tokens per round.
  - max_retries: 3 attempts per subtask.
  - max_rounds: Up to 3 rounds of interaction.
  - presence_penalty and frequency_penalty enabled with values 0.5.
  - prompts: Takes the user query as input with role user.
  - sys_prompt: Extensive detailed prompt specifying mission, execution framework, research process, query classification, quality gates, adaptive strategies, language adaptation, and execution examples.
- Downstream components: Sends output to Message:OrangeYearsShine.
- Upstream: Receives from begin.
- Functionality: Orchestrates the research process across three stages:
  - URL discovery via Web Search Specialist.
  - Content extraction via Content Deep Reader.
  - Strategic report generation via Research Synthesizer.
- Usage example:
```
{
  "sys.query": "How will AI transform healthcare by 2030?"
}
```
  The Lead Agent breaks down this query, classifies it, allocates tasks, manages sub-agents, and synthesizes final report content.
Agent:FreeDucksObey — Web Search Specialist
- Role: Expert in discovering high-quality URLs.
- Parameters:
  - Uses "qwen-plus@Tongyi-Qianwen" model.
  - Executes only one search round (max_rounds=1).
  - Uses web search tools (e.g., TavilySearch) exclusively.
  - Does not extract content — strictly URL discovery.
  - Enforces quality criteria avoiding paywalled, low-authority, or outdated sources.
  - Outputs exactly 5 URLs with extraction guidance.
  - Includes detailed system prompt specifying mission, process, search strategy, quality criteria, output format, and collaboration protocols.
- Downstream: Passes URLs to Content Deep Reader.
- Usage example:
  - For query "AI regulation finance 2024", it performs multiple web searches, evaluates URLs, and outputs 5 prioritized links with extraction focus.
Agent:WeakBoatsServe — Content Deep Reader
- Role: Extracts structured and comprehensive content from the URLs provided by the Web Search Specialist.
- Parameters:
  - Uses "moonshot-v1-auto@Moonshot" LLM.
  - Allows up to 3 rounds per URL extraction.
  - Uses web extracting tools (e.g., TavilyExtract).
  - Extracts full text, key statistics, findings, expert quotes, supporting data, methodology, and credibility scores.
  - Includes fallback strategies for extraction failures.
  - Outputs standardized EXTRACTED_CONTENT for each URL.
  - System prompt defines mission, process, extraction schema, quality assessment, failure handling, and collaboration.
- Downstream: Passes structured content to Research Synthesizer.
- Usage example:
```
EXTRACTED_CONTENT:
URL: https://www.sec.gov/ai-guidance-2024
TITLE: "SEC Guidance on AI in Financial Services - March 2024"
FULL_TEXT: "The Securities and Exchange Commission (SEC) today announced comprehensive guidance..."
KEY_STATISTICS: 65% of banks now use AI, $2.3B investment in 2024
MAIN_FINDINGS: New compliance framework requires AI audit trails...
...
```
Agent:SwiftToysTell — Research Synthesizer
- Role: Final integration specialist that synthesizes extracted content into executive-grade, consulting-style strategic reports.
- Parameters:
  - Uses "moonshot-v1-128k@Moonshot" model for deep analysis.
  - Supports multiple rounds to refine analysis.
  - Receives ANALYSIS_INSTRUCTIONS from Lead Agent.
  - Produces reports adhering strictly to analysis framework (e.g., McKinsey-style or BCG-style).
  - Enforces output constraints: no raw data, no intermediate summaries, only polished final reports.
  - System prompt includes mission, process, integration strategy, report structure templates, quality standards, synthesis techniques, and output formatting.
- Downstream: Final output sent to Message.
- Usage example:
  - Converts extracted content into sections: Executive Summary, Analysis, Recommendations.
  - Provides actionable insights, risk assessment, and implementation roadmap.
Message:OrangeYearsShine
- Role: Outputs the final content generated by Agent:NewPumasLick to the user interface or downstream system.
- Parameters: Contains the content from the Lead Agent.
- Upstream: Connected to Agent:NewPumasLick.
Begin
- Role: Entry point into the pipeline.
- Downstream: Starts the Lead Agent component.

Tools Used

TavilySearch: Web search API/tool used by Web Search Specialist for URL discovery.
TavilyExtract: Web extraction tool used by Content Deep Reader to retrieve webpage content.
Other internal or MCP (Model Context Protocol) connections support enhanced search and extraction capabilities.

Global Variables

sys.query: The user input query string.
sys.conversation_turns: Number of conversation turns (default 0).
sys.user_id, sys.files: Metadata for session management.

Important Implementation Details and Algorithms

Multi-Agent Orchestration: The Lead Agent (Agent:NewPumasLick) organizes a three-stage pipeline, delegating to specialized agents for URL discovery, content extraction, and report synthesis.
Research Process and Query Classification: The Lead Agent follows a methodical research process:
1. Assessment and breakdown of the user's query.
2. Classification of the query into one of three types: Depth-first, Breadth-first, or Straightforward.
3. Development of a detailed research plan allocating tasks to sub-agents.
4. Methodical execution with continuous monitoring and adaptive iteration.
Quality Gates: After each stage, quality is assessed against criteria such as number and diversity of URLs found, extraction success rate, and report completeness.
Adaptive Strategy: Resources and agents are allocated depending on query complexity, with failure recovery strategies (e.g., fallback to metadata extraction if content extraction fails).
Language Adaptation: The system auto-detects user language and adjusts source selection, pipeline instructions, and cultural context accordingly.
Output Constraints: Final reports must be actionable, evidence-based, and follow the detailed format specified by the Lead Agent. Intermediate data or raw content is never exposed downstream.

Interaction with Other Parts of the System

User Interaction: The system receives a user query (sys.query), triggering the pipeline.
Sub-Agents: Each sub-agent is an LLM-powered component or tool with a focused responsibility, collaborating in a pipeline fashion.
Tools Integration: Utilizes specialized external APIs/tools (TavilySearch and TavilyExtract) for web searching and content extraction.
Output Delivery: Final synthesized reports are delivered via the Message:OrangeYearsShine component, likely interfacing with UI or downstream consumers.
MCP Connections: Agents leverage Model Context Protocol for enhanced coordination and tool usage.

Usage Example (End-to-End)

User inputs query: "What are the main factors driving cryptocurrency market volatility?"
Lead Agent analyzes the query, classifies it as Depth-first, and develops a plan involving multiple perspectives.
Web Search Specialist discovers 5 high-quality URLs covering technical, regulatory, and psychological aspects.
Content Deep Reader extracts full content from these URLs with structured metadata.
Research Synthesizer integrates the content into a ~2000-word McKinsey-style strategic report with actionable insights.
Final report is delivered through the Message component to the user.

Visual Diagram

flowchart TD
    Begin --> LeadAgent[Agent:NewPumasLick<br/>Lead Agent]
    LeadAgent -->|Dispatch URLs| WebSearch[Agent:FreeDucksObey<br/>Web Search Specialist]
    LeadAgent -->|Dispatch URLs| ContentReader[Agent:WeakBoatsServe<br/>Content Deep Reader]
    LeadAgent -->|Receive Extracted Content| ResearchSynth[Agent:SwiftToysTell<br/>Research Synthesizer]
    LeadAgent -->|Send Final Report| Message[Message:OrangeYearsShine]
    
    WebSearch -->|URLs| ContentReader
    ContentReader -->|Extracted Content| ResearchSynth

Diagram Explanation:

The Begin node triggers the Lead Agent.
The Lead Agent coordinates three sub-agents:
- Web Search Specialist to find URLs.
- Content Deep Reader to extract content from URLs.
- Research Synthesizer to compile the final report.
Outputs flow from each agent back to the Lead Agent, which then sends the final report to the Message component for user delivery.

Summary

deep_search_r.json defines a multi-agent research orchestration framework that transforms complex user queries into high-quality strategic reports by leveraging specialized subagents for web discovery, extraction, and synthesis. It embodies a robust research methodology, adaptive execution, quality controls, and multilingual support, making it suitable for professional research tasks in business and consulting contexts.