Master Orchestrator
Smart routing, actor chaining, and AI lead generation pipeline lifecycle management
The Master Orchestrator is the brain of the LeadsLogix AI lead generation pipeline. It accepts any input format (CSV, Excel, JSON, company names, domains), auto-detects the optimal processing strategy, and routes data through the B2B sales intelligence pipeline. For small batches (<=50 companies), all 12 stages execute inline for speed. For larger datasets, it chains specialized actor engines via the sales intelligence API. Full checkpoint/resume ensures no work is lost on crash.
Pipeline Stages
Each stage executes automatically, escalating only when needed.
Input Resolution
Priority-based input parsing for the lead enrichment platform: inline company array -> existing Apify Dataset -> KV Store file -> HTTP URL download. Smart column detection for CSV/Excel.
Route Planning
Analyzes batch size and input type for AI lead generation. <=50 companies: inline 12-stage processing. >50 companies: actor-chain delegation via Actor.call() through the sales intelligence API.
Pipeline Execution
4 modes for B2B sales intelligence: Full (all 5 engines), Discovery Only, Enrich Only (skip discovery), Verify Only (email verification API). Stage skipping via configuration.
Checkpoint Management
Progress saved to KV Store every 50 items and on graceful shutdown. Resume skips already-processed items for reliable AI lead generation. Format: checkpoint-{actor}-{run_id}.
Result Aggregation
Collects outputs from all actor stages in the lead enrichment platform. Merges partial results. Computes pipeline-level statistics for the web scraping pipeline.
Delivery
Final output to Apify Named Dataset. Optional webhook POST with completion status, record count, and download URL via the sales intelligence API.
Input Resolution
Priority-based input parsing for the lead enrichment platform: inline company array -> existing Apify Dataset -> KV Store file -> HTTP URL download. Smart column detection for CSV/Excel.
Route Planning
Analyzes batch size and input type for AI lead generation. <=50 companies: inline 12-stage processing. >50 companies: actor-chain delegation via Actor.call() through the sales intelligence API.
Pipeline Execution
4 modes for B2B sales intelligence: Full (all 5 engines), Discovery Only, Enrich Only (skip discovery), Verify Only (email verification API). Stage skipping via configuration.
Checkpoint Management
Progress saved to KV Store every 50 items and on graceful shutdown. Resume skips already-processed items for reliable AI lead generation. Format: checkpoint-{actor}-{run_id}.
Result Aggregation
Collects outputs from all actor stages in the lead enrichment platform. Merges partial results. Computes pipeline-level statistics for the web scraping pipeline.
Delivery
Final output to Apify Named Dataset. Optional webhook POST with completion status, record count, and download URL via the sales intelligence API.
Input Resolution
Priority-based input parsing for the lead enrichment platform: inline company array -> existing Apify Dataset -> KV Store file -> HTTP URL download. Smart column detection for CSV/Excel.
Route Planning
Analyzes batch size and input type for AI lead generation. <=50 companies: inline 12-stage processing. >50 companies: actor-chain delegation via Actor.call() through the sales intelligence API.
Pipeline Execution
4 modes for B2B sales intelligence: Full (all 5 engines), Discovery Only, Enrich Only (skip discovery), Verify Only (email verification API). Stage skipping via configuration.
Checkpoint Management
Progress saved to KV Store every 50 items and on graceful shutdown. Resume skips already-processed items for reliable AI lead generation. Format: checkpoint-{actor}-{run_id}.
Result Aggregation
Collects outputs from all actor stages in the lead enrichment platform. Merges partial results. Computes pipeline-level statistics for the web scraping pipeline.
Delivery
Final output to Apify Named Dataset. Optional webhook POST with completion status, record count, and download URL via the sales intelligence API.
Key Capabilities
Smart Routing for AI Lead Generation
Automatic decision between inline processing (fast, single-process) and actor-chain delegation (scalable, distributed). Threshold-based routing with manual override for the B2B sales intelligence pipeline.
4 Pipeline Modes
Full: Discovery -> Contact -> Email -> AI -> Export. Discovery Only: just find companies and domains. Enrich Only: skip discovery for known domains. Verify Only: email verification API stage alone.
Actor-Chain Communication
Inter-actor data flows via Apify Named Datasets. State coordination via Named Key-Value Stores. Each actor reads input from the previous actor's output dataset in the lead enrichment platform.
Checkpoint/Resume
Every 50 processed items, a checkpoint is written. On crash or restart, the AI lead generation orchestrator skips already-processed items and continues from the last checkpoint.
Sales Intelligence API Input Auto-Detection
Accepts company name arrays, CSV files, Excel files, public URLs, Apify Datasets, and KV Store keys. Column mapping handles non-standard headers automatically for B2B sales intelligence.
Webhook Notifications for AI Outbound Automation
POST to any URL on pipeline completion. Payload includes: status, total records, processing time, output dataset ID, and download links for the sales intelligence API.
Accepted Inputs
- Company names array (strings or objects)
- Existing Apify Dataset ID
- CSV/Excel file in KV Store
- Public HTTP URL to CSV/Excel
- JSON array of company objects
Configuration
- Pipeline Mode: full / discovery_only / enrich_only / verify_only
- Workers: 1-20 concurrent (default 5)
- Max Results: limit total output
- Skip Stages: array of stage names to skip
- Resume: enable checkpoint resume
- Proxy: Apify proxy configuration
- Webhook URL: completion notification endpoint
See It In Action
Frequently Asked Questions
Everything you need to know about our platform.
Still have questions?
Our team can walk you through the pipeline, pricing, and your use case.