LeadsLogix
Pricing
Platform
5 Layers

Scraping Hierarchy

Each layer is tried in order. The system only escalates to the next (more expensive) layer when the current one can't achieve the required confidence threshold. This minimizes browser renders and maximizes throughput.

1

Static Layer

HTTP + Regex + Schema.org

Fast HTTP requests with BeautifulSoup parsing. Extracts emails, phone numbers, and structured data (JSON-LD, schema.org microdata) from raw HTML. Handles 60-70% of company websites that serve static content.

Techniques

  • httpx/requests with anti-detection headers
  • Regex patterns for email and phone extraction
  • JSON-LD and schema.org structured data parsing
  • Meta tag extraction (OG, Twitter cards)
  • Sitemap.xml parsing for page discovery

Escalation Rule

Escalates if confidence < threshold or page returns empty/minimal content

2

JS Data Layer

__NEXT_DATA__, React Props, API Endpoints

Extracts data from JavaScript-rendered frameworks without launching a browser. Parses __NEXT_DATA__ (Next.js), data-react-props attributes, inline JSON payloads, and discovers internal API endpoints that serve structured data.

Techniques

  • __NEXT_DATA__ JSON extraction (Next.js SSR)
  • data-react-props attribute parsing (React)
  • Inline script JSON payload detection
  • XHR/fetch API endpoint discovery
  • GraphQL introspection endpoint probing

Escalation Rule

Escalates if no JS data structures found or extracted data is incomplete

3

Structural Layer

Page Classification & Priority Crawling

Classifies discovered pages by type (/team, /about, /contact, /leadership, /staff) and crawls them in priority order. Budget-aware: max 15 pages per company with intelligent page selection.

Techniques

  • URL pattern classification (team, about, contact, careers)
  • Internal link graph analysis
  • Sitemap-guided page discovery
  • Priority scoring (team pages > about > contact > other)
  • Cross-page contact accumulation

Escalation Rule

Escalates if classified pages don't contain extractable contact data

4

Semantic Layer

4-Method Contact Extraction

Deep content analysis using 4 cascading extraction methods. Combines structured data parsing, visual layout analysis, proximity heuristics, and social profile matching to find decision makers.

Techniques

  • JSON-LD person/organization extraction
  • Team card CSS pattern detection (photo + name + title)
  • Heuristic proximity analysis (name near email/phone within DOM distance)
  • LinkedIn profile URL extraction and matching
  • Company general email separated from personal contacts

Escalation Rule

Escalates if semantic methods find < 2 contacts with confidence > 0.5

5

Browser Layer

Playwright Stealth Rendering

Full Playwright browser rendering for JavaScript-heavy SPAs that resist all other methods. Budget-capped at 3 browser renders per company to control costs. Handles click-to-reveal content, infinite scroll, and modal-based team directories.

Techniques

  • Playwright stealth mode with anti-detection
  • Click-to-reveal email/phone interaction
  • Infinite scroll handling for team directories
  • Modal and accordion content expansion
  • Screenshot-based fallback for heavily protected sites

Escalation Rule

Final layer -- if browser rendering fails, the company is marked for manual review

15
Max pages per company
3
Max browser renders
120s
Max runtime per company
FAQ

Frequently Asked Questions

Everything you need to know about our platform.

Still have questions?

Our team can walk you through the pipeline, pricing, and your use case.

Talk to us

Related Pipeline Pages

12-Stage Enrichment Pipeline

Platform

Full lead enrichment platform architecture

/platform/enrichment

7 Intelligence Modules

Platform

OSINT modules powering B2B sales intelligence

/platform/intelligence

Contact Intelligence Engine

Platform

Actor engine using the web scraping pipeline

/platform/actors/contact-intelligence-engine

Data Enrichment

Solution

Lead enrichment platform for CRM data

/solutions/data-enrichment

LeadsLogix vs Clay

Compare

Web scraping pipeline vs manual waterfall enrichment

/compare/vs-clay

All Features

Features

Full AI lead generation platform capabilities

/features
LeadsLogix

AI-native sales intelligence platform. Find, enrich, verify, and activate decision-maker contacts at scale.

LinkedInGitHubX / TwitterFacebookInstagramTrustpilotYouTubeCommunity

Stay ahead with sales intelligence insights

Weekly strategies, product updates, and industry intel. No spam.

Products

  • Sales Intelligence
  • Sales Intel Dashboard
  • Lead Generation
  • Lead Gen Dashboard
  • Data Enrichment
  • Enrichment Dashboard
  • Email Marketing
  • Email Dashboard
  • Company Data
  • Email Verification
  • All Products

Platform

  • B2B Platform
  • B2B Discovery Engine
  • Contact Intelligence
  • Email Intelligence
  • AI Qualification Engine
  • Email Infrastructure
  • Contact Extraction
  • Data Integrity
  • Website Crawling
  • B2B Discovery Actors
  • Master Orchestrator
  • Export Center
  • Autonomous Research
  • Pipeline DAG

Services

  • Email List Building
  • Cold Email Lists
  • Cold Email Software
  • Outreach Data Prep
  • Email Verification API
  • Managed Cold Email
  • Email Append Service
  • Sales Intelligence Platform
  • Prospecting Software
  • All Services

Industries

  • Healthcare
  • SaaS
  • Fintech
  • Manufacturing
  • Ecommerce
  • Cybersecurity
  • Real Estate
  • All Industries

Resources

  • Resource Hub
  • Free Tools
  • Glossary
  • Use Cases
  • New Market Entry
  • B2B Prospecting Workflow
  • Product Discovery Research
  • AI Qualification Model
  • B2B Sales Statistics
  • Email Marketing Statistics
  • Cold Email Benchmarks
  • API Documentation

Company

  • About
  • Contact
  • Pricing
  • Free Data Sample
  • Request Custom Data
  • Platform
  • Security
  • Trust Center
  • Integrations
Regional
United StatesUnited KingdomCanadaAustraliaIndiaGermanyFranceJapanSouth KoreaChinaBrazilMexicoUAESaudi ArabiaSingaporeIndonesiaThailandTurkeyNetherlandsSpainItalySwedenSouth AfricaRussia & CISNorth AmericaSouth AmericaEuropean UnionAsiaAPAC RegionMiddle EastAfrica
Compare
vs Apollo.iovs ZoomInfovs Clearbitvs Clayvs Lushavs Cognismvs Seamless.AIvs Hunter.iovs RocketReachvs Snov.iovs UpLeadvs Lead411
SOC 2 Ready
AES-256 Encryption
GDPR Compliant
CAN-SPAM

© 2026 LeadsLogix LLC. All rights reserved.

Privacy PolicyTerms of ServiceCookie Settings
hello@leadslogix.com