Oregon Counselor Directory STAGING
Engineering Case Study

How I Built Oregon Counselor Directory

From raw web scraping to a high-performance, geolocation-aware directory. A look inside the 2026 HealthTech stack.

1

The Scraper Pipeline

The core challenge was acquiring clean, verified data from trusted sources. I built a multi-stage pipeline executing over 15,000 scrapes using custom console JS and BeautifulSoup for robust data extraction, combined with Playwright for JavaScript-heavy network traversal.

Key Metrics

  • Up-to-Current Data via 15,000+ Scrapes
  • PostgreSQL (Supabase) Backend
  • Dual-Domain Deploy Sync
// Console JavaScript DOM Traversal & CSV Export
(async function scrapeRemainingLPC() {
  console.log('%c🚀 CONTINUING LPC SCRAPE FROM CURRENT PAGE (4301+ of 11,615)', 'color:#0a0;font-size:16px;font-weight:bold');

  let allData = [];
  const seen = new Set();
  // ... automated pagination logic ...
async def enrich_provider(page, name, city, state):
    # Search Bing, defeat tracking
    search_url = f"https://www.bing.com/search?q={query}"
    await page.goto(search_url)
    # Decode b64 tracking links
    link = decode_bing_url(a.get('href'))
    
    # Rip organic phone numbers via NLP
    phones = extract_valid_phones(site_text)

Fig 1.5: OSINT Playwright Enrichment

2

OSINT Data Enrichment

Proxy directories intentionally mask provider phone numbers and trap clinic websites behind redirects. I bypass this using a sophisticated Playwright & NLP pipeline to automatically scrape Bing, decode Base64 tracking links, and rip organic text from private practice DOMs.

Playwright Base64 Decode NLP Parsing
// Actively processing peer community feedback
const latestFeedback = [
    {
        user: "Oregon LCSW",
        insight: "OHP status gets buried in standard directories.",
        action: "Implemented prominent 'OHP Green' engine badges."
    },
    {
        user: "Eugene LPC",
        insight: "Stop making patients sift through generic modalities.",
        action: "Filtered categories to high-intent medical browsing."
    }
];

// Iterative design loop: Build -> Review -> Refine

Fig 2.0: Continuous Iteration from the Field

3

Collaboration & Testing

Technology is only as useful as the problem it solves. Instead of building in a silo, I collaborated directly with licensed peers across Oregon—gathering manual feedback and conducting user testing to shape every piece of the directory's workflow and design.

UX Strategy

Peer Reviewed

Development

Iterative Testing

4

Premium Design Language

Clinical directories are often cold and complex. Oregon Counselor Directory uses a custom HealthTech Glassmorphism design system. High-contrast typography (Playfair Display) paired with a functional 8px grid.

OHP Green Engine

Specific color-coding (Sage & Emerald) for Oregon Health Plan providers, making low-cost care easily visible.

Adaptive Funnels

Therapists take center stage; experimental "Treatments" are hidden deeper to ensure high-intent medical browsing.

Shadow Profiles

A dual-database approach that maps scraped public data into claimable "Shadow Profiles" for provider conversion.

Recent Shipping &
Engineering Roadmap

Stripe Monetization & Onboarding Live

Multi-tier subscriptions (Growth & Pro), secure provider dashboard with auto-save, and clean dynamic URL routing. Now live in production.

2

Dynamic SEO Sitemap & NLP Tools

Building out automated AI tools to rewrite bios and generate local search pages for "Therapist in [City]".

3

Review Engine

A HIPAA-compliant patient feedback loop to verify quality of care across the network.

The Media Vault

Go behind the scenes and explore the raw, unhinged, and high-volume marketing experiments driving growth for the directory.