How I Built Oregon Counselor Directory
From raw web scraping to a high-performance, geolocation-aware directory. A look inside the 2026 HealthTech stack.
The Scraper Pipeline
The core challenge was acquiring clean, verified data from trusted sources. I built a multi-stage pipeline executing over 15,000 scrapes using custom console JS and BeautifulSoup for robust data extraction, combined with Playwright for JavaScript-heavy network traversal.
Key Metrics
- Up-to-Current Data via 15,000+ Scrapes
- PostgreSQL (Supabase) Backend
- Dual-Domain Deploy Sync
// Console JavaScript DOM Traversal & CSV Export
(async function scrapeRemainingLPC() {
console.log('%c🚀 CONTINUING LPC SCRAPE FROM CURRENT PAGE (4301+ of 11,615)', 'color:#0a0;font-size:16px;font-weight:bold');
let allData = [];
const seen = new Set();
// ... automated pagination logic ...
async def enrich_provider(page, name, city, state):
# Search Bing, defeat tracking
search_url = f"https://www.bing.com/search?q={query}"
await page.goto(search_url)
# Decode b64 tracking links
link = decode_bing_url(a.get('href'))
# Rip organic phone numbers via NLP
phones = extract_valid_phones(site_text)
Fig 1.5: OSINT Playwright Enrichment
OSINT Data Enrichment
Proxy directories intentionally mask provider phone numbers and trap clinic websites behind redirects. I bypass this using a sophisticated Playwright & NLP pipeline to automatically scrape Bing, decode Base64 tracking links, and rip organic text from private practice DOMs.
// Actively processing peer community feedback
const latestFeedback = [
{
user: "Oregon LCSW",
insight: "OHP status gets buried in standard directories.",
action: "Implemented prominent 'OHP Green' engine badges."
},
{
user: "Eugene LPC",
insight: "Stop making patients sift through generic modalities.",
action: "Filtered categories to high-intent medical browsing."
}
];
// Iterative design loop: Build -> Review -> Refine
Fig 2.0: Continuous Iteration from the Field
Collaboration & Testing
Technology is only as useful as the problem it solves. Instead of building in a silo, I collaborated directly with licensed peers across Oregon—gathering manual feedback and conducting user testing to shape every piece of the directory's workflow and design.
UX Strategy
Peer Reviewed
Development
Iterative Testing
Premium Design Language
Clinical directories are often cold and complex. Oregon Counselor Directory uses a custom HealthTech Glassmorphism design system. High-contrast typography (Playfair Display) paired with a functional 8px grid.
OHP Green Engine
Specific color-coding (Sage & Emerald) for Oregon Health Plan providers, making low-cost care easily visible.
Adaptive Funnels
Therapists take center stage; experimental "Treatments" are hidden deeper to ensure high-intent medical browsing.
Shadow Profiles
A dual-database approach that maps scraped public data into claimable "Shadow Profiles" for provider conversion.
Recent Shipping &
Engineering Roadmap
Stripe Monetization & Onboarding Live
Multi-tier subscriptions (Growth & Pro), secure provider dashboard with auto-save, and clean dynamic URL routing. Now live in production.
Dynamic SEO Sitemap & NLP Tools
Building out automated AI tools to rewrite bios and generate local search pages for "Therapist in [City]".
Review Engine
A HIPAA-compliant patient feedback loop to verify quality of care across the network.
The Media Vault
Go behind the scenes and explore the raw, unhinged, and high-volume marketing experiments driving growth for the directory.