superporn hard-blocks the VPS IP with Cloudflare 403 on every TLS impersonation, so HTML ingest routes through Bright Data residential (BRIGHTDATA_PROXY_URL, parsed in config). First scraper to use a proxy: optional _proxy on the browse base, threaded into browser_get. JSON-LD VideoObject (title/desc/uploadDate/thumb/duration) + pornstar and category chips; superporn double-encodes HTML entities so titles are unescaped twice. Thumbnails fetch fine from the VPS (no proxy). Playback stays off-proxy: the <source> mp4 token is IP-bound to the fetcher, so resolve is phone-side via WebView (extractor superporncom -> _vps_blocked_fallback), same as porndoe. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| api | ||
| connectors | ||
| extractors | ||
| models | ||
| normalize | ||
| resolve | ||
| scheduler | ||
| templates | ||
| __init__.py | ||
| auth.py | ||
| config.py | ||
| db.py | ||
| ingest.py | ||
| main.py | ||