TPDB taxonomy emits numbered-duplicate tags (name "Bubble Butt2"); slugify yields "bubble-butt2" (no separator before digit), so resolve_tag created a separate tag alongside "bubble-butt". Tube scenes inherited the dup via scene-merge → 75 pairs, ~10k scene_tags on the wrong tag. - resolve_tag: canonicalize "<base>2" -> "<base>" when base exists (handles current + future; trailing-"2"+alpha guard leaves milf-30/teen18 intact) - scripts/merge_dup2_tags.py: one-off bulk merge (scene_tags + movie_tags + blacklist) and taxonomy-count refresh Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| movie_match.py | ||
| movie_resolver.py | ||
| movie_score.py | ||
| performer_resolver.py | ||
| scene_match.py | ||
| scene_merge.py | ||
| scene_resolver.py | ||
| scene_score.py | ||
| scoring.py | ||
| studio_resolver.py | ||
| studio_title_parser.py | ||
| tag_resolver.py | ||