feat(scripts): add sportstime-parser data pipeline
Complete Python package for scraping, normalizing, and uploading sports schedule data to CloudKit. Includes: - Multi-source scrapers for NBA, MLB, NFL, NHL, MLS, WNBA, NWSL - Canonical ID system for teams, stadiums, and games - Fuzzy matching with manual alias support - CloudKit uploader with batch operations and deduplication - Comprehensive test suite with fixtures - WNBA abbreviation aliases for improved team resolution - Alias validation script to detect orphan references All 5 phases of data remediation plan completed: - Phase 1: Alias fixes (team/stadium alias additions) - Phase 2: NHL stadium coordinate fixes - Phase 3: Re-scrape validation - Phase 4: iOS bundle update - Phase 5: Code quality improvements (WNBA aliases) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
8
sportstime_parser/__init__.py
Normal file
8
sportstime_parser/__init__.py
Normal file
@@ -0,0 +1,8 @@
|
||||
"""SportsTime Parser - Sports data scraper and CloudKit uploader."""
|
||||
|
||||
__version__ = "0.1.0"
|
||||
__author__ = "SportsTime Team"
|
||||
|
||||
from .cli import run_cli
|
||||
|
||||
__all__ = ["run_cli", "__version__"]
|
||||
Reference in New Issue
Block a user