Open Sports Data
Open-access sports data repositories for research, analytics, and AI model development. Unlike APIs that serve live data on demand, these datasets are downloadable collections of structured sports event data, statistics, and game logs — freely available with no authentication required.
The rise of AI-powered sports analytics has created enormous demand for high-quality training data. Open datasets like StatsBomb's event data and the SCORE Network's curated repositories give researchers, students, and independent developers access to the same granular data that professional clubs use — without the six-figure data licensing fees.
These resources are particularly valuable for building and benchmarking machine learning models: predicting match outcomes, evaluating player performance, optimizing in-game tactics, and training large language models on sports-specific corpora. They are also the foundation of most published sports analytics research.
DevLocker.dev tracks open datasets alongside APIs and MCP Servers because they represent a distinct and important layer of the sports-tech infrastructure stack — the raw material that powers everything from academic papers to production AI systems.
The most comprehensive freely accessible historical basketball database covering NBA and WNBA player statistics, team records, game logs, and advanced metrics from 1946 to present. Includes box scores, play-by-play, salary data, draft history, and award records. Widely used by sports analysts, data scientists, and AI researchers.
Freely available structured ball-by-ball data for international and T20 League cricket matches, including IPL, BBL, PSL, and all major ICC tournaments. Available in JSON, YAML, and CSV formats. Covers men's and women's internationals from 2005 onwards. No registration or API key required — direct download from cricsheet.org.
Comprehensive football (soccer) statistics database covering player, team, and league stats across the top 5 European leagues and major international competitions. Powered by StatsBomb data for advanced metrics including expected goals (xG), progressive passes, and pressures. Covers men's and women's competitions from 1888 onwards.
Comprehensive open NFL datasets maintained by the nflverse community. Includes play-by-play, player stats, schedules, rosters, draft picks, and advanced metrics. Available as CSV and Parquet files with a Python library (nfl_data_py). Covers 1999 to present, updated weekly during the NFL season.
Free, open public domain football (soccer) data in JSON format covering the English Premier League, Bundesliga, Primera División, Serie A, Ligue 1, and more. Includes match schedules, results, team and player data, and stadium information. No API key, no registration, no cost.