Skip to main content
Version: 0.0.54

NBA dataset loaders

flowchart LR
raw["scrape / raw"] --> enrich["enrich"] --> rel["release asset"] --> load["load_*()"]

Automation status

DatasetRelease tagPipeline
load_nba_pbpespn_nba_pbp
load_nba_player_boxscoreespn_nba_player_boxscores
load_nba_scheduleespn_nba_schedules
load_nba_team_boxscoreespn_nba_team_boxscores
load_nba_game_rostersespn_nba_game_rosters
load_nba_officialsespn_nba_officials
load_nba_shotsespn_nba_shots
load_nba_standingsespn_nba_standings
load_nba_stats_schedulesnba_stats_schedules

load_nba_pbp

Release: espn_nba_pbp · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_nba_pbp/play_by_play_{season}.parquet

Returns

col_nametype
idFloat64
sequence_numberString
type_idInt32
type_textString
textString
away_scoreInt32
home_scoreInt32
period_numberInt32
period_display_valueString
clock_display_valueString
scoring_playBoolean
score_valueInt32
shooting_playBoolean
coordinate_x_rawFloat64
coordinate_y_rawFloat64
seasonInt32
season_typeInt32
away_team_idInt32
away_team_nameString
away_team_mascotString
away_team_abbrevString
away_team_name_altString
home_team_idInt32
home_team_nameString
home_team_mascotString
home_team_abbrevString
home_team_name_altString
home_team_spreadFloat64
game_spreadFloat64
home_favoriteBoolean
game_spread_availableBoolean
game_idInt32
qtrInt32
timeString
clock_minutesInt32
clock_secondsFloat64
halfString
game_halfString
lead_qtrInt32
lead_game_halfString
start_quarter_seconds_remainingInt32
start_half_seconds_remainingInt32
start_game_seconds_remainingInt32
game_play_numberInt32
end_quarter_seconds_remainingInt32
end_half_seconds_remainingInt32
end_game_seconds_remainingInt32
periodInt32
team_idInt32
athlete_id_1Int32
athlete_id_2Int32
athlete_id_3Int32
lag_qtrInt32
lag_game_halfString
coordinate_xFloat64
coordinate_yFloat64
game_dateDate
game_date_timeDatetime(time_unit='us', time_zone='America/New_York')
type_abbreviationString
load_nba_pbp(seasons=2024)

load_nba_player_boxscore

Release: espn_nba_player_boxscores · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_nba_player_boxscores/player_box_{season}.parquet

Returns

col_nametype
game_idInt32
seasonInt32
season_typeInt32
game_dateDate
game_date_timeDatetime(time_unit='us', time_zone='America/New_York')
athlete_idInt32
athlete_display_nameString
team_idInt32
team_nameString
team_locationString
team_short_display_nameString
minutesFloat64
field_goals_madeInt32
field_goals_attemptedInt32
three_point_field_goals_madeInt32
three_point_field_goals_attemptedInt32
free_throws_madeInt32
free_throws_attemptedInt32
offensive_reboundsInt32
defensive_reboundsInt32
reboundsInt32
assistsInt32
stealsInt32
blocksInt32
turnoversInt32
foulsInt32
plus_minusString
pointsInt32
starterBoolean
ejectedBoolean
did_not_playBoolean
activeBoolean
athlete_jerseyString
athlete_short_nameString
athlete_headshot_hrefString
athlete_position_nameString
athlete_position_abbreviationString
team_display_nameString
team_uidString
team_slugString
team_logoString
team_abbreviationString
team_colorString
team_alternate_colorString
home_awayString
team_winnerBoolean
team_scoreInt32
opponent_team_idInt32
opponent_team_nameString
opponent_team_locationString
opponent_team_display_nameString
opponent_team_abbreviationString
opponent_team_logoString
opponent_team_colorString
opponent_team_alternate_colorString
opponent_team_scoreInt32
load_nba_player_boxscore(seasons=2024)

load_nba_schedule

Release: espn_nba_schedules · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_nba_schedules/nba_schedule_{season}.parquet

Returns

col_nametype
idInt32
uidString
dateString
attendanceFloat64
time_validBoolean
neutral_siteBoolean
conference_competitionBoolean
recentBoolean
start_dateString
notes_typeString
notes_headlineString
type_idInt32
type_abbreviationString
status_clockFloat64
status_display_clockString
status_periodFloat64
status_type_idInt32
status_type_nameString
status_type_stateString
status_type_completedBoolean
status_type_descriptionString
status_type_detailString
status_type_short_detailString
format_regulation_periodsFloat64
home_idInt32
home_uidString
home_locationString
home_nameString
home_abbreviationString
home_display_nameString
home_short_display_nameString
home_colorString
home_alternate_colorString
home_is_activeBoolean
home_logoString
home_scoreInt32
home_winnerBoolean
away_idInt32
away_uidString
away_locationString
away_nameString
away_abbreviationString
away_display_nameString
away_short_display_nameString
away_colorString
away_alternate_colorString
away_is_activeBoolean
away_logoString
away_scoreInt32
away_winnerBoolean
game_idInt32
seasonInt32
season_typeInt32
venue_idInt32
venue_full_nameString
venue_address_cityString
venue_address_stateString
venue_capacityFloat64
venue_indoorBoolean
status_type_alt_detailString
game_jsonBoolean
game_json_urlString
game_date_timeDatetime(time_unit='us', time_zone='America/New_York')
game_dateDate
PBPBoolean
team_boxBoolean
player_boxBoolean
load_nba_schedule(seasons=2024)

load_nba_team_boxscore

Release: espn_nba_team_boxscores · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_nba_team_boxscores/team_box_{season}.parquet

Returns

col_nametype
game_idInt32
seasonInt32
season_typeInt32
game_dateDate
game_date_timeDatetime(time_unit='us', time_zone='America/New_York')
team_idInt32
team_uidString
team_slugString
team_locationString
team_nameString
team_abbreviationString
team_display_nameString
team_short_display_nameString
team_colorString
team_alternate_colorString
team_logoString
team_home_awayString
team_scoreInt32
team_winnerBoolean
assistsInt32
blocksInt32
defensive_reboundsInt32
fast_break_pointsString
field_goal_pctFloat64
field_goals_madeInt32
field_goals_attemptedInt32
flagrant_foulsInt32
foulsInt32
free_throw_pctFloat64
free_throws_madeInt32
free_throws_attemptedInt32
offensive_reboundsInt32
points_in_paintString
stealsInt32
team_turnoversInt32
technical_foulsInt32
three_point_field_goal_pctFloat64
three_point_field_goals_madeInt32
three_point_field_goals_attemptedInt32
total_reboundsInt32
total_technical_foulsInt32
total_turnoversInt32
turnover_pointsString
turnoversInt32
opponent_team_idInt32
opponent_team_uidString
opponent_team_slugString
opponent_team_locationString
opponent_team_nameString
opponent_team_abbreviationString
opponent_team_display_nameString
opponent_team_short_display_nameString
opponent_team_colorString
opponent_team_alternate_colorString
opponent_team_logoString
opponent_team_scoreInt32
largest_leadString
load_nba_team_boxscore(seasons=2024)

load_nba_game_rosters

Release: espn_nba_game_rosters · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_nba_game_rosters/game_rosters_{season}.parquet

Returns

col_nametype
seasonInt32
game_idString
team_idInt32
team_slugString
team_abbreviationString
team_display_nameString
home_awayString
athlete_idInt32
athlete_uidString
athlete_guidString
athlete_display_nameString
athlete_short_nameString
athlete_first_nameString
athlete_last_nameString
athlete_jerseyString
athlete_positionString
athlete_headshotString
starterBoolean
did_not_playBoolean
activeBoolean
ejectedBoolean
reasonString
load_nba_game_rosters(seasons=2002)

load_nba_officials

Release: espn_nba_officials · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_nba_officials/officials_{season}.parquet

Returns

col_nametype
seasonInt32
game_idInt32
official_full_nameString
official_display_nameString
official_positionString
official_position_idInt32
official_orderInt32
load_nba_officials(seasons=2002)

load_nba_shots

Release: espn_nba_shots · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_nba_shots/shots_{season}.parquet

Returns

col_nametype
game_idInt32
seasonInt32
period_numberInt32
clock_display_valueString
team_idInt32
athlete_id_1Int32
athlete_id_2Int32
type_idInt32
type_textString
scoring_playBoolean
score_valueInt32
coordinate_xFloat64
coordinate_yFloat64
coordinate_x_rawFloat64
coordinate_y_rawFloat64
load_nba_shots(seasons=2002)

load_nba_standings

Release: espn_nba_standings · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_nba_standings/standings_{season}.parquet

Returns

col_nametype
seasonInt32
group_idString
group_nameString
group_abbreviationString
group_short_nameString
team_idInt32
team_uidString
team_slugString
team_locationString
team_nameString
team_abbreviationString
team_display_nameString
team_short_display_nameString
team_colorString
team_alternate_colorString
team_logoString
stat_nameString
stat_display_nameString
stat_short_display_nameString
stat_descriptionString
stat_abbreviationString
stat_typeString
display_valueString
valueFloat64
load_nba_standings(seasons=2002)

load_nba_stats_schedules

Release: nba_stats_schedules · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/nba_stats_schedules/schedule_{season}-26.parquet

Returns

col_nametype
game_dateDate
game_idString
game_codeString
game_statusInt32
game_status_textString
game_sequenceInt32
game_date_estString
game_time_estString
game_date_time_estString
game_date_utcString
game_time_utcString
game_date_time_utcString
away_team_timeString
home_team_timeString
dayString
month_numInt32
week_numberInt32
week_nameString
if_necessaryString
series_game_numberString
game_labelString
game_sub_labelString
series_textString
arena_nameString
arena_stateString
arena_cityString
postponed_statusString
branch_linkString
game_subtypeString
is_neutralBoolean
home_team_idInt32
home_team_nameString
home_team_cityString
home_team_tricodeString
home_team_slugString
home_team_winsInt32
home_team_lossesInt32
home_team_scoreInt32
home_team_seedInt32
away_team_idInt32
away_team_nameString
away_team_cityString
away_team_tricodeString
away_team_slugString
away_team_winsInt32
away_team_lossesInt32
away_team_scoreInt32
away_team_seedInt32
seasonInt32
league_idString
season_type_idString
season_type_descriptionString
load_nba_stats_schedules(seasons=2025)