Skip to main content
Version: 0.0.55

MBB dataset loaders

flowchart LR
raw["scrape / raw"] --> enrich["enrich"] --> rel["release asset"] --> load["load_*()"]

Automation status

DatasetRelease tagPipeline
load_mbb_pbpespn_mens_college_basketball_pbp
load_mbb_player_boxscoreespn_mens_college_basketball_player_boxscores
load_mbb_scheduleespn_mens_college_basketball_schedules
load_mbb_team_boxscoreespn_mens_college_basketball_team_boxscores
load_mbb_shotsespn_mens_college_basketball_shots

load_mbb_pbp

Release: espn_mens_college_basketball_pbp · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_mens_college_basketball_pbp/play_by_play_{season}.parquet

Returns

col_nametype
game_play_numberInt32
idFloat64
sequence_numberInt32
type_idInt32
type_textString
textString
away_scoreInt32
home_scoreInt32
period_numberInt32
period_display_valueString
clock_display_valueString
scoring_playBoolean
score_valueInt32
team_idInt32
athlete_id_1Int32
wallclockString
shooting_playBoolean
game_idInt32
seasonInt32
season_typeInt32
home_team_idInt32
home_team_nameString
home_team_mascotString
home_team_abbrevString
home_team_name_altString
away_team_idInt32
away_team_nameString
away_team_mascotString
away_team_abbrevString
away_team_name_altString
game_spreadFloat64
home_favoriteBoolean
game_spread_availableBoolean
home_team_spreadFloat64
halfInt32
timeString
clock_minutesInt32
clock_secondsInt32
home_timeout_calledBoolean
away_timeout_calledBoolean
lead_periodInt32
lead_halfInt32
start_period_seconds_remainingInt32
start_game_seconds_remainingInt32
end_period_seconds_remainingInt32
end_game_seconds_remainingInt32
lag_periodInt32
lag_halfInt32
athlete_id_2Int32
game_dateDate
game_date_timeDatetime(time_unit='us', time_zone='America/New_York')
coordinate_x_rawFloat64
coordinate_y_rawFloat64
coordinate_xFloat64
coordinate_yFloat64
media_idString
load_mbb_pbp(seasons=2024)

load_mbb_player_boxscore

Release: espn_mens_college_basketball_player_boxscores · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_mens_college_basketball_player_boxscores/player_box_{season}.parquet

Returns

col_nametype
game_idInt32
seasonInt32
season_typeInt32
game_dateDate
game_date_timeDatetime(time_unit='us', time_zone='America/New_York')
athlete_idInt32
athlete_display_nameString
team_idInt32
team_nameString
team_locationString
team_short_display_nameString
minutesFloat64
field_goals_madeInt32
field_goals_attemptedInt32
three_point_field_goals_madeInt32
three_point_field_goals_attemptedInt32
free_throws_madeInt32
free_throws_attemptedInt32
offensive_reboundsInt32
defensive_reboundsInt32
reboundsInt32
assistsInt32
stealsInt32
blocksInt32
turnoversInt32
foulsInt32
pointsInt32
starterBoolean
ejectedBoolean
did_not_playBoolean
activeBoolean
athlete_jerseyString
athlete_short_nameString
athlete_headshot_hrefString
athlete_position_nameString
athlete_position_abbreviationString
team_display_nameString
team_uidString
team_slugString
team_logoString
team_abbreviationString
team_colorString
team_alternate_colorString
home_awayString
team_winnerBoolean
team_scoreInt32
opponent_team_idInt32
opponent_team_nameString
opponent_team_locationString
opponent_team_display_nameString
opponent_team_abbreviationString
opponent_team_logoString
opponent_team_colorString
opponent_team_alternate_colorString
opponent_team_scoreInt32
load_mbb_player_boxscore(seasons=2024)

load_mbb_schedule

Release: espn_mens_college_basketball_schedules · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_mens_college_basketball_schedules/mbb_schedule_{season}.parquet

Returns

col_nametype
idInt32
uidString
dateString
attendanceFloat64
time_validBoolean
neutral_siteBoolean
conference_competitionBoolean
recentBoolean
start_dateString
notes_typeString
notes_headlineString
type_idInt32
type_abbreviationString
venue_idInt32
venue_full_nameString
venue_address_cityString
venue_address_stateString
venue_capacityFloat64
venue_indoorBoolean
status_clockFloat64
status_display_clockString
status_periodFloat64
status_type_idInt32
status_type_nameString
status_type_stateString
status_type_completedBoolean
status_type_descriptionString
status_type_detailString
status_type_short_detailString
format_regulation_periodsFloat64
home_idInt32
home_uidString
home_locationString
home_nameString
home_abbreviationString
home_display_nameString
home_short_display_nameString
home_colorString
home_alternate_colorString
home_is_activeBoolean
home_venue_idInt32
home_logoString
home_conference_idInt32
home_scoreInt32
home_winnerBoolean
away_idInt32
away_uidString
away_locationString
away_nameString
away_abbreviationString
away_display_nameString
away_short_display_nameString
away_colorString
away_alternate_colorString
away_is_activeBoolean
away_venue_idInt32
away_logoString
away_conference_idInt32
away_scoreInt32
away_winnerBoolean
game_idInt32
seasonInt32
season_typeInt32
status_type_alt_detailString
groups_idInt32
groups_nameString
groups_short_nameString
groups_is_conferenceBoolean
tournament_idInt32
game_jsonBoolean
game_json_urlBoolean
game_date_timeDatetime(time_unit='us', time_zone='America/New_York')
game_dateDate
PBPBoolean
team_boxBoolean
player_boxBoolean
load_mbb_schedule(seasons=2024)

load_mbb_team_boxscore

Release: espn_mens_college_basketball_team_boxscores · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_mens_college_basketball_team_boxscores/team_box_{season}.parquet

Returns

col_nametype
game_idInt32
seasonInt32
season_typeInt32
game_dateDate
game_date_timeDatetime(time_unit='us', time_zone='America/New_York')
team_idInt32
team_uidString
team_slugString
team_locationString
team_nameString
team_abbreviationString
team_display_nameString
team_short_display_nameString
team_colorString
team_alternate_colorString
team_logoString
team_home_awayString
team_scoreInt32
team_winnerBoolean
assistsInt32
blocksInt32
defensive_reboundsInt32
field_goal_pctFloat64
field_goals_madeInt32
field_goals_attemptedInt32
flagrant_foulsInt32
foulsInt32
free_throw_pctFloat64
free_throws_madeInt32
free_throws_attemptedInt32
largest_leadString
offensive_reboundsInt32
stealsInt32
team_turnoversInt32
technical_foulsInt32
three_point_field_goal_pctFloat64
three_point_field_goals_madeInt32
three_point_field_goals_attemptedInt32
total_reboundsInt32
total_technical_foulsInt32
total_turnoversInt32
turnoversInt32
opponent_team_idInt32
opponent_team_uidString
opponent_team_slugString
opponent_team_locationString
opponent_team_nameString
opponent_team_abbreviationString
opponent_team_display_nameString
opponent_team_short_display_nameString
opponent_team_colorString
opponent_team_alternate_colorString
opponent_team_logoString
opponent_team_scoreInt32
load_mbb_team_boxscore(seasons=2024)

load_mbb_shots

Release: espn_mens_college_basketball_shots · asset https://github.com/sportsdataverse/sportsdataverse-data/releases/download/espn_mens_college_basketball_shots/shots_{season}.parquet

Returns

col_nametype
game_idInt32
seasonInt32
period_numberInt32
clock_display_valueString
team_idInt32
athlete_id_1Int32
athlete_id_2Int32
type_idInt32
type_textString
scoring_playBoolean
score_valueInt32
coordinate_xFloat64
coordinate_yFloat64
coordinate_x_rawFloat64
coordinate_y_rawFloat64
load_mbb_shots(seasons=2025)