SeanLahman.com

Menu
  • Home
  • Books
  • Resources
  • Baseball Archive
  • Contact Sean
Menu

Download Lahman’s Baseball Database

The updated version of the database contains complete batting and pitching statistics from 1871 to 2022, plus fielding statistics, standings, team stats, managerial records, post-season data, and more. For more details on the latest release, please read the documentation.

The database can be used on any platform, but please be aware that this is not a standalone application. It is a database that requires Microsoft Access or some other database management software to be useful.

Download Latest Versions 

This release includes playing statistics through the end of the 2022 season.

  • 2022 – comma-delimited version [Baseball Databank]
  • 2022 – MS Access version

Limited Use License

This database is copyright 1996-2022 by Sean Lahman.

This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.  For details see: http://creativecommons.org/licenses/by-sa/3.0/

Thanks to Ted Turocy of the Chadwick Baseball Bureau, who for several years has done the heavy lifting to make the annual updates possible. Ted also hosts a version of the data at github, for folks who are inclined to interface with it that way.

Chris Dalzell and his team maintain an R package and library available through github. Thanks to Chris, Michael Friendly, Dennis Murphy, and Martin Monkman for their ongoing work.

Nat Dunn of Webucator has produced MySQL and SQLite versions, as well as a series of python scripts for working with the data, all available at github.


Please help support the Baseball Archive. The database is free, but there are real costs associated with maintaining it and making it available for download. The more popular this site becomes, the more expensive it is to keep things going. Please consider making a donation as a show of your support. Like the PBS folks say, we need your support if we’re going to survive. Click here for more information.


Download Previous Versions

Some third-party applications don’t work with newer versions of the database. For that reason, we’re making some earlier versions available for download. Please be advised that no support exists for these versions. All questions about using the database with third-party applications should be directed to the makers of that software.

Through 2021 season

2021 – comma-delimited version [Baseball Databank]
2021 – MS Access version

Through 2020 season

2020 – MS Access version
2020 – comma-delimited version [Baseball Databank]
2020 – R Package

Through 2019 season

2019 – MS Access version
2019 – comma-delimited version
2019 – R Package
2019 – MySQL version 
2019 – SQL Lite

Through 2018 season

2018 – comma-delimited version
2018 – SQL version
2018 – MS Access version

2017

2017 – Microsoft Access version
2017 – comma-delimited version
2017 – SQL version

2016

2016 – Microsoft Access version
2016 – comma-delimited version
2016 – SQL version

2015

2015 – Microsoft Access version
2015 – comma-delimited version
2015 – SQL version

2014

2014 – comma-delimited version
2014 – Microsoft Access version
2014 – SQL version

2013

2014 – comma-delimited version
2014  – Microsoft Access version
2014  – SQL version

2012

2012 Version  – Microsoft Access
2012 Version  – comma-delimited version
2012 Version  – SQL version

2011
Version 5.9.1 – Microsoft Access
Version 5.9.1 – comma-delimited version
Version 5.9.1 – SQL version

2010
Version 5.8 – Microsoft Access
Version 5.8 – comma-delimited version

2009
Version 5.7 – Microsoft Access

Version 5.7 – comma-delimited version

2008
Version 5.6 – Access
Version 5.6 – comma-delimited version

2007
Version 5.5 – Access
Version 5.5 – comma-delimited version

2006
Version 5.4 – Access
Version 5.4 – comma-delimited version
Version 5.4 – spreadsheet version

2005
Version 5.3 – Access 2000
Version 5.3 – Access 97
Version 5.3 – comma-delimited version
Version 5.3 – spreadsheet version

2004
Version 5.2 – Access 2000
Version 5.2 – Access 97
Version 5.2 – comma-delimited version
Version 5.2 – spreadsheet version

2003
Version 5.1 – Access 2000
Version 5.1 – Access 97
Version 5.1 – comma-delimited version

2002
Version 5.0 – Access 2000
Version 5.0 – Access 97
Version 5.0 – comma-delimited version

2001
Version 4.5  – Access 2000
Version 4.5 – Access 97
Version 4.5 – comma-delimted version

2000
Version 4.0 – Access 97

1999
Version 3.0 – comma-delimited version

24 thoughts on “Download Lahman’s Baseball Database”

  1. Pingback: Sean Lahman Looks Back on History of Sports Board Games | One for Five
  2. Pingback: March Moniker Madness Field is Set | Value Over Replacement Grit
  3. Pingback: High Heat Stats » Crowning New Strikeout Kings – Fully Normalized Strikeout Leaders
  4. Pingback: High Heat Stats » Living in the Postseason – Pitchers with the Highest Percentage of Career IP Coming in the Postseason
  5. Pingback: They have an SP1 through 5, but no A,E,I,O,Us. | Value Over Replacement Grit
  6. Pingback: The Heaviest Pitcher-Batter Matchups In MLB History | JunkStats
  7. Pingback: SQL - CycloneFanatic
  8. Pingback: Most Popular First Names in Baseball History – by Birth Decade | Value Over Replacement Grit
  9. Pingback: Birth Months and Budding Ballplayers: The Little League Thesis Revisited « spreadsheetjournalism
  10. Pingback: The VORG’s “All-Font” Team | Value Over Replacement Grit
  11. Pingback: A Lineup of Unique Last Names | Value Over Replacement Grit
  12. Pingback: Data Sets: A List in Flux | Citizen-Statistician
  13. Pingback: Comment-free collection of my most frequently visited statistics sites | Special Guessed
  14. Pingback: Baseball’s All-Star Break: Predicting the Game Using Excel | SoftArtisans
  15. Pingback: July 20 2013 MLB Predictions Vs Actual Winners | Epic99 Sports Analytics
  16. Pingback: My new favorite package: Shiny! | Chit Chat R
  17. Pingback: The VORG’s List of Unique Names in Baseball History | Value Over Replacement Grit
  18. Pingback: Correlation Analysis with IBM’s PureData for Analytics (Netezza) | Big Data topics (Netezza, Hadoop, etc)
  19. Pingback: A Very VORGy Thanksgiving – 2013 Edition | Value Over Replacement Grit
  20. Pingback: Quick SSIS Throughput Test | SQL Notes From The Underground
  21. Pingback: Man On First: Should I Steal? | Batting Leadoff
  22. Pingback: The Lahman Database: Season-by-Seasuon data | Boot Camp for New Users of R
  23. Pingback: Regression of OPS Stats | Analyzing Baseball Data with R
  24. Pingback: All the Sets of Same-named Players | Value Over Replacement Grit

Leave a Reply Cancel reply

You must be logged in to post a comment.

© 2023 SeanLahman.com | Powered by Minimalist Blog WordPress Theme