Skip to content

mpfoley73/baseball-survival

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This project is a statistical analysis of the association between longevity (life expectancy) and fame in Major League baseball. It is inspired by Redelmeier and Singh's survival in Oscar winners study which found that winning an Academy Award was associated with 3.9 years of additional life expectancy. In this setting, fame conferred by induction into the baseball Hall of Fame replaces the Academy Award.

What the project does

  • Evaluate data sources. There are at least two data sources that can support this project. Retrosheet publishes a single text file of players that is easy to work with and comprehensive, but somewhat limited in the number of attributes. Lahman's Baseball Database is an entire data model. It has an associated R package and it was also used in a similar study by Abel and Kruger.
  • Review Literature. The two studies referenced above lay the foundation that this study will build upon. In addition, Redelmeier and Singh recently published an update to their Academy study in PLoS ONE.
    • abel_kruger_2005 (html, Rmd). Reproduction of Abel & Kruger 2005 study (incomplete - I'm struggling with the data setup.)
    • saint_onge_2008 (html, Rmd). Review of Major League Baseball Players' Life Expectancies (2008). Partially reproduces the 2008 analysis using the Lahman data, re-fits their discrete-time logistic regression models with data updated through the 2021 season, and fits an alternative Cox proportional hazards model.
  • Groundwork. The Abel and Kruger study is so similar to this study, both is subject and data source, that reproducing their results will be a good exercise in working with the data.

Why the project is useful

I'm not a professional in age studies, so it is unlikely that this project makes a meaningful contribution to the literature on survival. However, it will be an end-to-end analysis with accessible data and code that analysts may reference and learn from for survival studies. It includes Kaplan Meier survival curve exploration and Cox proportional hazards regression analysis.

Who maintains and contributes to the project

I worked on this project independently. Project is under development.

References

Baseball with R blog.

Ernest L. Abel & Michael L. Kruger (2005) The Longevity of Baseball Hall of Famers Compared to Other Players, Death Studies, 29:10, 959-963, DOI: 10.1080/07481180500299493. PDF.

Lahman. Sean Lahman's R Package.

Lemez S, Baker J. Do Elite Athletes Live Longer? A Systematic Review of Mortality and Longevity in Elite Athletes. Sports Med Open. 2015;1(1):16. doi:10.1186/s40798-015-0024-x. PDF.

Redelmeier, D. A., & Singh, S. M. (2001). Survival in Academy Award–winning actors and actresses. Annals of Internal Medicine, 134(10), 955-962. PDF.

Redelmeier DA, Singh SM (2022) Long-term mortality of academy award winning actors and actresses. PLoS ONE 17(4): e0266563. https://doi.org/10.1371/journal.pone.0266563

Retrosheet.

Saint Onge JM, Rogers RG, Krueger PM. Major League Baseball Players' Life Expectancies. Soc Sci Q. 2008;89(3):817-830. doi:10.1111/j.1540-6237.2008.00562.x. HTML.

Sean Lahman's Baseball Database.

Smith G. The Baseball Hall of Fame is not the kiss of death. Death Stud. 2011 Nov-Dec;35(10):949-55. doi: 10.1080/07481187.2011.553337. PMID: 24501860. PDF.

Sylvestre, Marie-Pierre & Huszti, Ella & Hanley, James. (2006). Do Oscar Winners Live Longer than Less Successful Peers? A Reanalysis of the Evidence. Annals of internal medicine. 145. 361-3; discussion 392. 10.7326/0003-4819-145-5-200609050-00009. PDF.

Survival Analysis. My supervised machine learning notes on survival analysis.

About

Survival analysis of MLB baseball players.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages