Predicting which prospects will be successful in the major leagues is hard. With so many factors to consider, even the best prospect rankings are littered with a mix of successes and failures. Performance is just one part of the equation, although isolating it has a few advantages.
Purely stat-based projections provide a more objective measure of value, with subjectivity only lying in choices made when constructing the model. This isn’t to say that performance is the only measure that should be considered; rather it should be used in conjunction with scouting reports that contain information where the system is blind. A computer can also constantly churn out thousands of updated player projections, whereas asking any person to do this would surely drive them insane. I’ll talk a bit about the methodology of the model, but for those just interested in the projections, they can be found toward the bottom of this article. For the sake of mobile readers, the rankings in this article are condensed to a simplified Top 100, but there will also be a spreadsheet linked with more detailed projections.
To start, I wanted to find which minor league statistics were statistically significant in predicting major league performance. For hitters, I narrowed it down to just six: Age, Iso, K%, BB%, Spd, and wRC+. These are standardized and input into a series of logistic models trained on historical minor league data. Since there is some collinearity between the input variables, an L2 regularization penalty is applied. Multiple seasons are factored in, but recent performance and statistics at higher minor league levels are heavily weighted.
The output of the model changed over several iterations, but I eventually settled on average WAR over a player’s best three seasons before age 30. Total WAR over the same time span was also considered, though I found better results using a three-year peak because of factors like playing time and injuries adding noise to the data. It’s also worth noting here that since these are WAR-based outputs, this list isn’t strictly a fantasy list.
The logistic models predict the probability of a hitter’s peak ending up in seven different categories: never playing in the MLB, under 0.5 WAR, 0.5-1.5 WAR, 1.5-2.5 WAR, 2.5-3.5 WAR, 3.5-4.5 WAR, and more than 4.5 WAR. The full probability charts for each player can be found in the spreadsheet link at the bottom of the article, with percentages indicating the cumulative probability a player has of reaching each threshold. The xWAR column is calculated as the mean of these probabilities.
A few key principles the model abides by from the training sample:
- Age (relative to level), K%, and Iso are the most predictive minor league statistics
- High strikeout rates in the low minors are extremely concerning
- High strikeout rates in the high minors are a red flag but less important if the player is young for the level
- Walk rates are less important in the low minors and more important in the high minors
- Players who are very slow tend to have minimal defensive value, and players who are fast tend to end up at premium positions
These may seem obvious, but quantifying how much each of these matters answers the question of how to weight different factors.
There are a couple of weaknesses with the model. If I were aiming for perfection on the first try, I’d never end up releasing it. Firstly, speed does an OK job as a proxy for defensive ability, but it’s flawed in many cases, especially for evaluating catchers. I tested adding positions as inputs, but they had minimal predictive effect since players often change positions as they approach the majors. Eventually I’d like to add some more defensive statistics. The other weak point is projecting college players in the low minors. Recently drafted college players tend to be relatively old for their levels until they reach Double-A, so they generally don’t get great projections until then. I’d also like to add some categorical variables for a player’s pre-professional background. And of course, there has to be pitching projections as well—being on Pitcher List and all.
But without further ado, here are the Top 100 hitting prospects based on the projections:
Wander Franco unsurprisingly ranks No. 1 overall. He checks off every box in the model inputs and continues to put up elite numbers as an 18-year-old in High-A. I mainly want to focus on players where the projections differ from traditional rankings though, starting with No. 2 and No. 3.
Dylan Carlson has been getting some attention this year for his breakout campaign, but he’s really loved by the projections. At just 20 years old in Double-A, Carlson has put up a 143 wRC+ with strikeout and walk rates of 19.4% and 10.7%. With 17 home runs and 13 steals, he’s very good at everything even if there isn’t one elite tool.
There’s a couple of things not to like with Drew Waters, but his age and combination of power and speed place him at No. 3. The good is a 155 wRC+ with 47 extra-base hits and 13 steals as a 20-year-old in Double-A—like Carlson. However, he’s also got a 26.1% strikeout rate and .451 BABIP. Waters does profile as a high-BABIP hitter with plenty of doubles and speed, and good minor league hitters also tend to put up very high BABIPs. The strikeout rate is more concerning, though since he’s so young for Double-A he gets somewhat of a pass.
The rest of the Top 10 is fairly tame until we get to Trent Grisham. He’s massively improved his stock with a 152 wRC+ in Double-A and a 160 wRC+ in Triple-A while walking nearly as much as he’s struck out. Totaling 23 homers and 12 stolen bases so far, he’s a potential five-category producer in fantasy if he can find playing time in the crowded Brewers outfield.
For the sake of brevity, the last two players I’ll touch on are the pair of Blue Jays catchers at No. 14 and No. 15: Alejandro Kirk and Gabriel Moreno. They’re both young for their levels and have put up outstanding offensive numbers with little fanfare. Scouts don’t love the 20-year-old Kirk’s build at 5’9″, 225 pounds but he compares to Willians Astudillo of the Twins with a little less contact ability but more power upside and a high walk rate. Moreno, 19, looks the opposite of Kirk at 5’11”, 165 pounds, though he’s similarly put up a 155 wRC+ with a 9.4% strikeout rate in A-ball.
Full projections of all 1,780 rookie-eligible minor league hitters with at least 100 plate appearances this season and 200 plate appearances in their career can be found here. In future articles, I’d like to also release historical projections and scores for the test predictions when I can spend more words discussing them.
(Photo by Cliff Welch/Icon Sportswire)
