More on the Challenge
I have to admit, just because I haven’t been writing about it, thoughts of this challenge haunt me all day long. I have always put pressure on myself, but this is getting a bit out of hand. I go to bed and lie awake thinking about possibly tweaking my figures a bit, and then I wake up and don’t exactly remember what I thought about the night before.
Regardless, I’m getting closer. I would like to note that I ran multiple regressions on figuring plate appearrances from previous years’ totals, and it looks like Marcel is pretty close. This is good to know, but it is also a double-edged sword.
Fact is, I was hoping to come up with something better, but Tango is right. Pretty much all projection systems are about the same amount right and the same amount wrong. Problem is that if I base mine on what Marcel is doing, I am going to correlate to Marcel at something like .990, which is ridiculous, I think. I want to come up with something different that ends up somewhere in the middle of the pack of all these other projection systems. Anything in the top half of the table would be considered a victory for me. However, judging by the scoring system, where only the top 5 out of 25 people or so score points in each “league”, I would just like to earn some points.
Fact is, about 600 batters go to the plate at least once every year, and I have to figure out who all of them are going to be. I have noticed that the number has been slowly rising since 1990, which is as far back as I have checked. I bet that has something to do with the Kenny Lofton phenomenon, where a player could contribute, but no one signs him because of age and saving money. Then, you end up with three different utility players sharing time between the minors and the majors making up for where Kenny would have played. Is it good or bad for baseball, I don’t know, but there’s probably a bunch of reasons I’m not even thinking of (like expansion). By the way, about 500 of those 600 played the year before that so the trick will be coming up with the other 100, and the 10% (see Evan Longoria) that actually get some good playing time.
Anyway. My plate appearrance correlation is higher and my average error is lower than that of Marcel at the moment, but I am lower in correlation in things like home runs, and RBIs. However, my average error is right on, where Marcel ends up being off across the board. The problem is that I have no idea if what I have is beating Marcel or losing. It’s not really important either way, but if I could regularly beat Marcel, I would at least have a chance in the challenge.
So where am I at? Well, I am pretty happy with my plate appearrance formula. It seems that either the one where I take half of last season and add in a couple tenths of a three year average and then add 100, or the one where I take 65% of last season and add in 10% of the season before and add about 85 both beat Marcel in correlation, average error and absolute error. After that, it gets tougher.
I’ve been messing with the weights, and I have found that my HR correlation jumped up when I used a 9/6/4 (Marcel uses a 5/4/3). I regress to the mean 1000 PA, which ends up being much less than Marcel, because my multipliers are higher. Marcel’s multipliers add up to 12 and he does 1200 PAs. Mine add up to 19 and I use 1000, so it’s about half.
But I am only happy with Home Runs at the moment. I would like to have a different multiplier for each statistic that I need. I am aware the formula only calls for HR + SB + (H - .27*AB) + R/3 + RBI/3, but I am going to need to add in 2B and 3B and SB and CS, as my R and RBI statistics are based on regressions of those statistics.
I think where I am going to end up is that I come up with a different multiplier for each and every statistic, and then I use my regression formulas for R and RBI. I then average my “projections” based on multipliers and the regression projections to come up with a third number for R and RBI. I will not do this for HR, SB or H, as those statistics are not based on other factors, such as the batter in front or behind the person in question.
And then, I will tackle pitching. Man, am I running out of time. I hope that I a) don’t embarrass myself and b) that this happens again next year, because I have thoroughly enjoyed thinking about this so much and I believe that I can come up with something in the next year that will have a shot.
But we’ll have to wait and see.