OK, so this is the first post( of many ) of the GD2 Atypical series. In these posts I’ll be documenting some of the discrepancies I’ve found( and battled to fix in my personal DB ) in the data provided by the GD2 XML files. Also, please note that these entries will only include data from the Mexican Winter and Summer leagues, but inconsistencies may años appear in data provided for other leagues.
I first came through this error while I was querying my DB few days back. I got this result. So the first thing that came to my mind was: How can I player appear twice in the same event number? Furthermore, why did I only get 84 rows( first ten shown below ) out of a table that stores 20k+ records. Uncommon data maybe? You bet.
game inning part_of_the_inning event player_name appearances
gid_2017_10_11_navwin_culwin_1 2 bottom 165 Wilson, Alexis 2
gid_2017_10_12_herwin_obrwin_1 9 top 623 Cardona, Jose 2
gid_2017_10_13_herwin_navwin_1 9 bottom 693 Madero, Diego 2
gid_2017_10_15_mocwin_culwin_1 12 top 721 Garcia, Sergio 2
gid_2017_10_15_mxcwin_mazwin_1 8 bottom 562 Valenzuela, Roberto 2
gid_2017_10_17_culwin_mxcwin_1 8 bottom 676 German, Leo 2
gid_2017_10_19_culwin_mxcwin_1 5 bottom 360 Hernandez, Hector 2
gid_2017_10_19_navwin_obrwin_1 8 top 574 Zazueta, Christian 2
gid_2017_10_20_herwin_obrwin_1 9 top 522 Cardona, Jose 2
Analyzing the first record. I went to this game, searched down for event 165 and I was able to see that Alexis Wilson,id 656029 appeared twice in that event:
<runner id="455378" start="2B" end="" event="Double" event_num="165"/>
<runner id="656029" start="" end="2B" event="Double" event_num="165"/>
<runner id="656029" start="2B" end="" event="Double" event_num="165" score="T" rbi="T" earned="T"/>
<runner id="459441" start="" end="2B" event="Double" event_num="165"/>
According to Gameday( line 2 in above gist ), Alex Wilson hit a double although he was not at bat. Actually, the man batting during that event was Maxwell Leon and guess what, he hit a double. So based on this, we can conclude that Gameday is wrong and that the first runner record showing id 656029 in event 165, should hold Maxwell’s player id.
To end this post I must say that although I can’t prove it, I strongly believe that these discrepancies are caused by some bug related to pinch-runner events. If you don’t believe me, check out this file. It contains the event ids having the same runners twice and the event ids when those runners entered the game as PR. Both event ids are very close to each other in most of the cases.