A fundamental question we need to answer when creating a model is what we are going to ask it for. Ideally we would like to just ask the model to provide us with the winner of the race since this is what really matters. It is not as simple as this though. Remember that racing is a chaotic event and it is impossible to predict the winner with absolute accuracy. More than this we as horse players are not just trying to pick the winner of the race instead we are trying to capitalize in our opinion by been more precise in our opinions than the public.
In theory if we could ask the model to provide us with the probability of each horse to win the race our job as bettors would have been as easy as to comparer this probability against the odds offered by the pool and bet only the overlays. In real world it does not work like this as we have no clue of what the final odds are going to be, even our bet is affecting the pools which means than even if we were able to find the real probability we would not have been able to make a profit using it in this way. More than this what we are going to estimate as a real probability most of the times will be close to what is offered by the public and in the times there will be a large discrepancy the public will be correct most of the times as our model would have been missing some information known to the other bettors.
This does not mean that developing such a model is useless. It can be helpful not only as a confirmation of the validity of the whole methodology of model creation but more than this it can serve as a starting point to more sophisticated models that potentially can show profitability in the long run. Most likely the creation of this type of model is the best start and we will discuss it extensible later although we now that this will not be enough to beat the game.
Another opinion we can ask a model, is to provide us an opinion about what is the likelihood of this particular race to produce a high odds winner. In other words now what we are asking for is a binary value serving as an indicator of what kind of a winner we should be looking for. Obviously this is a much easier question than the previous although its answer remains open to multiple interpretations. Note though, that this answer can now initiate a completely new process of research trying to identify the winner or winners using a subset of the starters of the race.
Note that our question to the model does not necessary has to do with the final outcome of the race. For example we can very well ask the model to identify the horse that will take the lead, or how fast a race will go or anything else.
Based in the infinite set of questions we can ask a model for it becomes obvious that asking the right question will become a critical factor for its success and we will talk about this later to a larger extend.