No! This is a great question, that I think highlights the complexity of even specifying, let alone evaluating, strategy involving pair splits.
Let's clarify some definitions. First, by "zero-memory," I mean (although others may mean something different, for the purpose of this discussion I want to clarify what *I* mean) a strategy that specifies whether to stand/hit/double/split/surrender, as a function of the current player hand and dealer up card... and *only* the current hand and up card. That is, a zero-memory strategy only "knows" the cards in the current hand, not whether that hand was part of the initial deal, or is one of multiple split hands, etc.
Second, "CDZ-" is just one common conventionally-understood notation referring to a *particular* zero-memory strategy, that is determined as follows: (1) compute the composition-dependent strategy that optimizes EV... temporarily ignoring/prohibiting the possibility of pair splitting. (Note that by prohibiting pair splitting, we can all agree on an efficient means of computing this strategy, and furthermore, we can all truly claim that the corresponding EV is actually optimal among all possible composition-dependent strategies.) Then (2) compute the EVs for splitting all possible pairs, assuming that we (a) split and resplit at every opportunity, and (b) use the strategy already computed in step (1) for any other hands encountered "post-split." And finally (3) compute overall EV for the round, where for each initially dealt hand we choose the playing strategy that maximizes the EV computed in step (1) or (2) as appropriate.
Coming back to this 1D example, in the CDZ- strategy we hit 6-2 vs. dealer 5. That CDZ- strategy also specifies player actions for all other possible hands and dealer up cards. Now, let's call "CDZ*" (note the asterisk) the strategy that dictates, "Follow CDZ- in all situations... except that when you encounter 6-2 vs. dealer 5, always double down instead of always hitting." (Note that CDZ* is not any sort of standard notation, I just made it up for the purpose of this discussion.)
We can efficiently compute the overall expected return from playing this modified CDZ* strategy. And we happen to find that this expected return is greater than the expected return from CDZ-. That is, abusing notation somewhat, the above example demonstrates that, for this shoe and these rules, E[CDZ-]<E[CDZ*] (note that inequality is strict).
However, finally getting to your question
, there is yet another third strategy of interest, called CDZ (note there are no minus signs, asterisks, or other qualifiers), that is common conventionally-understood notation referring to the playing strategy that yields the maximum possible overall expected return (for the given shoe subset, in this example a full single deck)-- that is, maximum EV subject to the constraint that it is zero-memory.
What is this CDZ strategy? I don't know. In this specific single-deck S17 example, maybe it's CDZ*. But maybe not-- how do we know that we can't further improve overall EV by making *two* changes to CDZ-, or three, or four, etc., instead of just the *single* modification to strategy with 6-2 vs. 5? For example, I searched for examples like this one by evaluating CDP1 strategy (details are for another post, but essentially relaxing the zero-memory constraint to allow a different strategy pre vs. post-split), and applying individual differences to CDZ- (post- *and* pre-split). But instead of just trying *singleton* subsets of this collection of candidate modifications, it's possible that other subsets of modifications might "collaborate" to improve overall EV further still.
In other words, in this case, we know E[CDZ-]<E[CDZ*] (from explicit calculation), and we know E[CDZ*]<=E[CDZ] (by definition, that is, E[S]<=E[CDZ] for *all* possible zero-memory strategies S), and so by transitivity we know that E[CDZ-]<E[CDZ] (that is, all of the available CAs that we know about are suboptimal, hence the minus sign). But we *don't* know whether E[CDZ*]=E[CDZ].
E
Bookmarks