The reason that don't make the player wait 'til the next opp. to sub is that a team MUST have five players barring injury or DQ. 3-1-1
The rationale for not stopping the layup by Team A to penalize Team B for only having 4 is that you don't also want to penalize Team A when they have done nothing wrong. See Casebook play 10.4.1C
So in short if A has no immediate advantageous situation other than B only having 4 players, you stop the game right away and give the T.
|