1. Voting ensemble models always perform better than any of their constituent classifiers.
2. What is the rationale for using propensity averaging rather than a voting ensemble?
3. For a binary target, how is the propensity for a positive response calculated?