Regularizing axis-aligned ensembles via data rotations that favor simpler learners

Blaser Rico; Fryzlewicz Piotr

摘要

To overcome the inherent limitations of axis-aligned base learners in ensemble learning, several methods of rotating the feature space have been discussed in the literature. In particular, smoother decision boundaries can often be obtained from axis-aligned ensembles by rotating the feature space. In the present paper, we introduce a low-cost regularization technique that favors rotations which produce compact base learners. The restated problem adds a shrinkage term to the loss function that explicitly accounts for the complexity of the base learners. For example, for tree-based ensembles, we apply a penalty based on the median number of nodes and the median depth of the trees in the forest. Rather than jointly minimizing prediction error and model complexity, which is computationally infeasible, we first generate a prioritized weighting of the available feature rotations that promotes lower model complexity and subsequently minimize prediction errors on each of the selected rotations. We show that the resulting ensembles tend to be significantly more dense, faster to evaluate, and competitive at generalizing in out-of-sample predictions.

机译：为了克服集合学习中轴对齐基础学习者的固有局限，在文献中讨论了几种旋转特征空间的方法。特别地，通过旋转特征空间，通常可以从轴对齐的集合获得更平稳的决策边界。在本文中，我们介绍了一种低成本的正则化技术，这些技术有利于产生紧凑基础学习者的旋转。所重构的问题为丢失函数添加了缩小术语，以明确地占基础学习者的复杂性的损失函数。例如，对于基于树的集合，我们根据森林中的节点数量和树木中位深度的中位数申请惩罚。不是共同最小化预测误差和模型复杂性，这是计算方式不可行的，而是首先生成可用特征旋转的优先权加权，该优先级加权促进更低的模型复杂性，并且随后最小化每个所选择的旋转上的预测误差。我们表明，由此产生的集合倾向于更密集，更快地评估，并在样本外预测中的推广方面更快。

Regularizing axis-aligned ensembles via data rotations that favor simpler learners

摘要

著录项

相关主题

期刊订阅