REC: fast sparse regression-based multicategory classification

Zhang Chong; Lu Xiaoling; Zhu Zhengyuan; Hu Yin; Singh Darshan; Jones Corbin; Liu Jinze; Prins Jan F.; Liu Yufeng

首页> 外文期刊>Statistics and Its Interface >REC: fast sparse regression-based multicategory classification

【24h】

REC: fast sparse regression-based multicategory classification

机译：REC：基于快速稀疏的回归的多特语分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent advance in technology enables researchers to gather and store enormous data sets with ultra high dimensionality. In bioinformatics, microarray and next generation sequencing technologies can produce data with tens of thousands of predictors of biomarkers. On the other hand, the corresponding sample sizes are often limited. For classification problems, to predict new observations with high accuracy, and to better understand the effect of predictors on classification, it is desirable, and often necessary, to train the classifier with variable selection. In the literature, sparse regularized classification techniques have been popular due to the ability of simultaneous classification and variable selection. Despite its success, such a sparse penalized method may have low computational speed, when the dimension of the problem is ultra high. To overcome this challenge, we propose a new sparse REgression based multicategory Classifier (REC). Our method uses a simplex to represent different categories of the classification problem. A major advantage of REC is that the optimization can be decoupled into smaller independent sparse penalized regression problems, and hence solved by using parallel computing. Consequently, REC enjoys an extraordinarily fast computational speed. Moreover, REC is able to provide class conditional probability estimation. Simulated examples and applications on microarray and next generation sequencing data suggest that REC is very competitive when compared to several existing methods.

机译：最近的技术进步使研究人员能够通过超高维度收集和存储巨大的数据集。在生物信息学中，微阵列和下一代测序技术可以生产具有成千上万的生物标志物预测因子的数据。另一方面，相应的样本尺寸通常是有限的。对于分类问题，为了以高精度预测新观察，并更好地了解预测器对分类的影响，是理想的，并且通常需要培训分类器的变量选择。在文献中，由于同时分类和可变选择的能力，稀疏的正则化分类技术已经很受欢迎。尽管有其成功，但这种稀疏的惩罚方法可能具有低的计算速度，当问题的维度超高时。为了克服这一挑战，我们提出了一种基于新的稀疏回归的多特征分类器（REC）。我们的方法使用Simplex表示不同类别的分类问题。 REC的一个主要优点是，优化可以分离成较小的独立稀疏惩罚的回归问题，因此通过使用并行计算解决了。因此，REC享有非凡的计算速度。此外，REC能够提供类条件概率估计。微阵列和下一代测序数据的模拟示例和应用表明，与若干现有方法相比，REC是非常竞争力的。

著录项

来源
《Statistics and Its Interface》 |2017年第2期|共11页
作者
Zhang Chong; Lu Xiaoling; Zhu Zhengyuan; Hu Yin; Singh Darshan; Jones Corbin; Liu Jinze; Prins Jan F.; Liu Yufeng;
展开▼
作者单位

Univ Waterloo Dept Stat &

Actuarial Sci Waterloo ON N2L 3G1 Canada;

Renmin Univ China Ctr Appl Stat Sch Stat Beijing Peoples R China;

Iowa State Univ Dept Stat Ames IA USA;

Sage Bionetworks Seattle WA USA;

Univ North Carolina Chapel Hill Dept Comp Sci Chapel Hill NC USA;

Univ North Carolina Chapel Hill Dept Biol Chapel Hill NC USA;

Univ Kentucky Dept Comp Sci Lexington KY 40506 USA;

Univ North Carolina Chapel Hill Dept Comp Sci Chapel Hill NC USA;

Univ North Carolina Chapel Hill Dept Stat &

Operat Res UNC Lineberger Comprehens Canc Ctr Dept Genet Dept Biostat Carolina Ctr Genome Sci Chapel Hill NC USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类统计学;
关键词
LASSO; Parallel computing; Probability estimation; Simplex; Variable selection;

机译：套索;平行计算;概率估计;单纯x;变量选择;

相似文献

外文文献
中文文献
专利

1. REC: fast sparse regression-based multicategory classification [J] . Chong Zhang, Xiaoling Lu, Zhengyuan Zhu, Statistics and Its Interface . 2017,第2期

机译：REC：基于快速稀疏回归的多类别分类
2. Fast single sample face recognition based on sparse representation classification [J] . Meng-Jun Ye, Chang-Hui Hu, Li-Guang Wan, Multimedia Tools and Applications . 2021,第3期

机译：基于稀疏表示分类的快速单样本人脸识别
3. Fast kernel sparse representation based classification for Undersampling problem in face recognition [J] . Zizhu Fan, Chao Wei Multimedia Tools and Applications . 2020,第11a12期

机译：基于内采样问题的基于快速内核稀疏表示的面部识别
4. Sparse representation classification via fast matching pursuit for face recognition [C] . Michael M. Abdel-Sayed, Ahmed Khattab, Mohamed F. Abu-Elyazeed 2017 Proceedings of the Japan-Africa Conference on Electronics, Communications, and Computers . 2017

机译：通过快速匹配追踪进行人脸识别的稀疏表示分类
5. Multicategory support vector machines, theory, and application to the classification of microarray data and satellite radiance data. [D] . Lee, Yoonkyung. 2002

机译：多类别支持向量机，理论及其在微阵列数据和卫星辐射度数据分类中的应用。
6. Fast esophageal layer segmentation in OCT images of guinea pigs based on sparse Bayesian classification and graph search [O] . Cong Wang, Meng Gan, Na Yang, 2019

机译：基于稀疏贝叶斯分类和图搜索的豚鼠OCT图像快速食管层分割
7. REC: fast sparse regression-based multicategory classification [O] . Chong Zhang, Xiaoling Lu, Zhengyuan Zhu, 2017

机译：REC：基于快速稀疏的回归的多特语分类
8. An Adaptive Multicategory Pattern Classification System [R] . Pitt, J. M., Womack, B. F. 1968

机译：一种自适应多类别模式分类系统

REC: fast sparse regression-based multicategory classification

摘要

著录项

相似文献

相关主题

期刊订阅