首页> 外文学位 >Bolstering CART and Bayesian variable selection methods for classification.

【24h】

Bolstering CART and Bayesian variable selection methods for classification.

机译：支持CART和贝叶斯变量选择方法进行分类。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

An important problem in many areas is exploring the relationship between object categories and their observational characteristics. In particular, it is important to understand which measurements are related to a specific category. One way of tackling this sort of discriminant problem is by a nonparametric method known as Classification and Regression Trees (CART). In this thesis, a stochastic step is added to the CART algorithm and an annealing schedule is used to find 'optimal' models. Two approaches to model selection are proposed to avoid overfitting problems.; For the problems with high dimensional and collinear data sets, we propose a Bayesian variable selection approach to multinomial probit models. Motivated by the binary probit model with latent variables, we build a multivariate extension to the case of more than two categories and use latent variables to specialize the general distributional setting to the linear model with Gaussian errors. We then apply Bayesian variable selection techniques that adopt natural conjugate prior distributions. A posteriori we integrate some of the parameters out and do inference on the marginal distribution of single models by using MCMC methods and truncated normal or student-t sampling techniques to draw multivariate vectors. We apply the methodology to problems in both chemometrics and functional genomics, first to a dataset with three wheats and 100 near infra-red absorbance as regressors, then to two datasets involving microarray data.

机译：在许多领域中，一个重要的问题是探索物体类别与其观测特征之间的关系。特别重要的是要了解哪些测量值与特定类别有关。解决这类判别问题的一种方法是通过称为分类和回归树（CART）的非参数方法。在本文中，将随机步骤添加到CART算法中，并使用退火时间表查找“最佳”模型。提出了两种模型选择方法，以避免过度拟合问题。对于高维和共线数据集的问题，我们提出了多项式概率模型的贝叶斯变量选择方法。受具有潜在变量的二进制概率模型的启发，我们针对两个以上类别的情况构建了多元扩展，并使用潜在变量将一般分布设置专门化为具有高斯误差的线性模型。然后，我们应用采用自然共轭先验分布的贝叶斯变量选择技术。后验我们将某些参数整合出来，并通过使用MCMC方法和截断法线或学生t采样技术绘制多元向量来推断单个模型的边际分布。我们将该方法应用于化学计量学和功能基因组学方面的问题，首先将其应用到具有三个小麦和100个近红外吸收率的回归数据集，然后再应用到涉及微阵列数据的两个数据集。

著录项

作者
Sha, Naijun.;
展开▼
作者单位

Texas A&M University.;

展开▼
授予单位 Texas A&M University.;
学科 Statistics.
学位 Ph.D.
年度 2002
页码 87 p.
总页数 87
原文格式 PDF
正文语种 eng
中图分类统计学;
关键词

相似文献

外文文献
中文文献
专利

1. Bayesian variable variable selection in non-homogeneous hidden Markov models through an evolutionary Monte Carlo method [J] . Spezia Luigi Computational statistics & data analysis . 2020,第1期

机译：通过进化蒙特卡罗方法在非同质隐马尔可夫模型中的贝叶斯变量变量选择
2. Methods and Tools for Bayesian Variable Selection and Model Averaging in Normal Linear Regression [J] . Forte Anabel, Garcia-Donato Gonzalo, Steel Mark International statistical review . 2018,第2期

机译：正态线性回归中贝叶斯变量选择和模型平均的方法和工具
3. Post hoc Analysis for Detecting Individual Rare Variant Risk Associations Using Probit Regression Bayesian Variable Selection Methods in Case-Control Sequencing Studies (vol 40, pg 461, 2016) [J] . Catalona W. J. Genetic epidemiology. . 2017,第7期

机译：在病例对照测序研究中检测单个稀有变体风险关联的探测单个稀有变体风险关联的情况（Vol 40，PG 461,2016）
4. Comparison of Variable Parameter Muskingum-Cunge and Variable Parameter McCarthy-Muskingum Routing Methods [C] . Muthiah Perumal, Bhabagrahi Sahoo World environmental and water resources congress . 2012

机译：可变参数Muskingum-Cunge和可变参数McCarthy-Muskingum路由方法的比较
5. Bayesian Biclustering on Discrete Data: Variable Selection Methods [D] . Guo, Lei. 2013

机译：离散数据的贝叶斯聚类：变量选择方法
6. Identification of significant genes in genomics using Bayesian variable selection methods [O] . Eugene Lin, Lung-Cheng Huang 2008

机译：使用贝叶斯变量选择方法鉴定基因组学中的重要基因
7. Bayesian Variable Selection Methods for Matched Case-Control Studies [O] . Josephine Asafu-Adjei, Mahlet G. Tadesse, Brent Coull, 2017

机译：贝叶斯变量选择方法匹配案例控制研究

Bolstering CART and Bayesian variable selection methods for classification.

摘要

著录项

相似文献

相关主题

期刊订阅