Online Row Sampling

机译：在线行抽样

页面导航

摘要
著录项
相似文献
相关主题

摘要

Finding a small spectral approximation for a tall n x d matrix A is a fundamental numerical primitive. For a number of reasons, one often seeks an approximation whose rows are sampled from those of A. Row sampling improves interpretability, saves space when A is sparse, and preserves row structure, which is especially important, for example, when A represents a graph.However, correctly sampling rows from A can be costly when the matrix is large and cannot be stored and processed in memory. Hence, a number of recent publications focus on row sampling in the streaming setting, using little more space than what is required to store the outputted approximation [Kelner Levin 2013] [Kapralov et al. 2014].Inspired by a growing body of work on online algorithms for machine learning and data analysis, we extend this work to a more restrictive online setting: we read rows of A one by one and immediately decide whether each row should be kept in the spectral approximation or discarded, without ever retracting these decisions. We present an extremely simple algorithm that approximates A up to multiplicative error epsilon and additive error delta using O(d log d log (epsilon ||A||_2^2/delta) / epsilon^2) online samples, with memory overhead proportional to the cost of storing the spectral approximation. We also present an algorithm that uses O(d^2) memory but only requires O(d log (epsilon ||A||_2^2/delta) / epsilon^2) samples, which we show is optimal.Our methods are clean and intuitive, allow for lower memory usage than prior work, and expose new theoretical properties of leverage score based matrix approximation.

机译：为一个高n x d矩阵A找到一个小的光谱近似值是一个基本的数值本原。由于多种原因，人们通常会寻找一种近似值，该近似值是从A的行中进行采样的。行采样提高了可解释性，在A稀疏时节省了空间，并保留了行结构，这尤其重要，例如，当A代表图形时但是，当矩阵很大并且无法在内存中存储和处理时，从A正确采样行可能会耗费大量资金。因此，许多最近的出版物集中于流式设置中的行采样，所使用的空间比存储输出的近似值所需的空间少[Kelner Levin 2013] [Kapralov等。 2014]。受机器学习和数据分析在线算法工作量不断增长的启发，我们将这项工作扩展到更具限制性的在线设置：我们逐一读取A行，并立即决定是否应将每一行保留在频谱近似或丢弃，而无需撤消这些决策。我们提出了一种非常简单的算法，使用在线样本O（d log d log（epsilon || A || _2 ^ 2 / delta / epsilon ^ 2）/ epsilon ^ 2）将A近似为乘积误差epsilon和加性误差delta。存储光谱近似值的成本。我们还提出了一种使用O（d ^ 2）内存但仅需要O（d log（epsilon || A || _2 ^ 2 / delta）/ epsilon ^ 2）样本的算法，我们证明这是最优的。我们的方法是简洁直观，比以前的工作占用更少的内存，并公开了基于杠杆得分的矩阵近似的新理论属性。

著录项

作者
Cohen Michael B.; Musco Cameron; Pachocki Jakub;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Online Row Sampling [J] . Michael B. Cohen, Cameron Musco, Jakub Pachocki LIPIcs : Leibniz International Proceedings in Informatics . 2016,第29期

机译：在线行采样
2. Row recon IV: nutrient deficiencies sampling for healthiness--foliar sampling can help fruit growers detect nutrient levels in their trees [J] . Brian Sparks Western Fruit Grower . 2001,第4期

机译：行侦查IV：为健康而进行的养分缺乏采样-叶面采样可以帮助果农检测树木中的养分水平
3. Rapid location and online detection of plate material defects with multi-row crossed antenna pairs in the case of material movement [J] . Gao Chong, Li En, Su Qi, Journal of Electromagnetic Waves and Applications . 2018,第7a9期

机译：在材料运动的情况下，使用多排交叉天线对的板材缺陷的快速位置和在线检测
4. Online Testing of a Row-Stationary Convolution Accelerator [C] . Mohammad Rasoul Roshanshah, Katayoon Basharkhah, Zainalabedin Navabi IEEE European Test Symposium . 2021

机译：在线测试行固定卷积加速器
5. Examining the Effectiveness of Online Mindfulness and Nature Interventions on Improving Cognitive Functioning in a Trauma Exposed Sample [D] . Bartel, Alisa. 2018

机译：检查在线思想和自然干预的有效性，以改善创伤暴露样本中的认知功能
6. Innovative Recruitment Using Online Networks: Lessons Learned From an Online Study of Alcohol and Other Drug Use Utilizing a Web-Based Respondent-Driven Sampling (webRDS) Strategy [O] . José A. Bauermeister, Marc A. Zimmerman, Michelle M. Johns, -1

机译：使用在线网络进行创新招聘：使用基于Web的响应者驱动抽样（webRDS）策略从酒精和其他毒品使用的在线研究中吸取的教训
7. Online mixted sampling: An application in hidden populations Muestreo mixto online: Una aplicación en poblaciones ocultas Online mixted sampling: An application in hidden populations [O] . María Tatiana Gorjup, Fabiola Baltar 2012

机译：在线混合采样：在隐性人群中的应用在线混合采样：在隐性人群中的应用在线混合采样：在隐性人群中的应用
8. Unsteady Aerodynamics of a Rotating Compressor Blade Row in Incompressible Flow. Volume 1. Experimental Facilities, Procedures and Sample Data [R] . Hardin, L. W., Carta, F. O. 1985

机译：不可压缩流动中旋转压缩机叶片排的非定常空气动力学。第1卷。实验设施，程序和样本数据

Online Row Sampling

摘要

著录项

相似文献

相关主题

期刊订阅