首页> 外文会议>10th IEEE International Conference on Data Mining Workshops >Knowledge File System -- A Principled Approach to Personal Information Management
【24h】

Knowledge File System -- A Principled Approach to Personal Information Management

机译:知识文件系统-个人信息管理的原则方法

获取原文

摘要

The Knowledge File System (KFS) is a smart virtual file system that sits between the operating system and the file system. Its primary functionality is to automatically organize files in a transparent and seamless manner so as to facilitate easy retrieval. Think of the KFS as a personal assistant, who can file every one of you documents into multiple appropriate folders, so that when it comes time for you to retrieve a file, you can easily find it among any of the folders that are likely to contain it. Technically, KFS analyzes each file and hard links (which are simply pointers to a physical file on POSIX file systems) it to multiple destination directories (categories). The actual classification can be based on a combination of file content analysis, file usage analysis, and manually configured rules. Since the KFS organizes files using the familiar file/folder metaphor, it enjoys 3 key advantages against desktop search based solutions such as Googleȁ9;s Desktop Search, namely 1) usability, 2) portability, and 3) compatibility. The KFS has been prototyped using the FUSE (File system in User space) framework on Linux. Apache Lucerne was used to provide traditional desktop search capability in the KFS. A machine learning text classifier was used as the KFS content classifier, complimenting the customizable rule-based KFS classification framework. Lastly, an embedded database is used to log all file access to support file-usage classification.
机译:知识文件系统(KFS)是位于操作系统和文件系统之间的智能虚拟文件系统。它的主要功能是以透明和无缝的方式自动组织文件,以便于轻松检索。可以将KFS视为私人助理,他可以将您的每个文档都归档到多个适当的文件夹中,这样,当您需要检索文件时,您可以在可能包含以下文件的任何文件夹中轻松找到该文件它。从技术上讲,KFS分析每个文件和硬链接(它们只是指向POSIX文件系统上的物理文件的指针),并将其链接到多个目标目录(类别)。实际分类可以基于文件内容分析,文件使用情况分析和手动配置的规则的组合。由于KFS使用熟悉的文件/文件夹隐喻来组织文件,因此与基于桌面搜索的解决方案(例如Googleȁ9的桌面搜索)相比,它具有3个主要优势,即1)可用性,2)可移植性和3)兼容性。 KFS已使用Linux上的FUSE(用户空间中的文件系统)框架进行了原型设计。 Apache Lucerne用于在KFS中提供传统的桌面搜索功能。机器学习文本分类器用作KFS内容分类器,补充了可自定义的基于规则的KFS分类框架。最后,嵌入式数据库用于记录所有文件访问以支持文件使用分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号