Detection of Speaker Identities from Cochannel Speech Signal

PALLAVI INGALE; SANJAY NALBALWAR

首页> 外文期刊>WSEAS Transactions on Signal Processing >Detection of Speaker Identities from Cochannel Speech Signal

【24h】

Detection of Speaker Identities from Cochannel Speech Signal

机译：从Cochannel语音信号检测扬声器标识

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Supervised speech segregation for cochannel speech signal can be made easier if we use predetermined speaker's models instead of taking models for all the population. Here we propose a signal to signal ratio (SSR) independent method to detect speaker identities from a cochannel speech signal with unique speaker specific features for speaker identification. Proposed Kekre's Transform Cepstral Coefficient (KTCC)features are the robust acoustic features for speaker identification. A text independent speaker identification system is utilized for identifying speakers in short segments of test signal. Gaussian mixture modeling (GMM)classifier is used for the identification task. We compare the proposed method with a system utilizing conventional features called Mel Frequency Cepstral Coefficient (MFCC) features. Spontaneous speech utterances from candidates are taken for experimentation instead of utterances that follow a command like structure with a unique grammatical structure and have a limited word list in speech separation challenge (SSC)corpus. Identification is performed on short segments of the cochannel mixture. Two Speakers who have been identified for most of segments of the cochannel mixture are selected as two speakers detected for the same cochannel mixture. Average speaker detection accuracy of 93.56% is achieved in case of two speaker cochannel mixture for of KTCC features. This method produces best results for cochannel speaker identification even being text independent. Speaker identification performance is also checked for various test segment lengths. KTCC features outperform in speaker identification task even the length of speech segment is very short.

机译：如果我们使用预定的扬声器的型号而不是为所有人口占用模型，则可以更轻松地使Cochannel语音信号进行监督语音隔离。在这里，我们提出了信号比（SSR）独立方法的信号，以检测来自Cochannel语音信号的扬声器标识，具有用于扬声器识别的独特扬声器特定功能。提出的Kekre的转化临时临床系数（KTCC）特征是扬声器识别的强大声学功能。文本独立扬声器识别系统用于识别测试信号短段中的扬声器。高斯混合建模（GMM）分类器用于识别任务。我们将所提出的方法与利用传统特征的系统进行比较，该系统具有称为MEL频率谱系数系数（MFCC）特征的传统特征。来自候选人的自发言语是针对实验而不是遵循具有唯一语法结构的命令的语言，而是在语音分离挑战（SSC）语料库中有限的单词列表。在Cochannel混合物的短片段上进行识别。已经为大多数Cochannel混合物段识别的两个扬声器被选为检测同一Cochannel混合物的两个扬声器。对于KTCC特征的两个扬声器Cochannel混合物，实现了93.56％的平均扬声器检测精度。这种方法为Cochannel扬声器识别产生最佳结果，甚至是文本独立的。还检查了扬声器识别性能的各种测试段长度。 KTCC功能在扬声器识别任务中表达概率，即使语音段的长度也很短。

著录项

来源
《WSEAS Transactions on Signal Processing》 |2018年第1期|共7页
作者
PALLAVI INGALE; SANJAY NALBALWAR;
展开▼
作者单位

Dr. Babasaheb Ambedkar Technological University;

Dr. Babasaheb Ambedkar Technological University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类声学;
关键词
Detection of speaker identities; Text independent speaker identification; Cochannel speech; KTCC;

机译：检测扬声器身份;文本独立扬声器识别;Cochannel演讲;KTCC.;

相似文献

外文文献
中文文献
专利

1. Detection of Speaker Identities from Cochannel Speech Signal [J] . PALLAVI INGALE, SANJAY NALBALWAR WSEAS Transactions on Signal Processing . 2018,第Pta1期

机译：从Cochannel语音信号检测扬声器标识
2. Silence Removal and Endpoint Detection of Speech Signal for Text Independent Speaker Identification [J] . Tushar Ranjan Sahoo, Sabyasachi Patra International Journal of Image, Graphics and Signal Processing . 2014,第6期

机译：语音信号的消音和端点检测，用于独立于文本的说话人识别
3. Speech Extraction of a Target Speaker from One Mixed Speech Signal [J] . Tadahiro Azetsu, Eiji Uchino, Noriaki Suetake 電気学会論文誌 C:電子·情報·システム部門誌 . 2007,第6期

机译：从一个混合语音信号中提取目标说话人的语音
4. Speaker identification enhancement under cochannel conditions using sinusoidal model-based usable speech detection [C] . Khanwalkar S.S., Smolenski B.Y., Yantorno R.E. Intelligent Signal Processing and Communication Systems, 2004. ISPACS 2004. Proceedings of 2004 International Symposium on . 2004

机译：使用基于正弦模型的可用语音检测增强同频道条件下的说话人识别
5. On the Detection of Hate Speech, Hate Speakers and Polarized Groups in Online Social Media [D] . Warmsley, Dana. 2017

机译：在线社交媒体中仇恨言论，仇恨演说者和两极分化群体的检测
6. Speaker verification based on the fusion of speech acoustics and inverted articulatory signals [O] . Ming Li, Jangwon Kim, Adam Lammert, -1

机译：基于语音声学和反向发音信号融合的说话人验证
7. ADAPTIVE DEREVERBERATION OF SPEECH SIGNALS WITH SPEAKER-POSITION CHANGE DETECTION [O] . Takuya Yoshioka, Hideyuki Tachibana, Tomohiro Nakatani, 2015

机译：具有扬声器位置变化检测的语音信号的自适应降低

Detection of Speaker Identities from Cochannel Speech Signal

摘要

著录项

相似文献

相关主题

期刊订阅