Smith – “Digital Signal Processing”

The best explanation of FFT I have found so far.

http://www.dspguide.com/

 

Posted in Uncategorized | Leave a comment

DSP Blogs

http://www.dsprelated.com/blogs.php

Posted in Uncategorized | Leave a comment

NEVER-ENDING LEARNING SYSTEM FOR ON-LINE SPEAKER DIARIZATION

Konstantin Markov and Satoshi Nakamura

National Institute of Information and Communications Technology
ATR Spoken Language Communication Research Labs., Japan

ABSTRACT
In this paper, we describe newhigh-performanceon-line speaker
diarization system which works faster than real-time and
has very low latency. It consists of several modules including
voice activity detection, novel speaker detection, speaker gender
and speaker identity classi􀂿cation. Allmodules share a set
of Gaussian mixturemodels (GMM) representing pause,male
and female speakers, and each individual speaker. Initially,
there are only three GMMs for pause and two speaker genders,
trained in advance from some data. During the speaker
diarization process, for each speech segment it is decidedwhether
it comes from a new speaker or from already known
speaker. In case of a new speaker, his/her gender is identi􀂿ed,
and then, from the corresponding gender GMM, a new GMM
is spawned by copying its parameters. This GMM is learned
on-line using the speech segment data and from this point it
is used to represent the new speaker. All individual speaker
models are produced in this way. In the case of an old speaker,
s/he is identi􀂿ed and the correspondingGMMis again learned
on-line. In order to prevent an unlimited grow of the speaker
model number, those models that have not been selected as
winners for a long period of time are deleted from the system.
This allows the system to be able to perform its task
inde􀂿nitely in addition to being capable of self-organization,
i.e. unsupervised adaptive learning, and preservation of the
learned knowledge, i.e. speakers. Such functionalities are attributed
to the so called Never-Ending Learning systems. For
evaluation, we used part of the TC-STAR database consisting
of European Parliament Plenary speeches. The results show
that this system achieves a speaker diarization error rate of
4.6% with latency of at most 3 seconds.
Index Terms— Speaker diarization, Speaker segmentation,
On-line GMM learning, Never-ending learning.

Posted in Uncategorized | Leave a comment

Why Crunch Mode Doesn’t Work: 6 Lessons

There’s a bottom-line reason most industries gave up crunch mode over 75 years ago: It’s the single most expensive way there is to get the work done.

by Evan Robinson

http://archives.igda.org/articles/erobinson_crunch.php

 

Posted in Uncategorized | Leave a comment

Software Carpentry

Back from the Software Carpentry workshop, 1st – 5th November 2010, London, SOAS.

http://software-carpentry.org/

Verdict: Jolly useful.

Should advertise it in the  CSTR!

Posted in Uncategorized | Leave a comment

Noise Robust Voice Activity Detection Using Features Extracted From the Time-Domain Autocorrelation Function

from Proceedings of the Interspeech 2010 

http://www.interspeech2010.org/

I like that, how will it perform using  RT09 recordings?

Continue reading

Posted in Uncategorized | Leave a comment

Hello world!

Welcome to meetingdiarisation.WordPress.com.

View these blog to follow my simple thoughts and feel free to leave your comments.

Lang may yer lum reek

Erich

Posted in Uncategorized | Leave a comment