Proceedings
Kohler, J., Larson, M., de Jong, F.M.G., Kraaij, W., Ordelman R.J.F. (eds.) Proceedings of the ACM SIGIR Workshop on Searching Spontaneous Conversational Speech, Centre for Telematics and Information Technology, Enschede, 2008. sscs08_proceedings.pdf
Workshop Report
Kohler, J., Larson, M., de Jong, F.M.G., Kraaij, W., Ordelman R.J.F. Spoken Content Retrieval: Searching Spontaneous Conversational Speech, SIGIR Forum, December 2008, Volume 42 Number 2, pp. 67-76. 2008d_sigirforum_kohler.pdf


These talks were part of the complete workshop program:
Speech-based methods in the video
search mix:
In a large-scale, commercial application speech recognition and language technology serve to support video search. Speaker: T. Davis
Hybrid word-subword decoding for
spoken term detection A hybrid recognition system
directly produces lattices containing both words and
subwords. Using multigram models and searching for
in-vocabulary and out-of-vocabulary terms in separate
steps makes possible performance gains on a spoken
term detection task. Speaker: M. Fapso
Fast Approximate Spoken Term
Detection from Sequence of Phonemes A phoneme
recognition approach to spoken term detection is used
to achieve a smaller index size and faster detection
speed. Recognizer error is compensated with a
probabilistic model based on word pronunciation and
the recognizer's phoneme confusion matrix. Speaker:
M. Fapso
Cluster-based Model Fusion for
Spontaneous Speech Retrieval Training: Topics
(queries) in the collection are clustered. The best
weighting scheme for combination of retrieval models
is determined for each cluster. Test topics are
classified into the topic cluster and retrieval is
performed using the corresponding weighting scheme.
Speaker: D. Inkpen
Combination of Multiple Speech
Transcription Methods for Vocabulary Independent
Search: Two algorithms are presented that combine
speech transcripts generated using different word and
sub-word speech recognition methods. The approach
tackles the challenge that out-of-vocabulary terms
present in a spoken term detection task. Speaker J.
Mamou
Advances in the Fraunhofer IAIS
Audiomining System Performance improvements in
vocabulary-independent spoken term detection are
achieved by improved acoustic models and a hybrid word
and syllable search system. In the improved system, it
is no longer necessary to accommodate recognition
error by allowing the match between query word and
syllable transcript to be inexact. Speaker: D.
Schneider
Using Term Clouds to Represent
Segment-Level Semantic Content of Podcasts:
Structured surrogates, which prove useful for the
semantic representation of spoken audio in the user
interface, are created automatically. TextTiling
techniques applied to speech transcripts generate
divide the audio into topical segments and each
segment is represented by a mini-term-cloud derived
from the speech transcript. Speaker M. Tsagkias

