Thank you to all for a successful SSCS2008 Workshop.
Proceedings
Kohler, J., Larson, M., de Jong, F.M.G., Kraaij, W., Ordelman R.J.F. (eds.) Proceedings of the ACM SIGIR Workshop on Searching Spontaneous Conversational Speech, Centre for Telematics and Information Technology, Enschede, 2008. sscs08_proceedings.pdf

Workshop Report
Kohler, J., Larson, M., de Jong, F.M.G., Kraaij, W., Ordelman R.J.F. Spoken Content Retrieval: Searching Spontaneous Conversational Speech, SIGIR Forum, December 2008, Volume 42 Number 2, pp. 67-76. 2008d_sigirforum_kohler.pdf

SSCS2008 Future Research Directions PanelspacerSSCS2008 Panelists discuss future of speech searchspacerSSCS2008 Discussion of speech search

These talks were part of the
complete workshop program:

T. Davis Speech-based methods in the video search mix:
In a large-scale, commercial application speech recognition and language technology serve to support video search. Speaker: T. Davis

M. Fapso Hybrid word-subword decoding for spoken term detection A hybrid recognition system directly produces lattices containing both words and subwords. Using multigram models and searching for in-vocabulary and out-of-vocabulary terms in separate steps makes possible performance gains on a spoken term detection task. Speaker: M. Fapso

M. Fapso Fast Approximate Spoken Term Detection from Sequence of Phonemes A phoneme recognition approach to spoken term detection is used to achieve a smaller index size and faster detection speed. Recognizer error is compensated with a probabilistic model based on word pronunciation and the recognizer's phoneme confusion matrix. Speaker: M. Fapso

D. Inkpen Cluster-based Model Fusion for Spontaneous Speech Retrieval Training: Topics (queries) in the collection are clustered. The best weighting scheme for combination of retrieval models is determined for each cluster. Test topics are classified into the topic cluster and retrieval is performed using the corresponding weighting scheme. Speaker: D. Inkpen

J. Mamou Combination of Multiple Speech Transcription Methods for Vocabulary Independent Search: Two algorithms are presented that combine speech transcripts generated using different word and sub-word speech recognition methods. The approach tackles the challenge that out-of-vocabulary terms present in a spoken term detection task. Speaker J. Mamou

D. Schneider Advances in the Fraunhofer IAIS Audiomining System Performance improvements in vocabulary-independent spoken term detection are achieved by improved acoustic models and a hybrid word and syllable search system. In the improved system, it is no longer necessary to accommodate recognition error by allowing the match between query word and syllable transcript to be inexact. Speaker: D. Schneider

M. Tsagkias Using Term Clouds to Represent Segment-Level Semantic Content of Podcasts: Structured surrogates, which prove useful for the semantic representation of spoken audio in the user interface, are created automatically. TextTiling techniques applied to speech transcripts generate divide the audio into topical segments and each segment is represented by a mini-term-cloud derived from the speech transcript. Speaker M. Tsagkias

Chorusspaceramispacerlogo_mesh