Paper – Augmented Segmentation and Visualization for Presentation Videos

February 16, 2008 by justin

Today I read a paper titled “Augmented Segmentation and Visualization for Presentation Videos”

The abstract is:
We investigate methods of segmenting, visualizing, and indexing presentation videos by separately considering audio and visual data.

The audio track is segmented by speaker, and augmented with key phrases which are extracted using an Automatic Speech Recognizer (ASR).

The video track is segmented by visual dissimilarities and augmented by representative key frames.

An interactive user interface combines a visual representation of audio, video, text, and key frames, and allows the user to navigate a presentation video.

We also explore clustering and labeling of speaker data and present preliminary results.

« Forever Questing in a Blizzard

Listening – Third »

This is the place where I gather my thoughts that don’t fit anywhere else.

Consisting of snark, humour, observations, half-baked ideas, quips, quotes (from me) and just me generally making an arse of myself.

Maybe an occasional status update on my life.

Don’t take anything you read here too seriously. It’s full of thoughts and writing that make you cringe. It’s the unfiltered internal monologue we all have in our heads. Sometimes that voice in your head says stupid shit.

And if you are offended by anything you read here, then you probably stumbled on this website by accident.