Next Scheduled Talk
Title Joint Multi-Layer Segmentation and Reconstruction for 3D-TV Content Production
Time & Venue Printing House Hall - 14:30 31st March 2011
In this talk, I will present a technique which, under these challenging conditions, is able to efficiently compute a high-quality scene representation via graph-cut optimisation of an energy function combining multiple image cues. Robustness is achieved by jointly optimising scene segmentation and multiple view reconstruction in a view-dependent manner with respect to each input camera. Joint optimisation prevents propagation of errors from segmentation to reconstruction as is often the case with sequential approaches. View-dependent processing increases tolerance to errors in through-the-lens calibration compared to global approaches.
Experimental results will be presented with a variety of challenging outdoor scenes captured with manually operated broadcast cameras as well as several indoor scenes with natural background. These datasets will be used to evaluate the accuracy of the technique for high quality segmentation and reconstruction and demonstrate its application for 3D-TV content production. Particularly, two main applications will be considered: free-viewpoint video, which gives a user the ability to freely control the viewpoint from which a video is rendered, and 3D video, which augments a conventional 2D video with depth information.
Upcomming Talks, Last Year's Talks
Previous Talks
23rd March 2011
Speaker Viliam RapcanTitle Can changes in speech predict cognitive decline?
16th March 2011
Speaker Kangyu PanTitle CELLSNAKE : A new active contour technique for cell/fibre segmentation
2nd March 2011
Speaker Finian KellyTitle Effects of Ageing on Long-Term Speaker Verification
On a cohort of 13 adult speakers, using a conventional verification system, longitudinal testing of each speaker is carried out across a 30-40 year range. A progressive degradation in verification score is observed as the time span between the training and test material increases. Above a time span of 5 years, this degradation exceeds the range of normal inter-session variability. The age of the speaker at the time of training is shown to influence the rate at which the verification scores degrade. Our results suggest that the verification score drop-off accelerates for speakers over the age of 60. The implications of these findings for speaker verification will be discussed along with directions of future work.
9th Februaury 2011
Speaker Claire MastersonTitle Binaural Impulse Response Rendering for Immersive Audio
reproduction. This includes a method for the factorisation of datasets of head related impulse responses (HRIRs) using a least squares approach as well as a number of regularisation strategies to enable for more psychoacoustically meaningful, initial-condition independent results to be obtained for various types of HRIR data. A technique for the spatial interpolation of room impulse responses using dynamic time warping and tail synthesis will also be covered. The incorporation of both techniques into an overall spatial audio system using the virtual loudspeaker approach will be described.
2nd February 2011
Speaker Damien KellyTitle Voxel-based Viterbi Active Speaker Tracking (V-VAST) with Best View Selection for Video Lecture Post-production
26th January 2011
Speaker Luca CappellettaTitle Improved Visual Features for Audio-visual Speech Recognition
19th January 2011
Speaker Felix RaimbaultTitle Stereo Video Inpainting
14th December 2010
Speaker Andrew HinesTitle Speech Intelligibility Prediction using a Simulated Performance Intensity Function
7th December 2010
Speaker Mohamed AhmedTitle Reflection Detection in Image Sequences
Reflections in image sequences consist of several layers superimposed over each other. This phenomenon causes many image processing techniques to fail as they assume the presence of only one layer at each examined site e.g. motion estimation and object recognition. Reflections can arise by mixing any two images and hence detecting them automatically remains a hard problem that was not addressed before. This work presents an automated technique for detecting reflections in image sequences by analyzing motion trajectories of feature points. We generate sparse and dense detection maps and our results show high detection rate with rejection to pathological motion, occlusion, and motion blur.
12th October 2010
Speaker Bruno NicolettiTitle Developing VFX for Film and Video on GPUs
In the visual effects world, London-based award-winning firm The Foundry is renowned for its software. Bruno Nicoletti, founder and CTO of The Foundry, speed-talked through a tour of the companys tools and software, demonstrating to an audience with a healthy population of VFX artists and developers how GPUs are changing the industry in Developing GPU-Enabled Visual Effects for Film and Video.
Foundry technology has been used in a host of blockbusters, such as Avatar, Harry Potter, The Dark Knight and many, many others, and its Nuke compositing software has been used for everything from the fantastic (CGI castles) to the mundane (complexion correction).
As a leader in the industry, Nicoletti has an invaluable perspective on the changes that GPUs are making in VFX. GPUs are reducing rendering times and allowing VFX to be involved more pervasively in all stages of production, in effect blurring the line between post production and production.
The popularity of utilizing the power of GPUs in the visual effects (VFX) industry continues to gain momentum. Major film production studios that historically have been CPU-based for VFX are not only utilizing GPUs, they are starting to replace their CPU-based rendering systems with GPU-based one.
This transition to GPU in VFX, however, requires some legwork, particularly when it comes to the complex image processing algorithms in VFX software. This (along with The Foundrys solution) was the subject of the second half of Nicolettis talk.
With hundreds of effects and millions of lines of code in its software, The Foundry was faced with having to rewrite everything to exploit GPUs while maintaining separate algorithms for CPUs. Faced with the prospect of writing and debugging two sets of complex algorithms, The Foundry created something theyre calling Blink (although Nicoletti used its internal code name of RIP, or Righteous Image Processing).
Blink wraps image processing up into a high level C++ API. It lets programmers run kernels on the CPU for debugging, and then those kernels can be translated to spit out GPU CUDA. Nicoletti showed several coding examples and wrapped by showing examples of a motion estimation function run on an Intel Xeon 5504 versus an NVIDIA Quadro 5000. The speed difference was extraordinary (from 5fps to more than 200fps), which augurs for increased demand for VFX on GPU and Blink.
Bruno Nicoletti has worked in visual effects since graduating with a degree in Computer Science and Mathematics from Sydney University in 1987. He has worked at production companies, creating visual effects for broadcast and film, as well as at commercial software companies, developing software to sell into visual effects companies. In his career he has developed 2D image processing software, 3D animation, rendering and modelling tools, often before any equivalent tools were commercially available. In 1996 he started The Foundry to develop visual effects plug-ins and oversaw it's initial growth. The Foundry now develops and sells a range of applications and plugins for VFX which are used in may feature films and TV programmes. Now CTO, he acts as senior engineer at the company and is overseeing the effort to move The Foundry's software to a new image processing frameworks that can exploit CPUs and GPUs to yield dramatic speed improvements.
Upcoming Speakers
| Date | Speaker(s) |
|---|---|
| 7th February | Claire Masterson |
