record – LABS

Thesis Info

LABS ID: 00295

Thesis Title: Expressive Sound Synthesis for Animation

Author: Cécile Picard-Limpens

E-mail: cecile.picard AT sophia.inria.fr

2nd Author

3rd Author

Degree: Ph.D.

Year: 2009

Number of Pages: 175

University: Université de Nice-Sophia Antipolis

Thesis Supervisor: George Drettakis

Supervisor e-mail: george.drettakis AT sophia.inria.fr

Other Supervisor(s): François Faure, Nicolas Tsingos

Language(s) of Thesis: English

Department / Discipline: Automatic, Signal Processing, Audio

Copyright Ownership: none

Languages Familiar to Author: English, French

URL where full thesis can be found: www-sop.inria.fr/reves/Basilic/2009/Pic09/Thesis_CPicard_AftDS.pdf

Keywords: sound modeling, virtual reality, physically based modeling, contact modeling, modal analysis, granular synthesis, adaptive simulation

Abstract: 200-500 words: The main objective of this thesis is to provide tools for an expressive and real-time synthesis of sounds resulting from physical interactions of various objects in a 3D virtual environment. Indeed, these sounds, such as collisions sounds or sounds from continuous interaction between surfaces, are difficult to create in a pre-production process since they are highly dynamic and vary drastically depending on the interaction and objects. To achieve this goal, two approaches are proposed; the first one is based on simulation of physical phenomena responsible for sound production, the second one is based on the processing of a recordings database. According to a physically based point of view, the sound source is modeled as the combination of an excitation and a resonator. We first present an original technique to model the interaction force for continuous contacts, such as rolling. Visual textures of objects in the environment are reused as a discontinuity map to create audible position-dependent variations during continuous contacts. We then propose a method for a robust and exible modal analysis to formulate the resonator. Besides allowing to handle a large variety of geometries and proposing a multi-resolution of modal parameters, the technique enables us to solve the problems of coherence between physics simulation and sound synthesis that are frequently encountered in animation. Following a more empirical approach, we propose an innovative method that consists in bridging the gap between direct playback of audio recordings and physically based synthesis by retargetting audio grains extracted from recordings according to the output of a physics engine. In an off-line analysis task, we automatically segment audio recordings into atomic grains and we represent each original recording as a compact series of audio grains. During interactive animations, the grains are triggered individually or in sequence according to parameters reported from the physics engine and/or user-defined procedures. Finally, we address fracture events which commonly appear in virtual environments, especially in video games. Because of their complexity that makes a purely physical-based model prohibitively expensive and an empirical approach impracticable for the large variety of micro-events, this thesis opens the discussion on a hybrid model and the possible strategies to combine a physically based approach and an empirical approach. The model aims at appropriately rendering the sound corresponding to the fracture and to each specific sounding sample when material breaks into pieces.