Cinematographic Camera Diffusion Model

Hongda Jiang; Xi Wang; Marc Christie; Libin Liu; Baoquan Chen

doi:10.1111/cgf.15055

Article Dans Une Revue Computer Graphics Forum Année : 2024

Cinematographic Camera Diffusion Model

(1) , (2, 3) , (4) , (1) , (1)

1
2
3
4

Hongda Jiang

Fonction : Auteur
PersonId : 1091027

Peking University [Beijing]

Xi Wang

Fonction : Auteur
PersonId : 1035171

Laboratoire d'informatique de l'École polytechnique [Palaiseau]

École polytechnique

Marc Christie

Fonction : Auteur
PersonId : 853625
IdHAL : marc-christie
ORCID : 0000-0001-6080-8026

Nous, virtuels

Libin Liu

Fonction : Auteur

Peking University [Beijing]

Baoquan Chen

Fonction : Auteur correspondant

Peking University [Beijing]

Résumé

Designing effective camera trajectories in virtual 3D environments is a challenging task even for experienced animators. Despite an elaborate film grammar, forged through years of experience, that enables the specification of camera motions through cinematographic properties (framing, shots sizes, angles, motions), there are endless possibilities in deciding how to place and move cameras with characters. Dealing with these possibilities is part of the complexity of the problem. While numerous techniques have been proposed in the literature (optimization-based solving, encoding of empirical rules, learning from real examples, etc.), the results either lack variety or ease of control. In this paper, we propose a cinematographic camera diffusion model using a transformer-based architecture to handle temporality and exploit the stochasticity of diffusion models to generate diverse and qualitative trajectories conditioned by high-level textual descriptions. We extend the work by integrating keyframing constraints and the ability to blend naturally between motions using latent interpolation, in a way to augment the degree of control of the designers. We demonstrate the strengths of this text-to-camera motion approach through qualitative and quantitative experiments and gather feedback from professional artists.

Mots clés

Camera control Cinematography Animation Generative AI

Domaines

Géométrie algorithmique [cs.CG] Intelligence artificielle [cs.AI]

Fichier principal

CineCamDiff_AuthorVer.pdf (43.04 Mo)

teaser.jpeg (5.05 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Xi WANG : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04826479

Soumis le : vendredi 13 décembre 2024-12:37:15

Dernière modification le : mercredi 18 décembre 2024-03:23:38

Dates et versions

hal-04826479 , version 1 (13-12-2024)

Licence

Paternité

Identifiants

HAL Id : hal-04826479 , version 1
ARXIV : 2402.16143
DOI : 10.1111/cgf.15055

Citer

Hongda Jiang, Xi Wang, Marc Christie, Libin Liu, Baoquan Chen. Cinematographic Camera Diffusion Model. Computer Graphics Forum, 2024, 43 (2), pp.1-14. ⟨10.1111/cgf.15055⟩. ⟨hal-04826479⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X UNIV-RENNES1 UR2-HB CNRS INRIA INSA-RENNES IRISA LIX X-LIX X-DEP-INFO CENTRALESUPELEC M2S INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES2 UNIV-RENNES IP_PARIS UR1-MATH-NUM UR1-BIO-SA

0 Consultations

0 Téléchargements

Cinematographic Camera Diffusion Model

Cinematographic Camera Diffusion Model

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager