Computer Vision - ECCV 2024

Computer Vision - ECCV 2024

18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXIV

Roth, Stefan; Leonardis, Ales; Ricci, Elisa; Sattler, Torsten; Russakovsky, Olga; Varol, Guel

Springer International Publishing AG

10/2024

488

Mole

9783031730382

Pré-lançamento - envio 15 a 20 dias após a sua edição

Descrição não disponível.
Depth-guided NeRF Training via Earth Mover's Distance.- INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding.- DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks.- Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time.- Diagnosing and Re-learning for Balanced Multimodal Learning.- Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration.- Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders.- BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion.- SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views.- MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning.- Discovering Unwritten Visual Classifiers with Large Language Models.- LITA: Language Instructed Temporal-Localization Assistant.- MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain.- Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs.- Bridging the Pathology Domain Gap: Efficiently Adapting CLIP for Pathology Image Analysis with Limited Labeled Data.- AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation.- CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection.- SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging.- Minimalist Vision with Freeform Pixels.- All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation.- LatentEditor: Text Driven Local Editing of 3D Scenes.- Single-Photon 3D Imaging with Equi-Depth Photon Histograms.- Asynchronous Bioplausible Neuron for Spiking Neural Networks for Event-Based Vision.- Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models.- POET: Prompt Offset Tuning for Continual Human Action Adaptation.- Domain Generalization of 3D Object Detection by Density-Resampling.- IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers.
artificial intelligence;computer networks;computer systems;computer vision;education;Human-Computer Interaction (HCI);image analysis;image coding;image processing;image reconstruction;image segmentation;learning;machine learning;object recognition;pattern recognition;reconstruction;signal processing;software engineering