Computer Vision - ECCV 2024

Computer Vision - ECCV 2024

18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXV

Roth, Stefan; Sattler, Torsten; Leonardis, Ales; Varol, Guel; Ricci, Elisa; Russakovsky, Olga

Springer International Publishing AG

11/2024

497

Mole

9783031732256

15 a 20 dias

Descrição não disponível.
MONTAGE: Monitoring Training for Attribution of Generative Diffusion Models.- Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations.- Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination.- Self-supervised visual learning from interactions with objects.- OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation.- BAFFLE: A Baseline of Backpropagation-Free Federated Learning.- Sequential Representation Learning via Static-Dynamic Conditional Disentanglement.- OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects.- 3R-INN: How to be climate friendly while consuming/delivering videos?.- Rethinking Deep Unrolled Model for Accelerated MRI Reconstruction.- Towards Robust Full Low-bit Quantization of Super Resolution Networks.- Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking.- Diverse Text-to-3D Synthesis with Augmented Text Embedding.- Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation.- LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang.- Model Breadcrumbs: Scaling Multi-Task Model Merging with Sparse Masks.- AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems.- iHuman: Instant Animatable Digital Humans From Monocular Videos.- SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation.- Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier.- Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering.- Solving the inverse problem of microscopy deconvolution with a residual Beylkin-Coifman-Rokhlin neural network.- Face Reconstruction Transfer Attack as Out-of-Distribution Generalization.- FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models.- Deep Diffusion Image Prior for Efficient OOD Adaptation in 3D Inverse Problems.- Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation.- PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects.
artificial intelligence;computer networks;computer systems;computer vision;education;Human-Computer Interaction (HCI);image analysis;image coding;image processing;image reconstruction;image segmentation;learning;machine learning;object recognition;pattern recognition;reconstruction;signal processing;software engineering