Search Paper
  • Home
  • Login
  • Categories
  • Post URL
  • Academic Resources
  • Contact Us

 

CerberusDet: Unified Multi-Dataset Object Detection

google+
Views: 359                 

Author :  Irina Tolstykh, Mikhail Chernyshov, Maksim Kuprashevich

Affiliation :  SALUTEDEV

Country :  Uzbekistan

Category :  Artificial Intelligence

Volume, Issue, Month, Year :  -, -, September, 2024

Abstract :


Conventional object detection models are usually limited by the data on which they were trained and by the category logic they define. With the recent rise of Language-Visual Models, new methods have emerged that are not restricted to these fixed categories. Despite their flexibility, such Open Vocabulary detection models still fall short in accuracy compared to traditional models with fixed classes. At the same time, more accurate data-specific models face challenges when there is a need to extend classes or merge different datasets for training. The latter often cannot be combined due to different logics or conflicting class definitions, making it difficult to improve a model without compromising its performance. In this paper, we introduce CerberusDet, a framework with a multi-headed model designed for handling multiple object detection tasks. Proposed model is built on the YOLO architecture and efficiently shares visual features from both backbone and neck components, while maintaining separate task heads. This approach allows CerberusDet to perform very efficiently while still delivering optimal results. We evaluated the model on the PASCAL VOC dataset and Objects365 dataset to demonstrate its abilities. CerberusDet achieved state-of-the-art results with 36% less inference time. The more tasks are trained together, the more efficient the proposed model becomes compared to running individual models sequentially. The training and inference code, as well as the model, are available as open-source.

Keyword :  computer vision, detection, object detection, Multi-Task Learning, Multi-Dataset Learning, Computer Vision, YOLO, Parameter Sharing, Representation Similarity Analysis, Computational Efficiency, Multi

URL :  https://arxiv.org/abs/2407.12632

User Name : mvkuprashevich
Posted 06-08-2025 on 09:57:41 AEDT



Related Research Work

  • Augmented And Synthetic Data In Artificial Intelligence
  • Nohumansrequired: Autonomous High-quality Image Editing Triplet Mining
  • Gigacheck: Detecting Llm-generated Content
  • Saliency-guided Detr For Moment Retrieval And Highlight Detection

About Us | Post Cfp | Share URL Main | Share URL category | Post URL
All Rights Reserved @ Call for Papers - Conference & Journals