Use YoloR with swin transformer as backbone.
See original GitHub issue@leondgarse I am trying to get inference using yolor with swin backbone but getting the following results. What can be the issue?
from keras_cv_attention_models import efficientnet, yolor
from keras_cv_attention_models import swin_transformer_v2
from keras_cv_attention_models import efficientnet, yolor
bb = swin_transformer_v2.SwinTransformerV2Small_window16(input_shape=(256, 256, 3), num_classes=1000)
model = yolor.YOLOR(backbone=bb)
from keras_cv_attention_models import test_images
imm = test_images.dog_cat()
preds = model(model.preprocess_input(imm))
bboxs, lables, confidences = model.decode_predictions(preds)[0]
from keras_cv_attention_models.coco import data
data.show_image_with_bboxes(imm, bboxs, lables, confidences)
resulting output
Issue Analytics
- State:
- Created a year ago
- Comments:10 (5 by maintainers)
Top Results From Across the Web
YOLOX with Swin-Transformer backbone - GitHub
YOLOX is an anchor-free version of YOLO, with a simpler design but better performance. I rewrote the version with Swin-Transformer as backbone following ......
Read more >Using Swin Transformer as Backbone - IceVision
The script installs IceVision, IceData, the MMDetection library, and Yolo v5 as well as the fastai and pytorch lightning engines. Install from pypi......
Read more >Swin-Transformer-YOLOv5 for Real-Time Wine Grape Bunch ...
Swin -transformer is a novel backbone network of hierarchical Vision Transformer, using a multi-head self-attention mechanism that can focus ...
Read more >Swin Transformer - Hugging Face
This paper presents a new vision Transformer, called Swin Transformer, that capably serves as a general-purpose backbone for computer vision.
Read more >ViT-YOLO:Transformer-Based YOLO for Object Detection
Recently, the design of vision backbone architectures that use self- attention is an exciting topic. In this work, an improved backbone MHSA-Darknet is...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
If you really want, this yolor_swin.h5 is a model trained for 10 epochs.
This is a command I’ve just tested. Detail usage for
coco_train_script.py
is explained in COCO training and evaluating.Here is a test result after only runing 9 epochs: