Roadmap of MMDetection #2931

hellock · 2020-06-07T17:07:13Z

We keep this issue open to collect feature requests from users and hear your voice. Our monthly release plan is also available here.

You can either:

Suggest a new feature by leaving a comment.
Vote for a feature request with 👍 or be against with 👎. (Remember that developers are busy and cannot respond to all feature requests, so vote for your most favorable one!)
Tell us that you would like to help implement one of the features in the list or review the PRs. (This is the greatest things to hear about!)

V2.4 (August)

ResNeSt. (#2959)
YOLOv3. (#3083)
SOLO. (#3447)
YOLACT. (#3456)
Mosiac augmentation. (#3389)
Cutout augmentation. (#3521)
Batch inference support.

V2.3 (July)

~~ResNeSt. (#2959)~~ (delayed to V2.4)
CornerNet. (#2796, #2840, #3036)
~~YOLOv3. (#3083)~~ (delayed to V2.4)
DIoU/CIoU loss. (#3151)
ONNX support for single-stage detectors. (#3075)
~~Batch inference support. (#1833)~~ (delayed to V2.4)
LVIS v1

V2.2 (June)

~~ResNeSt.~~ (delayed to V2.3)
DetectorRS (#3064)
PointRend. (#2752)
~~CornerNet.~~ (delayed to V2.3)
Dynamic R-CNN. (#3040)
Generalized Focal Loss (#3097)
Refactoring of Anchor-free detectors. (#2867)

daavoo · 2020-06-08T08:20:23Z

I would be interested in adding support for EfficientNet (backbone, #764) and EfficientDet (detector, #1827) architectures along with trained models.

I will also be willing to contribute this feature myself.

mmeendez8 · 2020-06-09T10:02:14Z

It would be interesting to add batch inference support. I've seen a couple of issues related with this (#2703 #1833 #1659) and some details in source code and it looks like it has been left as future work (see https://github.com/open-mmlab/mmdetection/blob/master/mmdet/models/detectors/base.py#L118) so it could be a good moment.

I'd be glad to help with this too.

hiyyg · 2020-06-09T10:32:38Z

How about adding the support for yolov4?

leehao178 · 2020-06-10T02:25:07Z

Hi authors
Can you add DOTA dataset support?
ex:more bbox parameters!
https://captain-whu.github.io/DOTA/dataset.html

Thank you so much!!

zehuichen123 · 2020-06-10T10:50:48Z

Any plans about knowledge distillation for object detection, for example, fitnet (https://github.com/TuSimple/simpledet/blob/master/models/KD/README.md), so that users can customize their distillation loss functions based on their requirements.

hellock · 2020-06-11T04:16:32Z

I would be interested in adding support for EfficientNet (backbone, #764) and EfficientDet (detector, #1827) architectures along with trained models.

I will also be willing to contribute this feature myself.

Looking forward to it. Is there expected time for EfficientNet and EfficientDet? We need to have a plan whether to release it in V2.2 (June 30) or V2.3 (July 31).

hellock · 2020-06-11T04:19:22Z

It would be interesting to add batch inference support. I've seen a couple of issues related with this (#2703 #1833 #1659) and some details in source code and it looks like it has been left as future work (see https://github.com/open-mmlab/mmdetection/blob/master/mmdet/models/detectors/base.py#L118) so it could be a good moment.

I'd be glad to help with this too.

Thanks! We have already created a PR (#1833) to support batch inference a few months ago, and will continue working on it to make it compatible with V2.0. You are still welcome to contribute any other new features.

hellock · 2020-06-11T04:26:58Z

How about adding the support for yolov4?

If there is no community contributors helping with that, we will firstly add YOLO v3 in V2.3 (July 31).

ZwwWayne · 2020-06-11T04:29:56Z

Hi authors
Can you add DOTA dataset support?
ex:more bbox parameters!
https://captain-whu.github.io/DOTA/dataset.html

Think you so much!!

For now we do not have plan for that, but there have been mmdet-based project doing that. You might be interested in https://github.com/dingjiansw101/AerialDetection.

daavoo · 2020-06-12T07:53:20Z

I would be interested in adding support for EfficientNet (backbone, #764) and EfficientDet (detector, #1827) architectures along with trained models.
I will also be willing to contribute this feature myself.

Looking forward to it. Is there expected time for EfficientNet and EfficientDet? We need to have a plan whether to release it in V2.2 (June 30) or V2.3 (July 31).

I think that probably EfficientNet for version v2.2 and EfficientDet for v2.3

JosonChan1998 · 2020-06-12T16:45:53Z

Hi ! authors
Any plan about AC-FPN ?
(https://arxiv.org/pdf/2005.11475.pdf

SuzaKrish · 2020-06-12T20:44:01Z

Could you add some help about how to actually build modules out of these existing ones?

ZwwWayne · 2020-06-13T04:23:44Z

Hi @SuzaKrish ,
Tutorials have been already added here. Is there anything unclear or not sufficient?

SuzaKrish · 2020-06-13T14:35:06Z

Thanks! So in the tutorials, you have mentioned how the base is supposed to be formed right? how do we then call it finally to run the model? Also, while going over the config files, I had a doubt as to where to find these modules in the main repo page. Could you probably include a part in the readme describing which folders contain which components(like head, model structure, base etc.)

ZwwWayne · 2020-06-15T14:27:04Z

Hi @SuzaKrish ,
This documentation might help you.

hiyyg · 2020-06-15T16:14:41Z

How about supporting coco-like evaluation for custom datasets without coco json annotation file? For instance, by converting the custom dataset to coco format internally during evaluation like https://github.com/facebookresearch/detectron2/blob/master/detectron2/evaluation/coco_evaluation.py#L72.

SuzaKrish · 2020-06-16T12:26:51Z

documentation

Thank you! This is helpful! :) @ZwwWayne

shawn-xj · 2020-06-17T08:24:04Z

Is there any plan for supporting some light-weight networks like BiSeNet, DFANet?

JIEMIN1995 · 2020-06-18T06:14:56Z

Any plans to support light-weight backbone such as mobilenetV2, mobilenetV3 ?

ElectronicElephant · 2020-06-18T14:33:06Z

How about adding the support for yolov4?

If there is no community contributors helping with that, we will firstly add YOLO v3 in V2.3 (July 31).

Hello, @WenqiangX and I are glad to help implement the YOLO v3, and perhaps v4 as well, if we feel good. We are from MVIG, SJTU and have spent quite a little time studying all kinds of YOLOv3 implementation, especially the one from gluon-cv, which is probably one of the best re-implementation of YOLOv3.

Basically, we plan to continue the work of #1695 . From a big picture, we plan to do the following:

Refactor the backbone with your new ConvModule (as mentioned in #1695 (comment))
Solve the problem of not being able to use Distribute Training
Introduce some great ideas from gluon-cv

If you are glad with us, can you manage your time to check if there is any major defect in #1695, so that we can save some review time in the future?

BTW, I don't know if the license from Western Digital company will be a big issue.

hiyyg · 2020-06-19T00:03:44Z

Any plan on adding vovnet backbone, such as https://github.com/aim-uofa/AdelaiDet/tree/master/configs/FCOS-Detection/vovnet?

hellock · 2020-06-19T17:41:48Z

How about adding the support for yolov4?

If there is no community contributors helping with that, we will firstly add YOLO v3 in V2.3 (July 31).

Hello, @WenqiangX and I are glad to help implement the YOLO v3, and perhaps v4 as well, if we feel good. We are from MVIG, SJTU and have spent quite a little time studying all kinds of YOLOv3 implementation, especially the one from gluon-cv, which is probably one of the best re-implementation of YOLOv3.

Basically, we plan to continue the work of #1695 . From a big picture, we plan to do the following:

Refactor the backbone with your new ConvModule (as mentioned in #1695 (comment))

Solve the problem of not being able to use Distribute Training

Introduce some great ideas from gluon-cv

If you are glad with us, can you manage your time to check if there is any major defect in #1695, so that we can save some review time in the future?

BTW, I don't know if the license from Western Digital company will be a big issue.

Thanks and glad to know you are willing to help! We can have further discussion in that PR and may expect YOLOv3 in V2.3. The copyright is ok as if it is licensed under Apache-2.0.

chuong98 · 2020-06-22T22:53:22Z

Can we add SpineNet (CVPR2020)?
The code is provided by @lucifer443 https://github.com/lucifer443/SpineNet-Pytorch
As said in the repo, I don't have enough servers to train it, but willing to adjust the code and pull request.

I only trained with protocol B. Training SpineNet takes lots of time, for example I took 7 days to train SpineNet-49 with 8 TITAN V gpus.

ZwwWayne · 2020-06-23T06:08:17Z

Can we add SpineNet (CVPR2020)?
The code is provided by @lucifer443 https://github.com/lucifer443/SpineNet-Pytorch
As said in the repo, I don't have enough servers to train it, but willing to adjust the code and pull request.

I only trained with protocol B. Training SpineNet takes lots of time, for example I took 7 days to train SpineNet-49 with 8 TITAN V gpus.

PRs are welcomed.

tianq01 · 2020-06-23T09:21:39Z

any plan for ONNX support? e.g. two stage faster-rcnn
thanks.

mathmanu · 2020-06-29T04:40:43Z

"Bag of Freebies for Training Object Detection Neural Networks"

Describe the feature
Is it possible to support the training improvements described in the following paper:
"Bag of Freebies for Training Object Detection Neural Networks"
Zhi Zhang, Tong He, Hang Zhang, Zhongyue Zhang, Junyuan Xie, Mu Li
https://arxiv.org/pdf/1902.04103.pdf

Motivation
It seems the "Bag of Freebies" provide significant accuracy improvement (+5%) in a single shot detector such as YOLOv3. It improves Faster-RCNN by upto 1.7%. It is likely that these features will improve other detectors as well.

Related resources
https://medium.com/apache-mxnet/gluoncv-0-3-a-new-horizon-564326364e16
https://gluon-cv.mxnet.io/model_zoo/detection.html

Additional context
I think implementing these features would give accuracy lift to several object detectors in mmdetection.

Note: Added here as per the suggetion in #3124

Thanks,

mathmanu · 2020-07-01T11:41:36Z

"Objects as Points (CenterNet)" is a popular, high accuracy Object Detector.

"Objects as Points"
Xingyi Zhou, Dequan Wang, Philipp Krähenbühl
https://arxiv.org/abs/1904.07850
Source code: https://github.com/xingyizhou/CenterNet
(This detector is referred to as CenterNet in the paper. But there is another detector that calls itself CenterNet - so I am using both names to avoid confusion)

In CVPR2020, the second place solution in the 2D Object detection track of the Waymo open challenges used this detector as one component:
https://waymo.com/open/challenges/
https://waymo.com/open/challenges/2d-detection/
"2nd Place Solution for Waymo Open Dataset Challenge - 2D Object Detection"
Sijia Chen∗ Yu Wang∗ Li Huang Runzhou Ge Yihan Hu Zhuangzhuang Ding Jie Liao, Horizon Robotics Inc.
https://arxiv.org/pdf/2006.15507.pdf

It is also worth noting that this CenterNet inspired AFDet which in-turn was the basis for in the 1st place Solution for 3D Object Detection:
"1st Place Solution for Waymo Open Dataset Challenge - 3D Detection and Domain Adaptation"
Zhuangzhuang Ding∗ Yihan Hu∗ Runzhou Ge∗
Li Huang Sijia Chen Yu Wang Jie Liao
Horizon Robotics
https://arxiv.org/pdf/2006.15505.pdf
"AFDet: Anchor Free One Stage 3D Object Detection"
Runzhou Ge∗ Zhuangzhuang Ding∗ Yihan Hu∗
Yu Wang Sijia Chen Li Huang Yuan Li
Horizon Robotics
https://arxiv.org/pdf/2006.12671.pdf

So, having this "Objects as Points (CenterNet)" Detector in mmdetection is highly desirable. Kindly add it if possible.

YAOYI626 · 2020-07-01T13:06:37Z

"Objects as Points (CenterNet)" is a popular, high accuracy Object Detector.

Objects as Points
Xingyi Zhou, Dequan Wang, Philipp Krähenbühl
https://arxiv.org/abs/1904.07850
Source code: https://github.com/xingyizhou/CenterNet
(This detector is referred to as CenterNet in the paper. But there is another detector that calls itself CenterNet - so I am using both names to avoid confusion)

In CVPR2020, the second place solution in the 2D Object detection track of the Waymo open challenges used this detector as one component:
https://waymo.com/open/challenges/
https://waymo.com/open/challenges/2d-detection/

2nd Place Solution for Waymo Open Dataset Challenge - 2D Object Detection
Sijia Chen∗ Yu Wang∗ Li Huang Runzhou Ge Yihan Hu Zhuangzhuang Ding Jie Liao, Horizon Robotics Inc.
https://arxiv.org/pdf/2006.15507.pdf

Having this "Objects as Points (CenterNet)" Detector in mmdetection is highly desirabale. Kindly add it if possible.

hey @mathmanu maybe this PR will help you. But it is not offically supported and only work with mmdet v1.x. Would you mind merging it with newer verison ? @hellock @ZwwWayne I strongly believe our community want this one detector in the model zoo.

ZwwWayne · 2020-07-03T06:06:36Z

Hi @mathmanu , @mathmanu ,
Thanks for your kind suggestion. Due to limited developers and resources, we are not going to implement those methods in the near future, PRs are welcome.

ElectronicElephant · 2020-07-03T11:52:36Z

Hi @hellock @xvjiarui ,

The main job of our YOLOv3 implementation was done several days ago. It would be nice if you can manage your time to review the code. Also, I don't have many vid cards, so I haven't tested it with other backbones like ResNet yet. (But it should work in my design.)

Btw, I think if mm-detection is aimed at both academy and production, then it should contain both light-weight and heavy-weight backbones / models. Would you like me to add YOLOv3-tiny, after YOLOv3 is merged?

hellock · 2020-07-03T14:52:20Z

Hi @hellock @xvjiarui ,

The main job of our YOLOv3 implementation was done several days ago. It would be nice if you can manage your time to review the code. Also, I don't have many vid cards, so I haven't tested it with other backbones like ResNet yet. (But it should work in my design.)

Btw, I think if mm-detection is aimed at both academy and production, then it should contain both light-weight and heavy-weight backbones / models. Would you like me to add YOLOv3-tiny, after YOLOv3 is merged?

Thanks for your great work! The review is ongoing. YOLOv3-tiny is definitely favorable.

zhjw0927 · 2020-07-06T09:08:47Z

Please release a TT100K benchmark.
For anchor and augmentation, not have a benchmark for following research.

hyz-xmaster · 2020-07-08T12:00:35Z

I would suggest improving the training performance stability. I have ran into the same problem with #2773 when training the detectors on the coco dataset. There is generally a maximum of +0.2 or -0.2 performance gap even using the same config file and same seed. This is a bit annoying because you do not know whether the performance gain or drop is due to better parameters or just some randomness.

mathmanu · 2020-07-11T04:52:59Z

I thought of bringing the following announcement to your notice, as we are discussing features to be included:

TensorFlow 2 meets the Object Detection API
https://blog.tensorflow.org/2020/07/tensorflow-2-meets-object-detection-api.html
"Over the last year we’ve been migrating our TF Object Detection API models to be TensorFlow 2....
A suite of TF2 compatible (Keras-based) models; ...., as well as a few new architectures for which we will only maintain TF2 implementations: (1) CenterNet - a simple and effective anchor-free architecture based on the recent Objects as Points paper by Zhou et al, and (2) EfficientDet ...."

I think it shows how important and how much awaited these detectors are: Objects as Points and EfficientDet.

manhongnie · 2020-07-12T08:24:23Z

I hope you will increase your support for onnx, otherwise you will not be able to deploy with mmdetection. If you have succeeded in the steps of onnx, please provide the onnx model

hellock · 2020-07-12T17:16:47Z

I thought of bringing the following announcement to your notice, as we are discussing features to be included:

TensorFlow 2 meets the Object Detection API
https://blog.tensorflow.org/2020/07/tensorflow-2-meets-object-detection-api.html
"Over the last year we’ve been migrating our TF Object Detection API models to be TensorFlow 2....
A suite of TF2 compatible (Keras-based) models; ...., as well as a few new architectures for which we will only maintain TF2 implementations: (1) CenterNet - a simple and effective anchor-free architecture based on the recent Objects as Points paper by Zhou et al, and (2) EfficientDet ...."

I think it shows how important and how much awaited these detectors are: Objects as Points and EfficientDet.

We have heard the voice from the community for CenterNet, and will increase the priority in our roadmap. Hopefully we will introduce it to mmdet V2.4.

michaelschleiss · 2020-07-16T11:12:09Z

I would love to see Test Time Augmentation for Single Stage Detectors. #509

zhongqiu1245 · 2020-08-01T01:55:45Z

Hi, dear authors
Thank you for your amazing job!

Could you support BorderDet?
The original single-point feature can be directly optimized by using the border features, and the SOTA target detection algorithm BorderDet is proposed based on BorderAlign

Motivation
Ex1. Many sliding-window object detectors(such as FCOS,SSD,RetinaNet) adopte the feature maps on the point of the grid to generate the bounding box predictions, but lack the explicit border information for accurate localization.
Ex2.If those detectors use the border information, the performance of the detector will be improved.
Ex3. There is a recent paper《BorderDet: Border Feature for Dense Object Detection》.This paper gives a detailed explanation of the method of using border information to improve the performance of FCOS(mAP: 38.6 v.s. 41.4) and has achieved good results.

Related resources
official code：https://github.com/Megvii-BaseDetection/BorderDet

zhongqiu1245 · 2020-08-06T02:40:31Z

Hi, dear authors
Thank you for your amazing job!
Could you support FPT?

they use Transformer in FPN and got FPT(FPN + Transformer). FPT improves 8.5% box-AP
for object detection and 6.0% mask-AP for instance segmentation over baseline
on the MS-COCO test-dev

Related resources
official code：https://github.com/ZHANGDONG-NJUST/FPT
paper：https://arxiv.org/abs/2007.09451

bluesky314 · 2020-08-11T15:19:32Z

This library is so good. What is stopping wide-spread usage is lack of tutorials for beginners. If you guys spent some time preparing tutorials it would really help the library.

ZwwWayne · 2020-08-12T03:21:26Z

This library is so good. What is stopping wide-spread usage is lack of tutorials for beginners. If you guys spent some time preparing tutorials it would really help the library.

Hi @bluesky314 ,
Thanks for your kind advice. For the tutorials and docs, we have made some progress, e.g., documentation with 4 tutorials and colab tutorials. We are still working on that to make it easier for users to start. Therefore, could you be more specific? For example, what tutorials do you think might be valuable but we are missing for now? We will try to complete them in our nearest release plan.

bluesky314 · 2020-08-13T03:57:26Z

Off the top of my head I can think of:
Creating Hooks
Using Hooks(like tensorboard,wandb, etc)
Adding losses
Weighting/tweaking losses differently
Modifying the train loop

zeakey · 2020-08-22T04:18:37Z

Any plan for contour-based instance segmentation methods like PolarMask (https://arxiv.org/abs/1909.13226), DeepSnake (https://arxiv.org/abs/2001.01629) and Dense RepPoints (https://arxiv.org/pdf/1912.11473v3)?

wenmengzhou · 2020-08-24T15:04:56Z

Any plan for jit support? some related issues #1504 #2856

hellock changed the title ~~Roadmap of MMDetection :fire: :fire: :fire:~~ Roadmap of MMDetection Jun 7, 2020

hellock pinned this issue Jun 7, 2020

hellock added good first issue community discussion community help wanted labels Jun 7, 2020

yhcao6 unpinned this issue Jun 8, 2020

yhcao6 pinned this issue Jun 8, 2020

daavoo mentioned this issue Jun 17, 2020

Add efficientnet #3061

Open

hellock mentioned this issue Jul 31, 2020

Will support BorderDet？ #3430

Closed

Jul	AUG	Sep
	25
2019	2020	2021

open-mmlab / mmdetection

Join GitHub today

Roadmap of MMDetection #2931

Roadmap of MMDetection #2931

Comments

hellock commented Jun 7, 2020 • edited by OceanPang

V2.4 (August)

V2.3 (July)

V2.2 (June)

daavoo commented Jun 8, 2020

mmeendez8 commented Jun 9, 2020

hiyyg commented Jun 9, 2020

leehao178 commented Jun 10, 2020 • edited

zehuichen123 commented Jun 10, 2020 • edited

hellock commented Jun 11, 2020

hellock commented Jun 11, 2020

hellock commented Jun 11, 2020 • edited

ZwwWayne commented Jun 11, 2020

daavoo commented Jun 12, 2020

JosonChan1998 commented Jun 12, 2020

SuzaKrish commented Jun 12, 2020

ZwwWayne commented Jun 13, 2020

SuzaKrish commented Jun 13, 2020

ZwwWayne commented Jun 15, 2020

hiyyg commented Jun 15, 2020

SuzaKrish commented Jun 16, 2020

shawn-xj commented Jun 17, 2020

JIEMIN1995 commented Jun 18, 2020

ElectronicElephant commented Jun 18, 2020 • edited

hiyyg commented Jun 19, 2020

hellock commented Jun 19, 2020

chuong98 commented Jun 22, 2020

ZwwWayne commented Jun 23, 2020

tianq01 commented Jun 23, 2020

mathmanu commented Jun 29, 2020

mathmanu commented Jul 1, 2020 • edited

YAOYI626 commented Jul 1, 2020

ZwwWayne commented Jul 3, 2020 • edited by hellock

ElectronicElephant commented Jul 3, 2020

hellock commented Jul 3, 2020

zhjw0927 commented Jul 6, 2020

hyz-xmaster commented Jul 8, 2020

mathmanu commented Jul 11, 2020 • edited

manhongnie commented Jul 12, 2020

hellock commented Jul 12, 2020

michaelschleiss commented Jul 16, 2020

zhongqiu1245 commented Aug 1, 2020 • edited

zhongqiu1245 commented Aug 6, 2020 • edited

bluesky314 commented Aug 11, 2020

ZwwWayne commented Aug 12, 2020

bluesky314 commented Aug 13, 2020

zeakey commented Aug 22, 2020 • edited

wenmengzhou commented Aug 24, 2020

hellock commented Jun 7, 2020 •

edited by OceanPang

leehao178 commented Jun 10, 2020 •

edited

zehuichen123 commented Jun 10, 2020 •

edited

hellock commented Jun 11, 2020 •

edited

ElectronicElephant commented Jun 18, 2020 •

edited

mathmanu commented Jul 1, 2020 •

edited

ZwwWayne commented Jul 3, 2020 •

edited by hellock

mathmanu commented Jul 11, 2020 •

edited

zhongqiu1245 commented Aug 1, 2020 •

edited

zhongqiu1245 commented Aug 6, 2020 •

edited

zeakey commented Aug 22, 2020 •

edited