https://arxiv.org/abs/1506.02640 You Only Look Once: Unified, Real-Time Object Detection

https://arxiv.org/abs/1512.02325 SSD: Single Shot MultiBox Detector

https://github.com/facebookresearch/deepmask

https://devblogs.nvidia.com/parallelforall/detectnet-deep-neural-network-object-detection-digits/

https://arxiv.org/abs/1606.02147v1 ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

https://arxiv.org/pdf/1611.08588v1.pdf PVANet: Lightweight Deep Neural Networks for Real-time Object Detection

In object detection, reducing computational cost is as important as improving accuracy for most practical usages. This paper proposes a novel network structure, which is an order of magnitude lighter than other state-of-the-art networks while maintaining the accuracy. Based on the basic principle of more layers with less channels, this new deep neural network minimizes its redundancy by adopting recent innovations including C.ReLU and Inception structure. We also show that this network can be trained efficiently to achieve solid results on well-known object detection benchmarks: 84.9% and 84.2%mAP on VOC2007 and VOC2012 while the required compute is less than 10% of the recent ResNet-101.

http://mi.eng.cam.ac.uk/projects/segnet/#code SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling

https://arxiv.org/abs/1602.03409 Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

https://arxiv.org/abs/1703.07431v1 IOD-CNN: Integrating Object Detection Networks for Event Recognition

https://github.com/BichenWuUCB/squeezeDet

https://github.com/tensorflow/models/tree/master/object_detection

http://www.erogol.com/online-hard-example-mining-pytorch/

https://arxiv.org/abs/1804.06215v2 DetNet: A Backbone network for Object Detection There has been little work discussing on the backbone feature extractor specifically designed for the object detection. More importantly, there are several differences between the tasks of image classification and object detection. 1. Recent object detectors like FPN and RetinaNet usually involve extra stages against the task of image classification to handle the objects with various scales. 2. Object detection not only needs to recognize the category of the object instances but also spatially locate the position. Large downsampling factor brings large valid receptive field, which is good for image classification but compromises the object location ability.