Perception: prepare code for multi-scale camera object detection #8649

KaWaiTsoiBaidu · 2019-06-04T21:13:58Z

Change code to accept multi-scale camera object detection outputs of neural networks. Preparation work for Yolo-v3 object detection network.

techoe · 2019-06-04T22:54:13Z

Please fix lint error

techoe · 2019-06-05T00:00:02Z

modules/perception/camera/lib/obstacle/detector/yolo/region_output.cu

@@ -126,21 +129,25 @@ __global__ void get_object_kernel(int n,
    float scale = obj_data[loc_index];
    float cx = (w + sigmoid_gpu(loc_data[offset_loc + 0])) / width;
    float cy = (h + sigmoid_gpu(loc_data[offset_loc + 1])) / height;
-    float hw = exp(loc_data[offset_loc + 2]) * anchor_data[2 * c] / width * 0.5;
+    float hw = exp(max(-10.0f, min(loc_data[offset_loc + 2], 5.0f))) *


-10.0f and 5.0f to constexpr values in header file

techoe · 2019-06-05T00:00:42Z

modules/perception/camera/lib/obstacle/detector/yolo/region_output.cu

-
+  int num_anchor_per_scale = num_anchor;
+  if (multi_scale){
+    num_anchor_per_scale /= 3;


3 in header file

techoe · 2019-06-05T00:02:12Z

modules/perception/camera/lib/obstacle/detector/yolo/region_output.cu

+    const float *ori_data = ori_data_vec[i];
+    const float *dim_data = dim_data_vec[i];
+    const float *anchor_data = yolo_blobs.anchor_blob->gpu_data()
+                               + num_anchor_per_scale * 2 * i;


static_cast(num_anchor_per_scale * 2 * i)

This is an index, so integer should be used here.

techoe · 2019-06-05T00:04:36Z

modules/perception/camera/lib/obstacle/detector/yolo/region_output.cu

-    std::vector<float> conf_score(cpu_cls_data + k * num_candidates,
-                                  cpu_cls_data + (k + 1) * num_candidates);
+    std::vector<float> conf_score(cpu_cls_data + k * all_scales_num_candidates,
+                            cpu_cls_data + (k + 1) * all_scales_num_candidates);


align indentation of all arguments in the next lanes to the first argument.
e.g.
std::vector conf_score(
. cpu_cls_data + k * all_scales_num_candidates,
cpu_cls_data + (k + 1) * all_scales_num_candidates);

techoe · 2019-06-05T00:05:23Z

modules/perception/camera/lib/obstacle/detector/yolo/yolo_obstacle_detector.cc

  int obj_size =
-      output_height * output_width * static_cast<int>(anchors_.size()) / 2;
+    output_height_scale1 * output_width_scale1 *
+    static_cast<int>(anchors_.size()) / 2;


How about
int obj_size = (output_height_scale1 * output_width_scale1 *
static_cast(anchors_.size())) / 2;
to reduce truncation error?

here output_height_scale1 and output_width_scale1 are also integers, so maybe it does not matter?

techoe · 2019-06-05T00:10:33Z

modules/perception/camera/lib/obstacle/detector/yolo/yolo_obstacle_detector.cc

+      (output_height_scale1 * output_width_scale1 +
+       output_height_scale2 * output_width_scale2 +
+       output_height_scale3 * output_width_scale3) *
+      static_cast<int>(anchors_.size()) / 2 / 3;


/2/3 to one constrexpr float

Here, they are all integers in the expression. So I created 2 constexpr for both 2 and 3 and used them here as you suggested above instead of a constexpr float for 2/3.

KaWaiTsoiBaidu requested review from weidezhang, xiaoxq, techoe, yuliangguo and gchen-apollo June 4, 2019 21:13

techoe changed the title ~~prepare code for multi-scale camera object detection~~ Perception: prepare code for multi-scale camera object detection Jun 4, 2019

prepare code for multi-scale camera object detection

c73a98a

KaWaiTsoiBaidu force-pushed the multi-scale_camera_obj_detection branch from 07b743d to c73a98a Compare June 4, 2019 23:01

techoe suggested changes Jun 5, 2019

View reviewed changes

add constexpr for const value

df9889e

techoe approved these changes Jun 6, 2019

View reviewed changes

techoe merged commit 97eb919 into ApolloAuto:master Jun 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perception: prepare code for multi-scale camera object detection #8649

Perception: prepare code for multi-scale camera object detection #8649

KaWaiTsoiBaidu commented Jun 4, 2019

techoe commented Jun 4, 2019

techoe Jun 5, 2019

techoe Jun 5, 2019

techoe Jun 5, 2019

KaWaiTsoiBaidu Jun 5, 2019

techoe Jun 5, 2019

techoe Jun 5, 2019

KaWaiTsoiBaidu Jun 5, 2019

techoe Jun 5, 2019

KaWaiTsoiBaidu Jun 5, 2019

Perception: prepare code for multi-scale camera object detection #8649

Perception: prepare code for multi-scale camera object detection #8649

Conversation

KaWaiTsoiBaidu commented Jun 4, 2019

techoe commented Jun 4, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment