網頁

2020年11月19日 星期四

Nvidia TLT 分析

DetectPostprocessor::parseBoundingBox
networkInfo 640x368

outputCoverageLayerIndex
outputCoverageBuffer [4 40 23] 4 個 class 的 confidence

outputBBoxLayerIndex
outputBboxBuffer [16 40 23] numClasses*4, x1, y1, x2, y2

targetShape [40 23]
gridSize 40*23
strideX 16
strideY 16
gcCenters0 [40] (0.5 16.5 32.5...624.5)/35
gcCenters1 [23] (0.5 16.5 32.5...352.5)/35
numClasses 4

ClassifyPostprocessor::parseAttributesFromSoftmaxLayers
m_OutputLayerInfo[1]
m_OutputLayerInfo[1].inferDims[12 1 1] 12 個 class 的 probability
numClasses=12


參考 BBox Ground Truth Generator
cov: Batch_size, Num_classes, image_height/16, image_width/16
bbox: Batch_size, Num_classes * 4, image_height/16, image_width/16 (where 4 is the number of coordinates per cell)


沒有留言:

張貼留言