TuneContext
FittingGenerator → SpaceGeneratorFittingGenerator EvolutionarySearch → SearchStrategyEvolutionarySearch TuneContext → TuneContext → TuneContextInitialize
JSONDatabase → DatabaseJSONDatabase XGBModel → FeatureExtractor → PerStoreFeature → FeatureExtractorPerStoreFeauture MeasureCallback → MeasureCallbackDefault TaskScheduler → GradientBased → TaskSchedulerGradientBased
/src/tir/schedule/trace.cc
Array<ObjectRef> TranslateInputRVs(
const Array<ObjectRef>& inputs,
const std::unordered_map<const Object*, const Object*>& rv_map)
将调度指令的输入对象根据替换映射进行转换
ssh -p 25378 root@connect.bjb1.seetacloud.com 1.43512 1.36580
G1 rtx_3080 rtx4090
heron 0.03438 ms 5316s 0.015757 ms 3641
pibot 0.03384 ms 2235s 0.013777 ms 782
pytorch 0.510898 ms 1.953404 ms
G2 rtx_3080 rtx4090
heron 0.0014123 ms 2667 0.537718 ms 5100
pibot 0.0014334 ms 701 0.543329 ms 3565
pytorch 4.481762 ms 0.837625 ms
G3 rtx_3080 rtx4090
heron 0.010030 ms 1217 0.008226 ms 1236
pibot 0.009695 ms 1497 0.008218 ms 1026
pytorch 0.031238 ms 0.094816 ms
B1 rtx4090 rtx3080
heron 0.005173 ms 943 0.01365 ms 2699
pibot 0.005134 ms 881 0.01367 ms 1989
pytorch 0.042630 ms 0.083946 ms
B2 rtx4090
heron 0.005090 ms 2962 0.01402 ms 3388
pibo 0.005088 ms 1168 0.01365 ms 747
pytorch 0.024005 ms 0.050248 ms
B7 rtx4090
heron 0.053552 ms 5512 0.15041 ms 4917
pibo 0.053502 ms 3701 0.12068 ms 1323
pytorch 0.191892 ms 0.093483 ms
B8 rtx4090
heron 0.13958 ms 587
pibo 0.13949 ms 639
pytorch 0.203576 ms 0.159782 ms
B9 rtx4090
heron 0.13874 ms 1356 0.18474 ms 991
pibo 0.13735 ms 936 0.17858 ms 814
pytorch 0.19095 ms 0.166690 ms
C2 rtx3080
heron 0.040300 ms 341 0.011743 ms 5844
pibot 0.040653 ms 255 0.011532 ms 2727
pytorch 0.474093 ms 1.299625 ms
C3 rtx3080
heron 0.073979 ms 3845 0.036221 ms 2180
pibot 0.070862 ms 1400 0.039619 ms 2748
pytorch 0.120934 ms 0.121908 ms
C4
heron 0.013020 ms 735 0.007596 ms 4349
pibot 0.013155 ms 760 0.007115 ms 1260
pytorch 0.054416 ms 0.090722 ms
C5
heron 0.085828 ms 897 0.043186 ms 4888
pibot 0.088222 ms 1101 0.041430 ms 2576
pytorch 0.119524 ms 0.086970 ms
C7
heron 0.468694 ms 3939 0.336159 ms 1084
pibot 0.469949 ms 2121 0.336310 ms 580
pytorch 0.246989 ms 0.115868 ms
C8 heron 1.62734 ms 6345 0.765019 ms 14926 pibot 1.61799 ms 12566 0.722327 ms 10866 pytorch 1.438680 ms 0.760748 ms
C10
heron 1.273648 ms 5523 4.878418 ms 14446
pibot 1.232779 ms 4146 4.901200 ms 8578
pytorch 1.568065 ms 0.720168 ms
C11 heron 0.241642 ms 6217 pibot 0.202731 ms 3768 pytorch 0.930382 ms 0.441475 ms
1.309535 ms
G4 rtx_3080 rtx4090
heron 0.049233 ms 224 0.019519 ms 1912
pibot 0.049520 ms 244 0.019339 ms 1016
pytorch 0.104069 ms
G5 rtx_3080
heron 0.020117 ms 2255 0.013658 ms 926
pibot 0.019932 ms 1791 0.013728 ms 2193
pytorch 0.024846 ms
B3 rtx4090 heron 0.007050 ms 4019 0.01710 ms 664 pibo 0.006269 ms 1381 0.01710 ms 686 pytorch 0.02924 ms B4 rtx4090 heron 0.005636 ms 1574 0.01729 ms 747 pibo 0.005790 ms 3248 0.01748 ms 932 pytorch 0.02714 ms
C9
heron
pibot
pytorch 0.804250 ms 0.355089 ms
N H W CI CO KH KW stride padding dilation C7 16,224,224,3,64,3,3,1,1,1 C8 16,224,224,64,64,3,3,1,1,1 C10 16,112,112,128,128,3,3,1,1,1 C2 16,56,56,64,64,1,1,1,0,1 C3 16,28,28,128,128,3,3,1,1,1 C4 16,28,28,128,256,1,1,2,0,1
AMOS
_, C, H, W, K, _, R, S, _, stride, padding, dilation, _ 1, 3, 224, 224, 63, 1, 3, 3, 1, 1, 1, 1, 1 1, 64, 224, 224, 64, 1, 3, 3, 1, 1, 1, 1, 1 1, 128, 112, 112, 128, 1, 3, 3, 1, 1, 1, 1, 1 1, 64, 56, 56, 64, 1, 1, 1, 1, 1, 0, 1, 1 1, 128, 28, 28, 128, 1, 3, 3, 1, 1, 1, 1, 1 1, 128, 28, 28, 256, 1, 1, 1, 1, 2, 0, 1, 1
C2D AMOS Batch= 16 0.0823072016615084 0.2856333884812913 0.022220556887290806 1.5902389659442726 9.67448223 5.39098462
B1 12,512,64,512 B2 12,512,512,64 B7 16,512,768,768 B8 192,512,64,512 B9 192,512,512,64
BMM AMOS 0.07377967765776008 0.07971049660219552 0.20670990629139072 0.3435732154867257 0.6294092247992863
G1 1024,1024,1024 G2 4096,4096,4096 G3 32,2048,1000
GEMM AMOS Cost of gemm-float16-float16-layer-(32, 2048, 1000) is 0.037194 ms 0.07731810187079802 1.9382828080495358 0.03719387727609179
16, 224, 224, 3 , 64 , 3, 3, 1, 1, 1 16, 224, 224, 64 , 64 , 3, 3, 1, 1, 1 16, 112, 112, 64 , 128 , 3, 3, 1, 1, 1 16, 112, 112, 128, 128 , 3, 3, 1, 1, 1 16, 56, 56, 128 , 256 , 3, 3, 1, 1, 1 16, 56, 56, 256 , 256 , 3, 3, 1, 1, 1 16, 28, 28, 256 , 512 , 3, 3, 1, 1, 1 16, 28, 28, 512 , 512 , 3, 3, 1, 1, 1 16, 14, 14, 512 , 512 , 3, 3, 1, 1, 1 16, 7, 7, 512 , 512 , 3, 3, 1, 1, 1
\[\text{EI}(\mathbf{x}) = (\mu(\mathbf{x}) - f^* - \xi) \cdot \Phi(z) + \sigma(\mathbf{x}) \cdot \phi(z)\]